Agenda | Kisaco Research

2023 Agenda

Check out the initial agenda for the Efficient Generative AI Summit (Sept 11, Santa Clara Marriott). Speakers will be added soon so stay tuned...


09:00
Registration & Morning Networking
10:00 AM

Generative AI is going to play a crucial role in building the embodied internet, aka Metaverse. Democratizing the metaverse would imply giving every creator a set of tools to dream up and build their own immersive canvas that they can share with others. In this talk, I will discuss the challenges and opportunities in generative AI for synthesizing 3D objects, creating/editing digital twins, building virtual worlds, and accelerating the synthesis pipeline.

Author:

Vikas Chandra

Senior Director, AI
Meta

Vikas Chandra is Senior Director at Meta Reality Labs where he works on AI research focusing on AR and VR products. Prior to Meta, he was Director of Applied Machine Learning at Arm Inc. He received his Ph.D. and M.S. degrees in Electrical and Computer Engineering from Carnegie Mellon University. He held the positions of Visiting Scholar (2011 – 2014) and Visiting Faculty (2016 - 2017) in the EE department at Stanford University. He has authored 120+ research publications and is an inventor on 40+ US and international patents. Dr. Chandra received the ACM-SIGDA Technical Leadership Award in 2009 and was invited to the 2017 Frontiers of Engineering Symposium organized by the National Academy of Engineering. He is a senior member of IEEE.

Vikas Chandra

Senior Director, AI
Meta

Vikas Chandra is Senior Director at Meta Reality Labs where he works on AI research focusing on AR and VR products. Prior to Meta, he was Director of Applied Machine Learning at Arm Inc. He received his Ph.D. and M.S. degrees in Electrical and Computer Engineering from Carnegie Mellon University. He held the positions of Visiting Scholar (2011 – 2014) and Visiting Faculty (2016 - 2017) in the EE department at Stanford University. He has authored 120+ research publications and is an inventor on 40+ US and international patents. Dr. Chandra received the ACM-SIGDA Technical Leadership Award in 2009 and was invited to the 2017 Frontiers of Engineering Symposium organized by the National Academy of Engineering. He is a senior member of IEEE.

10:30 AM

Enterprises today face increasing pressure to incorporate GPT-scale large language models, or LLMs, throughout their product suites. However, these LLMs are complex, cumbersome, and costly to deploy. Unlike the human brain, which consumes a mere 30 watts of power, LLMs require massive compute and power footprints that are harming the planet. To address these issues, significant research is being conducted into LLM optimization but skyrocketing demand is outpacing these improvements. In this talk, we show how a new approach, grounded in neuroscience research, can significantly improve the performance of LLMs and provide a sustainable path forward. 

In this presentation, Subutai will:

  • Provide an overview of the Generative AI landscape and the challenges companies face in deploying LLMs.
  • Review recent neuroscience discoveries that show how our brain achieves its incredible computational efficiency.
  • Demonstrate how Numenta’s AI platform can apply these discoveries to AI and enable enterprises to deploy powerful LLMs at scale in an extremely cost-effective manner.

Author:

Subutai Ahmad

CEO
Numenta

Subutai is passionate about neuroscience, deep learning, and building intelligent systems. An accomplished technologist, he has been instrumental in driving Numenta’s research, technology and business since 2005. He previously served as VP Engineering at YesVideo where he helped grow the company from a three-person start-up to a leader in automated digital media authoring. In 1997, Subutai co-founded ePlanet Interactive which developed the IntelPlay Me2Cam, the first computer vision product developed for consumers. Subutai holds a B.S. in Computer Science from Cornell University, and a Ph.D in Computer Science from the University of Illinois at Urbana-Champaign.

Subutai Ahmad

CEO
Numenta

Subutai is passionate about neuroscience, deep learning, and building intelligent systems. An accomplished technologist, he has been instrumental in driving Numenta’s research, technology and business since 2005. He previously served as VP Engineering at YesVideo where he helped grow the company from a three-person start-up to a leader in automated digital media authoring. In 1997, Subutai co-founded ePlanet Interactive which developed the IntelPlay Me2Cam, the first computer vision product developed for consumers. Subutai holds a B.S. in Computer Science from Cornell University, and a Ph.D in Computer Science from the University of Illinois at Urbana-Champaign.

11:00 AM

In this keynote, we discuss why commercially available general computing hardware architectures are usually not a good choice for executing innovative AI workloads. We also deep dive into understanding some secrets (no technical expertise needed). Most importantly, we address what we can do to ameliorate this situation. Which type of methodologies and approaches can we create or leverage to efficaciously map new classes of AI algorithms to huge, heterogeneous clusters as well as increase the performance of said algorithms (e.g., accuracy, specificity, etc.)? Which promising and commercially available hardware architectures can we use to quickly and cheaply execute AI workloads? Hint: they are not NVIDIA GPUs. And what could our future software-hardware ecosystems look like, architecturally, to simplify the creation, deployment, execution, and monitoring of AI workloads?

Author:

Fausto Artico

Head of Innovation and Data Science
GSK

Fausto has two PhDs (Information & Computer Science respectively), earning his second master’s and PhD at the University of California, Irvine. Fausto also holds multiple certifications from MIT, Columbia University, London School of Economics and Political Science, Kellogg School of Management, University of Cambridge and soon also from the University of California, Berkeley. He has worked in multi-disciplinary teams and has over 20 years of experience in academia and industry.

As a Physicist, Mathematician, Engineer, Computer Scientist, and High-Performance Computing (HPC) and Data Science expert, Fausto has worked on key projects at European and American government institutions and with key individuals, like Nobel Prize winner Michael J. Prather. After his time at NVIDIA corporation in Silicon Valley, Fausto worked at the IBM T J Watson Center in New York on Exascale Supercomputing Systems for the US government (e.g., Livermore and Oak Ridge Labs).

Fausto Artico

Head of Innovation and Data Science
GSK

Fausto has two PhDs (Information & Computer Science respectively), earning his second master’s and PhD at the University of California, Irvine. Fausto also holds multiple certifications from MIT, Columbia University, London School of Economics and Political Science, Kellogg School of Management, University of Cambridge and soon also from the University of California, Berkeley. He has worked in multi-disciplinary teams and has over 20 years of experience in academia and industry.

As a Physicist, Mathematician, Engineer, Computer Scientist, and High-Performance Computing (HPC) and Data Science expert, Fausto has worked on key projects at European and American government institutions and with key individuals, like Nobel Prize winner Michael J. Prather. After his time at NVIDIA corporation in Silicon Valley, Fausto worked at the IBM T J Watson Center in New York on Exascale Supercomputing Systems for the US government (e.g., Livermore and Oak Ridge Labs).

11:30 AM

Sree Ganesan, who leads Software Products at Intel’s Habana Labs, will present approaches to bringing efficiency to Generative AI—from the software perspective—and will share things you may not know about what you can do with Gaudi Acceleration. We’ll look at opportunities developers have to optimize performance and build value that can yield positive results. 

Author:

Sree Ganesan

Head of Software Products
Habana Labs

Sree Ganesan leads Software Product Management at Habana Labs, working alongside a diverse global team to deliver state-of-the-art deep learning capabilities of the Habana SynapseAI® software suite to the market. Previously, she was Engineering Director in Intel’s AI Products Group, where she was responsible for AI software strategy and deep learning framework integration for Nervana NNP AI accelerators.  Ms. Ganesan joined Intel in 2001 and has held a variety of technical and management roles in software engineering, VLSI CAD and SOC design methodology. Ms. Ganesan received a bachelor’s degree in electrical engineering from the Indian Institute of Technology Madras, India and a PhD in computer engineering from the University of Cincinnati, Ohio.

Sree Ganesan

Head of Software Products
Habana Labs

Sree Ganesan leads Software Product Management at Habana Labs, working alongside a diverse global team to deliver state-of-the-art deep learning capabilities of the Habana SynapseAI® software suite to the market. Previously, she was Engineering Director in Intel’s AI Products Group, where she was responsible for AI software strategy and deep learning framework integration for Nervana NNP AI accelerators.  Ms. Ganesan joined Intel in 2001 and has held a variety of technical and management roles in software engineering, VLSI CAD and SOC design methodology. Ms. Ganesan received a bachelor’s degree in electrical engineering from the Indian Institute of Technology Madras, India and a PhD in computer engineering from the University of Cincinnati, Ohio.

12:00 PM
Networking Lunch
1:15 PM

In this panel, we explore the challenges and opportunities that large, medium, and small companies are facing when they try to pre-train and/or fine-tune Large Language Models (LLMs). More specifically, given the fact that not all companies have the time and resources to pre-train LLMs from scratch, we present options for executing pre-training, or at least fine-tuning, activities in easier and, most importantly, cheaper ways than what is done today. We will therefore discuss how you too can start to efficiently and efficaciously leverage the amazing capabilities provided by LLMs, as well as some important factors that we discovered and that make a significant difference in the creation of high-performance, robust, and reliable LLMs.

Moderator

Author:

Fausto Artico

Head of Innovation and Data Science
GSK

Fausto has two PhDs (Information & Computer Science respectively), earning his second master’s and PhD at the University of California, Irvine. Fausto also holds multiple certifications from MIT, Columbia University, London School of Economics and Political Science, Kellogg School of Management, University of Cambridge and soon also from the University of California, Berkeley. He has worked in multi-disciplinary teams and has over 20 years of experience in academia and industry.

As a Physicist, Mathematician, Engineer, Computer Scientist, and High-Performance Computing (HPC) and Data Science expert, Fausto has worked on key projects at European and American government institutions and with key individuals, like Nobel Prize winner Michael J. Prather. After his time at NVIDIA corporation in Silicon Valley, Fausto worked at the IBM T J Watson Center in New York on Exascale Supercomputing Systems for the US government (e.g., Livermore and Oak Ridge Labs).

Fausto Artico

Head of Innovation and Data Science
GSK

Fausto has two PhDs (Information & Computer Science respectively), earning his second master’s and PhD at the University of California, Irvine. Fausto also holds multiple certifications from MIT, Columbia University, London School of Economics and Political Science, Kellogg School of Management, University of Cambridge and soon also from the University of California, Berkeley. He has worked in multi-disciplinary teams and has over 20 years of experience in academia and industry.

As a Physicist, Mathematician, Engineer, Computer Scientist, and High-Performance Computing (HPC) and Data Science expert, Fausto has worked on key projects at European and American government institutions and with key individuals, like Nobel Prize winner Michael J. Prather. After his time at NVIDIA corporation in Silicon Valley, Fausto worked at the IBM T J Watson Center in New York on Exascale Supercomputing Systems for the US government (e.g., Livermore and Oak Ridge Labs).

Panellists

Author:

Abhijeet Gulati

Head of AI, Senior Director of AI & ML Engineering
Mitchell International

Abhijeet Gulati is the Head of AI & Senior Director of Engineering at Mitchell’s Auto Physical Damage (APD) business unit. Abhijeet is an accomplished technologist with over two decades of experience in the semiconductor, wireless, software & technology industry, focusing on AI, Machine Learning, NLP, Generative AI and SaaS solutions. He is a driven Artificial Intelligence, Advanced Analytics and business intelligence leader.

As the Head of AI at Mitchell International, Abhijeet has spent the past 5 years improving InsureTech inefficiencies, minimize decision biases, developed proprietary, differentiated enterprise scale AI products and an intelligent open platform that democratizes adoption of AI in enterprise business workflows. Abhijeet has extensive experience directing large-scale initiatives involving R&D, business & product strategy, operations, and advanced video, image and data analytics. Abhijeet has authored several patents on the application of AI in InsureTech industry. Abhijeet sits on the board of several AI standards, Ethical & Responsible AI committees.

Abhijeet Gulati

Head of AI, Senior Director of AI & ML Engineering
Mitchell International

Abhijeet Gulati is the Head of AI & Senior Director of Engineering at Mitchell’s Auto Physical Damage (APD) business unit. Abhijeet is an accomplished technologist with over two decades of experience in the semiconductor, wireless, software & technology industry, focusing on AI, Machine Learning, NLP, Generative AI and SaaS solutions. He is a driven Artificial Intelligence, Advanced Analytics and business intelligence leader.

As the Head of AI at Mitchell International, Abhijeet has spent the past 5 years improving InsureTech inefficiencies, minimize decision biases, developed proprietary, differentiated enterprise scale AI products and an intelligent open platform that democratizes adoption of AI in enterprise business workflows. Abhijeet has extensive experience directing large-scale initiatives involving R&D, business & product strategy, operations, and advanced video, image and data analytics. Abhijeet has authored several patents on the application of AI in InsureTech industry. Abhijeet sits on the board of several AI standards, Ethical & Responsible AI committees.

Author:

Greg Makowski

Head of Data Science Services
Johnson Controls

Greg Makowski

Head of Data Science Services
Johnson Controls

Author:

Mayank Anand

Machine Learning Engineering Manager
Adobe

Mayank Anand is a Machine Learning Engineering Manager at Adobe, dedicated to enhancing digital marketing with AI and machine learning. His focus lies in harnessing AI for better content creation from texts and images. Presently, he's innovating in the field of generative AI to produce smart, brand-safe content. Mayank holds a Master's in Computer Science from USC, Los Angeles.

Mayank Anand

Machine Learning Engineering Manager
Adobe

Mayank Anand is a Machine Learning Engineering Manager at Adobe, dedicated to enhancing digital marketing with AI and machine learning. His focus lies in harnessing AI for better content creation from texts and images. Presently, he's innovating in the field of generative AI to produce smart, brand-safe content. Mayank holds a Master's in Computer Science from USC, Los Angeles.

2:00 PM
Moderator

Author:

Michael Stewart

Partner
M12

Michael Stewart

Partner
M12
Panellists

Author:

Rashmi Gopinath

General Partner
B Capital

Rashmi Gopinath is a General Partner at B Capital Group where she leads the fund’s enterprise software practice in cloud infrastructure, cybersecurity, devops, and AI/ML sectors. She brings over two decades of experience investing and operating in cutting-edge enterprise technologies. She led B Capital’s investments in over 24 companies such as DataRobot, FalconX, Clari, Phenom People, Synack, Innovaccer, Labelbox, Fabric, 6Sense, Highspot, Pendo, Starburst, OwnBackup, Figment, Perimeter81, Zesty, among others.

Rashmi was previously a Managing Director at M12, Microsoft’s venture fund, where she led investments globally in enterprise software and sat on several boards including Synack, Innovaccer, Contrast Security, Frame, UnravelData, Incorta, among others.

Prior to M12, Rashmi was an Investment Director with Intel Capital where she was involved in the firm’s investments in startups including MongoDB (Nasdaq: MDB), ForeScout (Nasdaq: FSCT), Maginatics (acq. by EMC), BlueData (acq. by HPE), among others. Rashmi held operating roles at high-growth startups such as BlueData (acq. by HPE) and Couchbase (Nasdaq: BASE) where she led global business development, product and marketing roles. She began her career in engineering and product roles at Oracle and GE Healthcare. She earned an M.B.A. from Northwestern University, and a B.S. in Electrical Engineering from University of Mumbai in India.

Rashmi Gopinath

General Partner
B Capital

Rashmi Gopinath is a General Partner at B Capital Group where she leads the fund’s enterprise software practice in cloud infrastructure, cybersecurity, devops, and AI/ML sectors. She brings over two decades of experience investing and operating in cutting-edge enterprise technologies. She led B Capital’s investments in over 24 companies such as DataRobot, FalconX, Clari, Phenom People, Synack, Innovaccer, Labelbox, Fabric, 6Sense, Highspot, Pendo, Starburst, OwnBackup, Figment, Perimeter81, Zesty, among others.

Rashmi was previously a Managing Director at M12, Microsoft’s venture fund, where she led investments globally in enterprise software and sat on several boards including Synack, Innovaccer, Contrast Security, Frame, UnravelData, Incorta, among others.

Prior to M12, Rashmi was an Investment Director with Intel Capital where she was involved in the firm’s investments in startups including MongoDB (Nasdaq: MDB), ForeScout (Nasdaq: FSCT), Maginatics (acq. by EMC), BlueData (acq. by HPE), among others. Rashmi held operating roles at high-growth startups such as BlueData (acq. by HPE) and Couchbase (Nasdaq: BASE) where she led global business development, product and marketing roles. She began her career in engineering and product roles at Oracle and GE Healthcare. She earned an M.B.A. from Northwestern University, and a B.S. in Electrical Engineering from University of Mumbai in India.

Author:

Gayathri Radhakrishnan

Partner
Hitachi Ventures

Gayathri is currently Partner at Hitachi Ventures. Prior to that, she was with Micron Ventures, actively investing in startups that apply AI to solve critical problems in the areas of Manufacturing, Healthcare and Automotive. She brings over 20 years of multi-disciplinary experience across product management, product marketing, corporate strategy, M&A and venture investments in large Fortune 500 companies such as Dell and Corning and in startups. She has also worked as an early stage investor at Earlybird Venture Capital, a premier European venture capital fund based in Germany. She has a Masters in EE from The Ohio State University and MBA from INSEAD in France. She is also a Kauffman Fellow - Class 16.

Gayathri Radhakrishnan

Partner
Hitachi Ventures

Gayathri is currently Partner at Hitachi Ventures. Prior to that, she was with Micron Ventures, actively investing in startups that apply AI to solve critical problems in the areas of Manufacturing, Healthcare and Automotive. She brings over 20 years of multi-disciplinary experience across product management, product marketing, corporate strategy, M&A and venture investments in large Fortune 500 companies such as Dell and Corning and in startups. She has also worked as an early stage investor at Earlybird Venture Capital, a premier European venture capital fund based in Germany. She has a Masters in EE from The Ohio State University and MBA from INSEAD in France. She is also a Kauffman Fellow - Class 16.

Author:

Yvonne Lutsch

Investment Principal
Bosch Ventures

Yvonne is an accomplished Investment Principal at Bosch Ventures affiliate office located in Sunnyvale, and sources, evaluates, and executes venture capital deals in North America. Her specialty are investments in deep tech fields such as AI, edge and next gen. computing incl. quantum, robotics, industrial IoT, mobility, climate tech, semiconductors, or sensors. She is an investor and non-executive board member of Bosch Ventures’ portfolio companies Syntiant, Zapata AI, UltraSense Systems, Aclima, and Recogni.
Prior to this position Yvonne was Director of Technology Scouting and Business Development, building up an Innovation Hub in Silicon Valley including startup scouting, business development while advising executives of the Bosch business units on their strategy. She has more than two decades of solid experience in manufacturing operations and engineering in the automotive and consumer electronics space – gained through different executive roles at Bosch in Germany.
Yvonne received a diploma in Experimental Physics from University of Siegen, Germany, and holds a PhD in Applied Physics from University of Tuebingen, Germany.

Yvonne Lutsch

Investment Principal
Bosch Ventures

Yvonne is an accomplished Investment Principal at Bosch Ventures affiliate office located in Sunnyvale, and sources, evaluates, and executes venture capital deals in North America. Her specialty are investments in deep tech fields such as AI, edge and next gen. computing incl. quantum, robotics, industrial IoT, mobility, climate tech, semiconductors, or sensors. She is an investor and non-executive board member of Bosch Ventures’ portfolio companies Syntiant, Zapata AI, UltraSense Systems, Aclima, and Recogni.
Prior to this position Yvonne was Director of Technology Scouting and Business Development, building up an Innovation Hub in Silicon Valley including startup scouting, business development while advising executives of the Bosch business units on their strategy. She has more than two decades of solid experience in manufacturing operations and engineering in the automotive and consumer electronics space – gained through different executive roles at Bosch in Germany.
Yvonne received a diploma in Experimental Physics from University of Siegen, Germany, and holds a PhD in Applied Physics from University of Tuebingen, Germany.

Author:

Vibhor Rastogi

Global Head AIML Data Investing
Citi Ventures

Vibhor Rastogi

Global Head AIML Data Investing
Citi Ventures
2.50 PM

ChatGPT is the ‘sputnik’ moment for AI in US and Europe – it is now obvious how AI will be fundamental. There is an imperative to future-proof industry by investing in the exploration of the impact of next-generation AI technologies. But, it is also fuelling an urgency for enterprises to capitalize on and deploy core AI technologies today. Models continue to evolve but deployment is becoming more and more important. Smart users are comparing the compute costs and data risks of using massive, proprietary LLMs with smaller, open-source models, fine-tuned to their specific needs. They are searching for much more efficient solutions for deployment today and in the future. In this talk we will discuss:
·         Why LLMs like GPT-4 may not be the best solution for many enterprise AI tasks
·         Identifying the most exciting, enterprise-ready AI solutions for deployment today
·         Why your compute platform matters when using generative AI

Author:

Helen Byrne

VP, Solution Architect
Graphcore

Helen leads the Solution Architects team at Graphcore, helping innovators build their AI solutions using Graphcore’s Intelligence Processing Units (IPUs). She has been at Graphcore for more than 5 years, previously leading AI Field Engineering and working in AI Research, working on problems in Distributed Machine Learning. Before landing in the technology industry, she worked in Investment Banking. Her background is in Mathematics and she has a MSc in Artificial Intelligence.

Helen Byrne

VP, Solution Architect
Graphcore

Helen leads the Solution Architects team at Graphcore, helping innovators build their AI solutions using Graphcore’s Intelligence Processing Units (IPUs). She has been at Graphcore for more than 5 years, previously leading AI Field Engineering and working in AI Research, working on problems in Distributed Machine Learning. Before landing in the technology industry, she worked in Investment Banking. Her background is in Mathematics and she has a MSc in Artificial Intelligence.

3:15 PM
Networking Break
4:00 PM
Moderator

Author:

Arun Nandi

Senior Director and Head of Data & Analytics
Unilever

Arun is a visionary AI and Analytics expert recognized as one of the Top 100 Influential AI & Analytics leaders. He is the Head of Data & Analytics at Unilever today. With over 15 years of experience driving analytics-driven value in organizations, he has built AI practices from the ground up on several occasions. Arun advocates the adoption of AI to overcome enterprise-wide challenges and create growth. Beyond his professional achievements, Arun loves to travel, having explored over 40 countries and is passionate about adventure motorbiking.

Arun Nandi

Senior Director and Head of Data & Analytics
Unilever

Arun is a visionary AI and Analytics expert recognized as one of the Top 100 Influential AI & Analytics leaders. He is the Head of Data & Analytics at Unilever today. With over 15 years of experience driving analytics-driven value in organizations, he has built AI practices from the ground up on several occasions. Arun advocates the adoption of AI to overcome enterprise-wide challenges and create growth. Beyond his professional achievements, Arun loves to travel, having explored over 40 countries and is passionate about adventure motorbiking.

Panellists

Author:

Nikesh Pahuja

App Engineer
Eximius Systems

Nikesh Kumar Pahuja currently serves as an App Engineer in a contract position at Chevron Corporation, with a primary focus on product development and Data Analytics/ Data Science / AI. With an MBA in Finance, Strategy, and Analytics from the University of California, Davis, and an MS in Analytics from Harrisburg University of Science and Technology, Mr. Pahuja brings a wealth of knowledge and expertise to his work. He is a passionate advocate for data, AI, and the creation of exceptional technological products, driven by a strong commitment to innovation and advancement.

Nikesh Pahuja

App Engineer
Eximius Systems

Nikesh Kumar Pahuja currently serves as an App Engineer in a contract position at Chevron Corporation, with a primary focus on product development and Data Analytics/ Data Science / AI. With an MBA in Finance, Strategy, and Analytics from the University of California, Davis, and an MS in Analytics from Harrisburg University of Science and Technology, Mr. Pahuja brings a wealth of knowledge and expertise to his work. He is a passionate advocate for data, AI, and the creation of exceptional technological products, driven by a strong commitment to innovation and advancement.

Author:

Jay Dawani

CEO
Lemurian Labs

Jay Dawani is co-founder & CEO of Lemurian Labs, a startup at the forefront of general purpose accelerated computing for making AI development affordable and generally available for all companies and people to equally benefit. Author of the influential book "Mathematics for Deep Learning", he has held leadership positions at companies such as BlocPlay and Geometric Energy Corporation, spearheading projects involving quantum computing, metaverse, blockchain, AI, space robotics, and more. Jay has also served as an advisor to NASA Frontier Development Lab, SiaClassic, and many leading AI firms.

Jay Dawani

CEO
Lemurian Labs

Jay Dawani is co-founder & CEO of Lemurian Labs, a startup at the forefront of general purpose accelerated computing for making AI development affordable and generally available for all companies and people to equally benefit. Author of the influential book "Mathematics for Deep Learning", he has held leadership positions at companies such as BlocPlay and Geometric Energy Corporation, spearheading projects involving quantum computing, metaverse, blockchain, AI, space robotics, and more. Jay has also served as an advisor to NASA Frontier Development Lab, SiaClassic, and many leading AI firms.

Author:

Praveen Kolli

Staff Machine Learning Engineer
DoorDash

Dr. Praveen Kolli is currently a Staff Machine Learning Engineer in the Ads Quality Team at DoorDash, where he focuses on developing advanced deep learning models to improve ad recommendations and enhance user experience. Prior to his role at DoorDash, he served as a Technical Lead in the Ads Ranking Team at Pinterest. At Pinterest, Dr. Kolli played a pivotal role in building cutting-edge models for personalized ad recommendations, contributing to the platform's success in delivering relevant ads to its users. He holds a PhD in Mathematics from Carnegie Mellon University, Masters in Mathematics of Finance from Columbia University and Bachelors in Electrical Engineering from Indian Institute of Technology, Kharagpur.

Praveen Kolli

Staff Machine Learning Engineer
DoorDash

Dr. Praveen Kolli is currently a Staff Machine Learning Engineer in the Ads Quality Team at DoorDash, where he focuses on developing advanced deep learning models to improve ad recommendations and enhance user experience. Prior to his role at DoorDash, he served as a Technical Lead in the Ads Ranking Team at Pinterest. At Pinterest, Dr. Kolli played a pivotal role in building cutting-edge models for personalized ad recommendations, contributing to the platform's success in delivering relevant ads to its users. He holds a PhD in Mathematics from Carnegie Mellon University, Masters in Mathematics of Finance from Columbia University and Bachelors in Electrical Engineering from Indian Institute of Technology, Kharagpur.

4:45 PM

The evolution of transformers marks a remarkable advancement in comprehending intricate language patterns. These models are progressively growing in size and precision, enabling them to grasp complex linguistic structures more effectively. However, the real challenge lies in efficiently integrating these advanced models into production systems without incurring exorbitant costs. Striking a balance between superior performance and streamlined resource usage during both training and runtime is crucial for making these large language models (LLMs) viable for deployment in practical production environments.

Author:

Rahul Sharnagat

Principal Machine Learning Engineer
Walmart Global Tech

Rahul Sharnagat is a Principal Data Scientist in Walmart's Search Technologies. He began his career at IBM research lab then worked in Microsoft Bing Ads before joining Walmart in 2018, with his expertise centering around natural language understanding (NER) and question answering. He currently leads the efforts to develop and deploy query understanding models and its application across the search stack. Rahul holds a master's degree in computer science from Indian Institute of Technology, Bombay and a bachelor's degree in computer science from Visvesvaraya National Institute of Technology, Nagpur.

Rahul Sharnagat

Principal Machine Learning Engineer
Walmart Global Tech

Rahul Sharnagat is a Principal Data Scientist in Walmart's Search Technologies. He began his career at IBM research lab then worked in Microsoft Bing Ads before joining Walmart in 2018, with his expertise centering around natural language understanding (NER) and question answering. He currently leads the efforts to develop and deploy query understanding models and its application across the search stack. Rahul holds a master's degree in computer science from Indian Institute of Technology, Bombay and a bachelor's degree in computer science from Visvesvaraya National Institute of Technology, Nagpur.

5:10 PM
5.35 PM
CONFERENCE CLOSE
Tuesday, 12 Sep, 2023
Sept 12-14, 2023

Jump to: Day 1 | Day 2

Download the Community Brochure

Learn more about the network and community we’ve built

Download