Agentic AI Conference Sessions

In-Person Talks & Panels

Build Reasoning Models to Achieve Advanced Agentic AI Autonomy [S74781]

Joey Conway | Sr. Director, AI Software | NVIDIA

Oleksii Kuchaiev | Director of Applied Research | NVIDIA

Reasoning models represent a paradigm shift in AI for developing truly intelligent agentic systems. Unlike previous types of models, reasoning models can not only imitate human behavior, but can explore different ways of finding solutions on their own. The reasoning model’s approach also often leads to superior answer quality and often surprising and unexpected insights and solutions. We'll explain how reasoning models like DeepSeek-R1 are built using various techniques such as distillation and reinforcement learning with scalable (non-human) feedback, and the applications these models are unlocking. We'll also cover open tools, like NVIDIA NeMo, that you can use to create, fine-tune, or distill reasoning models and customize the open models for your domain.

Add to Schedule Tuesday Mar 18 | 10:00 PM - 10:40 PM GMT

Show More

In-Person Talks & Panels Live Stream

How to Build an Agentic AI System Using the Best Tools and Frameworks [S73739]

Bartley Richardson | Senior Director of Engineering | NVIDIA

Kris Murphy | Technical Product Manager | NVIDIA

AI agents are the new digital workforce, working for and with us. They can reason about a mission, create a plan, and retrieve data or use tools to generate a quality response. Data is the fuel for AI agents, but the magnitude and scale of enterprise data often make it too expensive and time-consuming to leverage effectively. For enterprises to thrive in the AI era, they must find a way to make use of all of their data. Join this session to learn about tools and frameworks to more easily build agentic AI systems that connectAI agents to a library of reusable tools that unlock your data, and drive efficiency gains across the organization.

Important: Near capacity, highly suggest arriving early. Attendees are let in on a first-come, first-served basis.

Add to Schedule Tuesday Mar 18 | 9:00 PM - 9:40 PM GMT

Show More

In-Person Talks & Panels Live Stream

Building Future-Ready AI With Agents and Data Flywheels: Insights From NVIDIA’s Enterprise Deployments [S72338]

Rama Akkiraju | VP, AI/ML for IT | NVIDIA

Santiago Pombo | Generative AI Product Manager | NVIDIA

Aaditya Shukla | Sr. Staff Engineer | NVIDIA

We’ll share insights, best practices, and lessons learned from building scalable, enterprise-ready AI solutions using AI agents and data flywheels. Drawing from our enterprise generative AI deployments, including chatbots, copilots, and ‘talk-to-your-data’ solutions, we’ll show how we implemented AI agents to orchestrate LLMs, APIs, and workflows for automating multi-step tasks, and data flywheels to drive continuous improvement of LLMs through user feedback. These architectural patterns are key to keeping enterprise AI solutions accurate, scalable, adaptable, and relevant in fast-paced business environments.

Add to Schedule Tuesday Mar 18 | 8:00 PM - 8:40 PM GMT

Show More

In-Person Talks & Panels Live Stream

AI Agents in Production: Insights and Future Directions [S72884]

Harrison Chase | Co-Founder and CEO | LangChain

As a leading developer platform for generative AI orchestration, LangChain has been at the forefront of the journey of AI agents from experimental concepts to essential components in production systems. We'll cover key lessons learned from LangChain’s experience enabling customers to develop, deploy, and manage enterprise AI agents in production at scale, and will explore emerging technologies that will shape the future.

Important: Near capacity, highly suggest arriving early. Attendees are let in on a first-come, first-served basis.

Add to Schedule Tuesday Mar 18 | 11:00 PM - 11:40 PM GMT

Show More

In-Person Talks & Panels

Grounding LLMs in Reality: Enhancing SAP’s Document Grounding [S72059]

Atreya Biswas | Lead Architect | SAP

Jia Xiang Lim | SAP

Large language models excel at generating human-like text, but they often lack the specificity and contextual understanding needed for business applications. Document Grounding, SAP's retrieval-augmented generation (RAG) solution for unstructured data, addresses this challenge by enabling contextual and semantic retrieval across diverse data types, including text, images, video, and audio. By harnessing state-of-the-art, GPU-accelerated models from NVIDIA for multi-modal embedding, re-ranking, speech-to-text conversion, and video processing, Document Grounding enables advanced data retrieval and interpretation, ensuring more accurate and relevant insights from various formats.

Add to Schedule Wednesday Mar 19 | 4:00 PM - 4:40 PM GMT

Show More

In-Person Fireside Chat Live Stream

Best Practices to Implement Your AI Strategies in the Enterprise [S71775]

Anne Hecht | Sr. Director of Product Marketing, Enterprise Products | NVIDIA

Andrew McMullan | Chief Data and Analytics Officer | Commonwealth Bank of Australia

Aaron Chaisson | VP, Product Marketing | VAST Data

Enterprises struggle to implement generative AI, but are looking to transition to full-scale production for cost savings and new revenue models. Reliability, security, and scalability are crucial for mission-critical AI applications. We'll explore best practices for implementing AI strategies, highlighting AI's value and potential impact. Learn how customers leverage NVIDIA AI Enterprise, which includes NVIDIA NIM and NVIDIA NIM Agent Blueprints, to gain insights on challenges, lessons, and best practices for all stages of the AI journey.

Important: Near capacity, highly suggest arriving early. Attendees are let in on a first-come, first-served basis.

Add to Schedule Wednesday Mar 19 | 3:00 PM - 3:40 PM GMT

Show More

In-Person Talks & Panels

How to Onboard Your Team of AI Agents and Transform Your Enterprise [S71784]

Adel El Hallak | Senior Director of Product Management | NVIDIA

Agentic AI is inspiring a paradigm shift from software-as-a-service to service-as-software. Agents combine reasoning, dynamic data retrieval and can access tools to drive outcomes. Teams of AI agents can work together, achieving results that are often far better than those of a single AI agent. With NVIDIA AI Blueprints, building blocks for agentic AI, we will show how enterprises can easily compose teams of agents, and securely deploy them with confidence.

Important: Near capacity, highly suggest arriving early. Attendees are let in on a first-come, first-served basis.

Add to Schedule Tuesday Mar 18 | 8:00 PM - 8:40 PM GMT

Show More

In-Person Talks & Panels

Building Scalable Data Flywheels for Continuously Improving AI Agents [S73280]

Vivienne Zhang | Senior Product Manager | NVIDIA

Kostikey Mustakas | Director of Data Science | AT&T

Julia Gomes | Technical Product Manager | Arize AI

Enterprises need a scalable, efficient, and modular solution for building data flywheels to capture the latest data to periodically train and improve the models that power AI agents. NVIDIA NeMo offers a complete solution for building these flywheels. NeMo enables enterprises to quickly and easily collect and process data, customize the generative AI models, evaluate the model's performance, and implement guardrails to ensure responsible and ethical use of the model. Learn about the ease of use, flexibility, and advanced techniques that will help you build your scalable solutions faster.

Add to Schedule Tuesday Mar 18 | 11:00 PM - 11:40 PM GMT

Show More

In-Person Talks & Panels Live Stream

Train Video Foundation Models at Scale [S73918]

Ersin Yumer | Sr. Director of AI/ML Platform and Data | Adobe

Join us for a deep dive into how generative AI is being used to train foundation models at scale.

Important: Near capacity, highly suggest arriving early. Attendees are let in on a first-come, first-served basis.

Add to Schedule Monday Mar 17 | 8:00 PM - 8:40 PM GMT

Show More

In-Person Talks & Panels Live Stream

Scaling AI Systems for Human Creativity: Lessons From Building Canva’s AI-First Design Platform [S72895]

Danny Wu | Head of AI Products | Canva

This deep dive explores the architecture and strategic challenges behind building accessible AI tools that power millions of creative use cases. Cameron Adams, co-founder and chief product officer of Canva, will share how the company scaled its infrastructure to unite more than 100 AI and ML models, and discuss the decisions and trade-offs that go into maintaining a reliable, seamless product experience for millions of users. We'll cover practical approaches to integrating multiple AI technologies into a cohesive platform that serves diverse user needs. Learn about the challenges of rapid AI deployment, upgrading as new capabilities become available, and how Canva approaches building AI workflows rather than siloed solutions.

Add to Schedule Monday Mar 17 | 9:00 PM - 9:40 PM GMT

Show More

In-Person Talks & Panels

How to Build Multimodal Agentic AI Retrieval Systems [S72208]

Tanay Varshney | Developer Advocate Engineer for Deep Learning SW | NVIDIA

Annie Surla | Developer Advocate Engineer | NVIDIA

Join NVIDIA technical product architects for an in-depth tutorial demonstrating how to build agentic AI pipelines that integrate diverse data types—including text, images, audio, and video—into enterprise AI applications. This session will cover the full journey, from designing production-ready retrieval systems to effectively using them in Enterprise use cases.

Important: Near capacity, highly suggest arriving early. Attendees are let in on a first-come, first-served basis.

Add to Schedule Thursday Mar 20 | 3:00 PM - 4:30 PM GMT

Show More

In-Person Talks & Panels Live Stream

From RAG to Agents: Building Enterprise Products with Generative AI [S71685]

Ranjitha Gurunath Kulkarni | Senior Machine Learning Engineer | Dropbox

How can an LLM effectively navigate the diverse complexities of enterprise systems and data - emails, documents, calendars, messages, tasks, tickets, line of business systems? How do you enable precise orchestration and execution of multi-step workflows while ensuring accuracy, predictability, and reliability in production? This session will explore how Dropbox Dash leverages cutting edge advances in RAG and Agents to tackle these challenges. From integrating heterogenous data sources and optimizing task orchestration by leveraging advances in powerful LLM models, we share our journey in building scalable and secure AI solutions. We will discuss how robust evaluation frameworks and architectural innovations help us ensure reliable, enterprise-ready systems.

Important: Near capacity, highly suggest arriving early. Attendees are let in on a first-come, first-served basis.

Add to Schedule Friday Mar 21 | 3:00 PM - 3:40 PM GMT

Show More

In-Person Talks & Panels

Create Multilingual 2D Digital Humans for Enterprise [S72370]

Rochelle Pereira | Sr. Director of Engineering | NVIDIA

Ragav Venkatesan | Principal Software Engineer | NVIDIA

Unlock the potential of NVIDIA's AI technologies to create dynamic, multilingual digital avatars. Discover how Maxine's Audio2Face-2D, Lip Sync, and Riva's translation seamlessly integrate to animate high-resolution 2D portraits with synchronized lip movements and custom voices in multiple languages. Learn about NVIDIA NIM™ microservices for secure, high-performance AI deployment across various platforms. Explore how these innovations can enhance customer service with 24/7 AI-driven avatars and transform AI-generated content and streaming experiences. Join us to elevate your virtual interactions with NVIDIA AI.

Add to Schedule Wednesday Mar 19 | 11:00 PM - 11:40 PM GMT

Show More

In-Person Talks & Panels

Streamlining Investment Insights for Wealth Management with Generative AI [S71653]

Lavinia Ghita | Solutions Architect | NVIDIA

Orest Xherija | Data Science Manager, Director | UBS

The collaboration between UBS and NVIDIA focuses on real-time risk assessment and monitoring of production retrieval augmented generation (RAG). Unlike holistic evaluations of RAG applications, real-time solutions must introduce minimal latency and offer a high degree of live adaptation and reasoning. To ensure the live observability of large language models (LLMs), we use "LLMs as a judge," where the reliability of the answers provided by one LLM is re-evaluated by another LLM before being returned as the system’s output. A common pitfall in "LLM as a judge" applications is that models tend to prefer their own answers, and thus undermine a system’s trustworthiness. To provide a first-of-its-class solution to this problem in the heavily-regulated banking industry, we adopt NVIDIA’s NIMs to deploy an open-source model as the LLM judge and extend it with observability tools from NeMo Guardrails and Evaluator.

Add to Schedule Thursday Mar 20 | 3:00 PM - 3:40 PM GMT

Show More

In-Person Talks & Panels

Transform an Enterprise Data Platform With Generative AI and RAG [S72205]

Nave Algarici | Generative AI Product Manager | NVIDIA

Sean Sodha | Sr. Software Product Manager | NVIDIA

Trillions of PDF files are generated every year, each file likely consisting of multiple pages filled with various content types, including text, images, charts, and tables. Learn how generative AI and retrieval-augmented generation (RAG) is enabling enterprises to extract massive volumes of data to quickly empower employees with valuable expertise.

Add to Schedule Wednesday Mar 19 | 3:00 PM - 3:40 PM GMT

In-Person Talks & Panels

Accelerate Super Long-Context LLM Inference [S72568]

Boyuan Huang | Product Director of Big Data and AI platform | Alibaba Cloud Intelligence Group

As the context length of LLM serving continues to increase, the inference time escalates dramatically. To strike this issue, we devise three key techniques: A fine-grained sparse attention algorithm for the pre-filling phase, equipped by a high-performance token-sparsity attention kernel using Tensor Core with CUTLASS CUTE, which delivers an 8X speedup at a 10X sparse rate without sacrificing accuracy; block-sparsity attention for the decoding phase, which further doubles the sparse rate while preserving accuracy with novel KV cache clustering; dynamic chunked pipeline parallelism, which decomposes a long sequence into chunks with adaptively updated chunk size based on an analytical cost model to balance pipeline stages, resulting in about 2X acceleration over Tensor Parallelism on eight GPUs. The above optimizations have empowered over 20X and 4X speedups for the pre-filling and decoding phases, respectively, while preserving accuracy with context length of 1 million tokens.

Important: Near capacity, highly suggest arriving early. Attendees are let in on a first-come, first-served basis.

Add to Schedule Wednesday Mar 19 | 11:00 PM - 11:40 PM GMT

Show More

In-Person Talks & Panels

LLM Pruning and Distillation in Practice: The Minitron Approach [S71779]

Saurav Muralidharan | Sr. Research Scientist | NVIDIA

Sharath Turuvekere Sreenivas | Sr. Deep Learning Algorithms Engineer | NVIDIA

We'll ive deep into the Minitron approach for producing compact language models via structured pruning and knowledge distillation. We'll describe how we apply this approach to compress LLMs in the Nemotron-4, Llama 3.1, and Mistral-NeMo families by 2-4X, achieving state-of-the-art accuracy while using orders of magnitude fewer training tokens compared to training from scratch.

Add to Schedule Tuesday Mar 18 | 10:00 PM - 10:40 PM GMT

In-Person Talks & Panels Live Stream

Harnessing AI Agents for Enterprise Success: Insights From AI Experts [S72355]

Rama Akkiraju | VP, AI/ML for IT | NVIDIA

Clara Shih | VP of Business AI | Meta

Dorit Zilbershot | Group VP of AI Experiences & Innovation | ServiceNow

Raji Rajagopalan | VP, Azure AI Foundry | Microsoft

Rajendra "RP" Prasad | Chief Information and Asset Engineering Officer | Accenture

AI agents and agentic architectures offer significant potential as enterprises move from initial experiments to widespread generative AI deployments. In this session, moderated by NVIDIA’s vice president of enterprise AI and automation, AI experts will share their insights and experiences in developing and deploying generative AI and agentic solutions. They'll discuss the opportunities AI agents bring, real-world enterprise use cases, strategies for building user trust, and the challenges of scaling these technologies across organizations. Join us as we explore the future of AI in the enterprise.

Important: Near capacity, highly suggest arriving early. Attendees are let in on a first-come, first-served basis.

Add to Schedule Wednesday Mar 19 | 4:00 PM - 5:00 PM GMT

Show More

Agentic AI Conference Sessions

Featured Sessions

Build Reasoning Models to Achieve Advanced Agentic AI Autonomy [S74781]

How to Build an Agentic AI System Using the Best Tools and Frameworks [S73739]

Building Future-Ready AI With Agents and Data Flywheels: Insights From NVIDIA’s Enterprise Deployments [S72338]

AI Agents in Production: Insights and Future Directions [S72884]

Grounding LLMs in Reality: Enhancing SAP’s Document Grounding [S72059]

Best Practices to Implement Your AI Strategies in the Enterprise [S71775]

How to Onboard Your Team of AI Agents and Transform Your Enterprise [S71784]

Building Scalable Data Flywheels for Continuously Improving AI Agents [S73280]

Train Video Foundation Models at Scale [S73918]

Scaling AI Systems for Human Creativity: Lessons From Building Canva’s AI-First Design Platform [S72895]

How to Build Multimodal Agentic AI Retrieval Systems [S72208]

From RAG to Agents: Building Enterprise Products with Generative AI [S71685]

Create Multilingual 2D Digital Humans for Enterprise [S72370]

Streamlining Investment Insights for Wealth Management with Generative AI [S71653]

Transform an Enterprise Data Platform With Generative AI and RAG [S72205]

Accelerate Super Long-Context LLM Inference [S72568]

LLM Pruning and Distillation in Practice: The Minitron Approach [S71779]

Harnessing AI Agents for Enterprise Success: Insights From AI Experts [S72355]