NVIDIA NeMo

A comprehensive software suite to build, monitor, and optimize AI agents across their lifecycle at enterprise scale.

Get Started

Documentation

Overview
Benefits
Features
Use Cases
Starting Options
Customer Stories
Adopters
Resources
Next Steps

Overview
Benefits
Features
Use Cases
Starting Options
Customer Stories
Adopters
Resources
Next Steps

Get Started

Overview

What Is NVIDIA NeMo?

NVIDIA NeMo™ is a modular software suite for managing the AI agent lifecycle. It provides microservices and toolkits for data processing, model fine-tuning and evaluation, reinforcement learning, policy enforcement, and system observability. NeMo helps enterprises build, monitor, and optimize agentic AI systems at scale, on any GPU-accelerated infrastructure. It integrates with existing AI platforms and supports cloud, on-premises, and hybrid deployment, enabling enterprises to rapidly manage and effortlessly create data flywheels that continuously optimize AI agents.

New NVIDIA AI Blueprint for Building Data Flywheels

Now available, this new AI blueprint enables developers to build an automated data flywheel that continuously powers your generative and agentic AI applications with more accurate and efficient models.

Try Now

Benefits

Explore the Benefits of NVIDIA NeMo for Agentic AI

Modular AI Agent Lifecycle Management

Manage the AI agent lifecycle—from data curation, customization, and evaluation to guardrailing, observability, and optimization—with an enterprise-ready, interoperable software suite.

Seamless Deployment and Scaling

Easily build data flywheels that use enterprise data to improve AI agents, powering the entire flywheel with a simple Helm chart deployment or API calls for various parts of the workflow.

Increased ROI

Quickly train, customize, and deploy large language models (LLMs), vision-language models (VLMs), video AI, and speech AI at scale, reducing time to solution and increasing ROI.

Accelerated Performance

Maximize AI agent performance and throughput with GPU-accelerated optimization, multi-node scaling, and tuning for cost-efficient training, deployment, and continuous improvement.

Safer Agentic AI

Build safer agentic AI systems by vetting models, guardrailing prompts, and continuously scanning for vulnerabilities.

Production Ready

Deploy into production with a secure, optimized, full-stack solution that offers support, security, and API stability as part of NVIDIA AI Enterprise.

Build, monitor, and optimize AI agents anywhere—from the cloud and data center to the edge.

Features

Tools for Managing the AI Agent Lifecycle

The AI agent lifecycle is an end-to-end process for developing and improving AI agents in production applications. NVIDIA NeMo provides tools that enable each step of this workflow, so enterprises can build powerful, secure, and continuously learning agents.

Build
Prepare AI-ready data Process existing multimodal datasets into high-quality, AI-ready formats for development pipelines, and generate synthetic data to close critical data gaps.	NeMo Curator Clean, filter, and prepare multimodal data. NeMo Data Designer Create domain-specific datasets from scratch.
Select the right model Pick or build models suited to the use case, validate with academic benchmarks, run custom evaluations, and fine-tune if needed.	NVIDIA Nemotron State-of-the-art open multimodal reasoning models. NeMo Retriever Extraction, embedding, and reranking models for RAG pipelines. NeMo Evaluator Benchmark, test, and evaluate models and agents.
Build your AI agent Turn your custom model into a scalable application, seamlessly connect it to your enterprise stack and tools, and define workflows with flexible orchestration.	NeMo Agent Toolkit Framework-agnostic toolkit to build, profile, and optimize AI agents.
Deploy
Deploy your agent with maximum performance Optimize your agent for production with high-throughput, low-latency inference, ensuring it can scale to meet enterprise demands and deliver fast, reliable responses.	NVIDIA NIM Run AI models in optimized containers, exposed as OpenAI-compatible APIs.
Stay grounded in data and enforce guardrails Use retrieval-augmented generation (RAG) to anchor agent responses in trusted knowledge while applying safety, compliance, and content moderation guardrails.	NeMo Retriever Build accurate, privacy-preserving RAG pipelines. NeMo Guardrails Enforce safety, compliance, and control across AI interactions.
Optimize
Monitor and collect feedback Track the agent's real-world interactions with users and other systems. Systematically evaluate its performance and accuracy, finding opportunities to continuously improve.	NeMo Agent Toolkit Framework-agnostic toolkit to build, profile, and optimize AI agents. NeMo Customizer Fine-tune and align models with domain data. NeMo Framework Open-source toolkit for training and aligning LLMs and multimodal models.
Continuously improve with data flywheels Use the feedback and data gathered from monitoring to create a data-driven flywheel, iteratively retraining the agent to continuously optimize and stay effective over time.	NeMo RL Post-train and align models at scale with advanced reinforcement learning techniques. NeMo Gym Simulated training environments to generate high-quality agentic RL rollouts. NeMo Evaluator Benchmark, test, and evaluate models and agents. NeMo Customizer Fine-tune and align models with domain data.

Use Cases

How NeMo Is Being Used

See how NVIDIA NeMo supports industry use cases and jump-starts your AI development.

AI Agents

AI agents are transforming customer service across sectors, helping companies enhance customer conversations, achieve high resolution rates, and improve human representative productivity. AI agents can handle predictive tasks, reason and problem-solve, be trained to understand industry-specific terms, and pull relevant information from an organization’s knowledge bases, wherever that data resides.

Learn More About AI Agents

Synthetic Data Generation for Agentic AI

Specialized agentic systems need massive, high-quality datasets that are slow and expensive to collect from real-world sources. Synthetic data created through simulations or generative AI models can eliminate this bottleneck by creating unlimited training scenarios without privacy restrictions or quality issues. This enables faster development of reasoning LLMs, multi-step decision-makers, and multimodal AI assistants.

Learn More About SDG for Agentic AI

AI Assistant

Businesses are deploying AI assistants to efficiently address the queries of millions of customers and employees around the clock. Powered by customized NVIDIA NIM microservices for LLMs, RAG, and speech and translation AI, these AI teammates deliver immediate and accurate spoken responses, even in the presence of background noise, poor sound quality, and diverse dialects and accents.

Learn More About AI Assistants

Try Now

Information Retrieval

Trillions of PDF files are generated every year, each file likely consisting of multiple pages filled with various content types, including text, images, charts, and tables. This goldmine of data can only be used as quickly as humans can read and understand it. But with generative AI and RAG, this untapped data can be used to uncover business insights that can help employees work more efficiently and result in lower costs.

Learn More About Information Retrieval

Try Now

Content Generation

Generative AI makes it possible to generate highly relevant, bespoke, and accurate content grounded in the domain expertise and proprietary IP of your enterprise.

Learn More About Content Generation

Humanoid Robot

Humanoid robots are built to adapt quickly to existing human-centric urban and industrial work spaces, tackling tedious, repetitive, or physically demanding tasks. Their versatility has them in such varied locations as factory floors to healthcare facilities, where these robots are assisting humans and helping alleviate labor shortages with automation.

Learn More About Humanoid Robots

Apptronik

Starting Options

Ways to Get Started With NVIDIA NeMo

Manage the AI agent lifecycle with tools and technologies for building, monitoring, and optimizing AI agents in production.

1

Try NVIDIA-optimized foundation models like NVIDIA Nemotron.

Try Now

2

Build, monitor, and optimize AI agents with NVIDIA NeMo.

Get Started

3

Jump-start building your AI solutions with NVIDIA Blueprints.

Try the Blueprints

Compare Starting Options

Customer Stories

How Industry Leaders Are Driving Innovation With NeMo

Adopters

Leading Adopters Across All Industries

Customers
Partners

Resources

The Latest in NVIDIA NeMo Resources

Blogs
Sessions
Training
Videos

View All Blogs

View All Sessions

Get Started With LLM Customization

In this course, you’ll go beyond prompt-engineering LLMs and learn techniques to efficiently customize pretrained LLMs for your specific use cases. Using NVIDIA NIM microservices, NeMo Curator, and NeMo Framework, you’ll learn various parameter-efficient fine-tuning methods to customize LLM behavior for your organization.

Get Started

Elevate Your LLM Skills

Take advantage of our comprehensive LLM learning path, covering fundamental to advanced topics featuring hands-on training developed and delivered by NVIDIA experts. You can opt for the flexibility of self-paced courses or enroll in instructor-led workshops to earn a certificate of competency.

Explore LLM Training

Get Certified by NVIDIA

Showcase your Generative AI skills and advance your career by getting certified by NVIDIA. Our new professional certification program offers two developer exams focusing on proficiency in large language models (LLMs) and multimodal workflow skills.

Learn About Certification

Train a Reasoning-Capable LLM in One Weekend

Explore a simple and computationally efficient recipe for training reasoning models with small amounts of training data curated from the Llama Nemotron post-training dataset and NVIDIA NeMo.

Watch Reasoning Video

Optimize AI Agents Using a Data Flywheel

Learn how to optimize AI agents in production using the NVIDIA Data Flywheel Blueprint—a continuous loop of distillation, fine-tuning, and evaluation powered by NeMo and NIM microservices.

Watch Data Flywheel Video

Build AI Agents With NeMo Agent Open-Source Toolkit

Learn how to build, integrate, and optimize custom AI agents using the NVIDIA NeMo Agent open-source Python toolkit.

Watch Custom AI Agent Video

View All Videos

Next Steps

Ready to Get Started?

Use the right tools and technologies to take your agentic AI applications from development to production.

For Developers

Explore everything you need to start developing with NVIDIA NeMo, including the latest documentation, tutorials, technical blogs, and more.

Start Developing

Get in Touch

Talk to an NVIDIA product specialist about moving from pilot to production with the assurance of security, API stability, and support that comes with NVIDIA AI Enterprise.

Building and Deploying Generative AI Models

Enterprises are turning to generative AI to revolutionize the way they innovate, optimize operations, and build a competitive advantage. NeMo is an end-to-end platform for curating data; training, customizing, and evaluating multimodal models; and running inference at scale. It supports text, image, video, and speech generation.

Watch Now

Unlocking Synthetic Data Generation with Llama 3.1

Learn how to use the Meta Llama 3.1 405B model to generate tailored synthetic data for your specific domain and explore how to evaluate this data using the Nemotron-4 340B Reward model and ensure alignment with human preferences through NVIDIA NeMo.

Watch Now

Build World-Class AI Virtual Assistants for Customer Service with RAG

Learn how companies can use the AI virtual assistant for customer service NVIDIA AI Blueprint to improve the operational efficiency of existing contact center solutions or build new customer service-centric systems.

Watch Now

NVIDIA NeMo

What Is NVIDIA NeMo?

New NVIDIA AI Blueprint for Building Data Flywheels

Benefits

Explore the Benefits of NVIDIA NeMo for Agentic AI

Modular AI Agent Lifecycle Management

Seamless Deployment and Scaling

Increased ROI

Accelerated Performance

Safer Agentic AI

Production Ready

Features

Tools for Managing the AI Agent Lifecycle

How NeMo Is Being Used

AI Agents

Synthetic Data Generation for Agentic AI

AI Assistant

Information Retrieval

Content Generation

Humanoid Robot

Starting Options

Ways to Get Started With NVIDIA NeMo

1

2

3

Customer Stories

How Industry Leaders Are Driving Innovation With NeMo

Adopters

Leading Adopters Across All Industries

Customers

Partners

Resources

The Latest in NVIDIA NeMo Resources

Get Started With LLM Customization

Elevate Your LLM Skills

Get Certified by NVIDIA

Train a Reasoning-Capable LLM in One Weekend

Optimize AI Agents Using a Data Flywheel

Build AI Agents With NeMo Agent Open-Source Toolkit

Next Steps

Ready to Get Started?

For Developers

Get in Touch

Shell

AI Sweden

Accelerate Industry Applications With LLMs

Amazon

How Amazon and NVIDIA Help Sellers Create Better Product Listings With AI

Amdocs

NVIDIA and Amdocs Bring Custom Generative AI to Global Telco Industry

AT&T

AT&T Drives Customer Care AI Agents’ Accuracy, Efficiency, and Performance With NVIDIA NeMo

AWS

NVIDIA Powers Training for Some of the Largest Amazon Titan Foundation Models

Accenture

Accelerate Generative AI Adoption for Enterprises

Azure

Harnessing the Power of NVIDIA AI Enterprise on Azure Machine Learning

Bria

Bria Builds Responsible Generative AI for Enterprises Using NVIDIA NeMo, Picasso

Cohesity

Unlock Your Data Superpower: NVIDIA Microservices Unleash Enterprise-Grade Secure Generative AI for Cohesity

CrowdStrike

Shaping the Future of AI in the Cybersecurity Domain

Dell

Dell Validated Design for Generative AI With NVIDIA

Deloitte

Unlock the Value of Generative AI Across Enterprise Software Platforms

Domino Data Lab

Domino Offers Production-Ready Generative AI Powered by NVIDIA

Dropbox

Dropbox and NVIDIA to Bring Personalized Generative AI to Millions of Customers

Google Cloud

AI Titans Collaborate to Create Generative AI Magic

HuggingFace

Leading AI Community to Accelerate Data Curation Pipeline

KT

Creating New Customer Experiences With LLMs

Lenovo

New Reference Architecture for Generative AI Based on LLMs