NVIDIA NeMo

Build, customize, and deploy multimodal generative AI.

Get Started

Video | Solution Brief | Documentation

Overview
Benefits
Features
Use Cases
Starting Options
Customer Stories
Adopters
Resources
Next Steps

Overview

Overview
Benefits
Features
Use Cases
Starting Options
Customer Stories
Adopters
Resources
Next Steps

Get Started

Overview

What Is NVIDIA NeMo?

NVIDIA NeMo™ is an end-to-end platform for developing custom generative AI—including large language models (LLMs), vision language models (VLMs), video models, and speech AI—anywhere.

Deliver enterprise-ready models with precise data curation, cutting-edge customization, retrieval-augmented generation (RAG), and accelerated performance with NeMo, part of NVIDIA AI Foundry—a platform and service for building custom generative AI models with enterprise data and domain-specific knowledge.

Get Started With NVIDIA Cosmos for Advancing Physical AI

Explore how to get started with NVIDIA Cosmos™, a platform comprising state-of-the-art generative world foundation models, advanced tokenizers, guardrails, and an accelerated video processing pipeline built to accelerate the development of physical AI systems.

Read Blog

Generative AI Essentials

Get on the fast-track to enterprise transformation with generative AI. This series of on-demand webinars offers a roadmap to accelerated development and deployment, offering the knowledge you need to take full advantage of this breakthrough technology.

Watch Webinars

Benefits

Explore the Benefits of NVIDIA NeMo for Generative AI

Flexible

Train and deploy generative AI anywhere, across clouds, data centers, and the edge.

Production Ready

Deploy into production with a secure, optimized, full-stack solution that offers support, security, and API stability as part of NVIDIA AI Enterprise.

Increased ROI

Quickly train, customize, and deploy large language models (LLMs), VLMs, video, and speech AI at scale, reducing time to solution and increasing ROI.

Accelerated Performance

Maximize throughput and minimize training time with multi-node, multi-GPU training and inference.

End-to-End Pipeline

Experience the benefits of a complete generative AI pipeline—from data processing and training to inference and guardrails of AI models.

Superior Visual Generation

State-of-the-art reconstruction quality using Cosmos tokenizer across a wide spectrum of image and video categories.

Features

The Complete Solution for Building Enterprise-Ready Generative AI Models

NeMo Curator

Accelerate Data Curation

NVIDIA NeMo Curator improves generative AI model accuracy by processing text, image, and video data at scale. It also provides pre-built pipelines for generating synthetic data to customize and evaluate generative AI systems.

With NeMo Curator, you can accelerate video processing from years to days, compared to alternatives.

Start Developing

Video and Image Tokenizer

Generate High-Quality Visuals

NVIDIA Cosmos tokenizers are open models designed to simplify the development and customization of VLMs and video AI models. They offer high-quality compression and fast, excellent visual reconstruction, lowering TCO during model development and deployments.

Read the Blog

NeMo Customizer

Simplify Fine-Tuning

NVIDIA NeMo Customizer is a high-performance, scalable microservice that simplifies fine-tuning and alignment of LLMs for domain-specific use cases, making it easier to adopt generative AI across industries.

Read the Blog Apply for Early Access

NeMo Evaluator

Evaluate Models

NVIDIA NeMo Evaluator provides a microservice to assess generative AI models and pipelines across academic and custom benchmarks on any platform.

Read the Blog Apply for Early Access

NeMo Retriever

Seamless Data Retrieval

NVIDIA NeMo™ Retriever is a collection of microservices that provide world-class information retrieval with high accuracy and maximum data privacy.

Start Developing

NeMo Guardrails

Generative AI Guardrails

NVIDIA NeMo Guardrails is a scalable rail orchestration platform for ensuring the security, safety, accuracy, and topical relevance of LLM interactions.

Start Developing

NVIDIA NIM™

Generative AI Inference

NVIDIA NIM, part of NVIDIA AI Enterprise, is a set of easy-to-use microservices designed for secure, reliable deployment of high-performance AI model inferencing across clouds, data centers, and workstations.

Start Developing Start Prototyping

Use Cases

How NeMo Is Being Used

See how NVIDIA NeMo supports industry use cases and jump-starts your AI development.

AI Chatbots
AI Assistant
PDF Data Extraction
Content Generation
Humanoid Robot

AI Chatbots

Organizations are looking to build smarter AI chatbots using custom LLMs and retrieval-augmented generation (RAG). With RAG, chatbots can accurately answer domain-specific questions by retrieving current information from an organization’s knowledge base and providing real-time responses in natural language. These chatbots can be used to enhance customer support, personalize AI avatars, manage enterprise knowledge, streamline employee onboarding, provide intelligent IT support, create content, and more.

Learn More About AI Chatbots

AI Assistant

Businesses are deploying AI assistants to efficiently address the queries of millions of customers and employees around the clock. Powered by customized NVIDIA NIM microservices for LLMs, RAG, and speech and translation AI, these AI teammates deliver immediate and accurate spoken responses, even in the presence of background noise, poor sound quality, and diverse dialects and accents.

Learn More About AI Assistants

Try Now

Multimodal PDF Data Extraction

Trillions of PDF files are generated every year, each file likely consisting of multiple pages filled with various content types, including text, images, charts, and tables. This goldmine of data can only be used as quickly as humans can read and understand it. But with generative AI and RAG, this untapped data can be used to uncover business insights that can help employees work more efficiently and result in lower costs.

Learn More About Multimodal PDF Data Extraction

Try Now

An image of a human using a laptop generating data using retrieval-augmented generation using NVIDIA NIM.

Content Generation

Generative AI makes it possible to generate highly relevant, bespoke, and accurate content grounded in the domain expertise and proprietary IP of your enterprise.

Learn More About Content Generation

Humanoid Robot

Humanoid robots are built to adapt quickly to existing human-centric urban and industrial work spaces, tackling tedious, repetitive, or physically demanding tasks. Their versatility has them in such varied locations as factory floors to healthcare facilities, where these robots are assisting humans and helping alleviate labor shortages with automation.

Learn More About Humanoid Robots

Apptronik

Starting Options

Ways to Get Started With NVIDIA NeMo

Use the right tools and technologies to take generative AI models from development to production.

Try

Start prototyping with leading NVIDIA-built and open-source generative AI models that can be deployed using NVIDIA NIM™ microservices and customized with NeMo.

Try Now

Experience

Access NVIDIA-hosted infrastructure and guided hands-on labs that include step-by-step instructions and examples, available for free on NVIDIA LaunchPad.

Access Hands-On Lab

Build

Jump-start building your generative AI solutions with NVIDIA Blueprints, customizable reference applications, available for free on the NVIDIA API catalog.

Try Now

Develop

For those looking to use NeMo for development, the software is available to download for free or apply for early access.

Start Developing

Deploy

Get a free license to try NVIDIA AI Enterprise in production for 90 days using your existing infrastructure.

Request a 90-Day License

Compare Starting Options

Customer Stories

How Industry Leaders Are Driving Innovation With NeMo

Bringing Personalized Generative AI to Customers

Dropbox plans to leverage NVIDIA’s AI foundry to build custom models and improve AI-powered knowledge work with the Dropbox Dash universal search tool and Dropbox AI.

Learn More

Perplexity

Enhance Model Performance for AI-Powered Search Engines

Using NVIDIA NeMo, Perplexity aims to quickly customize frontier models to improve the accuracy and quality of search results and optimize them for lower latency and high throughput for a better user experience.

Learn More

Amdocs

Bringing Custom Generative AI to the Global Telco Industry

Amdocs plans to build custom LLMs for the $1.7 trillion global telecommunications industry using NVIDIA’s AI foundry on Microsoft Azure.

Learn More

Adopters

Leading Adopters Across All Industries

Customers
Partners

Resources

The Latest in NVIDIA NeMo Resources

Blogs
Sessions
Training
Videos

February 11, 2025

What Are Foundation Models?

Editor’s note: This article, originally published on March 13, 2023, has been updated. The mics were live and tape was rolling in the studio where the Miles Davis Quintet was recording dozens of tunes in 1956 for Prestige Records. When an engineer asked for the next song’s title, Davis shot back, “I’ll play it, and Read Article

February 04, 2025

NVIDIA Blackwell Now Generally Available in the Cloud

AI reasoning models and agents are set to transform industries, but delivering their full potential at scale requires massive compute and optimized software. The “reasoning” process involves multiple models, generating many additional tokens, and demands infrastructure with a combination of high-speed communication, memory and compute to ensure real-time, high-quality results. To meet this demand, CoreWeave Read Article

January 31, 2025

What Is Retrieval-Augmented Generation, aka RAG?

Editor’s note: This article, originally published on Nov. 15, 2023, has been updated. To understand the latest advancements in generative AI, imagine a courtroom. Judges hear and decide cases based on their general understanding of the law. Sometimes a case — like a malpractice suit or a labor dispute — requires special expertise, so judges Read Article

View All Blogs

View More Sessions

Develop on NVIDIA Generative AI Platform

Get Started Building Custom Generative AI

Kick-start your generative journey with access to NVIDIA NeMo—for free on NVIDIA LaunchPad.

Get Started

Elevate Your LLM Skills

Take advantage of our comprehensive LLM learning path, covering fundamental to advanced topics featuring hands-on training developed and delivered by NVIDIA experts. You can opt for the flexibility of self-paced courses or enroll in instructor-led workshops to earn a certificate of competency.

Explore LLM Training

Get Certified by NVIDIA

Showcase your Generative AI skills and advance your career by getting certified by NVIDIA. Our new professional certification program offers two developer exams focusing on proficiency in large language models (LLMs) and multimodal workflow skills.

Learn About Certification

View More Training

Building and Deploying Generative AI Models

Enterprises are turning to generative AI to revolutionize the way they innovate, optimize operations, and build a competitive advantage. NeMo is an end-to-end platform for curating data; training, customizing, and evaluating multimodal models; and running inference at scale. It supports text, image, video, and speech generation.

Watch Now

Unlocking Synthetic Data Generation with Llama 3.1

Learn how to use the Meta Llama 3.1 405B model to generate tailored synthetic data for your specific domain and explore how to evaluate this data using the Nemotron-4 340B Reward model and ensure alignment with human preferences through NVIDIA NeMo.

Watch Now

Build World-Class AI Virtual Assistants for Customer Service with RAG

Learn how companies can use the AI virtual assistant for customer service NVIDIA AI Blueprint to improve the operational efficiency of existing contact center solutions or build new customer service-centric systems.

Watch Now

View all Videos

Next Steps

Ready to Get Started?

Use the right tools and technologies to take generative AI models from development to production.

For Developers

Explore everything you need to start developing with NVIDIA NeMo, including the latest documentation, tutorials, technical blogs, and more.

Start Developing

Get in Touch

Talk to an NVIDIA product specialist about moving from pilot to production with the assurance of security, API stability, and support that comes with NVIDIA AI Enterprise.

NVIDIA NeMo

Overview

What Is NVIDIA NeMo?

Get Started With NVIDIA Cosmos for Advancing Physical AI

Generative AI Essentials

Benefits

Explore the Benefits of NVIDIA NeMo for Generative AI

Flexible

Production Ready

Increased ROI

Accelerated Performance

End-to-End Pipeline

Superior Visual Generation

Features

The Complete Solution for Building Enterprise-Ready Generative AI Models

Accelerate Data Curation

Generate High-Quality Visuals

Simplify Fine-Tuning

Evaluate Models

Seamless Data Retrieval

Generative AI Guardrails

Generative AI Inference

How NeMo Is Being Used

AI Chatbots

Starting Options

Ways to Get Started With NVIDIA NeMo

Try

Experience

Build

Develop

Deploy

Customer Stories

How Industry Leaders Are Driving Innovation With NeMo

Bringing Personalized Generative AI to Customers

Enhance Model Performance for AI-Powered Search Engines

Bringing Custom Generative AI to the Global Telco Industry

Adopters

Leading Adopters Across All Industries

Customers

Partners

Resources

The Latest in NVIDIA NeMo Resources

Next Steps

Ready to Get Started?

For Developers

Get in Touch

AI Sweden

Accelerate Industry Applications With LLMs

Amazon

How Amazon and NVIDIA Help Sellers Create Better Product Listings With AI

Amdocs

NVIDIA and Amdocs Bring Custom Generative AI to Global Telco Industry

AWS

NVIDIA Powers Training for Some of the Largest Amazon Titan Foundation Models

Accenture

Accelerate Generative AI Adoption for Enterprises

Azure

Harnessing the Power of NVIDIA AI Enterprise on Azure Machine Learning

Bria

Bria Builds Responsible Generative AI for Enterprises Using NVIDIA NeMo, Picasso

Cohesity

Unlock Your Data Superpower: NVIDIA Microservices Unleash Enterprise-Grade Secure Generative AI for Cohesity

CrowdStrike

Shaping the Future of AI in the Cybersecurity Domain

Dell

Dell Validated Design for Generative AI With NVIDIA

Deloitte

Unlock the Value of Generative AI Across Enterprise Software Platforms

Domino Data Lab

Domino Offers Production-Ready Generative AI Powered by NVIDIA

Dropbox

Dropbox and NVIDIA to Bring Personalized Generative AI to Millions of Customers

Google Cloud

AI Titans Collaborate to Create Generative AI Magic

HuggingFace

Leading AI Community to Accelerate Data Curation Pipeline

KT

Creating New Customer Experiences With LLMs

Lenovo

New Reference Architecture for Generative AI Based on LLMs