NVIDIA AI Foundation Models

Reasoning and instruct models optimized for enterprise agentic AI.

Introduction
Benefits
AI Models
Success Stories
Partners
Get Started

Introduction
Benefits
AI Models
Success Stories
Partners
Get Started

What Are NVIDIA AI Foundation Models?

NVIDIA AI foundation models are community- and NVIDIA-built models that are optimized to deliver the best performance on NVIDIA-accelerated infrastructure. Enterprises can customize and deploy these models with NVIDIA NIM^TM microservices and streamline the transition to production AI.

Explore the NVIDIA API catalog and experience the multimodal models directly from a browser, connect to NVIDIA-hosted endpoints and start POC for free, or download and run on your compute platform.

Supercharge Agents With Expert Reasoning

Power your enterprise agents with open NVIDIA Llama Nemotron with reasoning models that deliver leading accuracy and efficiency for a wide range of reasoning and non-reasoning agentic tasks.

Available as NVIDIA NIM microservices, the NVIDIA-developed and 3rd party models are optimized for performance and can be rapidly deployed on any NVIDIA-accelerated infrastructure.

Explore Reasoning Models

NVIDIA NeMo

A comprehensive software suite to build, monitor, and optimize AI agents across their lifecycle at enterprise scale.

Get Started

Documentation

The World Foundation Model Platform to Accelerate Physical AI

The development of physical-AI-embodied systems such as robots and autonomous vehicles is accelerated with the new NVIDIA Cosmos^TM platform.

Read the Press Release

Build Custom AI Models for Enterprise AI Agents

The NVIDIA AI Foundry —a collection of NVIDIA AI foundation models, NVIDIA NeMo^TM framework and microservices, NVIDIA NIM microservices, and NVIDIA DGX^TM Cloud —gives enterprises an end-to-end solution for creating custom generative AI models.

Start with State-of-the-Art Generative AI Models

Try leading multimodal models, including Llama, Llama Nemotron with reasoning, Cosmos Nemotron, and Phi, optimized for the highest performance and efficiency.

Experience NVIDIA AI Foundation Models

Customize the Foundation Models

Tune and test the models with proprietary data using NVIDIA NeMo.

Customize With NVIDIA NeMo

Build Models Faster in the Cloud

Customize models on DGX Cloud, a serverless AI-training-as-a-service platform for enterprise developers.

Train on NVIDIA DGX Cloud

Run Models in Production

Deploy custom and NVIDIA AI foundation models anywhere with enterprise-grade NVIDIA NIM.

Scale With NVIDIA NIM

Benefits of NVIDIA AI Foundation Models and Endpoints

Performance Optimized

Lower your TCO and increase energy efficiency by running inference up to 4x faster.

Enterprise-Grade

Use lean, high-performing LLMs built from responsibly sourced datasets.

Try Models on the Fly

Experience a model’s peak performance directly from a browser with a GUI or API.

Ready-to-Integrate APIs

Connect your applications to API endpoints and test their real-world performance running on a fully-accelerated stack.

Deploy Your Models Anywhere

Run the model anywhere, from cloud to data center to workstations, with NVIDIA AI Enterprise.

Experience-Optimized Generative AI Models

NVIDIA AI foundation models include leading community- and NVIDIA-built models to support various use cases, including content generation, image creation, drug discovery, and IT service automation.

Llama Nemotron

Highest reasoning accuracy and efficiency models to advance autonomy of complex agentic AI systems.

View Llama Nemotron Models

Cosmos

NVIDIA Cosmos^TM, a platform of state-of-the-art generative world foundation models, advanced tokenizers, guardrails, and an accelerated data processing and curation pipeline built to accelerate the development of physical AI systems such as autonomous vehicles (AVs) and robots.

Try Cosmos

Mistral Large

Mistral Large excels in complex multilingual reasoning tasks, including text understanding, and code generation.

Try Mistral Large

View All Models

Power Your Enterprise Applications With Retrieval-Augmented Generation (RAG)

Build AI chatbots that connect with your custom LLMs and knowledge bases to accurately and naturally answer domain-specific questions in real time.

Explore the RAG AI Workflow

Success Stories

Generative AI is impacting every industry today—from IT services and telecommunications to finance and retail. Putting generative AI into practice requires enterprises to have access to an AI foundry to build custom models using proprietary data and deploy them at scale. See how the world’s leading organizations are serving their customers with NVIDIA AI.

ServiceNow

ServiceNow is bringing intelligent workflow automation to their Now Platform with custom LLMs using NVIDIA AI foundation models and NVIDIA NeMo on NVIDIA DGX.

Learn More

Amdocs

Amdocs is building custom LLMs for the $1.7 trillion global telecommunications industry using the NVIDIA AI foundry service on Microsoft Azure.

Learn More

cont-1
cont-2

Ecosystem Partners

Let’s Get Started

Try the latest, fully optimized NVIDIA AI foundation models today from the NGC catalog, Azure ML model catalog, or Hugging Face.

Experience the Models

Notify me as new models are optimized and added to NVIDIA’s collection of AI foundation models.

Notify Me

Explore additional generative AI resources and tools.

Learn More