NVIDIA AI Foundation Models

Reasoning and instruct models optimized for enterprise agentic AI.

What Are NVIDIA AI Foundation Models?

NVIDIA AI foundation models are community- and NVIDIA-built models that are optimized to deliver the best performance on NVIDIA-accelerated infrastructure. Enterprises can customize and deploy these models with NVIDIA NIMTM microservices and streamline the transition to production AI.

Explore the NVIDIA API catalog and experience the multimodal models directly from a browser, connect to NVIDIA-hosted endpoints and start POC for free, or download and run on your compute platform.

Supercharge Agents With Expert Reasoning

Power your enterprise agents with open NVIDIA Llama Nemotron with reasoning models that deliver leading accuracy  and efficiency for a wide range of reasoning and non-reasoning agentic tasks.

Available as NVIDIA NIM microservices, the NVIDIA-developed and 3rd party models are optimized for performance and can be rapidly deployed on any NVIDIA-accelerated infrastructure.

The World Foundation Model Platform to Accelerate Physical AI

The development of physical-AI-embodied systems such as robots and autonomous vehicles is accelerated with the new NVIDIA CosmosTM platform.

Build Custom AI Models for Enterprise AI Agents

The NVIDIA AI Foundry —a collection of NVIDIA AI foundation models, NVIDIA NeMoTM framework and microservices, NVIDIA NIM microservices, and NVIDIA DGXTM Cloud —gives enterprises an end-to-end solution for creating custom generative AI models.

Start with State-of-the-Art Generative AI Models

Try leading multimodal models, including Llama, Llama Nemotron with reasoning, Cosmos Nemotron, and Phi, optimized for the highest performance and efficiency.

Customize the Foundation Models

Tune and test the models with proprietary data using NVIDIA NeMo.

Build Models Faster in the Cloud

Customize models on DGX Cloud, a serverless AI-training-as-a-service platform for enterprise developers.

Run Models in Production

Deploy custom and NVIDIA AI foundation models anywhere with enterprise-grade NVIDIA NIM.

Benefits of NVIDIA AI Foundation Models and Endpoints

Performance Optimized

Lower your TCO and increase energy efficiency by running inference up to 4x faster.

Enterprise-Grade

Use lean, high-performing LLMs built from responsibly sourced datasets.

Try Models on the Fly

Experience a model’s peak performance directly from a browser with a GUI or API.

Ready-to-Integrate APIs

Connect your applications to API endpoints and test their real-world performance running on a fully-accelerated stack.

Deploy Your Models Anywhere

Run the model anywhere, from cloud to data center to workstations, with NVIDIA AI Enterprise.

Experience-Optimized Generative AI Models

NVIDIA AI foundation models include leading community- and NVIDIA-built models to support various use cases, including content generation, image creation, drug discovery, and IT service automation.

Llama 2 : An AI agent showing its thinking with a reasoning model

Llama Nemotron

Highest reasoning accuracy and efficiency models to advance autonomy of complex agentic AI systems.

Cosmos

Cosmos

NVIDIA CosmosTM, a platform of state-of-the-art generative world foundation models, advanced tokenizers, guardrails, and an accelerated data processing and curation pipeline built to accelerate the development of physical AI systems such as autonomous vehicles (AVs) and robots.

NVIDIA's Nemotron-3 8B : An enterprise-grade Question-Answering LLM

Mistral Large

Mistral Large excels in complex multilingual reasoning tasks, including text understanding, and code generation.

Power Your Enterprise Applications With Retrieval-Augmented Generation (RAG)

Build AI chatbots that connect with your custom LLMs and knowledge bases to accurately and naturally answer domain-specific questions in real time.

Success Stories

Generative AI is impacting every industry today—from IT services and telecommunications to finance and retail.  Putting generative AI into practice requires enterprises to have access to an AI foundry to build custom models using proprietary data and deploy them at scale. See how the world’s leading organizations are serving their customers with NVIDIA AI.

Ecosystem Partners

Google Cloud
Hugging Face
Microsoft Azure
Oracle Cloud

Let’s Get Started

Try the latest, fully optimized NVIDIA AI foundation models today from the NGC catalog, Azure ML model catalog, or Hugging Face.

Notify me as new models are optimized and added to NVIDIA’s collection of AI foundation models.

Explore additional generative AI resources and tools.

Select Location
Middle East