NVIDIA AI Foundation Models and Endpoints

Optimized for enterprise generative AI.

What Are NVIDIA AI Foundation Models and Endpoints?

NVIDIA AI Foundation models are community and NVIDIA-built models and are NVIDIA-optimized to deliver the best performance on NVIDIA accelerated infrastructure. Enterprises can customize and deploy these models with NVIDIA microservices and streamline the transition to production AI.

Explore the NVIDIA API catalog and experience the models directly from a browser or connect to NVIDIA-hosted endpoints and start POC for free.

Accelerate Time to Production AI

Deploy the NVIDIA AI Foundation models at scale with NVIDIA NIM—a set of easy-to-use microservices that ensures seamless, scalable inference, on-premises or in the cloud, leveraging industry-standard APIs.

Build Custom Generative AI Models With NVIDIA AI Foundry

Access foundation models, enterprise software, accelerated computing, and AI expertise to build, fine-tune, and deploy custom models for your enterprise applications.

Build Custom Generative AI Models for Enterprise Applications

The NVIDIA AI foundry service—a collection of NVIDIA AI Foundation Models, NVIDIA NeMo™ framework and tools, and NVIDIA DGX™ Cloud gives enterprises an end-to-end solution for creating custom generative AI models.

Start with State-of-the-Art Generative AI Models

Try leading foundation models, including Llama 2, Stable Diffusion, and NVIDIA’s Nemotron-3 8B family, optimized for the highest performance efficiency.

Customize the Foundation Models

Tune and test the models with proprietary data using NVIDIA NeMo.

Build Models Faster in the Cloud

Customize models on DGX Cloud, a serverless AI-training-as-a-service platform for enterprise developers.

Run Models in Production

Deploy custom and NVIDIA AI Foundation Models anywhere with enterprise-grade NVIDIA NIM.

Start with State-of-the-Art Generative AI Models

Try leading foundation models, including Llama 2, Stable Diffusion, and NVIDIA’s Nemotron-3 8B family, optimized for the highest performance efficiency.

Customize the Foundation Models

Tune and test the models with proprietary data using NVIDIA NeMo.

Build Models Faster in the Cloud

Customize models on DGX Cloud, a serverless AI-training-as-a-service platform for enterprise developers.

Run Models in Production

Deploy custom and NVIDIA AI Foundation Models anywhere with enterprise-grade NVIDIA NIM.

Benefits of NVIDIA AI Foundation Models and Endpoints

Performance Optimized

Lower your TCO and increase energy efficiency by running inference up to 4x faster.

Enterprise-Grade

Use lean, high-performing large language models (LLMs) built from responsibly sourced datasets.

Try Models on the Fly

Experience a models’ peak performance directly from a browser with a GUI or API.

Ready-to-Integrate APIs

Connect your applications to API endpoints and test their real-world performance running on a fully-accelerated stack.

Deploy Your Models Anywhere

Run the model anywhere, from cloud to data center to workstations, with NVIDIA AI Enterprise.

Experience-Optimized Generative AI Models

NVIDIA AI Foundation Models include leading community- and NVIDIA-built models to support various use cases, including content generation, image creation, drug discovery, and IT service automation.

Llama 2

Llama 2 is a large language AI model capable of generating text and code in response to prompts.

Stable Diffusion XL

Stable Diffusion XL (SDXL) generates expressive images with shorter prompts and inserts words inside images.

Nemotron-3-8B-QA

Nemotron-3 8B is an enterprise-grade Question-Answering LLM that enterprises can customize for their domains.

Power Your Enterprise Applications With Retrieval-Augmented Generation (RAG)

Build AI chatbots that connect with your custom LLMs and knowledge bases to accurately and naturally answer domain-specific questions in real time.

Success Stories

Generative AI is impacting every industry today—from IT services and telecommunications to finance and retail.  Putting generative AI into practice requires enterprises to have access to an AI foundry to build custom models using proprietary data and deploy them at scale. See how the world’s leading organizations are serving their customers with NVIDIA AI.

Ecosystem Partners

Let’s Get Started

Try the latest, fully optimized NVIDIA AI Foundation Models today from the NGC catalog, Azure ML model catalog, or Hugging Face.

Notify me as new models are optimized and added to NVIDIA’s collection of AI foundation models.

Explore additional generative AI resources and tools.