Optimized for enterprise generative AI.
NVIDIA AI Foundation models are community and NVIDIA-built models and are NVIDIA-optimized to deliver the best performance on NVIDIA accelerated infrastructure. Enterprises can customize and deploy these models with NVIDIA microservices and streamline the transition to production AI.
Explore the NVIDIA API catalog and experience the models directly from a browser or connect to NVIDIA-hosted endpoints and start POC for free.
Deploy the NVIDIA AI Foundation models at scale with NVIDIA NIM—a set of easy-to-use microservices that ensures seamless, scalable inference, on-premises or in the cloud, leveraging industry-standard APIs.
Access foundation models, enterprise software, accelerated computing, and AI expertise to build, fine-tune, and deploy custom models for your enterprise applications.
The NVIDIA AI foundry service—a collection of NVIDIA AI Foundation Models, NVIDIA NeMo™ framework and tools, and NVIDIA DGX™ Cloud gives enterprises an end-to-end solution for creating custom generative AI models.
Try leading foundation models, including Llama 2, Stable Diffusion, and NVIDIA’s Nemotron-3 8B family, optimized for the highest performance efficiency.
Tune and test the models with proprietary data using NVIDIA NeMo.
Customize models on DGX Cloud, a serverless AI-training-as-a-service platform for enterprise developers.
Deploy custom and NVIDIA AI Foundation Models anywhere with enterprise-grade NVIDIA NIM.
Lower your TCO and increase energy efficiency by running inference up to 4x faster.
Use lean, high-performing large language models (LLMs) built from responsibly sourced datasets.
Experience a models’ peak performance directly from a browser with a GUI or API.
Connect your applications to API endpoints and test their real-world performance running on a fully-accelerated stack.
Run the model anywhere, from cloud to data center to workstations, with NVIDIA AI Enterprise.
NVIDIA AI Foundation Models include leading community- and NVIDIA-built models to support various use cases, including content generation, image creation, drug discovery, and IT service automation.
Llama 3 is a large language AI model capable of generating text and code in response to prompts.
Stable Diffusion XL (SDXL) generates expressive images with shorter prompts and inserts words inside images.
Mistral Large excels in complex multilingual reasoning tasks, including text understanding, and code generation.
Build AI chatbots that connect with your custom LLMs and knowledge bases to accurately and naturally answer domain-specific questions in real time.
Generative AI is impacting every industry today—from IT services and telecommunications to finance and retail. Putting generative AI into practice requires enterprises to have access to an AI foundry to build custom models using proprietary data and deploy them at scale. See how the world’s leading organizations are serving their customers with NVIDIA AI.
ServiceNow is bringing intelligent workflow automation to their Now Platform with custom LLMs using NVIDIA AI Foundation Models and NVIDIA NeMo on NVIDIA DGX.
Amdocs is building custom LLMs for the $1.7 trillion global telecommunications industry using the NVIDIA AI foundry service on Microsoft Azure.
Try the latest, fully optimized NVIDIA AI Foundation Models today from the NGC catalog, Azure ML model catalog, or Hugging Face.
Notify me as new models are optimized and added to NVIDIA’s collection of AI foundation models.
Explore additional generative AI resources and tools.