Discover

Experience Test-Time Scaling with DeepSeek
Try NowState-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding.
Most Popular Models
View AllThe leading open models built by the community, optimized and accelerated by NVIDIA's enterprise-ready inference runtime.

deepseek-aideepseek-r1
State-of-the-art, high-efficiency LLM excelling in reasoning, math, and coding.

nvidiacosmos-1.0-diffusion-7b
Generates physics-aware video world states from text and image prompts for physical AI development.

nvidiacosmos-1.0-autoregressive-5b
Generates future frames of a physics-aware world state based on simply an image or short video prompt for physical AI development.

metallama-3.3-70b-instruct
Advanced LLM for reasoning, math, general knowledge, and function calling

nvidiallama-3.1-nemotron-70b-instruct
Llama-3.1-Nemotron-70B-Instruct is a large language model customized by NVIDIA in order to improve the helpfulness of LLM generated responses.

metallama-3.1-405b-instruct
Advanced LLM for synthetic data generation, distillation, and inference for chatbots, coding, and domain-specific tasks.

metallama-3.2-90b-vision-instruct
Cutting-edge vision-Language model exceling in high-quality reasoning from images.

metallama-3.2-3b-instruct
Advanced state-of-the-art small language model with language understanding, superior reasoning, and text generation.

nvidianemotron-4-340b-reward
Grades responses on five attributes helpfulness, correctness, coherence, complexity and verbosity.

metallama-3.1-8b-instruct
Advanced state-of-the-art model with language understanding, superior reasoning, and text generation.

nv-mistralaimistral-nemo-12b-instruct
Most advanced language model for reasoning, code, multilingual tasks; runs on a single GPU.

mistralaimixtral-8x22b-instruct-v0.1
An MOE LLM that follows instructions, completes requests, and generates creative text.

stabilityaisdxl-turbo
A fast generative text-to-image model that can synthesize photorealistic images from a text prompt in a single network evaluation

nvidianemotron-4-340b-instruct
Creates diverse synthetic data that mimics the characteristics of real-world data.

googlegemma-2-9b-it
Cutting-edge text generation model text understanding, transformation, and code generation.

microsoftphi-3-vision-128k-instruct
Cutting-edge open multimodal model exceling in high-quality reasoning from images.
Create AI Agents
View AllBlueprints to build and deploy Agentic AI applications, digital twins, etc.

nvidiaPDF to Podcast
Transform PDFs into AI podcasts for engaging on-the-go audio content.

nvidiaMultimodal PDF Data Extraction
Ingest and extract highly accurate insights contained in text, graphs, charts, and tables within massive volumes of PDF documents.

nvidiaVulnerability Analysis for Container Security
Rapidly identify and mitigate container security vulnerabilities with generative AI.

nvidiaBuild an AI Virtual Assistant
Create intelligent virtual assistants for customer service across every industry

nvidiaBuild a Video Search and Summarization Agent
Ingest massive volumes of live or archived videos and extract insights for summarization and interactive Q&A

nvidiaBuild a Digital Human
Create intelligent, interactive avatars for customer service across industries
Explore Agents from Partners
View AllReference blueprints co-developed with leading agentic AI platform providers.

crewaiCode Documentation for Software Development
Document your github repositories with AI Agents using CrewAI and Llama3.3 70B NIM.

langchainStructured Report Generation
Generate detailed, structured reports on any topic using LangGraph and Llama3.3 70B NIM

llamaindexDocument Research Assistant for Blog Creation
Automate research, and generate blogs with AI Agents using LlamaIndex and Llama3.3-70B NIM LLM.

pipecatVoice Agent Framework for Conversational AI
Automate voice AI agents with NVIDIA NIM microservices and Pipecat.

wandbTraceability for Agentic AI
Trace and evaluate AI Agents with Weights & Biases.