GPU-Accelerated Google Cloud Platform

Capabilities
Customer Stories
Accelerated Infrastructure
Optimized Software
Developer Resources

Capabilities
Customer Stories
Accelerated Infrastructure
Optimized Software
Developer Resources

Accelerate Innovation

NVIDIA pioneered accelerated computing to push the boundaries of innovation for developers, designers, and creators around the globe and transform the world’s largest industries. NVIDIA accelerated computing combined with the flexibility, global reach, and scale of Google Cloud speeds up time to solution and drives down infrastructure TCO for computationally intensive workloads like generative AI, data analytics, high-performance computing (HPC), graphics, and gaming wherever they need to run.

Power New Capabilities With Google Cloud and NVIDIA

image-1
image-2

Generative AI

NVIDIA and Google Cloud partner across every layer of the generative AI stack, providing access to next-gen infrastructure, enterprise-grade software, and inference microservices and optimizing foundation models to accelerate time from prototype to production deployment.

Data Analytics

NVIDIA and Google Cloud have joined forces to offer cutting-edge data analytics solutions, enabling enterprises to gain valuable insights from massive datasets and unlock new possibilities with data-driven decision making and innovation.

High Performance Computing

The NVIDIA accelerated computing platform on Google Cloud helps developers, scientists, engineers, and researchers tackle complex workloads in fields like life sciences, climate modeling, manufacturing, energy, quantum simulations, and financial services.

Explore Customer Stories

Toyota

Toyota is transforming manufacturing with an AI platform accelerated by NVIDIA and Google Cloud, empowering factory workers to automate tasks like quality inspection, predictive maintenance, and process optimization—saving over 10,000 work-hours annually across all of its plants.

Learn More

Shopify

Shopify empowers businesses to supercharge their online stores with real-time, AI-powered search and recommendations—using NVIDIA and Google Cloud—to deliver instant updates to product listings and images, boost merchant sales, and create a seamless shopping experience.

Learn More

LiveX.AI

By leveraging the power of NVIDIA NIM™ inference microservices on GKE with NVIDIA GPUs, LiveX AI has achieved a 6.1X increase in average token speed. This enhancement lets LiveX AI deliver personalized experiences to customers in real time, including seamless customer support, instant product recommendations, and reduced returns.

Learn More

Palo Alto Networks

Palo Alto Networks is transforming cybersecurity by using NVIDIA Dynamo-Triton and Google Cloud for real-time AI-powered threat detection and data protection—reducing latency and costs while helping enterprises defend against advanced cyberattacks more efficiently.

Learn More

Writer

Learn how Writer, a full-stack generative AI platform for enterprises, leverages NVIDIA H100 and L4 Tensor Core GPUs on GKE with the NVIDIA NeMo™ framework and TensorRT™-LLM to train and deploy over 17 large language models (LLMs) that scale up to 70 billion parameters.

Learn More

Augment Code

Augment Code is a developer AI platform for enterprises that accelerates coding by deeply understanding your codebase, and—by partnering with Google Cloud and NVIDIA—delivering lightning-fast, reliable AI assistance that helps teams write better code and ensures high availability and security for users.

Learn More

NVIDIA Accelerated Infrastructure on Google Cloud

Accelerate next-generation AI with the latest NVIDIA GPUs on Google Cloud, seamlessly integrated with Google Cloud AI Hypercomputer architecture—enabling demanding workloads at scale like LLM training, real-time inference, and advanced agentic AI applications for autonomous decision-making and physical AI for robotics, autonomous vehicles, and digital twins.

See the Full List of NVIDIA Accelerated VMs Here

Google A4X VMs With NVIDIA GB200 NVL72

Google Cloud’s A4X VMs deliver over one exaFLOP of compute per rack and support seamless scaling to tens of thousands of Blackwell GPUs, enabled by Google’s Jupiter network fabric and advanced networking with NVIDIA® ConnectX®-7 NICs. Google’s third-generation liquid cooling infrastructure delivers sustained, efficient performance even for the largest AI workloads.

Learn More

Google A4 VM With NVIDIA HGX B200

Google Cloud’s A4 VMs, accelerated by NVIDIA HGX™ B200, are now generally available. The A4 VM features eight NVIDIA Blackwell GPUs interconnected by fifth-generation NVIDIA NVLink™. Compared to previous generation A3 VMs, the A4 VM offers a significant performance boost, enabling faster model training, real-time inference, and accelerated data analytics.

Learn More

Google G4 VMs With NVIDIA RTX PRO 6000 Blackwell Server Edition

Google G4 VMs with NVIDIA RTX PRO™ 6000 Blackwell deliver breakthrough performance for both agentic and physical AI applications, accelerating everything from cost-efficient inference and generative AI to robotics simulation, hyper-realistic 3D rendering, and next-generation game rendering. Unlock next-generation AI and graphics capabilities.

Learn More

Unlock the Full Potential of NVIDIA Accelerated Computing on Google Cloud

NVIDIA on Google Cloud Marketplace
NVIDIA Integrations in Google Cloud

NVIDIA on Google Cloud Marketplace

NVIDIA offers a comprehensive, performance-optimized software stack directly on Google Cloud Marketplace to unlock the full potential of cutting-edge NVIDIA accelerated infrastructure and reduce the complexity of building accelerated solutions on Google Cloud. This lowers TCO through improved performance, simplified deployment, and streamlined development.

NVIDIA DGX Cloud

NVIDIA DGX Cloud accelerates AI workloads in the cloud, delivering high-performance training, scalable inference, and global GPU access for developers and platform teams.

Learn More

Foretellix

NVIDIA AI Enterprise

NVIDIA AI Enterprise is a cloud native platform that streamlines development and deployment of production-grade AI solutions including generative AI, computer vision, speech AI, and more. Easy-to-use microservices provide optimized model performance with enterprise-grade security, support, and stability to ensure a smooth transition from prototype to production for enterprises that run their businesses on AI.

Learn More

NVIDIA NIM

NVIDIA NIM, part of NVIDIA AI Enterprise, is a set of easy-to-use inference microservices for accelerating the deployment of AI applications that require natural language understanding and generation. By offering developers access to industry-standard APIs, NIM enables the creation of powerful copilots, chatbots, and AI assistants, while making it easy for IT and DevOps teams to self-host AI models in their own managed environments. NVIDIA NIM can be deployed on GCE, GKE, or Google Cloud Run.

Learn More

NVIDIA Omniverse

NVIDIA Omniverse™ is a platform of APIs, SDKs, and services that enable developers to integrate OpenUSD, NVIDIA RTX™ rendering technologies into physical AI applications. Use VMs on Google Cloud to accelerate your application development.

Learn More

Integrations at Every Layer of the Google Cloud Stack

NVIDIA and Google Cloud collaborate closely on integrations that bring the power of the full-stack NVIDIA AI platform to a broad range of native Google Cloud services, giving developers the flexibility to choose the level of abstraction they need. With these integrations, Google Cloud customers can combine the power of both enterprise-grade NVIDIA AI software and the computational power of NVIDIA GPUs to maximize application performance within the Google Cloud services they’re already familiar with.

Google Kubernetes Engine

Combine the power of the NVIDIA AI platform with the flexibility and scalability of GKE to efficiently manage and scale generative AI training and inference and other compute-intensive workloads. GKE's on-demand provisioning, automated scaling, NVIDIA Multi-Instance GPU (MIG) support, and GPU time-sharing capabilities ensure optimal resource utilization. This minimizes operational costs while delivering the necessary computational power for demanding AI workloads.

Learn More

Vertex AI

Combine the power of NVIDIA accelerated computing with Google Cloud’s Vertex AI, a fully managed, unified MLOps platform for building, deploying, and scaling AI models in production. Leverage the latest NVIDIA GPUs and NVIDIA AI software, like Triton™ Inference Server, within Vertex AI Training, Prediction, Pipelines, and Notebooks to accelerate generative AI development and deployment without the complexities of infrastructure management.

Learn More

Google Dataproc

Leverage the NVIDIA RAPIDS™ Accelerator for Spark to accelerate Apache Spark and Dask workloads on Dataproc, Google Cloud’s fully managed data processing service—without code changes. This enables faster data processing, extract, transform, and load (ETL) operations, and machine learning pipelines while substantially lowering infrastructure costs. With the RAPIDS Accelerator for Spark, users can also speed up batch workloads within Dataproc Serverless without provisioning clusters.

Learn More

Google Dataflow

Accelerate machine learning inference with NVIDIA AI on Google Cloud Dataflow, a managed service for executing a wide variety of data processing patterns, including both streaming and batch analytics. Users can optimize the inference performance of AI models using NVIDIA TensorRT’s integration with Apache Beam SDK and speed up complex inference scenarios within a data processing pipeline using NVIDIA GPUs supported in Dataflow.

Learn More

Cloud Run

DAccelerate the path to deploy generative AI faster with NVIDIA NIM on Google Cloud Run, a fully managed, serverless compute platform for deploying containers on Google Cloud’s infrastructure. With support for NVIDIA GPUs in Cloud Run, users can leverage NIM to optimize performance and accelerate deployment of gen AI models into production in a serverless environment that abstracts away infrastructure management.

Learn More

Dynamic Workload Scheduler

Get easy access to NVIDIA GPU capacity on Google Cloud for short-duration workloads like AI training, fine-tuning, and experimentation using Dynamic Workload Scheduler. With flexible scheduling and atomic provisioning, users can get access to the compute resources they need within services like GKE, Vertex AI, and Batch while enhancing resource utilization and optimizing costs associated with running AI workloads.

Learn More

Google Distributed Cloud

With the NVIDIA Blackwell platform coming to Google Distributed Cloud, enterprises can now securely deploy advanced agentic AI—including Google Gemini models—directly in their own data centers on premises. This integration empowers organizations to harness breakthrough AI performance and scalability for sensitive, regulated workloads while ensuring data privacy, sovereignty, and compliance. By combining the strengths of Google Distributed Cloud and NVIDIA Blackwell, businesses can accelerate innovation with next-generation AI, while maintaining full control over their data and operations.

Learn More

Google Cloud and NVIDIA Developer Community

Google Cloud and NVIDIA have partnered to create this community for developers, data scientists, AI/ML engineers, and technical practitioners focused on leveraging NVIDIA and Google Cloud technologies for their development.

Join the Community

Additional Resources

Gemma

NVIDIA is collaborating with Google to launch Gemma, a newly optimized family of open models built from the same research and technology used to create the Gemini models. An optimized release with TensorRT-LLM enables users to develop with LLMs using only a desktop with an NVIDIA RTX™ GPU.

Try Now

RAPIDS cuDF on Google Colab

RAPIDS cuDF is now integrated into Google Colab. Developers can instantly accelerate pandas code up to 50X on Google Colab GPU instances and continue using pandas as data grows—without sacrificing performance.

Read Blog

Accelerate Your Startup

The NVIDIA Inception program helps startups accelerate innovation with developer resources and training, access to cloud credits, exclusive pricing on NVIDIA software and hardware, and opportunities for exposure to the VC community.

Learn More and Apply

Latest News

Run Google DeepMind’s Gemma 3n on NVIDIA Jetson and RTX

With NVIDIA Jetson™ and RTX AI PCs, developers can easily deploy Gemma 3n models using Ollama. AI enthusiasts can use Gemma 3n models with RTX accelerations in their favorite apps like AnythingLLM and LM Studio.

Read the Announcement

New G4 VMs With NVIDIA RTX PRO 6000 Blackwell Power AI, Graphics, Gaming, and Beyond

We’re excited to announce the preview of our new G4 VMs based on NVIDIA RTX PRO 6000 Blackwell Server edition—the first cloud provider to do so. This follows the introduction earlier this year of A4 and A4X VMs powered by NVIDIA Blackwell GPUs, designed for large-scale AI training and serving.

Read the Blog

NVIDIA and Google Partnership Gains Momentum With the Latest Blackwell and Gemini Announcements

NVIDIA and Google share a long-standing relationship rooted in advancing AI innovation and empowering the global developer community. This partnership goes beyond infrastructure, encompassing deep engineering collaboration to optimize the computing stack.

Read the Blog

Access the Power of Google Cloud and NVIDIA

Contact Sales