AWS logo

Accelerate Innovation in the Cloud

Diagnosing cancer. Predicting hurricanes. Automating business operations. These are some of the breakthroughs possible when you use accelerated computing to unveil the insights hiding in vast volumes of data. Amazon Web Services (AWS) and NVIDIA have collaborated for over 13 years to deliver the most powerful and advanced GPU-accelerated cloud to help customers build a more intelligent future.

Power New Capabilities With AWS and NVIDIA

Healthcare

Healthcare

Deliver personalized medicine and accelerate breakthroughs in biomedical research with AWS and NVIDIA solutions.

Media and Entertainment

Media and Entertainment

Realize the potential of cloud computing for digital content creation. Adapt your resources as your studio’s demands grow, and access the best creative talent across the globe.

Financial Services

Financial Services

Boost risk management, improve data-backed decisions and security, and enhance customer experiences with generative AI, deep learning, machine learning, and natural language processing (NLP) solutions.

Digital Twins and the Metaverse

Digital Twins and the Metaverse

Harness the power of large-scale simulation for industrial and scientific applications.

Enterprise AI and Machine Learning

Enterprise AI and Machine Learning

Reduce development time, lower costs, improve accuracy and performance, and have more confidence in AI outcomes with NVIDIA solutions running on AWS.

High-Performance Computing

High-Performance Computing

Learn how AWS and NVIDIA high-performance computing (HPC) solutions are optimized to work together, cost-effectively solving the world’s most complex problems.

Explore Customer Stories

Read.ai logo

Video Call Transcription

Software company Read.ai built their video call transcription platform on NVIDIA® Riva and reduced costs by 20–30 percent using Amazon EC2 G5 instances powered by NVIDIA A10G Tensor Core GPUs.

Paige.ai logo

Machine Learning in Life Sciences

Life sciences company Paige is furthering cancer treatment with a hybrid machine learning workflow built using Amazon EC2 P4d instances powered by NVIDIA A100 Tensor Core GPUs.

Netflix logo

VFX Studio in the Cloud

Netflix deployed their visual effects (VFX) studio to facilitate remote collaboration among a global workforce using Amazon EC2 G5 instances powered by NVIDIA A10G GPUs.

Internal logo

Generative AI for Content

Iternal Technologies used Amazon EC2 Instances powered by NVIDIA GPUs to help their customers supercharge their marketing, improving ROI by 30X with generative AI. Because Iternal is part of NVIDIA Inception, they were among the first to gain access to NVIDIA Riva’s voice cloning capabilities to get a proof-of-concept generative AI voice product up and running in two weeks.

reezocar logo

HPC and Machine Learning for Retail

Automotive company Reezocar estimates vehicle repairs swiftly and accurately using AWS HPC and machine learning infrastructure powered by NVIDIA GPUs. With this infrastructure, the company can meticulously detect car dents and imperfections and estimate repair costs in milliseconds, helping to extend the serviceable life of vehicles.

Codeway logo

Generative AI for Gaming

Codeway optimized price performance for their generative AI application, Wonder, using NVIDIA GPU-powered Amazon EC2 G5 instances, saving 48 percent on compute costs.

NVIDIA Accelerated Infrastructure—From Cloud to Edge—on AWS

Amazon Elastic Cloud Compute (EC2)

Access a broad range of NVIDIA GPU-accelerated instances on Amazon EC2 on demand to meet the diverse computational requirements of AI, machine learning, data analytics, graphics, cloud gaming, virtual desktops, and HPC applications. Starting from single-GPU instances to thousands of GPUs in EC2 UltraClusters, AWS customers can provision the right-sized GPU to accelerate time to solution and reduce total costs of running their cloud workloads.

NVIDIA RTX™ technology, EC2 G5 instances
Amazon EC2 G5 With NVIDIA A10G

Featuring NVIDIA A10G Tensor Core GPUs and support for NVIDIA RTX™ technology, EC2 G5 instances are ideal for graphics-intensive applications like video editing, rendering, 3D visualization, and photorealistic simulations. Additionally, they can be used to accelerate AI inference and single-GPU AI training workloads.

NVIDIA T4G Tensor Core GPUs and AWS Graviton2
Amazon EC2 G5g With NVIDIA T4G

Featuring NVIDIA T4G Tensor Core GPUs and AWS Graviton2 processors, EC2 G5g instances are best suited for cloud game development and Android-in-the-cloud gaming services. They can also be used for cost-effective AI inference using Arm®-enabled software from the NVIDIA NGC™ catalog.

NVIDIA A100 40GB Tensor Core GPUs
Amazon EC2 P4d With NVIDIA A100 40GB

Featuring eight NVIDIA A100 40GB Tensor Core GPUs, EC2 P4d instances deliver the highest performance for AI and HPC. For multi-node AI training and distributed HPC workloads, you can scale from few to thousands of NVIDIA A100 GPUs in EC2 UltraClusters.

Amazon EC2 for deep learning and HPC applications
Amazon EC2 P5 With NVIDIA H100 80GB:

Tensor Core GPUs deliver the highest performance in Amazon EC2 for deep learning and HPC applications. They help you accelerate your time to solution by up to 6X compared to previous-generation GPU-based EC2 instances and reduce the cost to train machine learning models by up to 40 percent.

Simplify Development and Maximize Performance With NVIDIA-Optimized Software

NVIDIA-Optimized Software on AWS

Access the computational power of NVIDIA GPU-accelerated instances on AWS to develop and deploy your applications at scale with fewer compute resources, accelerating time to solution and reducing TCO. To maximize performance and developer productivity, NVIDIA offers a wide range of GPU-optimized software for a broad range of workloads, including data science, data analytics, AI and machine learning training, AI and machine learning inference, HPC, and graphics.

NVIDIA AI Enterprise.
NVIDIA NGC

NVIDIA NGC is the portal of enterprise services, software, management tools, and support for end-to-end AI and digital twin workflows. The NGC software catalog provides a range of resources that meet the needs of data scientists, developers, and researchers with varying levels of expertise, including containers, pretrained models, domain-specific SDKs, use case-based collections, and Helm charts for the fastest AI implementations. To take AI workloads to production with NGC software, you can access enterprise-grade support, training, and services with NVIDIA AI Enterprise.

NVIDIA AI Enterprise Support Cloud Services
NVIDIA AI Enterprise on AWS

NVIDIA AI Enterprise is a secure, end-to-end, cloud-native suite of AI software. It accelerates data science pipelines and streamlines the development, deployment, and management of predictive AI models to automate essential processes and deliver rapid insights from data. NVIDIA AI Enterprise includes an extensive library of full-stack software, including NVIDIA AI workflows, frameworks, pretrained models, and infrastructure optimization. Global enterprise support and regular security reviews ensure business continuity and that AI projects stay on track.

Virtual Workstations on AWS and NVIDIA
NVIDIA RTX Virtual Workstation

The NVIDIA RTX Virtual Workstation (RTX vWS) for GPU-accelerated graphics helps creative and technical professionals maximize their productivity from anywhere by providing  access to the most demanding professional design and engineering applications from the cloud.  Amazon EC2 G5 (NVIDIA A10G) and G4dn (NVIDIA T4) instances, combined with the RTX vWS Amazon Machine Image (AMI), enables the industry’s most advanced 3D graphics platform, including the latest real-time ray tracing with RTX technology in virtual machines.

Developer Resources and Quick-Start Guides

Image of multiple diacoms of the human brain.
MONAI Label Workshops

Learn how you can make use of MONAI—an open-source AI framework for healthcare—in your work. Join us to get a hands-on experience.

Image of molecules
BioNeMo Now on AWS

Researchers and developers at leading pharmaceutical and techbio companies can now easily deploy NVIDIA Clara™ software and services, including NVIDIA BioNeMo™, for accelerated healthcare through AWS.

Image of individuals all in different technology roles to show how a startup could have many different types of focus.
Accelerate Your Startup

Explore the program that provides cutting-edge startups around the world with critical access to go-to-market support, technical expertise, training, and funding opportunities.

NVIDIA and Amazon joint logo together.
AI Capabilities Using TensorRT-LLM

Previously, creating detailed product listings required significant time and effort for sellers, but this simplified process gives them more time to focus on other tasks. The NVIDIA TensorRT-LLM software is available today on GitHub and can be accessed through NVIDIA AI Enterprise, which offers enterprise-grade security, support, and reliability for production AI.

Image to show how the cloud connects people.
NVIDIA CloudXR

NVIDIA CloudXR™ is NVIDIA’s extended reality (XR) streaming technology, built on RTX and RTX Virtual Workstation software. By using CloudXR alongside Amazon NICE DCV streaming protocols, you can use on-demand compute resources for all aspects of your immersive application development.

Multiple servers to show how Sagemaker connects.
NVIDIA Triton Inference Server in Amazon SageMaker

This blog provides an overview of NVIDIA Triton Inference Server and SageMaker, shows the benefits of using Triton Inference Server containers, and showcases how easy it is to deploy your own machine learning models. To work from a sample notebook that supports this blog post, download it here.

NVIDIA Riva speech skills on Amazon EKS
NVIDIA Riva at Scale With Amazon EKS

This step-by-step guide show you how to deploy and scale NVIDIA Riva speech skills on Amazon EKS with Traefik-based load balancing.

NVIDIA Triton Inference Server,
Amazon Music Uses SageMaker With NVIDIA to Optimize Machine Learning Training and Inference

Take a look inside the journey Amazon Music took to optimize performance and cost using SageMaker, NVIDIA Triton Inference Server, and NVIDIA TensorRT®. We show how the seemingly simple, yet intricate, search bar works, ensuring a seamless Amazon Music experience with little-to-zero typo delays and relevant real-time search results.

NVIDIA Triton Inference Server and NVIDIA TensorRT
NVIDIA Clara Parabricks on AWS

Amazon.com, one of the most visited ecommerce websites in the world, uses an AI model that automatically corrects misspelled words in search queries to let customers more effortlessly shop. Amazon measures the success of their accelerated search results based on latency—how fast typos are corrected—and the number of successful sessions.

Access the Power of AWS and NVIDIA

Amazon EC2 P5 Instances

NVIDIA AI Enterprise

NVIDIA RTX Virtual Workstations

Select Location
Middle East