Speech and Translation AI
Build and deploy fully customizable multilingual speech and translation AI for your large language model and retrieval-augmented generation based applications.
NVIDIA® Riva is a set of GPU-accelerated multilingual speech and translation microservices for building fully customizable, real-time conversational AI pipelines. Riva includes automatic speech recognition (ASR), text-to-speech (TTS), and neural machine translation (NMT) and is deployable in all clouds, in data centers, at the edge, and on embedded devices. With Riva, organizations can add speech and translation interfaces with large language models (LLMs) and retrieval-augmented generation (RAG) to transform chatbots into engaging, expressive multilingual assistants and avatars.
Achieve high multilingual transcription and translation accuracy, and provide out-of-the-box, expressive, professional female and male voices with state-of-the-art models pretrained on thousands of hours of audio on NVIDIA supercomputers.
Customize across ASR pipelines for different languages, accents, domains, vocabulary, and context for the best possible accuracy for your use case and across TTS pipelines for the brand voice and intonation you want.
Provide consistent experiences to hundreds of thousands of concurrent users with higher inference performance than existing technology, and deploy anywhere—in data centers, on premises, in the cloud, at the edge, or in embedded devices.
Accelerate the development and deployment of production-grade, multilingual, voice-enabled AI applications with NVIDIA AI Enterprise, an end-to-end, cloud-native software platform for enterprise-grade secure and stable generative AI.
Experience new ASR, TTS and NMT microservices now available—designed to provide optimized AI inference for speech and translation AI. This includes Parakeet models that deliver recording setting ASR accuracy and performance.
Find out how industry leaders are driving innovation with Riva.
Companies are deploying Q&A assistants to automatically address the queries of millions of customers and employees around the clock. With Riva’s speech and translation AI microservices, these assistants provide helpful and natural responses at every turn of the conversation despite background noise, poor sound quality, and diverse speaker dialects and accents.
Use the right tools and technologies to build and deploy fully customizable, multilingual speech and translation AI applications.
Use the right tools and technologies to build and deploy fully customizable, multilingual, speech and translation AI applications.
Explore everything you need to start developing with NVIDIA Riva, including the latest documentation, tutorials, technical blogs, and more.
Talk to an NVIDIA product specialist about moving from pilot to production with the security, API stability, and support of NVIDIA AI Enterprise.