Riva provides deep learning-based automatic speech recognition (ASR), text-to-speech (TTS), and neural machine translation (NMT) models for AI practitioners and developers. ASR, TTS, and NMT are voice interfaces in speech AI-based applications, such as call center agent assists, digital assistants, video call transcriptions, and AI superchats driven by large language models (LLMs) and retrieval-augmented generation (RAG).
ASR converts speech to text and usually is the first step in a speech pipeline, so its transcription accuracy influences all downstream tasks. TTS generates human-like voices from text. NMT translates words from one language to another.
Riva is used across all industries—from telecommunications and finance to healthcare, retail, and automotive—wherever companies interact with customers.