Modern speech AI systems use deep neural network (DNN) models trained on massive datasets. Over time, the size of speech AI models has grown so much that training such models can take weeks of intensive compute time, even when using deep learning frameworks, such as PyTorch, TensorFlow, and MXNet, on high-performance GPUs.
NVIDIA speech and translation AI offers pretrained, production-quality models in the NVIDIA NGC™ catalog that are trained on several public and proprietary datasets for over hundreds of thousands of hours on NVIDIA DGX™ systems.