Once generative AI models are built, fine-tuned, and trained, NeMo enables seamless deployment through optimized inference on virtually any data center or cloud. NeMo Retriever, a collection of generative AI microservices, provides world-class information retrieval with the lowest latency, highest throughput, and maximum data privacy, enabling organizations to generate insights in real time. NeMo Retriever enhances generative AI applications with enterprise-grade retrieval-augmented generation (RAG), which can be connected to business data wherever it resides.
NVIDIA DGX Cloud is an AI-training-as-a-service platform, offering a serverless experience for enterprise developers that’s optimized for generative AI. Enterprises can experience performance-optimized, enterprise-grade NVIDIA AI Foundation models directly from a browser and customize them using proprietary data with NeMo on DGX Cloud.
NVIDIA AI Enterprise for Accelerated Data Science and Logistics Optimization
The NVIDIA AI Enterprise software suite enables quicker time to results for AI and machine learning initiatives, while improving cost-effectiveness. Using analytics and machine learning, telecom operators can maximize the number of completed jobs per field technician, dispatch the right personnel for each job, dynamically optimize routing based on real-time weather conditions, scale to thousands of locations, and save billions of dollars in maintenance.
AT&T is transforming their operations and enhancing sustainability by using NVIDIA-powered AI for processing data, optimizing fleet routing, and building digital avatars for employee support and training. AT&T first adopted the NVIDIA RAPIDS™ Accelerator for Apache Spark to capitalize on energy-efficient GPUs across their AI and data science pipelines. Of the data and AI pipelines targeted with Spark RAPIDS, AT&T saves about half of their cloud computing spend and sees faster performance, while reducing their carbon footprint.
AT&T, which operates one of the largest field dispatch teams, is currently testing NVIDIA® cuOpt™ software to to handle more complex technician routing and optimization challenges. In early trials, cuOpt delivered solutions in 10 seconds, while the same computation on x86 CPUs took 1,000 seconds. The results yielded a 90 percent reduction in cloud costs and allowed technicians to complete more service calls each day.
Quantiphi, an innovative AI-first digital engineering company, is working with leading telcos to build custom LLMs to support field technicians. Through LLM-powered virtual assistants acting as copilots, Quantiphi is helping field technicians resolve network-related issues and manage service tickets raised by end customers.