Name: Making Large Language Models and Retrieval-Augmented Generation Work With Ease (Presented by Softserve, Inc.) | GTC 24 2024 | NVIDIA On-Demand
Uploaded: 2024-03-21T08:00:00Z
Duration: 1325 s
Description: Large language models (LLMs) provide new possibilities for engaging and intelligent conversational systems

Watch NVIDIA CEO Jensen Huang's GTC keynote replay to catch all the announcements and more.

Watch Now

NVIDIA 隨選內容

This site requires Javascript in order to view all its content. Please enable Javascript in order to access all the functionality of this web site. Here are the instructions how to enable JavaScript in your web browser.

詳細內容

字幕

Large language models (LLMs) provide new possibilities for engaging and intelligent conversational systems. However, productionizing and managing these models and ensuring they work to your advantage can be challenging. Two key strategies that can help are RAG-workflows and NeMo Guardrails.

Retrieval-augmented generation (RAG) is a powerful technique that combines the strengths of retrieval-based models and generative models by leveraging customers' proprietary data. This innovative workflow leverages the LLM for inference by providing a prompt with additional context, significantly enhancing the performance of LLMs and enabling them to handle a wide range of tasks with greater flexibility and capability.

NeMo Guardrails provides a comprehensive framework for seamlessly integrating programmable guardrails into LLM-based conversational systems. These guardrails serve as protective measures, effectively mitigating potential LLM-based attacks, ensuring a trustworthy and secure conversational system and addressing LLM issues like hallucinations.

活動:

日期:

產業:

領域:

技術水平需求:

NVIDIA technology: NeMo,Riva

語言: English

地區: