Name: Training Optimization for LLM with NVIDIA NeMo and AWS | GTC 24 2024 | NVIDIA On-Demand
Uploaded: 2024-03-19T14:00:00Z
Duration: 3560 s
Description: Training a large language model at scale while ensuring efficiency and reliability poses numerous challenges

This site requires Javascript in order to view all its content. Please enable Javascript in order to access all the functionality of this web site. Here are the instructions how to enable JavaScript in your web browser.

詳細內容

字幕

Training a large language model at scale while ensuring efficiency and reliability poses numerous challenges. During this presentation, we'll share our experience training LLMs at Amazon Search, utilizing NVIDIA's Nemo Framework in collaboration with AWS. We'll discuss the process of selecting the appropriate training framework, establishing the training infrastructure by harnessing the power of Nemo and AWS , and implementing zero-touch training through automated job monitoring and recovery mechanisms. Additionally, we'll share practical insights into fine-tuning hyperparameters and selecting model architectures to optimize training efficiency. Finally, we'll examine potential paths to further streamline the training process of Large Language Models.

活動:

日期:

技術水平需求:

產業:

領域:

NVIDIA technology: NeMo,Triton

語言: English

地區:

Fill out this form to enjoy this content

Section

Section

名

姓

電子郵件

組織 / 大學名稱

我願意收到下列有關 NVIDIA 的最新消息與公告：

企業業務解決方案

開發人員技術和工具

(非必選) 您可以隨時取消訂閱。

NVIDIA 隱私權政策

Follow Nvidia