Name: Scaling and Optimizing Your LLM Pipeline for End-to-End Efficiency | GTC 24 2024 | NVIDIA On-Demand
Uploaded: 2024-03-21T11:00:00Z
Duration: 1353 s
Description: Are you having trouble getting language models (LLMs) to work in your organization? You're not alone

Watch NVIDIA CEO Jensen Huang's GTC keynote replay to catch all the announcements and more.

Watch Now

NVIDIA 隨選內容

This site requires Javascript in order to view all its content. Please enable Javascript in order to access all the functionality of this web site. Here are the instructions how to enable JavaScript in your web browser.

詳細內容

字幕

Are you having trouble getting language models (LLMs) to work in your organization? You're not alone. We'll look at how to deploy an open-source language model on GKE. We'll show data scientists and machine learning engineers how to use NeMo and TRT LLM with GKE's notebooks. Plus, GKE has a unique ability to help orchestrate AI workloads with efficiency and convenience. We'll also demonstrate how to train and tune a language model using NeMo and do a live technical demo of how data science teams can infer these models on GPUs with TRT LLM and GKE.

活動:

日期:

領域:

產業:

技術水平需求:

NVIDIA technology: Cloud / Data Center GPU

語言: English

地區:

Fill out this form to enjoy this content

Section

Section

名

姓

電子郵件

組織 / 大學名稱

我願意收到下列有關 NVIDIA 的最新消息與公告：

企業業務解決方案

開發人員技術和工具

(非必選) 您可以隨時取消訂閱。

NVIDIA 隱私權政策

Follow Nvidia