Name: Transform the Retail Experience: Architecting LLM Inferencing Systems for Edge Deployment (Presented by Supermicro) | GTC 24 2024 | NVIDIA On-Demand
Uploaded: 2024-03-20T15:00:00Z
Duration: 894 s
Description: As large language models (LLMs) are rapidly integrated into every industry, edge AI is crucial for enabling faster, more secure, and efficient data process

Watch NVIDIA CEO Jensen Huang's GTC keynote replay to catch all the announcements and more.

Watch Now

NVIDIA On-Demand

This site requires Javascript in order to view all its content. Please enable Javascript in order to access all the functionality of this web site. Here are the instructions how to enable JavaScript in your web browser.

詳細

字幕

As large language models (LLMs) are rapidly integrated into every industry, edge AI is crucial for enabling faster, more secure, and efficient data processing, particularly for applications where real-time analytics and decision-making are essential. We'll explore the transformative potential of deploying powerful systems at the edge to enhance in-store experiences. These systems must be capable of providing high-quality responses with minimal latency. We showcase a retail concierge solution, powered by an LLM inferencing model, that enables customers to have live interactions with an avatar. Next, we provide an overview of designing solutions that address the unique environmental considerations at the edge, including form factor, power delivery, network latency, and serviceability.

イベント:

日付:

レベル:

業界:

トピック:

NVIDIA technology: Hardware

言語: English

地域:

Fill out this form to enjoy this content

Section

Section

名

姓

メールアドレス

組織名/大学名

NVIDIA から最新ニュース、お知らせ等を受け取る:

企業向けビジネスソリューション

開発者向けテクノロジ & ツール

(任意) 配信停止はいつでも可能です。

NVIDIA プライバシーポリシー

Follow Nvidia