Name: CUTLASS: A Performant, Flexible, and Portable Way to Target Hopper Tensor Cores | GTC 24 2024 | NVIDIA On-Demand
Uploaded: 2024-03-19T15:00:00Z
Duration: 2976 s
Description: NVIDIA’s H100 introduced fourth-generation Tensor Cores to GPU computing, with over twice the peak performance of the previous generation

Watch NVIDIA CEO Jensen Huang's GTC keynote replay to catch all the announcements and more.

Watch Now

NVIDIA 隨選內容

This site requires Javascript in order to view all its content. Please enable Javascript in order to access all the functionality of this web site. Here are the instructions how to enable JavaScript in your web browser.

詳細內容

字幕

NVIDIA’s H100 introduced fourth-generation Tensor Cores to GPU computing, with over twice the peak performance of the previous generation. This session will build on our GTC’23 session. We'll describe how the latest version of CUTLASS leverages Hopper features for peak performance, covering major new features since its release last year including convolutions, fused epilogue visitors, Python interface, and more. Our discussion is aimed at those who wish to implement custom kernels for machine learning and HPC applications that achieve peak performance.

活動:

日期:

領域:

技術水平需求:

產業:

NVIDIA technology: CUDA,Hopper

語言: English

地區:

Fill out this form to enjoy this content

Section

Section

名

姓

電子郵件

組織 / 大學名稱

我願意收到下列有關 NVIDIA 的最新消息與公告：

企業業務解決方案

開發人員技術和工具

(非必選) 您可以隨時取消訂閱。

NVIDIA 隱私權政策

Follow Nvidia