2024 LLVM Dev Mtg - Simplifying GPU Programming with Parametric Tile-Level Tensors In Mojo

Name: 2024 LLVM Dev Mtg - Simplifying GPU Programming with Parametric Tile-Level Tensors In Mojo
Uploaded: Dec 5, 2024
Duration: 1554 s

LLVM18.3K subscribers

1.6K views

Dec 5, 2024

25:54

2024 LLVM Developers' Meeting https://llvm.org/devmtg/2024-10/ ------ Simplifying GPU Programming with Parametric Tile-Level Tensors In Mojo Speaker: Ahmed Taei ------ Slides: https://llvm.org/devmtg/2024-10/slides/techtalk/Taei-Simplifying-GPU-Programming-with-Parametric-Tile-Level-Tensors-In-Mojo.pdf ----- Today’s AI GPU workloads are dominated by operations such as matrix multiplication (matmul) and flash-attention, with state-of-the-art implementations designed to leverage the compute and memory hierarchy of modern GPUs at a tile-level granularity. Expressing these algorithms at this level, rather than using the low-level SIMT (Single Instruction, Multiple Threads) model, presents a significant challenge for kernel developers. In this talk, we will demonstrate how Mojo, a systems programming language built on MLIR, addresses this challenge through its powerful metaprogramming capabilities. Mojo enables the creation of simple yet powerful composable abstractions for parametric Tensor types, which can be tiled, distributed across the compute hierarchy, and vectorized. Additionally, the language provides GPU library authors with direct access to MLIR, making it easier for library authors to specialize high-level library operations for specific hardware targets, which facilitates the efficient development of state-of-the-art GPU kernels that outperform vendor libraries like cuBLAS. ----- Videos Edited by Bash Films: http://www.BashFilms.com

Download

0 formats

No download links available.