Back to Browse

ML Performance Reading Group Session 8: Megatron-LM

1.2K views
Mar 10, 2025
1:09:20

ML Performance Reading Group Session 8, where we covered the paper "Megatron-LM: Training Multi-Billion Parameter Language Models Using Model Parallelism" (https://arxiv.org/abs/1909.08053) Presenter: Daniel Vega-Myhre

Download

1 formats

Video Formats

360pmp4114.3 MB

Right-click 'Download' and select 'Save Link As' if the file opens in a new tab.

ML Performance Reading Group Session 8: Megatron-LM | NatokHD