Back to Browse

Mastering Nvidia Nsight GPU Profiling

2.0K views
Jan 20, 2026
1:01:18

Talk #0: Introductions and Meetup Updates by Chris Fregly Best Selling O'Reilly book, "AI Systems Performance Engineering" is now available (eBook and physical!), 1000 pages, 200 figures, 700 examples!!! Amazon: https://www.amazon.com/Systems-Performance-Engineering-Optimizing-Algorithms/dp/B0F47689K8/ GitHub: https://github.com/cfregly/ai-performance-engineering Talk #1: Diving deep into NVIDIA Nsight Systems GPU profiling tools for PyTorch LLM and computer vision workloads by Chaim Rand In this talk, Chaim Rand (repeat speaker on this webinar series!) revisits the NVIDIA Nsight profiling tools to augment the PyTorch Profiler for LLM and vision workloads. This talk is based on Chaim's recent blog posts on Optimizing Data Transfer in AI/ML Workloads part 1 (https://chaimrand.medium.com/optimizing-data-transfer-in-ai-ml-workloads-60df62fe1278) and part 2 (https://chaimrand.medium.com/optimizing-data-transfer-in-batched-ai-ml-inference-workloads-a9f4165208b8). Zoom link: https://us02web.zoom.us/j/82308186562 Related Links Github Repo: http://github.com/cfregly/ai-performance-engineering/ O'Reilly Book: https://www.amazon.com/Systems-Performance-Engineering-Optimizing-Algorithms/dp/B0F47689K8/ YouTube: https://www.youtube.com/@AIPerformanceEngineering Generative AI Free Course on DeepLearning.ai: https://bit.ly/gllm

Download

0 formats

No download links available.

Mastering Nvidia Nsight GPU Profiling | NatokHD