55:21Optimizing AI Inference for Heterogeneous Clusters by Natalie Serrino, Founder @ Gimlet LabsAI Performance Engineering886 views·3 weeks ago
1:16:45NVIDIA GTC 2026 Conf Recap + Inference Engines + Scaling Disagg Prefill-Decode + RadixAttentionAI Performance Engineering570 views·1 month ago
1:16:07OpenClawMCP for AI Systems Performance Tuning + NVFP4 Low Precision AI System OptimizationsAI Performance Engineering858 views·2 months ago
1:05:55Advanced and Accelerated Data Curation + Visualizations for LLMs with NVIDIA CuML, DBSCAN, and tSNEAI Performance Engineering209 views·5 months ago
14:46Automated Browser Use with Amazon AGI by Antje BarthAI Performance Engineering128 views·5 months ago
1:39:13Speed of Light Inference w NVIDIA + AMD GPUs and Modular by Abdul Dakkak, Head of Gen AI @ ModularAI Performance Engineering1.1K views·6 months ago
1:30:36AI-Powered GPU Kernel Optimization(Mako.dev) + Distributed PyTorch with nbdistributed (Hugging Face)AI Performance Engineering1.2K views·6 months ago
1:04:49NVIDIA Dynamo + Disaggregated Prefill-Decode LLM Serving + PyTorchCUDA Performance with LuminalAI Performance Engineering1.2K views·8 months ago
1:22:21Maximize LLM Inference Performance + Auto-ProfileOptimize PyTorchCUDA CodeAI Performance Engineering1.7K views·8 months ago
1:25:13DynamicAdaptive RL-based Inference CUDA Kernel Optimization +Accelerated PyTorch +Modular MojoMAXAI Performance Engineering926 views·9 months ago
1:22:57AI Agent Inference Performance Optimizations + vLLM vs. SGLang vs. TensorRT w Charles Frye (Modal)AI Performance Engineering2.2K views·11 months ago
1:32:34PyTorch Data Loader Tuning + GPU Cross-Architecture Optimizations CUDA and AMDAI Performance Engineering839 views·11 months ago
1:11:39Nvidia GTC 2025 Recap + PyTorch Model Tuning +AI Systems Performance Engineering TipsAI Performance Engineering2.0K views·1 year ago
1:23:30GPUs @ KubeCon 2024 + New DeepLearning.ai Data Engineering Course + LLMs with Amazon EKSRay ServeAI Performance Engineering743 views·1 year ago
1:21:34Quantum Artificial General Intelligence (AGI) + Multi-Modal Chatbot + Karini GenAI SaaS Startup!AI Performance Engineering787 views·1 year ago
1:00:32Hands-on with Devin.Al and Crew.AI + Text-to-Video GenAI Pipelines with Vikit.AIAI Performance Engineering814 views·1 year ago
2:13:20Segment Anything Model 2 +Building XR Applications +Code-Savvy Assistants +Multi-Modal RAG EmbeddingAI Performance Engineering530 views·1 year ago
1:05:41Multi-Modal RAG - Segment Anything Model v2, ImageBind, Embeddings, Fine-TuningAI Performance Engineering933 views·1 year ago
1:24:37Multi-Modal RAG w Milvus + Multi-Agent Optimization w DSPy + LLM DistillationAI Performance Engineering1.2K views·1 year ago
1:06:43From RLHF with PPODPO to ORPO + How to build ORPO on TrainiumNeuron SDKAI Performance Engineering946 views·1 year ago
7:23Databricks Data + AI Summit 2024 Highlights - LLM Performance, Tracing, Debugging, SQLAI Performance Engineering799 views·1 year ago
5:02Apple Reveals Foundation Model Details Datasets, Frameworks, and Evaluation Benchmarks!AI Performance Engineering651 views·1 year ago
1:08:11Mistral AI Updates incl Mixtral 8x22B + OpenLLMetry Evaluation OptimizationAI Performance Engineering598 views·1 year ago
1:02:06Anthropic 2024 Updates including Claude 3 + GenAI Observability and LLM Evaluation with TrueraAI Performance Engineering1.2K views·2 years ago
1:05:45Nvidia GTC 2024 Recap + Generative AI Live Demo w Nvidia Jetson Edge GPU Device + Nvidia H200, B200AI Performance Engineering450 views·2 years ago
1:00:13Advanced RAG by Jay Alammar (Cohere) + Parameter-Efficient Fine-Tuning (PEFT)AI Performance Engineering2.9K views·2 years ago