Back to Browse

Large-Scale Data Curation for LLM Training

349 views
Aug 22, 2025
1:25:56

We are happy to share the recording of the second session from the webinar series jointly organized by NVIDIA and C-DAC, Pune, focused on training large language models (LLMs) from scratch. The second session focused on large-scale data curation for LLM training with NVIDIA NeMo Curator, highlighting how scalable pipelines and high-quality dataset preparation play a key role in building accurate and robust AI applications. #NPSF #GPU #CDACPune #HPCAI #AI #PARAMSiddhiAI #LLM

Download

0 formats

No download links available.

Large-Scale Data Curation for LLM Training | NatokHD