NatokHD
Back to Browse
Lec 60 Reinforcement Learning for Aligning Large Language Models
NPTEL - Indian Institute of Science, Bengaluru
87.4K subscribers
Share
1.0K views
Feb 23, 2026
26:19
RLHF, PPO, DPO, preference learning
Download
0 formats
No download links available.
Lec 60 Reinforcement Learning for Aligning Large Language Models | NatokHD