W4_L2: More on dynamic programming (DP)
Welcome to Week 4 Lecture 2 of the course "Special topics in ML (Reinforcement Learning)" by Prof. Balaraman Ravindran. Full Course: https://study.iitm.ac.in/ds/course_pages/BSDA5007.html Video Overview This lecture explores advanced topics in dynamic programming for reinforcement learning, focusing on asynchronous DP methods that update states selectively rather than all at once. It also introduces Real-Time Dynamic Programming (RTDP), an online variant designed for large state spaces, and explains Generalized Policy Iteration (GPI), the unifying framework that ties policy evaluation and improvement together. About IIT Madras' online Bachelor of Science programme IIT Madras offers four-year BS programmes that aim to provide quality education to all, irrespective of age, educational background, or location. The BS programme has multiple levels, which provide flexibility to students to exit at any of these levels. Depending on the courses completed and credits earned, the learner can receive a Foundation Certificate from IITM CODE (Centre for Outreach and Digital Education), Diploma(s) from IIT Madras, or BSc/BS Degrees from IIT Madras. For more details, Visit: https://www.iitm.ac.in/academics/study-at-iitm/non-campus-bs-programmes #reinforcementlearning #dynamicprogramming #asynchronousdp #rtdp #gpi #mdp #rlmethods #machinelearning #iitmadrasbs
Download
0 formatsNo download links available.