Back to Browse

W4_L1: Dynamic programming (DP): value iteration

3.7K views
Aug 3, 2023
19:39

Welcome to Week 4 Lecture 1 of the course "Special topics in ML (Reinforcement Learning)" by Prof. Balaraman Ravindran. Full Course: https://study.iitm.ac.in/ds/course_pages/BSDA5007.html Video Overview This lecture introduces the value iteration algorithm, a fundamental dynamic programming method in reinforcement learning. It explains how value iteration combines policy evaluation and improvement into a single update step, converging efficiently toward the optimal value function. The session also includes a practical example to illustrate how value iteration operates in real environments. About IIT Madras' online Bachelor of Science programme IIT Madras offers four-year BS programmes that aim to provide quality education to all, irrespective of age, educational background, or location. The BS programme has multiple levels, which provide flexibility to students to exit at any of these levels. Depending on the courses completed and credits earned, the learner can receive a Foundation Certificate from IITM CODE (Centre for Outreach and Digital Education), Diploma(s) from IIT Madras, or BSc/BS Degrees from IIT Madras. For more details, Visit: https://www.iitm.ac.in/academics/study-at-iitm/non-campus-bs-programmes #reinforcementlearning #valueiteration #dynamicprogramming #mdp #optimalvaluefunction #rlbasics #machinelearning #iitmadrasbs

Download

0 formats

No download links available.

W4_L1: Dynamic programming (DP): value iteration | NatokHD