Back to Browse

34 - Q learning

216 views
Nov 3, 2021
5:05

Policy iteration is a method for finding a solution to a Markov decision process. Deep Q-learning introduces a practical, model-free, way to determine the value function in off-policy settings. Learn more about the Duckietown massive online open course "Self-Driving Cars with Duckietown" on https://www.duckietown.org/mooc

Download

0 formats

No download links available.

34 - Q learning | NatokHD