A maze example using Q learning. Introducing the updating rule in Q learning.
If you like this, please like my code on Github as well.
Code: https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow
Support me by Patreon: https://www.patreon.com/morvan