Discuss the on policy algorithm Sarsa and Sarsa(lambda) with eligibility trace. Take about why he Sarsa(lambda) is more efficient.
If you like this, please like my code on Github as well.
Code: https://github.com/MorvanZhou/Reinforcement-learning-with-tensorflow
Support me by Patreon: https://www.patreon.com/morvan