50:19Deep Reinforcement Learning From Human Preferences in tensorflowDaniel Eid1.9K views·4 years ago
1:05:09Pytorch Continuous A2C RNN agent, bonus sample code, and 'Deep Mimic' multi skill decoder tinkeringDaniel Eid867 views·4 years ago