Genetic Algorithm wrapped DQN preserves learning in Continual RL

Name: Genetic Algorithm wrapped DQN preserves learning in Continual RL
Uploaded: Mar 16, 2022
Duration: 43 s
Description: PPO model loaded for tested after training on visual fox task. The fox has to reach the closest target when it spawns in. A reward of +1 is given if it runs into the correct target, -1 if the incorrect target, and -0.01 for every action step.

Rachel St Clair290 subscribers

159 views

Mar 16, 2022

0:43

PPO model loaded for tested after training on visual fox task. The fox has to reach the closest target when it spawns in. A reward of +1 is given if it runs into the correct target, -1 if the incorrect target, and -0.01 for every action step.

Download

0 formats

No download links available.