A demo for the tool-use sheep environment I made in Unity3D with ML-Agents to test if reinforcement learning can control an agent's navigation to a target that can only be retrieved by using an environmental tool.
The agent learns to navigate to tool and jump on it in order to get to the target, but does not always achieve this quickly. There is some evidence that the agent learns to push the tool.