Deep reinforcement learning
In this environment, The agents must move its body toward the goal direction without falling. The environment has 12 agents, each observes a state with length 129, and outputs a vector of action of size 12. This task is episodic.
Untrained Agents in Tennis Environment
Testing the Agents after training the agents for 1800 episodes
Testing on Single Agent after training multiple agents for 50 episodes
Testing on Multiple Agents after training multiple agents for 50 episodes
An A* Curriculum Approach to Reinforcement Learning for RGBD Indoor Robot Navigation
Authors: Kaushik Balakrishnan, Punarjay Chakravarty, and Shubham Shrivastava