Deep-Reinforcement-Learning-Rainbow-Navigation

The state space consists of 37 different inputs with ray-based vision of objects and the velocity of the agent.

The agent has 4 available actions, moving forward, backwards and turning left or right.

The goal is to collect as many yellow bananas as possible and to avoid the blue bananas.

The environment is considered solved when your agent averages scores above 13 for 100 episodes. One episode lasts for 300 frames.

First clone the udacity deep reinforcement learning repo and navigate to it's directory then

cd python
pip install .

this installs the required dependencies.

Download the Banana world unity ml environment from one of the following links:

Put it in the root of the cloned project folder and unzip the file.

You should now be able to build and run the project.

Run the following command to train the agent

python runner.py

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
checkpoints		checkpoints
README.md		README.md
checkpoint3.pth		checkpoint3.pth
checkpoint4.pth		checkpoint4.pth
dqnagent.py		dqnagent.py
network.py		network.py
report.md		report.md
report.txt		report.txt
results.png		results.png
runner.py		runner.py
testrunner.py		testrunner.py

Provide feedback