Udacity's Deep Reinforcement ND Project 1: Navigation

Program Details

Trains an agent to collect yellow bananas and avoid purple bananas.

State and action space details

The environment provides the state as a 37 dimension vector containing the agent's velocity and a ray-based perception of objects around the agent's forward direction. The reward provided by the environment is +1 for collecting a yellow banana and -1 for a purple banana.

The agent returns an integer in [0, 3] representing the following directions:

0 - move forward.
1 - move backward.
2 - turn left.
3 - turn right.

The environment is considered solved when the average culmative reward over 100 consecutive episodes is above 13. The current agent solves the environment after around 550 episodes.

Implementation details

Uses Double DQN with 3 layer FC network. See Report.md for more details.

Getting Started

Download the environment for your operating system below.
- Linux: click here
- Mac OSX: click here
- Windows (32-bit): click here
- Windows (64-bit): click here
Extract the contents into banana_app/
Install anaconda
Install pytorch and unityagents

Instructions

Run either main.py or use navigation.ipynb to run environment on existing model or retrain.z

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
banana_app		banana_app
.gitignore		.gitignore
README.md		README.md
Report.md		Report.md
agent.py		agent.py
main.py		main.py
model.py		model.py
model_weights.pth		model_weights.pth
navigation.ipynb		navigation.ipynb
results.png		results.png
run_game.py		run_game.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Udacity's Deep Reinforcement ND Project 1: Navigation

Program Details

State and action space details

Implementation details

Getting Started

Instructions

About

Releases

Packages

Languages

lesaun/dqn-banana-hunting

Folders and files

Latest commit

History

Repository files navigation

Udacity's Deep Reinforcement ND Project 1: Navigation

Program Details

State and action space details

Implementation details

Getting Started

Instructions

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages