Project 1: Navigation

Project Details

The goal of this project is to train an agent to navigate and collect bananas within a square environment using the DQN algorithm (Deep Q-Network).

The agent receives a reward of +1 for every yellow banana and -1 for every blue banana that it collects. The goal is to collect the maximum amount of yellow bananas while avoiding the blue ones.

The state space has 37 dimensions, which contains the agent's velocity and ray-based perception of objects around the agent's forward direction. Each state space value ranges between 0 and 1. With this information the agent needs to learn the best four discrete actions:

0 - move forward
1 - move backward
2 - turn left
3 - turn right

The task is episodic, and to solve the environment the agent must get an average score greater than 13 over 100 consecutive episodes.

Getting Started

You will need to set up your python environment.

Create (and activate) a new environment with Python 3.6.

Linux or Mac:

conda create --name drlnd python=3.6
source activate drlnd

Windows:

conda create --name drlnd python=3.6 
activate drlnd

Perform a minimal install of OpenAI gym with:

pip install gym

Install the classic control environment group by following the instructions here
Install the box2d environment group by following the instructions here

Clone the Udacity's Deep Reinforcement Learning repository

git clone https://github.com/udacity/deep-reinforcement-learning.git
cd deep-reinforcement-learning/python
pip install .

Create an IPython kernel for drlnd environment

python -m ipykernel install --user --name drlnd --display-name "drlnd"

Before running code in a notebook, change the kernel to drlnd environment by using the drop-down Kernel menu.
For this project you will need to download the pre-built environment prepared by Udacity, and you can download it from one of the links below. You need to download the file that matches your operating system:
- Linux: here
- Mac OSX: here
- Windows (32-bit): here
- Windows (64-bit): here
Download this repository within your working directory.

Instructions

Follow the instruction in Navigation.ipynb to get started with training your agent!

Files

Navigation.ipynb This is the Jupyter notebook that contains the implementation of the DQN algorithm.
dqn_agent.py This Python file contains two classes: Agent and ReplayBuffer. The Agent class contains an act method used to return an action for a given state and current policy. It also has a learn method used to updated the Q-network parameters given a batch of experience tuples. The ReplayBuffer class has an add method to add a new experience to the memory buffer, and a sample method used to randomly fetch a batch of experiences from memory.
model.py This Python file contains the Q-Network model, which maps 37 input states to 4 action values. This neural network contains two hidden layers, each with 64 nodes. A ReLU activation function is used after the output of the first and second hidden layers, and an identity activation function for the output.
checkpoint.pth This file contains the DQN weights of the trained agent.

License

The source code is released under an MIT license.

Acknowledgements

I would like to thank the Udacity community for the technical support and for providing coding exercises that helped me understand the implementation of this algorithm.

Author

Andres Campos

Name		Name	Last commit message	Last commit date
Latest commit History 15 Commits
Figures		Figures
LICENSE		LICENSE
Navigation.ipynb		Navigation.ipynb
README.md		README.md
Report.md		Report.md
checkpoint.pth		checkpoint.pth
dqn_agent.py		dqn_agent.py
model.py		model.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Project 1: Navigation

Project Details

Getting Started

Instructions

Files

License

Acknowledgements

Author

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Project 1: Navigation

Project Details

Getting Started

Instructions

Files

License

Acknowledgements

Author

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages