MLND-P4-Train-a-Smartcab-How-to-Drive

Udacity Machine Learning Engineer Nanodegree Project 4

A smartcab is a self-driving car from the not-so-distant future that ferries people from one arbitrary location to another. In this project, you will use reinforcement learning to train a smartcab how to drive.

Environment

Your smartcab operates in an idealized grid-like city, with roads going North-South and East-West. Other vehicles may be present on the roads, but no pedestrians. There is a traffic light at each intersection that can be in one of two states: North-South open or East-West open.

US right-of-way rules apply: On a green light, you can turn left only if there is no oncoming traffic at the intersection coming straight. On a red light, you can turn right if there is no oncoming traffic turning left or traffic from the left going straight.

Inputs

Assume that a higher-level planner assigns a route to the smartcab, splitting it into waypoints at each intersection. And time in this world is quantized. At any instant, the smartcab is at some intersection. Therefore, the next waypoint is always either one block straight ahead, one block left, one block right, one block back or exactly there (reached the destination).

The smartcab only has an egocentric view of the intersection it is currently at (sorry, no accurate GPS, no global location). It is able to sense whether the traffic light is green for its direction of movement (heading), and whether there is a car at the intersection on each of the incoming roadways (and which direction they are trying to go).

In addition to this, each trip has an associated timer that counts down every time step. If the timer is at 0 and the destination has not been reached, the trip is over, and a new one may start.

Outputs

At any instant, the smartcab can either stay put at the current intersection, move one block forward, one block left, or one block right (no backward movement).

Rewards

The smartcab gets a reward for each successfully completed trip. A trip is considered “successfully completed” if the passenger is dropped off at the desired destination (some intersection) within a pre-specified time bound (computed with a route plan).

It also gets a smaller reward for each correct move executed at an intersection. It gets a small penalty for an incorrect move, and a larger penalty for violating traffic rules and/or causing an accident.

Goal

Design the AI driving agent for the smartcab. It should receive the above-mentioned inputs at each time step t, and generate an output move. Based on the rewards and penalties it gets, the agent should learn an optimal policy for driving on city roads, obeying traffic rules correctly, and trying to reach the destination within a goal time.

Install

This project requires Python 2.7 with the pygame library installed:

https://www.pygame.org/wiki/GettingStarted

To install on OSX:

brew install sdl sdl_ttf sdl_image sdl_mixer portmidi

conda install -c https://conda.anaconda.org/quasiben pygame

Code

Open smartcab/agent.py and implement LearningAgent. Follow TODOs for further instructions.

Run

Make sure you are in the top-level project directory smartcab/ (that contains this README). Then run:

python smartcab/agent.py

OR:

python -m smartcab.agent

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
images		images
smartcab		smartcab
.gitignore		.gitignore
.travis.yml		.travis.yml
README.md		README.md
TrainaSmartcabtoDrive.pdf		TrainaSmartcabtoDrive.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

MLND-P4-Train-a-Smartcab-How-to-Drive

Install

Code

Run

About

Releases

Packages

Languages

HenryDev/MLND-P4-Train-a-Smartcab-How-to-Drive

Folders and files

Latest commit

History

Repository files navigation

MLND-P4-Train-a-Smartcab-How-to-Drive

Install

Code

Run

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages