Autonomous Car with Deep Q-Learning

This repository demonstrates how to train and run a Deep Q-learning (DQN) model for controlling an autonomous car using the AirSim simulation environment. The primary goal is to teach the car to navigate an environment without collisions, maximizing a reward function that encourages safe and efficient driving.

Project Overview

In this project, a deep Q-learning algorithm is employed to train a car to navigate autonomously. The car receives observations from the environment—such as camera view of the obstacles, velocity, coordinates, and sensor readings—and uses a DQN to decide the best possible steering and acceleration actions. The car trained for 1000 episodes (it took about 12 hours) to reach the end of the street and turn left without any collisions.

Installation

Requirements

Python
PyTorch
NumPy
OpenCV-python
Matplotlib
AirSim (You can download it from https://github.com/microsoft/AirSim/releases/download/v1.8.1-windows/AirSimNH.zip)

You can download the necessary libraries from the requirements.txt with this command: pip install -r requirements.txt.

Usage

Using an Existing Model: python drive.py. Do not forget to change the model path.
Training a model: python train.py.

How It Works

Neural Network Architecture

The DQN model takes the current image observation from the simulation, resized to 84×84, as input. It consists of three convolutional layers that extract features from the image. The filter sizes in these layers decrease from 8×8 to 3×3, with strides adjusting from 4×4 to 1×1, and each layer is followed by ReLU activation. After the last convolutional layer, the outputs are concatenated with the current state of the simulation, represented by the X, Y , and Z coordinates, to inform the network about the vehicle’s current position. The concatenated features are passed through a fully connected layer with 512 neurons, followed by another ReLU activation. The final layer outputs five actions, representing the possible navigation decisions for the vehicle.
The last layer outputs Q-values for each possible action.
By selecting the action with the highest Q-value, the agent makes a decision at each step.

Reward Function

The reward function provides critical feedback to guide the agent's learning in the environment.
It heavily penalizes collisions (-100 reward) by ending the episode upon collision, discouraging unsafe actions.
It rewards the agent for reaching waypoints (+100 reward) and gives an additional bonus for completing all waypoints (+500 reward), encouraging goal-oriented behavior.
It offers continuous feedback based on the reduction of distance to the target waypoint, rewarding movement towards the goal and penalizing moving away.
By combining penalties and rewards, the function guides the agent to learn safe navigation while efficiently reaching its objectives.

Results

Reference Path

The numbered red point are represents the waypoints on the path.

Reached Path

Autonomous Driving Video (accelerated x2)

Full.Path.mp4

Mean Max Q-value per Episode

Total Reward Per Episode

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
Deep_QNetwork.py		Deep_QNetwork.py
LICENSE		LICENSE
README.md		README.md
best_dqn_model.pth		best_dqn_model.pth
drive.py		drive.py
environment.yml		environment.yml
requirements.txt		requirements.txt
settings.json		settings.json
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Autonomous Car with Deep Q-Learning

Table of Contents

Project Overview

Installation

Requirements

Usage

How It Works

Neural Network Architecture

Reward Function

Results

About

Releases

Packages

Languages

License

BurakAhmet/Autonomous-Car-with-Deep-Q-Learning

Folders and files

Latest commit

History

Repository files navigation

Autonomous Car with Deep Q-Learning

Table of Contents

Project Overview

Installation

Requirements

Usage

How It Works

Neural Network Architecture

Reward Function

Results

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages