After training the network, the model makes lane changes only when required. Here is an example:
In the initial phases, the Model takes random actions that result in unnecessary lane changes and collision. The network is penalized for unnecessary lane changes and lane changes that result in collision. Here is an example:
This project utilizes a three-lane highway traffic simulator for reinforcement learning based lane change decision making. The simulator provides current vechicle position and surrounding vehicles position information. Deep Q Learning Network is employed for the decision making model. The action space is discretized into 3 actions, namely, stay on the current lane, make left lane change and make right lane change.
The simulator is taken from one of the udacity courses. Here is a snapshot of the simulator: