Name		Name	Last commit message	Last commit date
parent directory ..
Relu16-Drop0_2-Relu16-LinOut-1e-3-B_1500		Relu16-Drop0_2-Relu16-LinOut-1e-3-B_1500
Relu16-Drop0_2-Relu16-LinOut-1e-3-B_2000		Relu16-Drop0_2-Relu16-LinOut-1e-3-B_2000
Readme.md		Readme.md
deep-q-keras-learning.py		deep-q-keras-learning.py
show_model_weights.py		show_model_weights.py

Readme.md

Deep Q-learning with keras

First successful training:

Goal was reach around 250 epochs. In training were involved 15 agents. Additional reward was granted for high speed movement, 0.2 for small speed and 0.5 for high, sum of both is less than reward for reaching goal. Main model was updated every 10 trainings.

Goal was reach around 200 epochs. In training were involved 20 agents. Additional reward was granted for high speed movement, 0.2 for small speed and 0.5 for high, sum of both is less than reward for reaching goal. Main model was updated every 2 trainings. We can see diffrence if we look at Q-values, there is less red moves, which means, agents used to stand less idle and more to choose other actions,

As you can see, red periods in model with minibatch_size=2000 are shorter, due to more frequently model updates.

Model 1 - model update every 10 trainings

Model 2 - model update every training

Samples comes from training, if action was not random, sample was collected. On graph we see only actions that was chosen in given moment. As training progress, we can see as red color is disappearing.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

deep-qlearning

deep-qlearning

Readme.md

Deep Q-learning with keras

First successful training:

Model 1 - model update every 10 trainings

Model 2 - model update every training

Video of model 2 evolution

Files

deep-qlearning

Directory actions

More options

Directory actions

More options

Latest commit

History

deep-qlearning

Folders and files

parent directory

Readme.md

Deep Q-learning with keras

First successful training:

Model 1 - model update every 10 trainings

Model 2 - model update every training

Video of model 2 evolution