This is the reposository for Thomas Tarler's practicuum 2 in reinforcement learning.
Directory
- Final Report: Latex documents with final results.
- : (Note to instructor:) Use this PDF when evaluating writeup, as the
ReadMe.MD
is rather sparse - scripts: Scripts that can be run on a Python cluster to determine final results
Final Neural Networks:
To Run Scripts
- Zip and upload
DDPG
orDQN
to the multiprocessing cluster of your choice. Must have the following: a.anaconda3
withkeras-rl
,box2d
installed b. If you want to render, ensure thatffmpeg
(usebrew
,yum
, orapt
depending on server) c. Tensorflow 1.0 compatability installed - For DQN, run
python DQN_BipedalWalker.py
, no command line arguments are necessary. If you do not have rendering established, putn
when prompted. - For DDPG, run
python ddpg_batch.py EPISODES BATCH_SIZE LSTM ARG EPC
. Again, selectn
for rendering. Command line arguments are: a.EPISODES
in integer is number of episodes (I ran 10000) b.BATCH_SIZE
in integer is the batch size (typically 16-32) c.LSTM
is a boolean, depending if you want to call a LSTM network for the recall buffer d.ARG
in integer is how often you want training results displayed via CLI e.EPC
in integer is how many epochs you want to use.