GitHub - lingjiekong/CS238Project: Course project for Stanford CS238: Decision Making Under Uncertainty (Autumn 2017)

lingjiekong / CS238Project Public

Notifications You must be signed in to change notification settings
Fork 1
Star 0

Course project for Stanford CS238: Decision Making Under Uncertainty (Autumn 2017)

Notifications

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
a3c		a3c
double_a3c		double_a3c
dqn		dqn
exp_results		exp_results
material		material
old_a3c		old_a3c
proposal		proposal
src		src
tensorpack-master		tensorpack-master
README		README

Repository files navigation

========== Source code directory ==========

double_a3c/: Implementation of A3C, double A3C, and less shared double A3C. Please run them with 
             Python3 (tensorflow required).
             Usage: python3 train-atari.py --env Breakout-v0 --gpu 0
                    python3 double-train-atari.py --env Breakout-v0 --gpu 0
                    python3 shared-double-train-atari.py --env Breakout-v0 --gpu 0
             The codes of double A3C and LS A3C are based on the A3C implementation in:
             https://github.com/ppwwyyxx/tensorpack/blob/master/examples/A3C-Gym/train-atari.py

dqn/: Implementation of DQN. Utilize starter codes of homework 2 in Stanford CS234, implemented by
      Yangxin Zhong, one of the authors in this project. Please run it with Python2 (tensorflow required).
      Usage: python q5_train_atari_nature.py
      One needs to edit 'configs/q5_train_atari_nature.py' to change the environment and parameters
      setting.


========== Report directory ==========

exp_results/: Log files of experiments.

proposal/: Project proposol and report. See 'proposal.docx' and 'Report_Combine.docx'