Skip to content

k-duan/rl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

36 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

rl

This repo is for learning RL basics. I intend to keep things simple, for example:

  • Focus on simple gym environment
  • Minimum dependencies and avoid advanced implementations in pytorch
  • Only use implementation tricks that are necessary to make things work
  • Tensorization as much as possible, e.g. episode masks to handle variable sequence length

Implementations

  • REINFORCE
  • A2C
  • PPO

Notes

  • A2C suffers from instability issues. A few helpful tricks:
    • advantage normalization
    • use a learning schedule
    • gradient norm clipping
    • carefully tune learning rate and other parameters
  • PPO makes training much more stable. See A2C and PPO (both using GAE with same lambda to calculate advantages) comparison on CartPole-V1 (experimented using 8181d9f ):

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages