RLCodebase

RLCodebase is a modularized codebase for deep reinforcement learning algorithms based on PyTorch. This repo aims to provide an user-friendly reinforcement learning codebase for beginners to get started and for researchers to try their ideas quickly and efficiently.

For now, it has implemented DQN(PER), A2C, PPO, DDPG, TD3 and SAC algorithms, and has been tested on Atari, Procgen, Mujoco, PyBullet and DMControl Suite environments.

Introduction

The design of RLCodebase is shown as below.

Config: Config is a class that contains parameters for reinforcement learning algorithms such as discount factor, learning rate, etc. and general configurations such as random seed, saving path, etc.
Trainer: Trainer is a wrapped class that controls the workflow of reinforcement learning training. It manages the interactions between submodules (Agent, Env, memory).
Agent: Agent chooses actions to take given states. It also defines how to update the model given a batch of data.
Model: Model gathers all neural networks to train.
Env: Env is a vectorized gym environment.
Memory: Memory stores experiences utilized for RL training.

Installtion

All required packages have been included in setup.py and requirements.txt. Mujoco is needed for mujoco_py and dm control suite. To support mujoco_py and dm control, please refer to https://github.com/openai/mujoco-py and https://github.com/deepmind/dm_control. For mujoco_py 2.1.2.14 and dm_control (commit fe44496), you may download mujoco like below

cd ~  
mkdir .mujoco  
cd .mujoco  
# for mujoco_py
wget https://mujoco.org/download/mujoco210-linux-x86_64.tar.gz
tar -xf mujoco210-linux-x86_64.tar.gz  
# for dm control
wget https://github.com/deepmind/mujoco/releases/download/2.1.1/mujoco-2.1.1-linux-x86_64.tar.gz
tar -xf mujoco-2.1.1-linux-x86_64.tar.gz

To install RLCodebase, follow

# create virtual env
conda create -n rlcodebase python=3.8
conda activate rlcodebase

# install rlcodebase
git clone [email protected]:KarlXing/RLCodebase.git RLCodebase
cd RLCodebase
pip install -e .
pip install -r requirements.txt

# try it
python examples/example_ppo.py

Supported Algorithms

DQN (PER)
A2C
PPO
DDPG
TD3
SAC

Supported Environments (tested)

Atari
Mujoco
PyBullet
Procgen

Results

1. PPO & A2C In Atari Games

2. DDPG & TD3 & SAC In PyBullet Environments

3. DQN & DQN+PER In PongNoFrameskip-v4

4. Procgen

Citation

Please use the bibtex below if you want to cite this repository in your publications:

@misc{rlcodebase,
  author = {Jinwei Xing},
  title = {RLCodebase: PyTorch Codebase For Deep Reinforcement Learning Algorithms},
  year = {2020},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/KarlXing/RLCodebase}},
}

References for implementation and design

RLCodebase is inspired by resources below.

Name		Name	Last commit message	Last commit date
Latest commit History 65 Commits
examples		examples
imgs		imgs
rlcodebase		rlcodebase
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RLCodebase

Introduction

Installtion

Supported Algorithms

Supported Environments (tested)

Results

1. PPO & A2C In Atari Games

2. DDPG & TD3 & SAC In PyBullet Environments

3. DQN & DQN+PER In PongNoFrameskip-v4

4. Procgen

Citation

References for implementation and design

About

Releases 4

Packages

Contributors 2

Languages

KarlXing/RLCodebase

Folders and files

Latest commit

History

Repository files navigation

RLCodebase

Introduction

Installtion

Supported Algorithms

Supported Environments (tested)

Results

1. PPO & A2C In Atari Games

2. DDPG & TD3 & SAC In PyBullet Environments

3. DQN & DQN+PER In PongNoFrameskip-v4

4. Procgen

Citation

References for implementation and design

About

Topics

Resources

Stars

Watchers

Forks

Releases 4

Packages 0

Contributors 2

Languages

Packages