A Policy-Guided Imitation Approach for Offline Reinforcement Learning

This is the code for reproducing the results of the paper A Policy-Guided Imitation Approach for Offline Reinforcement Learning accepted as oral at NeurIPS'2022. The paper and slide can be found at paper and slide.

Policy-guided Offline RL (POR) is a new offline RL paradigm, it enables state-stitching from the dataset rather than action-stitching as conducted in prior offline RL methods. POR enjoys training stability by using in-sample learning while still allowing logical out-of-sample generalization. We hope that POR could shed light on how to enable state-stitching in offline RL, which connects well to goal-conditioned RL and hierarchical RL.

Usage

Paper reuslts can be reproduced by running ./run_mujoco.sh and ./run_antmaze.sh.

Bibtex

@inproceedings{xu2022policyguided,
  title  = {A Policy-Guided Imitation Approach for Offline Reinforcement Learning},
  author = {Haoran Xu and Li Jiang and Jianxiong Li and Xianyuan Zhan},
  year   = {2022},
  booktitle = {Advances in Neural Information Processing Systems},
}

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
main.py		main.py
policy.py		policy.py
por.py		por.py
requirements.txt		requirements.txt
util.py		util.py
value_functions.py		value_functions.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

A Policy-Guided Imitation Approach for Offline Reinforcement Learning

Usage

Bibtex

About

Releases

Packages

Languages

License

Misakau/POR

Folders and files

Latest commit

History

Repository files navigation

A Policy-Guided Imitation Approach for Offline Reinforcement Learning

Usage

Bibtex

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages