Skip to content
/ IVR Public

[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"

License

Notifications You must be signed in to change notification settings

ryanxhr/IVR

Folders and files

NameName
Last commit message
Last commit date

Latest commit

author
xuhaoran4
Jul 27, 2023
58edd26 · Jul 27, 2023

History

26 Commits
Mar 23, 2023
Mar 23, 2023
Mar 8, 2023
Mar 14, 2023
Jul 27, 2023
Mar 25, 2023
Mar 14, 2023
Mar 25, 2023
Mar 8, 2023
Mar 25, 2023
Mar 22, 2023
Mar 8, 2023
Mar 22, 2023
Mar 25, 2023
Mar 25, 2023
Mar 25, 2023
Mar 8, 2023
Mar 25, 2023
Mar 22, 2023

Repository files navigation

Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization

This is the code for reproducing the results of the paper Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization accepted as Notable-top-5% at ICLR'2023.

The discrete version of IVR on Atari datasets can be found at https://github.com/ryanxhr/Discrete_IVR.

Usage

Our code is built on the jax version code of IQL (https://github.com/ikostrikov/implicit_q_learning). Paper reuslts can be reproduced by running ./run_mujoco.sh, ./run_antmaze.sh and ./run_kitchen.sh.

Bibtex

@inproceedings{xu2023offline,
  title  = {Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization},
  author = {Haoran Xu, Li Jiang, Jianxiong Li, Zhuoran Yang, Zhaoran Wang, Victor Wai Kin Chan, Xianyuan Zhan},
  year   = {2023},
  booktitle = {International Conference on Learning Representations},
}

About

[ICLR 2023 Oral] The official implementation of SQL and EQL in "Offline RL with No OOD Actions: In-Sample Learning via Implicit Value Regularization"

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published