SafePolicyImprovementNonstationary

This a repository containing the code for the paper "Towards Safe Policy Improvement forNon-Stationary MDPs"

Yash Chandak, Scott M. Jordan, Georgios Theocharous, Martha White, Philip S. Thomas, Towards Safe Policy Improvement for Non-Stationary MDPs (NeurIPS, 2020) [NeurIPS] [ArXiv] [Code]

For dependencies, see the file Project.toml.

Setup Conda environment for Julia

ENV["PYTHON"] = "/path/to/miniconda3/envs/<env_name>/bin/python"
] build PyCall
build IJulia

add https://github.com/ScottJordan/EvaluationOfRLAlgs.git

Install Glucose Simulator

conda activate <env_name> #Activate the virtual specified above. 
cd python/SimGlucose
pip install -e .

If you get MKL errors when trying to use the simulator from Julia. Uninstall the conda numpy library that has MKL (should be the default) and then add the one without MKL.

To reproduce results in the paper run the files experiments/bandit_swarm.jl and experiments/glucose_swarm.jl.

The jupyter notebook experiments/plots.ipynb contains code for plotting and analyzing the results.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
experiments		experiments
python/SimGlucose		python/SimGlucose
.gitignore		.gitignore
Manifest.toml		Manifest.toml
Project.toml		Project.toml
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SafePolicyImprovementNonstationary

Setup Conda environment for Julia

Install Glucose Simulator

About

Releases

Packages

Languages

ScottJordan/SafePolicyImprovementNonstationary

Folders and files

Latest commit

History

Repository files navigation

SafePolicyImprovementNonstationary

Setup Conda environment for Julia

Install Glucose Simulator

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages