Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation

This repository contains the code release for the paper Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation by Cansu Sancaktar, Sebastian Blaes, and Georg Martius, published as a poster at NeurIPS 2022. Please use the provided citation when making use of our code or ideas.

Installation

Install and activate a new python3.8 virtualenv.

virtualenv mbrl_venv --python=python3.8

source mbrl_venv/bin/activate

For the following steps, make sure you are sourced inside the mbrl_venv virtualenv.

Install torch with CUDA. Here is an example for CUDA version 11.3.

pip3 install torch==1.10.0+cu113 torchvision==0.11.1+cu113 torchaudio==0.10.0+cu113 -f https://download.pytorch.org/whl/cu113/torch_stable.html

You can change the CUDA version according to your system requirements, however we only tested for the versions specified here.

Prepare for mujoco-py installation.
1. Download mujoco200
2. cd ~
3. mkdir .mujoco
4. Move mujoco200 folder to .mujoco
5. Move mujoco license key mjkey.txt to ~/.mujoco/mjkey.txt
6. Set LD_LIBRARY_PATH (add to your .bashrc (or .zshrc) ):
export LD_LIBRARY_PATH="$LD_LIBRARY_PATH:$HOME/.mujoco/mujoco200_linux/bin"
1. For Ubuntu, run:
sudo apt install libosmesa6-dev libgl1-mesa-glx libglfw3

sudo apt install -y patchelf
Install supporting packages

pip3 install -r requirements.txt

From the project root:

pip install -e .

Set PYTHONPATH:

export PYTHONPATH=$PYTHONPATH:<path/to/repository>

Note: These settings have only been tested on Ubuntu 20. It is recommended to use Ubuntu 20.

How to run

python mbrl/main.py experiments/cee_us/settings/[env]/curious_exploration/[settings_file].yaml

The settings files are stored in the experiments folder. Parameters for models, environments, controllers, free play vs. zero-shot downstream task generalization are all specified in these files. In the corresponding folders, you will also find the settings files for the baselines.

For example, in order to run CEE-US free play in the construction environment run:

python mbrl/main.py experiments/cee_us/settings/construction/curious_exploration/gnn_ensemble_cee_us.yaml

After the free play phase to perform zero-shot dowmstream task generalization on stacking with 2 objects, run:

python mbrl/main.py experiments/cee_us/settings/construction/zero_shot_generalization/gnn_ensemble_cee_us_zero_shot_stack.yaml

You need to add the path to the trained model in this settings file! (e.g. see gnn_ensemble_cee_us_zero_shot_stack)

Usage Examples

Our method CEE-US as well as the baselines can be run using the settings files in in experiments/cee_us/settings. E.g. for free play in the construction environment:

gnn_ensemble_cee_us.yaml: (CEE-US) Uses disagreement of GNN ensemble as intrinsic reward, MPC with iCEM
mlp_ensemble_cee_us.yaml: Uses disagreement of MLP ensemble as intrinsic reward, MPC with iCEM
gnn_rnd_icem.yaml: Uses GNN model with Random Network Distillation as intrinsic reward, MPC with iCEM
mlp_rnd_icem.yaml: Uses MLP model with Random Network Distillation as intrinsic reward, MPC with iCEM

See the full paper for more details.

Code style

Run to set up the git hook scripts

pre-commit install

This command will install a number of git hooks that will check your code quality before you can commit.

The main configuration file is located in

/.pre-commit-config

Individual config files for the different hooks are located in the base directory of the rep. For instance, the configuration file of flake8 is /.flake8.

Citation

Please use the following bibtex entry to cite us:

@inproceedings{sancaktar22curious,
  Author = {Sancaktar, Cansu and
  Blaes, Sebastian and Martius, Georg},
  Title = {Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation},
  Booktitle = {Advances in Neural Information Processing Systems 35 (NeurIPS 2022)},
  Year = {2022}
}

Credits

We adapted C-SWM by Thomas Kipf for the GNN implementation and fetch-block-construction by Richard Li for the construction environment, both under MIT license. The RoboDesk environment was taken from RoboDesk and adapted to mujoco-py and to be object-centric.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
batch		batch
datasets		datasets
docs		docs
experiments		experiments
mbrl		mbrl
.flake8		.flake8
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
LICENSE		LICENSE
MANIFEST.in		MANIFEST.in
README.md		README.md
package.xml		package.xml
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
setup.cfg		setup.cfg
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation

Installation

How to run

Usage Examples

Code style

Citation

Credits

About

Releases

Packages

Contributors 2

Languages

License

Simon-Reif/cee-us

Folders and files

Latest commit

History

Repository files navigation

Curious Exploration via Structured World Models Yields Zero-Shot Object Manipulation

Installation

How to run

Usage Examples

Code style

Citation

Credits

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages