Multi-Modal-Manipulation

We use self-supervision to learn a compact and multimodal representation of our sensory inputs, which can then be used to improve the sample efficiency of our policy learning. We train a policy in PyBullet (on the a Kuka LBR iiwa robot arm) using PPO for peg-in-hole tasks. This implementation can also be used to understand force-torque (F/T) control for contact-rich manipulation tasks as at each step the force-torque (F/T) reading is captured at the joint connected with the end-effector.

How it works: it uses self-supervision to learn a compact and multimodal representation of our sensory inputs. This can improve the sample efficiency of our policy learning. We train a policy in #PyBullet (on a Kuka LBR iiwa robot arm) using PPO for peg-in-hole tasks.

This implementation can also be used to understand force-torque (F/T) control for contact-rich manipulation tasks as at each step the force-torque (F/T) reading is captured at the joint connected with the end-effector.

Instructions

To add Robotiq see: https://github.com/Alchemist77/pybullet-ur5-equipped-with-robotiq-140/blob/master/urdf/robotiq_140_gripper_description/urdf/robotiq_140.urdf

To add S-RL Toolbox see: https://s-rl-toolbox.readthedocs.io/en/latest/

Download the project-master folder in the master branch. Note: you will need anaconda to run this program. I recommend installing it, creating/initializing an environment, installing python 3.6 in it as that is the version you'll need.
cd into the folder on your local laptop
Run pip install -r requirements.txt. You will also have to install pybullet pip install pybullet, gym pip install gym, opencv pip install opencv-python, pytorch pip install torch torchvision.
Run python train_peg_insertion.py to train the agent. If you get any errors, you might have to change any paths that are specified for me to your own.
To collect the multimodal dataset for encoder pre-train run python environments/kuka_peg_env.py.You will be able to get more data by changing the random seed.
To pre-train the fusion encoder run python multimodal/train_my_fusion_model.py You have to specify the path to the root directory of multimodal dataset.

Quick Notes:

This code was built on from the implementation here https://github.com/Henry1iu/ierg5350_rl_course_project
DDPG code implementation: https://github.com/ghliu/pytorch-ddpg

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Multi-Modal-Manipulation

Instructions

About

Releases

Packages

License

alishbaimran/Multi-Modal-Manipulation

Folders and files

Latest commit

History

Repository files navigation

Multi-Modal-Manipulation

Instructions

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages