One-Shot Imitation under Mismatched Execution

Kushal Kedia, Prithwish Dan, Angela Chao, Maximus A. Pace, Sanjiban Choudhury

Cornell University, ^*Equal Contribution

🚀 Installation

Follow these steps to install RHyME:

Create and activate the conda environment:

cd rhyme
conda env create -f environment.yml
conda activate rhyme
pip install -e .

📦 Simulation Dataset

To set up the simulation dataset:

Instructions TBD

🚴‍♂️ Training

Simulation

Datasets (Visual Encoder):

robot
twohands
robot_segments_paired_twohands and twohands_segments_paired_twohands (Optional)

Datasets (Diffusion Policy):

robot
imagined demonstrator dataset (will be created)

Pretrain visual encoder:

python scripts/skill_discovery.py

Additional options include:

exp_name (name of model)
cross_embodiment (human, singlehand, twohands)
use_paired_data (True/False)
paired_dataset.percentage_pairing (0-1)

Convert images into latent vectors using pretrained visual encoder:

python scripts/label_sim_kitchen_dataset.py

Additional options include:

cross_embodiment (human, singlehand, twohands)
pretrain_model_name
ckpt

Compute and store sequence-level distance metrics between cross embodiment play data and robot data:

python scripts/chopped_segment_wise_dists.py

Additional options include:

cross_embodiment_segments (e.g. twohands_segments_paired_sample)
pretrain_model_name
ckpt 
num_chops (number of clips to retrieve per robot video)

"Imagine" the paired demonstrator dataset, and store it in the datasets folder:

python scripts/reconstruction.py

Additional options include:

cross_embodiment_segments (e.g. twohands_segments_paired_sample)
pretrain_model_name
ckpt 
ot_lookup (True/False)
tcc_lookup (True/False)
num_chops (number of clips to retrieve per robot video)

Convert the imagined dataset into latent vectors:

python scripts/label_sim_kitchen_dataset.py include_robot=False pretrain_model_name=NO_PAIRING_TWOHANDS cross_embodiment=NO_PAIRING_TWOHANDS_twohands_segments_paired_sample_generated_ot_2_ckpt40

Additional options include:

include_robot (True/False)
pretrain_model_name
cross_embodiment (now should be the name of the reconstructed dataset from OT)

Train conditional diffusion policy to translate imagined demonstrator videos into robot actions:

python scripts/skill_transfer_composing.py pretrain_model_name=NO_PAIRING_TWOHANDS pretrain_ckpt=40 eval_cfg.demo_type=twohands cross_embodiment=NO_PAIRING_TWOHANDS_twohands_segments_paired_sample_generated_ot_2_ckpt40 dataset.paired_data=True dataset.paired_percent=0.5

Additional options include:

pretrain_model_name
pretrain_ckpt
eval_cfg.demo_type (specifies which demonstrator to evaluate on)
cross_embodiment (reconstructed dataset from OT)
dataset.paired_data (True if using the imagined paired dataset)
dataset.paired_percent (hybrid training on robot/imagined dataset)

BibTeX

@article{
   kedia2024one,
   title={One-Shot Imitation under Mismatched Execution},
   author={Kedia, Kushal and Dan, Prithwish and Choudhury, Sanjiban},
   journal={arXiv preprint arXiv:2409.06615},
   year={2024}
}

Acknowledgement

Much of the training pipeline is adapted from XSkill.
Diffusion Policy is adapted from Diffusion Policy
Many useful utilies are adapted from XIRL.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
config		config
rhyme		rhyme
scripts		scripts
.gitignore		.gitignore
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

One-Shot Imitation under Mismatched Execution

🚀 Installation

📦 Simulation Dataset

🚴‍♂️ Training

Simulation

BibTeX

Acknowledgement

About

Releases

Packages

Contributors 3

Languages

portal-cornell/rhyme

Folders and files

Latest commit

History

Repository files navigation

One-Shot Imitation under Mismatched Execution

🚀 Installation

📦 Simulation Dataset

🚴‍♂️ Training

Simulation

BibTeX

Acknowledgement

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages