Name		Name	Last commit message	Last commit date
parent directory ..
env		env
gen		gen
models		models
scripts		scripts
README.md		README.md
requirements.txt		requirements.txt
test.sh		test.sh
train.sh		train.sh

README.md

ReALFRED-Seq2Seq

This is the code repository of the paper ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks for reproduction in ReALFRED.
Our code is largely built upon the codebase from ALFRED.

Environment

Clone repository

$ git clone https://github.com/snumprlab/realfred.git
$ cd realfred/seq2seq
$ export ALFRED_ROOT=$(pwd)

Install requirements

$ conda create -n realfred python=3.6
$ conda activate realfred

$ cd $ALFRED_ROOT
$ pip install --upgrade pip
$ pip install -r requirements.txt

You also need to install Pytorch depending on your system. e.g) PyTorch v1.10.0 + cuda 11.1
Refer here

pip install torch==1.10.0+cu111 torchvision==0.11.0+cu111 torchaudio==0.10.0 -f https://download.pytorch.org/whl/torch_stable.html

Download

Download the ResNet-18 features and annotation files from the Hugging Face repo.
Note: It takes quite a large space (~2.3TB).

git clone https://huggingface.co/datasets/SNUMPR/realfred_feat data

Training

To train seq2seq, run train_seq2seq.py with hyper-parameters below.

python models/train/train_seq2seq.py --data <path_to_dataset> --model seq2seq_im_mask --dout <path_to_save_weight> --splits data/splits/oct24.json --gpu --batch <batch_size> --pm_aux_loss_wt <pm_aux_loss_wt_coeff> --subgoal_aux_loss_wt <subgoal_aux_loss_wt_coeff>

Note: As mentioned in the repository of ALFRED, run with --preprocess only once for preprocessed json files. Note: All hyperparameters used for the experiments in the paper are set as default.

For example, if you want train seq2seq and save the weights for all epochs in "exp/seq" with all hyperparameters used in the experiments in the paper, you may use the command below

python models/train/train_seq2seq.py --gpu --dout exp/seq --save_every_epoch

or simply just run

bash train.sh

Note: The option, --save_every_epoch, saves weights for all epochs and therefore could take a lot of space.

Evaluation

Task Evaluation

To evaluate seq2seq, run eval_seq2seq.py with hyper-parameters below.
To evaluate a model in the seen or unseen environment, pass valid_seen or valid_unseen or tests_seen or tests_unseen to --eval_split.

python models/eval/eval_seq2seq.py --data <path_to_dataset> --model models.model.seq2seq_im_mask --model_path <path_to_weight> --eval_split <eval_split> --gpu --num_threads <thread_num>

Note: All hyperparameters used for the experiments in the paper are set as default.

If you want to evaluate our pretrained model saved in exp/pretrained/pretrained.pth in the seen validation, you may use the command below.

python models/eval/eval_seq2seq.py --model_path "exp/pretrained/pretrained.pth" --eval_split valid_seen --gpu --num_threads 4

Hardware

Trained and Tested on:

GPU - RTX A6000
CPU - Intel(R) Core(TM) i7-12700K CPU @ 3.60GHz
RAM - 64GB
OS - Ubuntu 20.04

Citation

ReALFRED

@inproceedings{kim2024realfred,
  author    = {Kim, Taewoong and Min, Cheolhong and Kim, Byeonghwi and Kim, Jinyeon and Jeung, Wonje and Choi, Jonghyun},
  title     = {ReALFRED: An Embodied Instruction Following Benchmark in Photo-Realistic Environment},
  booktitle = {ECCV},
  year      = {2024}
  }

ALFRED

@inproceedings{ALFRED20,
  author    = {Mohit Shridhar and Jesse Thomason and Daniel Gordon and Yonatan Bisk and Winson Han and Roozbeh Mottaghi and Luke Zettlemoyer and Dieter Fox},
  title     = {{ALFRED: A Benchmark for Interpreting Grounded Instructions for Everyday Tasks}},
  booktitle = {CVPR},
  year      = {2020}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

seq2seq

seq2seq

README.md

ReALFRED-Seq2Seq

Environment

Clone repository

Install requirements

Download

Training

Evaluation

Task Evaluation

Hardware

Citation

Files

seq2seq

Directory actions

More options

Directory actions

More options

Latest commit

History

seq2seq

Folders and files

parent directory

README.md

ReALFRED-Seq2Seq

Environment

Clone repository

Install requirements

Download

Training

Evaluation

Task Evaluation

Hardware

Citation