X-modaler

X-modaler is a versatile and high-performance codebase for cross-modal analytics. This codebase unifies comprehensive high-quality modules in state-of-the-art vision-language techniques, which are organized in a standardized and user-friendly fashion.

Installation

See installation instructions.

Requiremenets

Linux or macOS with Python ≥ 3.6
PyTorch ≥ 1.8 and torchvision that matches the PyTorch installation. Install them together at pytorch.org to make sure of this
fvcore
pytorch_transformers
jsonlines
pycocotools

Getting Started

See Getting Started with X-modaler

Training & Evaluation in Command Line

We provide a script in "train_net.py", that is made to train all the configs provided in X-modaler. You may want to use it as a reference to write your own training script.

To train a model(e.g., UpDown) with "train_net.py", first setup the corresponding datasets following datasets, then run:

# Teacher Force
python train_net.py --num-gpus 4 \
 	--config-file configs/image_caption/updown.yaml

# Reinforcement Learning
python train_net.py --num-gpus 4 \
 	--config-file configs/image_caption/updown_rl.yaml

Model Zoo and Baselines

A large set of baseline results and trained models are available here.

API Documentation

xmodaler.checkpoint
xmodaler.config
xmodaler.datasets
xmodaler.engine
xmodaler.evaluation
xmodaler.functional
xmodaler.losses
xmodaler.lr_scheduler
xmodaler.modeling
xmodaler.optim
xmodaler.scorer
xmodaler.tokenization
xmodaler.utils

License

X-modaler is released under the Apache License, Version 2.0.

Citing X-modaler

If you use X-modaler in your research, please use the following BibTeX entry.

@inproceedings{Xmodaler2021,
  author =       {Yehao Li, Yingwei Pan, Jingwen Chen, Ting Yao, and Tao Mei},
  title =        {X-modaler: A Versatile and High-performance Codebase for Cross-modal Analytics},
  booktitle =    {Proceedings of the 29th ACM international conference on Multimedia},
  year =         {2021}
}

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
configs		configs
data/temp		data/temp
docs		docs
images		images
tools		tools
xmodaler		xmodaler
.gitignore		.gitignore
.readthedocs.yml		.readthedocs.yml
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
train_net.py		train_net.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

X-modaler

Installation

Requiremenets

Getting Started

Training & Evaluation in Command Line

Model Zoo and Baselines

API Documentation

License

Citing X-modaler

About

Releases

Packages

Languages

License

hotlion1987/xmodaler

Folders and files

Latest commit

History

Repository files navigation

X-modaler

Installation

Requiremenets

Getting Started

Training & Evaluation in Command Line

Model Zoo and Baselines

API Documentation

License

Citing X-modaler

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages