GitHub - VectorSpaceLab/Video-XL: 🔥🔥First-ever hour scale video understanding models

Video-XL Family: Efficient VLMs for Extremely Long Video Understanding

News

[2025/06/03] 🔥 Video-XL2 is released, which achieves state-of-the-art results on several long video understanding benchmarks.
[2025/04/19] 🎉 Most of the Video-XL-Pro training data is released.
[2025/04/07] 🎉 Video-XL has been selected as Oral presentation for CVPR.
[2025/03/16] 🎉 Video-XL-Pro is released, which can process 10000 frames on an 80G GPU and achieves promising results with only 3B parameters.
[2025/02/27] 🎉 Video-XL has been accepted by CVPR 2025!
[2024/12/22] 🔥 Most of the training data is released.
[2024/10/17] 🔥 Video-XL-7B weight is released, which can process max 1024 frames.
[2024/10/15] 🔥 Video-XL is released, including model, training and evaluation code.

Citation

If you find this repository useful, please consider giving a star ⭐ and citation

@article{shu2024video,
  title={Video-XL: Extra-Long Vision Language Model for Hour-Scale Video Understanding},
  author={Shu, Yan and Zhang, Peitian and Liu, Zheng and Qin, Minghao and Zhou, Junjie and Huang, Tiejun and Zhao, Bo},
  journal={arXiv preprint arXiv:2409.14485},
  year={2024}
}

@article{liu2025video,
  title={Video-XL-Pro: Reconstructive Token Compression for Extremely Long Video Understanding},
  author={Liu, Xiangrui and Shu, Yan and Liu, Zheng and Li, Ao and Tian, Yang and Zhao, Bo},
  journal={arXiv preprint arXiv:2503.18478},
  year={2025}
}

Acknowledgement

LongVA: the codebase we built upon.
LMMs-Eval: the codebase we used for evaluation.
Activation Beacon: The compression methods we referring.

License

This project utilizes certain datasets and checkpoints that are subject to their respective original licenses. Users must comply with all terms and conditions of these original licenses. The content of this project itself is licensed under the Apache license 2.0.

Name		Name	Last commit message	Last commit date
Latest commit History 184 Commits
Video-XL-2		Video-XL-2
Video-XL-Pro		Video-XL-Pro
Video-XL		Video-XL
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Video-XL Family: Efficient VLMs for Extremely Long Video Understanding

News

Citation

Acknowledgement

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 6

Uh oh!

Languages

License

VectorSpaceLab/Video-XL

Folders and files

Latest commit

History

Repository files navigation

Video-XL Family: Efficient VLMs for Extremely Long Video Understanding

News

Citation

Acknowledgement

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 6

Uh oh!

Languages

Packages