Unofficial ROLLER Implementation Using TVM 0.21+

This repository contains an unofficial modern implementation of the paper named ROLLER: Fast and Efficient Tensor Compilation for Deep Learning, rebuilt using the latest version of TVM (Apache TVM 0.21+).

A. About ROLLER

ROLLER is a fast and efficient tensor compilation system for deep learning workloads. Unlike search-based approaches that can take hours to find optimal kernels, ROLLER uses a construction-based approach that generates highly efficient kernels in seconds.

1. Original Paper:

ROLLER: Fast and Efficient Tensor Compilation for Deep Learning (OSDI'22)

2. Key Features:

Novel rTile abstraction that encapsulates tensor shapes aligned with accelerator characteristics.
Recursive construction algorithm for generating efficient rTile-based programs.
Micro-performance model for rapid evaluation without device execution.
Support for various accelerators including GPUs and emerging AI chips.

B. Installation

This implementation requires TVM 0.21 or newer. Please install the latest version of TVM following the official installation guide.

C. Usage

python test_op.py

D. Citation

If you use ROLLER in your research, please cite the original paper:

@inproceedings {280896,
  author = {Hongyu Zhu and Ruofan Wu and Yijia Diao and Shanbin Ke and Haoyu Li and Chen Zhang and Jilong Xue and Lingxiao Ma and Yuqing Xia and Wei Cui and Fan Yang and Mao Yang and Lidong Zhou and Asaf Cidon and Gennady Pekhimenko},
  title = {{ROLLER}: Fast and Efficient Tensor Compilation for Deep Learning},
  booktitle = {16th USENIX Symposium on Operating Systems Design and Implementation (OSDI 22)},
  year = {2022},
  isbn = {978-1-939133-28-1},
  address = {Carlsbad, CA},
  pages = {233--248},
  url = {https://www.usenix.org/conference/osdi22/presentation/zhu},
  publisher = {USENIX Association},
  month = jul
}

If you wish, you may also cite this repository:

@misc{unofficial_roller_impl,
  author = {Jianchao Yang},
  title = {Unofficial ROLLER Implementation Using TVM 0.21+},
  year = {2025},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/ConvolutedDog/Roller}}
}

E. Roadmap & Contributing

The codebase has not been fully organized and refactored.
Scripts in the original Roller repository are not included and have not been thoroughly tested in this new environment.

This repo is currently in its early stages. We are actively working towards a stable and feature-complete release. We welcome and appreciate any contributions!

F. Acknowledgments

This implementation is based on the original ROLLER research from Microsoft Research and collaborating institutions. The original implementation can be found at: microsoft/nnfusion.

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
.github/workflows		.github/workflows
arch		arch
codegen/op_impl		codegen/op_impl
compile-time		compile-time
compute_db		compute_db
config		config
cost_model		cost_model
microbenchmark		microbenchmark
op		op
policy		policy
script_v1		script_v1
script_v2_rocm		script_v2_rocm
test_config		test_config
tests		tests
utils		utils
.gitignore		.gitignore
Constrution.py		Constrution.py
README.md		README.md
construct_algo.py		construct_algo.py
cu_helper.h		cu_helper.h
dbg_logger.py		dbg_logger.py
parse_compute_util_log.py		parse_compute_util_log.py
test_op.py		test_op.py
test_op_mp.py		test_op_mp.py
test_op_rocm.py		test_op_rocm.py
test_op_rocm_mp.py		test_op_rocm_mp.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Unofficial ROLLER Implementation Using TVM 0.21+

A. About ROLLER

1. Original Paper:

2. Key Features:

B. Installation

C. Usage

D. Citation

E. Roadmap & Contributing

F. Acknowledgments

About

Uh oh!

Releases

Packages

Languages

ConvolutedDog/Roller

Folders and files

Latest commit

History

Repository files navigation

Unofficial ROLLER Implementation Using ​TVM 0.21+

A. About ROLLER

​​1. Original Paper:​

2. Key Features:​​

B. Installation

C. Usage

D. Citation

E. Roadmap & Contributing

F. Acknowledgments

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Unofficial ROLLER Implementation Using TVM 0.21+

1. Original Paper:

2. Key Features:

Packages