Skip to content

Files

Latest commit

454f397 · May 25, 2023

History

History

mmrazor_large

Folders and files

NameName
Last commit message
Last commit date

parent directory

..
May 25, 2023
May 25, 2023
May 25, 2023

MMRazor for Large Models

Introduction

MMRazor is dedicated to the development of general-purpose model compression tools. Now, MMRazor not only supports conventional CV model compression but also extends to support large models. This project will provide examples of MMRazor's compression for various large models, including LLaMA, stable diffusion, and more.

Code structure overview about large models.

mmrazor
├── implementations           # core algorithm components
    ├── pruning
    └── quantization
projects
└── mmrazor_large
    ├── algorithms            # algorithms usage introduction
    └── examples              # examples for various models about algorithms
        ├── language_models
        │   ├── LLaMA
        │   └── OPT
        └── ResNet

Model-Algorithm Example Matrix

ResNet OPT LLama Stable diffusion
SparseGPT
GPTQ

PaperList

We provide a paperlist for researchers in the field of model compression for large models. If you want to add your paper to this list, please submit a PR.

Paper Title Type MMRazor
SparseGPT SparseGPT: Massive Language Models Can Be Accurately Pruned in One-Shot Pruning
GPTQ GPTQ: Accurate Post-Training Quantization for Generative Pre-trained Transformers Quantization