DistillKitPlus

DistillKit is an open-source toolkit for doing knowledge distillation (KLD). The repo was inspired by acree-ai/DistillKit. The main motivation behind the toolkit was to support offline distillation and PEFT for low computation resource settings.

Features

Logit Distillation: Supports same-architecture teacher and student models.
Pre-Computed Logits: Enables memory-efficient training by generating logits in advance.
LoRA Fine-Tuning Integration: Efficient low-rank adaptation fine-tuning support.
Quantization Support: 4-bit model quantization for faster inference and reduced memory usage.

Installation

git clone https://github.com/agokrani/distillkitplus.git
cd distillkitplus
pip install -r requirements.txt
pip install .

Quick Start

Configure your distillation settings in config/default_config.json

Generate teacher logits:

python scripts/local/generate_logits.py --config config/default_config.json

Run distillation:

python scripts/local/distill_logits.py --config config/default_config.json

Optional: Modal Integration

DistillKitPlus also supports running scripts using Modal. Follow the steps below to perform knowledge distillation with Modal.

Use the following command to generate pre-computed logits with Modal:

Generate teacher logits:

python scripts/modal/generate_logits.py --config config/default_config.json

Run distillation:

python scripts/modal/distill_logits.py --config config/default_config.json

Configuration

The toolkit uses a JSON configuration file with the following main sections:

project_name: Name of your distillation project
dataset: Dataset configuration including source and processing settings
models: Teacher and student model specifications
tokenizer: Tokenizer settings including max length and padding
training: Training hyperparameters
distillation: Distillation-specific parameters (temperature, alpha)
lora: LoRA configuration for efficient fine-tuning
quantization: Model quantization settings

See config/default_config.json for a complete example.

Contributing

We welcome contributions from the community! If you have ideas for improvements, new features, or bug fixes, please feel free to open an issue or submit a pull request.

Contact

For any technical questions or issues, please open an issue in this repository. We appreciate your feedback and support!

Name		Name	Last commit message	Last commit date
Latest commit History 28 Commits
components		components
config		config
scripts		scripts
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DistillKitPlus

Features

Installation

Quick Start

Optional: Modal Integration

Configuration

Contributing

Contact

About

Releases

Packages

Contributors 2

Languages

License

agokrani/distillKitPlus

Folders and files

Latest commit

History

Repository files navigation

DistillKitPlus

Features

Installation

Quick Start

Optional: Modal Integration

Configuration

Contributing

Contact

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages