Smolmodels

Collection of experiments related to tuning small language models for specific tasks.

Installation

pip install uv
uv venv
source .venv/bin/activate
uv sync --group torch
uv sync --no-build-isolation --group training

CUDA:

export CUDA_HOME=/usr/local/cuda
sudo apt purge cmake
uv pip install setuptools_scm cmake
uv pip install vllm -vv --no-build-isolation

Mac:

pip install vllm==0.7.0 --use-deprecated=legacy-resolver

llama-cpp can also be installed with:

uv pip install "llama-cpp-python[server]" --extra-index-url https://abetlen.github.io/llama-cpp-python/whl/metal

pyright --createstub transformers

modal run modal_trl.py

Name		Name	Last commit message	Last commit date
Latest commit History 602 Commits
.vscode		.vscode
chat_templates		chat_templates
data		data
dataset		dataset
evaluation		evaluation
model		model
notebooks		notebooks
scripts		scripts
seed_data_files		seed_data_files
synthetic_data		synthetic_data
tests		tests
trl_wrapper		trl_wrapper
.gitattributes		.gitattributes
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
evaluate.py		evaluate.py
generate.py		generate.py
get_logprobs.py		get_logprobs.py
gguf_inference.py		gguf_inference.py
gguf_inference.sh		gguf_inference.sh
gradio_ui.py		gradio_ui.py
modal_entrypoint.py		modal_entrypoint.py
pyproject.toml		pyproject.toml
submit.sh		submit.sh
train_lightning.py		train_lightning.py
train_sequence_rank.py		train_sequence_rank.py
train_trl.py		train_trl.py
train_vit.py		train_vit.py
util_scripts.py		util_scripts.py
uv.lock		uv.lock