birr

Batch InfeRence Runtime

Overview

A simplified local-only release of the toolchain used by Ai2 to perform large-scale inference through LLMs and VLMs.

This image orchestrates inference jobs based on a user-provided YAML config file.

It leverages:

ray for concurrency management
vllm for inference backend

It consumes JSONL work files and outputs result files to a chosen destination.

Usage

Project Setup

cd <project_root>
python3 -m venv venv
source venv/bin/activate
pip install .[batch_inference,vllm]

Prepare Your Data

Records you want to run inference over must be partitioned into one or more jsonl files in a flat directory. Each row should have the following structure:

{"chat_messages": [{"role":  "user", "content":  "asdf"}]}

Additional fields may be provided in each row (e.g. ids, metadata), and will be preserved in output.

Define A Job

Author a configuration file for your job, see example file here:

https://github.com/allenai/birr/blob/main/configs/inference/example.yaml

Run Your Job

# in activated venv
python src/birr/batch_inference/runner.py --config-file <path_to_config_file>

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
configs/inference		configs/inference
scripts		scripts
src/birr		src/birr
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
Makefile		Makefile
README.md		README.md
pyproject.toml		pyproject.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

birr

Overview

Usage

Project Setup

Prepare Your Data

Define A Job

Run Your Job

About

Releases

Packages

Languages

allenai/birr

Folders and files

Latest commit

History

Repository files navigation

birr

Overview

Usage

Project Setup

Prepare Your Data

Define A Job

Run Your Job

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages