Skip to content

Latest commit

 

History

History
12 lines (10 loc) · 393 Bytes

README.md

File metadata and controls

12 lines (10 loc) · 393 Bytes

Phonira

An audio model based on Soundstorm.

Installation

pip install -r requirements.txt

Usage

accelerate launch phonira/trainer.py  --dataset /media/works/data/data/ --split train --column_code codes.npy --column_prompt prompt.txt --dataset_size 200000 --batch_size 1 --gradient_accumulation_steps 32 --depth 16 --hidden_size 1024 --num_heads 16 --dropout 0