Meta Spirit LM: Interleaved Spoken and Written Language Model

This repository contains the model weights, inference code and evaluation scripts for the Spirit LM paper. You can find more generation samples on our demo page.

Spirit LM Model Overview

Installation Setup

Conda

conda env create -f env.yml
pip install -e '.[eval]'

Pip

pip install -e requirements.txt
pip install -e '.[eval]'

Dev

(Optionally, use only if you want to run the tests.)

pip install -e '.[dev]'

Checkpoints Setup

See checkpoints/README.md

Quick Start

Model Card

More details of the model can be found in MODEL_CARD.md.

License

The present code is provided under the FAIR Noncommercial Research License found in LICENSE.

Citation

@misc{nguyen2024spiritlminterleavedspokenwritten,
      title={SpiRit-LM: Interleaved Spoken and Written Language Model},
      author={Tu Anh Nguyen and Benjamin Muller and Bokai Yu and Marta R. Costa-jussa and Maha Elbayad and Sravya Popuri and Paul-Ambroise Duquenne and Robin Algayres and Ruslan Mavlyutov and Itai Gat and Gabriel Synnaeve and Juan Pino and Benoit Sagot and Emmanuel Dupoux},
      year={2024},
      eprint={2402.05755},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2402.05755},
}

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
assets		assets
checkpoints		checkpoints
data/examples		data/examples
examples		examples
spiritlm		spiritlm
tests		tests
.gitignore		.gitignore
CODE_OF_CONDUCT.md		CODE_OF_CONDUCT.md
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
MODEL_CARD.md		MODEL_CARD.md
README.md		README.md
env.yml		env.yml
requirements.dev.txt		requirements.dev.txt
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Meta Spirit LM: Interleaved Spoken and Written Language Model

Spirit LM Model Overview

Installation Setup

Conda

Pip

Dev

Checkpoints Setup

Quick Start

Speech Tokenization

Spirit LM Generation

Speech-Text Sentiment Preservation benchmark (STSP)

Model Card

License

Citation

About

Releases

Packages

Languages

License

soulinuniverse/spiritlm

Folders and files

Latest commit

History

Repository files navigation

Meta Spirit LM: Interleaved Spoken and Written Language Model

Spirit LM Model Overview

Installation Setup

Conda

Pip

Dev

Checkpoints Setup

Quick Start

Speech Tokenization

Spirit LM Generation

Speech-Text Sentiment Preservation benchmark (STSP)

Model Card

License

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages