Listen, Attend and Spell (LAS)

This is a PyTorch implementation of Listen, Attend and Spell (LAS) paper

@article{DBLP:journals/corr/ChanJLV15,
  author    = {William Chan and
               Navdeep Jaitly and
               Quoc V. Le and
               Oriol Vinyals},
  title     = {Listen, Attend and Spell},
  journal   = {CoRR},
  volume    = {abs/1508.01211},
  year      = {2015},
  url       = {http://arxiv.org/abs/1508.01211},
  eprinttype = {arXiv},
  eprint    = {1508.01211},
  timestamp = {Mon, 13 Aug 2018 16:46:45 +0200},
  biburl    = {https://dblp.org/rec/journals/corr/ChanJLV15.bib},
  bibsource = {dblp computer science bibliography, https://dblp.org}
}

Train on your data

In order to train the model on your data follow the steps below

1. data preprocessing

prepare your data and make sure the data is formatted in an CSV format as below

audio_path,text,duration
file/to/file.wav,the text in that file,3.2

make sure the audios are MONO if not make the proper conversion to meet this condition

2. Setup development environment

create enviroment

python -m venv env

activate the enviroment

source env/bin/activate

install the required dependencies

pip install -r requirements.txt

3. Training

update the config file if needed

train the model

from scratch

python train.py

from checkpoint

python train.py checkpoint=path/to/checkpoint tokenizer.tokenizer_file=path/to/tokenizer.json

TODO

Compeleting the inference module
Adding Demo

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
config		config
.gitignore		.gitignore
README.md		README.md
data.py		data.py
data_loaders.py		data_loaders.py
hprams.py		hprams.py
inference.py		inference.py
model.py		model.py
requirements.txt		requirements.txt
tokenizer.py		tokenizer.py
train.py		train.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Listen, Attend and Spell (LAS)

Train on your data

1. data preprocessing

2. Setup development environment

3. Training

TODO

About

Releases

Packages

Languages

msalhab96/Listen-Attend-and-Spell

Folders and files

Latest commit

History

Repository files navigation

Listen, Attend and Spell (LAS)

Train on your data

1. data preprocessing

2. Setup development environment

3. Training

TODO

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages