I am a newbie in pytorch. I create this repo to learn pythorch and E2E asr modeling.
For practical E2E ASR job, please refer to better tools such as espnet.
- Very simple pytorch implementation without optimization
- lstm+ctc, same config with eesen
- kaldi feature extraction (40 fbank+delta+cmvn)
- eesen TLG fst decoding
-
Download and install Eesen
-
in path.sh file, set EESEN_ROOT to eesen path.
export EESEN_ROOT='your/eesen/root/dir'
- run run_ctc_phn.sh
- CTC beamdecoder, phone/char/word LM
- Use RNN-t to involve LM.
- E2E structure
- Try other Net structure.