Skip to content

Commit

Permalink
A Zipformer recipe with Byte-level BPE for Aishell-1 (#1464)
Browse files Browse the repository at this point in the history
* init commit

* Update train.py

* Update decode.py

* Update RESULTS.md

* added `vocab_size`

* removed unused softlinks

* added scripts for testing pretrained models

* set `bpe_model` as required

* re-org the bbpe recipe for aishell
  • Loading branch information
JinZr authored Jan 16, 2024
1 parent 398401e commit 7bdde91
Show file tree
Hide file tree
Showing 5 changed files with 2,518 additions and 2 deletions.
56 changes: 54 additions & 2 deletions egs/aishell/ASR/RESULTS.md
Original file line number Diff line number Diff line change
Expand Up @@ -2,9 +2,61 @@

### Aishell training result (Stateless Transducer)

#### Zipformer (Byte-level BPE)

[./zipformer](./zipformer/)

It's reworked Zipformer with Pruned RNNT loss, trained with Byte-level BPE, `vocab_size` set to 500.

##### normal-scaled model, number of model parameters: 65549011, i.e., 65.55 M

| | test | dev | comment |
|------------------------|------|------|-----------------------------------------|
| greedy search | 4.54 | 4.31 | --epoch 40 --avg 10 |
| modified beam search | 4.37 | 4.11 | --epoch 40 --avg 10 |
| fast beam search | 4.43 | 4.17 | --epoch 40 --avg 10 |

```bash
./prepare.sh

export CUDA_VISIBLE_DEVICES="0,1"

./zipformer/train_bbpe.py \
--world-size 2 \
--num-epochs 40 \
--start-epoch 1 \
--use-fp16 1 \
--context-size 2 \
--enable-musan 0 \
--exp-dir zipformer/exp_bbpe \
--max-duration 1000 \
--enable-musan 0 \
--base-lr 0.045 \
--lr-batches 7500 \
--lr-epochs 10 \
--spec-aug-time-warp-factor 20
```

Command for decoding is:
```bash
for m in greedy_search modified_beam_search fast_beam_search ; do
./zipformer/decode_bbpe.py \
--epoch 40 \
--avg 10 \
--exp-dir ./zipformer_bbpe/exp \
--bpe-model data/lang_bbpe_500/bbpe.model \
--context-size 2 \
--decoding-method $m
done
```
Pretrained models, training logs, decoding logs, tensorboard and decoding results
are available at
<https://huggingface.co/zrjin/icefall-asr-aishell-zipformer-bbpe-2024-01-16>


#### Zipformer (Non-streaming)

[./zipformer](./zipformer)
[./zipformer](./zipformer/)

It's reworked Zipformer with Pruned RNNT loss.
**Caution**: It uses `--context-size=1`.
Expand Down Expand Up @@ -260,7 +312,7 @@ done
Pretrained models, training logs, decoding logs, and decoding results
are available at
<https://huggingface.co/marcoyang/icefall-asr-aishell-zipformer-pruned-transducer-stateless7-2023-03-21>
#### Pruned transducer stateless 7 (zipformer)
#### Pruned transducer stateless 7 (Byte-level BPE)

See <https://github.com/k2-fsa/icefall/pull/986>

Expand Down
Loading

0 comments on commit 7bdde91

Please sign in to comment.