Skip to content

Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for Automatic Speech Recognition https://arxiv.org/abs/1904.08779

License

Notifications You must be signed in to change notification settings

ludovicschwartz/sparse_image_warp_pytorch

This branch is up to date with bobchennan/sparse_image_warp_pytorch:master.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

d05592d · Jul 12, 2019

History

18 Commits
Jun 29, 2019
Jun 29, 2019
Jul 5, 2019
Jun 29, 2019
Jun 30, 2019
Jul 12, 2019

Repository files navigation

sparse_image_warp_pytorch

Provides a pytorch implementation of sparse_image_warp and an example of spec_augment.

Original implementation comes from zcaceres.

But there are several errors in their implementation:

  • In their implementation they didn't add boundary points to source and destination. The idea is to sample a point and its destination after warpping but we want to keep all boundary points the same. Since they didn't do this they need to add small noise instead of zeros to lhs matrix.
  • cross_squared_distance_matrix

Some initial results of SpecAug on WSJ:

System(#iters) WER dev93 WER eval92
Transformer 8.2 4.8
Transformer+FM+TM 7.3 4.6
Transformer+FM+TM+TW 7.0 5.0
RNN 8.6 5.1
RNN+FM+TM 7.8 5.2
RNN+FM+TM+TW 7.3 4.6

Example of time warpping(W changes from 10 to 160, 81 dimensional features):

time warp

Example of time warpping(10 sources to 10 destinations which indicates that speed changes 10 times):

time warp

About

Pytorch implementation of sparse_image_warp and an example of GoogleBrain's SpecAugment is given: A Simple Data Augmentation Method for Automatic Speech Recognition https://arxiv.org/abs/1904.08779

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Python 100.0%