Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Data simulation and augmentation #5

Open
v-nhandt21 opened this issue Dec 23, 2021 · 8 comments
Open

Data simulation and augmentation #5

v-nhandt21 opened this issue Dec 23, 2021 · 8 comments

Comments

@v-nhandt21
Copy link

v-nhandt21 commented Dec 23, 2021

Can you detail the way you are using to make the noise audio for training?

Does it the same with described in the paper?

image

Are you using kaldi or any tool for this, and can you share your noise dataset !

Thank rishikksh !

@LXnn058
Copy link

LXnn058 commented Aug 15, 2022

Hi, have you solved this problem?

@v-nhandt21
Copy link
Author

Hi, have you solved this problem?

With my experiment, the data augmented with Kaldi give better results than some methods that augment on the fly

@LXnn058
Copy link

LXnn058 commented Aug 15, 2022

Are the parameters the same as those set in the paper? Can you provide me the corresponding processing script? thanks.

@v-nhandt21
Copy link
Author

I think the parameter depending on the test set / practical problem you want to solve.

My config is SNR between 10dB and 30dB, use MUSAN as additive noise

You can check out the public source code here: https://github.com/zhaoyi2/audio_augment

@LXnn058
Copy link

LXnn058 commented Aug 15, 2022

This helped me a lot, thanks!

@skol101
Copy link

skol101 commented Aug 20, 2022

@v-nhandt21 I simply added audiomentations (RoomSimulator, Backgroundnoise) and stored to separate directory, which I use in "input_wavs" params after step for postnet training (250k in my case, though it's 500k in the default config).

@v-nhandt21
Copy link
Author

@v-nhandt21 I simply added audiomentations (RoomSimulator, Backgroundnoise) and stored to separate directory, which I use in "input_wavs" params after step for postnet training (250k in my case, though it's 500k in the default config).

Yep, it seems hard for the model to converge on the fly augmentation

@velonica0
Copy link

Hi, have you solved this problem?

With my experiment, the data augmented with Kaldi give better results than some methods that augment on the fly

Hi, Bro
Can I get a copy of your processed training data about this code, I don't understand the specific definitions of the folders and file trees in the code, such as gt_noise, gt_clean, generated, etc.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants