Adversarial Example Speech Quality

Codes for the paper "On the robustness of non-intrusive speech quality model by adversarial examples".

This repository hosts the Pytorch codes for paper On the robustness of non-intrusive speech quality model by adversarial examples (ICASSP 2023) by Hsin-Yi Lin, Huan-Hsin Tseng, and Yu Tsao.

This work shows that deep speech quality predictors can be vulnerable to adversarial perturbations, where the prediction can be changed drastically by unnoticeable perturbations. In addition to exposing the vulnerability of deep speech quality predictors, we further explore and confirm the viability of adversarial training for strengthening robustness of models.

Datasets

- Voice Bank corpus (VCTK)

- TIMIT Acoustic-Phonetic continuous speech corpus

- DNS challenge DNS challenge speech corpus

Run

Stage 1- Adversarial perturbations

change data paths, onnx model path, save model path, output path
change score transform (in attack_modules.py)
run stage1_attack.py

Stage 2- Model enhancement by adversarial examples

change data paths, onnx model path, perturbation paths, save model path
run stage2_enhancement.py

Prerequisites

Hardware

NVIDIA V100 (32 GB CUDA memory) and 4 CPUs.

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
attack_modules.py		attack_modules.py
datasets.py		datasets.py
stage1_attack.py		stage1_attack.py
stage2_enhancement.py		stage2_enhancement.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Adversarial Example Speech Quality

Datasets

- Voice Bank corpus (VCTK)

- TIMIT Acoustic-Phonetic continuous speech corpus

- DNS challenge DNS challenge speech corpus

Run

Stage 1- Adversarial perturbations

Stage 2- Model enhancement by adversarial examples

Prerequisites

Hardware

About

Releases

Packages

Languages

hsinyilin19/adversarial_example_speech_quality

Folders and files

Latest commit

History

Repository files navigation

Adversarial Example Speech Quality

Datasets

- Voice Bank corpus (VCTK)

- TIMIT Acoustic-Phonetic continuous speech corpus

- DNS challenge DNS challenge speech corpus

Run

Stage 1- Adversarial perturbations

Stage 2- Model enhancement by adversarial examples

Prerequisites

Hardware

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages