Why is my duplicated wavLM results on vox1-o is 30% worse #28

AIDman · 2022-06-18T04:11:16Z

model	EER(mine)	EER(official)
wavlm_large_nofinetune.pth	0.965	0.75
wavlm_large_finetune.pth	0.631	0.431

The above results are the validation results of your shared wav_lm models on the original Vox1-o data without changing any code.
What might be the reason for this gap? Wrong settings?
Here is more background about my setting:

Create a conda env as:

conda create -n UniSpeech_py3p8 python=3.8

Following your guidance under https://github.com/microsoft/UniSpeech/tree/main/downstreams/speaker_verification

pip install --require-hashes -r requirements.txt

The following error will appear:

Collecting numpy<1.23.0,>=1.16.5
ERROR: In --require-hashes mode, all requirements must have their versions pinned with ==. These do not:
    numpy<1.23.0,>=1.16.5 from https://files.pythonhosted.org/packages/2f/14/abc14a3f3663739e5d3c8fd980201d10788d75fea5b0685734227052c4f0/numpy-1.22.4-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl#sha256=64f56fc53a2d18b1924abd15745e30d82a5782b2cab3429aceecc6875bd5add0 (from scipy==1.7.1->-r requirements.txt (line 1))

Then I installed the environment manually (installed around 30~40 tools) just as #26

Here is some related details:
pip list | grep fairseq
fairseq 0.12.1 /home/user1/tools/fairseq
pip list | grep s3prl
s3prl 0.3.1
torch.version: 1.9.0+cu102
python -V: 3.8.13

Thanks for your wonderful work and looking forward for your help.

The text was updated successfully, but these errors were encountered:

YuzaChongyi · 2022-06-30T07:59:48Z

In my experiment, the wavlm_large_finetune EER is 0.574.

Sanyuan-Chen · 2022-07-08T08:06:37Z

Hi @AIDman ,

As for the environment error, could you replace this line

UniSpeech/downstreams/speaker_verification/models/ecapa_tdnn.py

Line 196 in e3043e2

self.feature_extract = torch.hub.load('s3prl/s3prl', feat_type)

with self.feature_extract = torch.hub.load('s3prl/s3prl:e52439edaeb1a443e82960e6401ae6ab4241def6', feat_type) and try again? The fairseq library is not necessary for inference WavLM model. As for the older version of s3prl, it can automatically skip the Import Error from fairseq, but the latest version of s3prl code would accidentally raise an ImportError.

As for the fine-tuning results for speaker verification, we use the adaptive snorm to normalize the trial scores and further apply the quality-aware score calibration as introduced in Section V.C-3 of our WavLM paper.

WhXmURandom · 2024-02-05T03:43:13Z

Hi @AIDman ,

As for the environment error, could you replace this line

UniSpeech/downstreams/speaker_verification/models/ecapa_tdnn.py

Line 196 in e3043e2

self.feature_extract = torch.hub.load('s3prl/s3prl', feat_type)

with self.feature_extract = torch.hub.load('s3prl/s3prl:e52439edaeb1a443e82960e6401ae6ab4241def6', feat_type) and try again? The fairseq library is not necessary for inference WavLM model. As for the older version of s3prl, it can automatically skip the Import Error from fairseq, but the latest version of s3prl code would accidentally raise an ImportError.
As for the fine-tuning results for speaker verification, we use the adaptive snorm to normalize the trial scores and further apply the quality-aware score calibration as introduced in Section V.C-3 of our WavLM paper.

Can you provide the code for the quality-aware score calibration?Thank you!

MarkWuNLP closed this as completed Jul 19, 2022

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why is my duplicated wavLM results on vox1-o is 30% worse #28

Why is my duplicated wavLM results on vox1-o is 30% worse #28

AIDman commented Jun 18, 2022

YuzaChongyi commented Jun 30, 2022 •

edited

Loading

Sanyuan-Chen commented Jul 8, 2022

WhXmURandom commented Feb 5, 2024

Why is my duplicated wavLM results on vox1-o is 30% worse #28

Why is my duplicated wavLM results on vox1-o is 30% worse #28

Comments

AIDman commented Jun 18, 2022

YuzaChongyi commented Jun 30, 2022 • edited Loading

Sanyuan-Chen commented Jul 8, 2022

WhXmURandom commented Feb 5, 2024

YuzaChongyi commented Jun 30, 2022 •

edited

Loading