Error Loading Fine-Tuned MSDD Model - Missing speaker_model_cfg
in NeMo
#12600
Labels
bug
Something isn't working
speaker_model_cfg
in NeMo
#12600
I fine-tuned the NeMo MSDD (Multi-Scale Diarization Decoder) model on VoxConverse 2-speaker subset using NVIDIA's official fine-tuning notebook:
🔗 Notebook Used: [Speaker_Diarization_Training.ipynb](https://github.com/NVIDIA/NeMo/blob/main/tutorials/speaker_tasks/Speaker_Diarization_Training.ipynb)
After training, I attempted to load the fine-tuned MSDD model for inference using the
NeuralDiarizer
class in three different ways:❌ All three methods resulted in the following error:
🚨 Issue Summary:
speaker_model_cfg
is missing from the configuration.speaker_model_cfg
, but it's required during inference.Steps/Code to Reproduce Bug
.nemo
format:Expected Behavior
I expected the fine-tuned MSDD model to load correctly for inference after training without requiring
speaker_model_cfg
, as the speaker embedding extractor was frozen during fine-tuning.Environment Overview
pip install nemo_toolkit['all']
The text was updated successfully, but these errors were encountered: