Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Citrinet CTC Decoder Alphabet size mismatch. #9554

Open
huks0 opened this issue Jun 27, 2024 · 0 comments
Open

Citrinet CTC Decoder Alphabet size mismatch. #9554

huks0 opened this issue Jun 27, 2024 · 0 comments
Assignees
Labels
bug Something isn't working

Comments

@huks0
Copy link

huks0 commented Jun 27, 2024

I trained a Citrinet and intended to use the CTC Decoder to retrieve a corrected output.

Using the CTC Beam Search Decoder of DeepSpeech I get the following error:

[ctc_beam_search_decoder.cpp:279] FATAL: "(alphabet.GetSize()+1) == (class_dim)" check failed. Number of output classes in acoustic model does not match number of labels in the alphabet file. Alphabet file must be the same one that was used to train the acoustic model.

I have controlled the alphabet and it has the size of 1023, even though I built it with 1024 characters. I built the tokenizer and alphabet with the nemo script setting the vocab size to 1024, spe and unigram. The output shape of the model is 1025. I believe the mismatch is 1 character. I thought of blank or unk token, but I aint sure if that is the cause of the error. Any idea why this happens or how to solve it? I appreciate your help!!

@huks0 huks0 added the bug Something isn't working label Jun 27, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

2 participants