Processing of model outputs during fine-tuning #2459

s56fasla · 2024-11-28T09:56:17Z

s56fasla
Nov 28, 2024

Hey everyone!
I have a short question (maybe not so short, haha) about fine-tuning Whisper models on own labelled data. Following the general jupyter notebook on Huggingface, there seems to be no preprocessing steps for transcriptions that are generated during the training and are used for evaluations over the course of training. Is it necessary to add this step in the training somehow?
What I mean:
For example, for my own dataset that I am using for fine-tuning, all letters are lowercase, and there are no punctuation marks, only pure letter characters and whitespaces. Now, after starting fine-tuning, for example, every 200 steps the current model gets evaluated; however, is it not possible that model will generate output that will have uppercase and punctuation characters, and therefore WER will be higher than if they were preprocessed after being generated?
Opinions and suggestions would be very helpful, thank you very much!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Processing of model outputs during fine-tuning #2459

{{title}}

Replies: 0 comments

Select a reply

Processing of model outputs during fine-tuning #2459

s56fasla Nov 28, 2024

Replies: 0 comments

s56fasla
Nov 28, 2024