distillation with original Whisper codebase #2134

afsara-ben · 2024-04-15T21:25:43Z

afsara-ben
Apr 15, 2024

I am unsure how in the openai-whisper code distillation can be done (not talking about the huggingface implementation) -- for distillation to happen, gradient has to flow throughout the forward function, but i get grad_fn=None always, even when the model has been set to train(). Any script that does simple distillation using the originial whisper codebase (this repo)?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

distillation with original Whisper codebase #2134

{{title}}

Replies: 0 comments

Select a reply

distillation with original Whisper codebase #2134

afsara-ben Apr 15, 2024

Replies: 0 comments

afsara-ben
Apr 15, 2024