Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

I have reduced the model size in half, but the RTF has doubled, what happend? #20

Open
Vuducbao913 opened this issue Jan 12, 2023 · 1 comment

Comments

@Vuducbao913
Copy link

I'm trying to reduce the model size to reduce the inference time since the RTF is pretty high right now. I am trying to reduce the model size by reducing the number of ResnetBlock, num resolutions in the default model are 7, so I have reduced it to 3. The current model is about 27M parameters
Then I trained from scratch with the WSJ data, amazingly it gave better results than the pre-trained model on some of our real datasets.
You can listen to the audio here to see some of the differences
https://drive.google.com/drive/folders/1aU_3btAczyzLeecAQwzNFrwiZWzotgaG?usp=share_link

More importantly, however, the RTF has doubled. The current model is much smaller in size than the pre-trained model (27M compared to 65M), I took a lot of time to this problem, what is the problem here, have you ever encountered a similar case?

@ksasso1028
Copy link

The size of the feature maps are larger (less blocks to down sample) so the attention block is running on larger inputs. The spatial resolution is a tradeoff

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants