Skip to content
This repository has been archived by the owner on Oct 26, 2022. It is now read-only.

Instructions for fine-tuning pre-trained models #139

Open
y3nk0 opened this issue Dec 13, 2019 · 1 comment
Open

Instructions for fine-tuning pre-trained models #139

y3nk0 opened this issue Dec 13, 2019 · 1 comment

Comments

@y3nk0
Copy link

y3nk0 commented Dec 13, 2019

Is there any chance that we can have a full description on how to fine-tune pre-trained models (for example in machine translation)? I've managed to continue training on a much smaller dataset (by using the pre-trained dictionary) but the results are disappointing. The model gets kind of "broken" (it mostly outputs "unk"). Am I missing any step? Should we update the bpecodes as well? Thank you!

@nikitacs16
Copy link

I'm facing the same issue. @y3nk0, were you able to resolve it?
@myleott @michaelauli could you advise regarding the same?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants