-
Notifications
You must be signed in to change notification settings - Fork 19
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
what is the right way to get a 8-bit model? #28
Comments
fixed-quant is dead since it's been merged into master and @afaji needs to pay attention to issue #24 to remove references to it. You can get a slower 8-bit model in https://github.com/marian-nmt/marian-dev without output matrix quantization or get a faster 8-bit model from https://github.com/browsermt/marian-dev . The teacher can be trained with anything. |
you mean i can get a fixed-quant(16-bit or 8-bit) model by adding the command "--quantize-bits 16" using marian master |
@XapaJIaMnu the 8-bit documentation is lacking. |
@yandaowei sorry for the lacking documentation, could you please check the steps described here: https://github.com/browsermt/students/tree/master/train-student#5-optional-8bit-quantization and see if everything is clear. The quantisation finetuning is completely optional and is described here https://github.com/browsermt/students/tree/master/train-student/finetune |
So I need two step to get the 8-bit model : |
|
1、which version or branch of marian to complie? marian-master, marian-dev or https://github.com/afaji/Marian/tree/fixed-quant?
2、could the teacher model trained by teh marian-master?
The text was updated successfully, but these errors were encountered: