Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training on logits rather than tokens? #271

Open
SinanAkkoyun opened this issue Oct 5, 2023 · 0 comments
Open

Training on logits rather than tokens? #271

SinanAkkoyun opened this issue Oct 5, 2023 · 0 comments

Comments

@SinanAkkoyun
Copy link

Hey, I would like to train a student model from my teacher model (knowledge distillation for specualtive decoding).
Commonly, the student model is being trained on the teachers logits (soft "labels") rather than tokens (hard "labels")

How can I do that with qLora? Thank you so much!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant