We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
最新版llamafactory
我微调了deepseek-qwen-7B模型,我的输出只有A,B,C,训练时准确率很高,但是推理时会输出思维链,甚至会有<|im_start|>user类似的在input中的词,请问训练时是做了什么操作让其不输出思维链吗,另外推理时输出在input中的词是为什么呢,应该如何解决呢
No response
The text was updated successfully, but these errors were encountered:
什么叫训练的时候准确率很高,训练准确率是意思?推理时又是什么意思,用webui推理还是transformers还是别的什么框架
Sorry, something went wrong.
No branches or pull requests
Reminder
System Info
最新版llamafactory
Reproduction
我微调了deepseek-qwen-7B模型,我的输出只有A,B,C,训练时准确率很高,但是推理时会输出思维链,甚至会有<|im_start|>user类似的在input中的词,请问训练时是做了什么操作让其不输出思维链吗,另外推理时输出在input中的词是为什么呢,应该如何解决呢
Others
No response
The text was updated successfully, but these errors were encountered: