-
Notifications
You must be signed in to change notification settings - Fork 242
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
WISE推理问题 #461
Labels
question
Further information is requested
Comments
evaluation是采用teacher forcing的形式,和generate还不太一样。这个问题你可以尝试下别的方法,结果应该都不太好。 我倾向于这并不是bug,而是当前模型编辑方法的缺陷 |
这个问题WISE确实存在。如果你要测generate类似的指标,我建议可以使用类似bertscore(算生成文本和target_new的相似度),或者用一些简单的字符串匹配/gpt4打分等,也很期待您这边可以从这个视角再重新审视下当前所有的模型编辑方法 |
请问您还有其他问题吗? |
请问WISE目前没有generate方法,是否就在opencompass上测试不了? |
可以的吧,如果你是为了测通用能力,可以直接跑opencompass的 |
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
您好,我在使用WISE方法对Qwen2.5-7B进行知识编辑后,在推理时出现了一些问题,以下是我的代码与配置文件:
结果如下:
The text was updated successfully, but these errors were encountered: