Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add glm4 reward model tutorial & bugfix qwen2 dpo readme #101

Open
wants to merge 1 commit into
base: master
Choose a base branch
from

Conversation

coder-yuzhiwei
Copy link
Collaborator

@coder-yuzhiwei coder-yuzhiwei commented Nov 8, 2024

  1. 修复qwen2 dpo文档中的一处错误
  2. 增加了glm4模型reward model的教程

@coder-yuzhiwei coder-yuzhiwei changed the title bugfix qwen2 dpo readme add glm4 reward model tutorial & bugfix qwen2 dpo readme Nov 14, 2024
# model: 模型名称
# input_path: 下载HuggingFace权重的文件夹路径,注意最后面有/
# output_path: 转换后的MindSpore权重文件保存路径
# dtype: 转换权重的精度
```

Copy link
Collaborator

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

readme里面贴一下最后的eval结果

Copy link
Collaborator Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

这里是修改qwen2里的文档错误,在reward model里的readme中,已经贴了eval的结果。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants