Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The RFT data #19

Open
ZIKEYUAN opened this issue Nov 25, 2023 · 3 comments
Open

The RFT data #19

ZIKEYUAN opened this issue Nov 25, 2023 · 3 comments

Comments

@ZIKEYUAN
Copy link

Hi,after completing SFT and multipath reasoning, I have some doubts about the data under the data/rft path in your github code base. I would like to ask you how these data are generated from? I see that four data sets are generated after the Filter reasoning path process, and I would like to ask whether the data under data/rft are created from four datasets?

@GanjinZero
Copy link
Contributor

data/rft contains llama7b/13b/7b2/13b2 which means this dataset is generated by inferencing this size of SFT models with 100 times and filtered with correct and distinct reasoning paths.

@ZIKEYUAN
Copy link
Author

Thank you, but I have a question. After the Filter reasoning path process, it will generate four files. If I don’t want to use the RFT data you provided, how can I use the four data files to generate my RFT data?

@GanjinZero
Copy link
Contributor

I think one of the generated data is rft data which you can use directly. If you don't know use which one, you can copy some lines here and I will tell you.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants