You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Hi,after completing SFT and multipath reasoning, I have some doubts about the data under the data/rft path in your github code base. I would like to ask you how these data are generated from? I see that four data sets are generated after the Filter reasoning path process, and I would like to ask whether the data under data/rft are created from four datasets?
The text was updated successfully, but these errors were encountered:
data/rft contains llama7b/13b/7b2/13b2 which means this dataset is generated by inferencing this size of SFT models with 100 times and filtered with correct and distinct reasoning paths.
Thank you, but I have a question. After the Filter reasoning path process, it will generate four files. If I don’t want to use the RFT data you provided, how can I use the four data files to generate my RFT data?
I think one of the generated data is rft data which you can use directly. If you don't know use which one, you can copy some lines here and I will tell you.
Hi,after completing SFT and multipath reasoning, I have some doubts about the data under the data/rft path in your github code base. I would like to ask you how these data are generated from? I see that four data sets are generated after the Filter reasoning path process, and I would like to ask whether the data under data/rft are created from four datasets?
The text was updated successfully, but these errors were encountered: