Open ZIKEYUAN opened 11 months ago
data/rft contains llama7b/13b/7b2/13b2 which means this dataset is generated by inferencing this size of SFT models with 100 times and filtered with correct and distinct reasoning paths.
Thank you, but I have a question. After the Filter reasoning path process, it will generate four files. If I don’t want to use the RFT data you provided, how can I use the four data files to generate my RFT data?
I think one of the generated data is rft data which you can use directly. If you don't know use which one, you can copy some lines here and I will tell you.
Hi,after completing SFT and multipath reasoning, I have some doubts about the data under the data/rft path in your github code base. I would like to ask you how these data are generated from? I see that four data sets are generated after the Filter reasoning path process, and I would like to ask whether the data under data/rft are created from four datasets?