OFA-Sys / gsm8k-ScRel

Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
https://arxiv.org/abs/2308.01825
219 stars 16 forks source link

The RFT data #19

Open ZIKEYUAN opened 11 months ago

ZIKEYUAN commented 11 months ago

Hi,after completing SFT and multipath reasoning, I have some doubts about the data under the data/rft path in your github code base. I would like to ask you how these data are generated from? I see that four data sets are generated after the Filter reasoning path process, and I would like to ask whether the data under data/rft are created from four datasets?

GanjinZero commented 11 months ago

data/rft contains llama7b/13b/7b2/13b2 which means this dataset is generated by inferencing this size of SFT models with 100 times and filtered with correct and distinct reasoning paths.

ZIKEYUAN commented 11 months ago

Thank you, but I have a question. After the Filter reasoning path process, it will generate four files. If I don’t want to use the RFT data you provided, how can I use the four data files to generate my RFT data?

GanjinZero commented 11 months ago

I think one of the generated data is rft data which you can use directly. If you don't know use which one, you can copy some lines here and I will tell you.