OFA-Sys / gsm8k-ScRel

Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
https://arxiv.org/abs/2308.01825
215 stars 16 forks source link

Release the RFT 7B model #2

Closed wenhuchen closed 1 year ago

wenhuchen commented 1 year ago

Hi there, is there any chance to share with us your RFT-7B model?

GanjinZero commented 1 year ago

Sure, I will open-source RFT (k=100, U13B, U33B). The models are quite large, and I think I need some time for uploading them.

wenhuchen commented 1 year ago

Great. Would you mind uploading the 7B one to Huggingface first?

GanjinZero commented 1 year ago

I will do it, while I also need open-source license from the company. So RFT 7B could possibly not be uploaded in one day.

GanjinZero commented 1 year ago

OFA-Sys/gsm8k-rft-llama7b-u13b

GanjinZero commented 1 year ago

The uploading is very unstable, I will try to fix it...

wenhuchen commented 1 year ago

Thanks a lot!

GanjinZero commented 1 year ago

I'm still failing uploading, and I will try to fix it tomorrow.

wenhuchen commented 1 year ago

Hello, thanks for sharing the model. I'm able to load it now. So this model is LLaMa-1 7B, RFT on the 100 paths generated by 13B SFT model. Is that correct? So the accuracy should be 49.3?

GanjinZero commented 1 year ago

Hello, thanks for sharing the model. I'm able to load it now. So this model is LLaMa-1 7B, RFT on the 100 paths generated by 13B SFT model. Is that correct? So the accuracy should be 49.3?

Generated by 7B 7b2 13b 13b2 with 100 paths each. Acc should be 49.3.

GanjinZero commented 1 year ago

Base model is llama1-7B.

wenhuchen commented 1 year ago

Would you mind also sharing the 7B SFT model? Thanks a lot!

GanjinZero commented 1 year ago

I can do it. But my internet is so slow. It costs some time.

wenhuchen commented 1 year ago

Never mind. I just reproduced your results and trained an SFT model on my own. Thanks a lot! The numbers are totally correct.

GanjinZero commented 1 year ago

Happy you can reproduce my results!