OFA-Sys / gsm8k-ScRel

Codes and Data for Scaling Relationship on Learning Mathematical Reasoning with Large Language Models
https://arxiv.org/abs/2308.01825
212 stars 16 forks source link

When will release model of LLama13b RFT model? #10

Closed xingweiqu closed 1 year ago

xingweiqu commented 1 year ago

Hi, I wanna to reproduce the result form RFT model for LLaMa 13B, Do you have any plan for that ?

GanjinZero commented 1 year ago

There are some problems for reproducing 7B RFT models, mostly due to different tokenizers between LLaMA1 and LLaMA2. I will open source 13B models after solving those issues.

GanjinZero commented 1 year ago

Uploading

GanjinZero commented 1 year ago

Uploaded.