Open Joyyang158 opened 4 months ago
@Joyyang158 To reproduce the results from our paper, you would need:
zephyr-7b-sft-full
at revision ac6e600eefcce74f5e8bae1035d4f66019e93190
.lm-evaluation-harness
at version v0.4.0
.For settings different from our original configuration, you may need to adjust the parameters. We are currently working on tuning the parameters for VLLM generation and will provide updates once we have results.
Thank you.
Thanks for your reply. I will try epoch = 6. And There are two things I want to check with you
Thanks!
@Joyyang158
I see, thanks for your help and patience!
Hi authors,
I tried to reproduce the result you declared in the paper using zephyr-7b-sft-full
If following the algorithm you design in the paper, only the performance of the first iteration can increase, and then it decreases after later a few iterations. And I set the epoch = 3.
Could you give me some guidance? Thanks!