Open platoonpluto opened 1 year ago
We have not enough resources to complete that experiment. From my experience, using a slightly larger temperature (maybe larger than 0.7) will be helpful with self-consistency for a large SFT model.
We have not enough resources to complete that experiment. From my experience, using a slightly larger temperature (maybe larger than 0.7) will be helpful with self-consistency for a large SFT model.