When I finetune the model from checkpoint "N-Step-Checkpoint_3_30000.ckpt",
the reproduction results are much higher than them report in the paper.
For example, I got 1.4575 on u0 task, while the reported result on the same task is 0.3211.
I have checked the finetune code several times, I cannot find issues.
Is there anyone who get similar results to the reported ones on any regression task?
When I finetune the model from checkpoint "N-Step-Checkpoint_3_30000.ckpt", the reproduction results are much higher than them report in the paper.
For example, I got 1.4575 on u0 task, while the reported result on the same task is 0.3211. I have checked the finetune code several times, I cannot find issues.
Is there anyone who get similar results to the reported ones on any regression task?