Open shizhediao opened 4 years ago
Thanks to @kyleclo , this question has been solved. This is because I use the wrong metrics. The correct one should be (test_F1_I-OUT+test_F1_I-PAR+test_F1_I-INT)/3 = 0.7013232907. I am curious why it is still higher than the reported number, which is 68.30
Hi, I am reproducing the fine-tune results following your instruction. I work on your default code and my setting is as below. I work on ebmnlp and pico task without fine-tuning
However, I found my results are much better than the reported number in your paper. In scibert paper, the micro F1 is 68.30 (Frozen) and 72.28 (fine-tune), but in my experiment, the F1 is 75.48 (frozen).
I am really confused. Thanks