Open qiuwei opened 5 years ago
@qiuwei In their report, the F score on the development set is 0.964. The result of 0.932 is lower than them, but I also found that the results of their own MRPC codes lower than the values reported in their paper , so i think it must has some tricks on parameter adjustment .
Thanks for the information. I thought the number was on the test set. Did you use the large model or the base model?
@qiuwei
the result was on the dev, i used base model ,when i use large i got OOM.
hi, so the F1 of your system is 0.9326? Is it comparable with the numbers reported in the literature?