Open yyh030806 opened 1 week ago
Thank you for your interest in our work. I have noted the discrepancy between the number of generated instances and the ground truth. The prediction dataset size should align with the test set size of 5007.
Thanks for the explanation, that makes sense now.
Hello, I'm encountering an issue reproducing your experiment. I attempted to use the provided checkpoint for generation without pretraining or fine-tuning. However, the results are significantly worse than expected (see attached screenshots).
What additional steps are necessary to achieve state-of-the-art performance with the provided checkpoint? Are pretraining and/or fine-tuning required, and if so, could you provide guidance on the correct procedure?
Thank you for your assistance.