A question about the score result

yyh030806 commented 1 week ago

Hello, I'm encountering an issue reproducing your experiment. I attempted to use the provided checkpoint for generation without pretraining or fine-tuning. However, the results are significantly worse than expected (see attached screenshots).

What additional steps are necessary to achieve state-of-the-art performance with the provided checkpoint? Are pretraining and/or fine-tuning required, and if so, could you provide guidance on the correct procedure?

Thank you for your assistance.

yuqianghan commented 1 week ago

Thank you for your interest in our work. I have noted the discrepancy between the number of generated instances and the ground truth. The prediction dataset size should align with the test set size of 5007.

Please review the data preprocessing steps for the USPTO_50k dataset with the augmentation parameter set to 20.
If the dataset was preprocessed with different augmentation settings, the "-aug" parameter in the "2_generate_50k.sh" script should be adjusted accordingly.
I have also updated the "2_generate_50k.sh" script, where you can replace "ckpt_path" with the path to the provided checkpoint.

yyh030806 commented 1 week ago

Thanks for the explanation, that makes sense now.

yuqianghan / editretro

A question about the score result #16