Open Aadyant12 opened 2 years ago
Hi! Is it possible to share the specific implementation of ROUGE scores, METEOR, and pretrained metrics that are used to evaluate our blind test set performance? I was hoping to also report these metrics on the original dev and test sets in the paper. Having the exact same implementation will be super helpful.
Kindly provide a method to access the various metrics using which our results would be assessed.