GEM-benchmark / GEM-metrics

Automatic metrics for GEM tasks
https://gem-benchmark.com
MIT License
60 stars 20 forks source link

Issues generating Heavy metrics #103

Open SimonvdFliert opened 1 year ago

SimonvdFliert commented 1 year ago

Hi,

I have been trying to test several models on the GEM-benchmark metrics. I followed the tutorial provided both on GitHub and the official website and have been able to generate a submission file with generations and GEM-ID keys. However, when I attempt to generate output scores, I notice that I am missing several scores I wish to have.

For example, in the requirements.txt file, the package of rouge-score is included, however, the output scores do not contain any rouge metric. Furthermore, I attempted several times to generate output scores with --heavy-metric flag, however, this is always skipped. Regardless of whether I include the flag or leave it out, the same metrics are returned.

I attached an example of my output scores below: afbeelding

An example of the generation is shown below here: afbeelding

More information:

Could someone help uncover what I am doing wrong?

Kind regards