Closed lixnvege closed 1 year ago
Hi @lixnvege,
We simply calculated the rouge score without any stemming (i.e. on the raw tokenized ngrams) for the languages without a corresponding stemmer implementation.
@abhik1505040 Thank you!
Closing the issue. Please feel free to reopen if you have any further questions.
Hi,
I am writing to seek clarification regarding the calculation of Rouge scores for various languages in the XL-Sum dataset. While reviewing the provided toolkit, I noticed that it appears to only support a limited list of languages. However, I also observed that Rouge scores have been reported for all languages, which has led to some confusion on my part.
Could you kindly provide me with further details on how the Rouge scores were computed for languages not supported by the toolkit?
Like language igbo the code looks like:
and the output is:
Did I misunderstand something?
I really appreciate any clarification or guidance.