Closed volksen closed 2 days ago
Hi! thanks for your contribution!, great first issue!
Thanks for the report! Gonna check this weekend.
Thanks for the report! Gonna check this weekend.
@stancld, did you have a chance to have a look at it? :rabbit:
@Borda pls assign it to me
🐛 Bug
Hi,
when using the rouge_score with accumulate="best", the results are dependent on the order of the labels. As of my understanding, accumulate="best" should return the best f score over all references.
Minimal example:
gives different results:
Did I missread the documentation or is this a bug. Accumulate='avg' works as expected. Maybe the bug is in https://github.com/Lightning-AI/torchmetrics/blob/v1.1.0/src/torchmetrics/functional/text/rouge.py#L378 where there is a todo comment.
I compared the results to the rouge-score package:
which gives the same results in both cases:
Environment