Closed Runingtime closed 6 years ago
Yep that's intended as per the docs: "Metrics to omit. Omitting Bleu{i} will omit Bleu{j} for j>=i." See here: https://github.com/Maluuba/nlg-eval/blob/master/nlgeval/__init__.py#L161
I wouldn't mind changing the behavior but this feature is mainly meant as a performance gain but the way a lot of the BLEU calculations are done means that when calculating Bleu_4, you also compute the others. So there isn't much saved in returning Bleu_4 but not the others the way the code is currently written.
Hi, it seems that there is a bug in the load_scorers method of the NLGEval class. For example, when running the following code,
it gives the wrong results (Bleu_4 isn't printed):
So, is this a real bug or did I miss something?