-
I have two questions:
1. How do you compute the reported BLEU scores? I see the imports of BLEU in several models but BLEU is never added to the `self.metrics` collection of any of them.
2. When…
-
-
Why do I only get a CIDEr score of 0.065 on Flickr30k for bliva_vicuna7b, even if multiplied by 10 it's only 0.65? Could you tell me what might have gone wrong in this process?? Thanks.
{"test": {"Bl…
-
The model was trained on flicker8k, but the results achieved only half the BLEU-4 score mentioned by the authors (about 0.14-0.15). I have not modified any parameters in train.py. May I ask why such a…
-
When i run the inference with the trained model with the evaluation ,the train and the test set i get the following BLEU-4 scores
Train set: 2.0
Dev set: 8.4
Test set: 9.3
How is that possible to…
-
I am running under Windows 10, but I found that the bleu score is not running, why?
-
`In [1]: from sacrebleu.metrics import BLEU, CHRF`
` ...: refs = [["请关掉灯光。"]]`
` ...: sys = ["请关闭灯光。"]`
`In [2]: bleu = BLEU(trg_lang="zh")`
`In [3]: bleu.corpus_score(sys, refs)`
`Out[3]: BL…
-
-
## 🐛 Bug
bug when using command-line tool "fairseq-score" with argument "--sentence-bleu"
### To Reproduce
Steps to reproduce the behavior (**always include the command you ran**):
1. Run …
-
Thank you for the dataset.
I am very novice at NMT, the tensor2tensor's bleu score differs from that obtained by using the 'multi-bleu.perl'. Example t2t-bleu reports: 28.13 (uncase), 27.391 (cased…