-
Within "evaluation metrics", talk about how ROUGE is not really intended for machine translation, and the pitfalls thereof.
https://stats.stackexchange.com/questions/301626/interpreting-rouge-score…
-
pip install rouge-score
-
Aujourd'hui, dans Trident, les préavis à vérifier apparaissent à part, dans un tableau spécifique : les opérateurs voient du 1er coup d'œil s'il y en à traiter ou non.
Il faudrait qu'ils puissent voi…
-
Why do I only get a CIDEr score of 0.065 on Flickr30k for bliva_vicuna7b, even if multiplied by 10 it's only 0.65? Could you tell me what might have gone wrong in this process?? Thanks.
{"test": {"Bl…
-
Mode idle
Idée avec rosalie:
- montre seulement le data du robot (pas de ligne verte et rouge)
- montre l'heure de la prochaine représentation (besoin d'un endroit ou entrer manuellement l'heur…
-
Hello,
I have had some ZeroDivisionErrors trying to get the Rouge-L summary level score for one of my data.
The problem was in the function _union_lcs of rouge.py where the "union longest commo…
-
-
My code is using a quote form Alexandre Dumas' _Les Trois Mousquetaires_ as a string to test some code with. The quote contains "à Amsterdam, chez Pierre Rouge." Running codespell, I get: ```Rouge ==…
-
r1 = rouge_score[0]["rouge-1"]['r']
r2 = rouge_score[0]["rouge-2"]['r']
rl = rouge_score[0]["rouge-l"]['r']
b1 = sentence_bleu(reference, candidate, weights=(1, 0, 0…
-
Hi,
thank you for the interesting work!
I am trying to reproduce the results for LLaMA-2-7b on LIMAEval for the discard method.
I ran the evaluation script after generating with the release mode…