-
Hi :wave:,
this is a feature request for an additional input format that also tackles the output format. I often have additional annotations like a document id or a sentence id for given sentences …
-
Hi! I tried running `generate` to evaluate `transformer.wmt14.en-fr` on the WMT'14 test set but was only able to get a BLEU score of 35.42. I ran `prepare-wmt14en2fr.sh` and `fairseq-preprocess` on th…
-
Source: [Masader Project](https://arbml.github.io/masader/)
- uid: uncorpus
- entry: https://arbml.github.io/masader/card.html?48
- Link: https://conferences.unite.un.org/uncorpus
- License : cust…
-
Hi, I've got UnicodeDecodeError when running 'nlg-eval --setup':
Downloading http://nlp.stanford.edu/data/glove.6B.zip to nlgeval/data.
Downloading https://raw.githubusercontent.com/robmsmt/glove-…
-
I just tried the detokenizer, and, while punctuation marks seems not to pose any problem, it erroneously puts a white space inside contractions:
```
import mosestokenizer …
-
The WMT14 include the following corpus:
1. commoncrawl
2. europarl-v7
3. giga
4. news-commentary
5. undoc
This entire corpus contains 40.8 M sentences, differ from 36M as paper reported. Is …
-
I'm trying to reproduce the BLEU score reported at http://matrix.statmt.org/matrix/output/1914?score_id=37605 and described here https://github.com/pytorch/fairseq/tree/master/examples/wmt19
Would …
-
In evaluator.py, the code now tries to download a `multi-bleu.perl` from `BLEU_SCRIPT_URL = 'https://raw.githubusercontent.com/facebookresearch/XLM/master/src/evaluation/multi-bleu.perl'`. However, th…
-
Hi, I have a few points in the research paper that I want to confirm and also a few questions to ask about fine-tuning procedure with JESC dataset.
From what I read:
- You use the big model to fin…
-
I am running my command to compile Moses ToolKit on Ubuntu 20.04 using the following command and get the below issue
./bjam --with-cmph=/home/namrata/smt/cmph-2.0 -j4
Error :
XMLRPC-C: USING …