-
Hi there,
I recently started going through the code in this repository after having read your paper, which I found very fascinating.
I would be very interested in trying to reproduce the results…
-
Hi, I notice that when you use `scripts/translate.sh` to evaluate model, you do not use option `--sacrebleu`. Does it mean that is not accounted in BLEU computation? Do all the values in the paper is…
-
In the paper, you mentioned, "We use whitespace tokenization for all of the MLQA languages other than Chinese". I am wondering is there any suggested way to get the sentence boundaries, so that, we ca…
-
hi, I tried to run your code but got the error of "no file named multi-bleu-detok.perl", could you refer to the link for downloading this file? Thanks!
-
Thanks for making this available. Trying to run this code on windows and getting the following error:
Missing $ on loop variable at preprocess.perl line 1.
When I run
for l in en de; do for f …
-
On running the following command for preprocessing
`for l in en de; do for f in data/multi30k/*.$l; do if [[ "$f" != *"test"* ]]; then sed -i "$ d" $f; fi; done; done;
`
I'm getting the following …
-
The original Moses perl scripts add a space between tokens that do not end with a CJK character and tokens that do:
https://github.com/moses-smt/mosesdecoder/blob/555829a771cd897bb807f495a95737953a7c…
-
Hello, as far as I could see there isn't trained en-zh model. Would you give me some advice for training that model?
-
There is another sentence splitting logic here in moses-smt/mosesdecoder repository at [this](https://github.com/moses-smt/mosesdecoder/blob/master/scripts/ems/support/split-sentences.perl#L6) link, b…
-
Your documentation says:
> The input file should contain one sentence by line, and they have to be tokenized. Otherwise, the tagger will perform poorly.
Simple Question: Which tokenizer did you…