-
Hello,
While reproducing an experiment on MuST-C, I realized that the reference file used for computing BLEU is different from the original reference file: tokens such as `(_Gelächter_)` and `(_App…
-
The schedule needs to be fixed as follows:
Time (PDT) | Event | Speakers
July 9, 9:00 AM-9:15 AM | Foreword/Introduction | Marcello Federico/Alex Waibel
July 9, 9:15 AM-10:15 AM | Keynote + Q&A | …
-
This is really just feedback for the authors of the M2M 100 model.
Based on the Venture Beat article I read about, it sounds like there is a need to expand the text corpus to include more content f…
-
I was wondering if you ever encountered nan-gradients during admin training.
I'm in torch 1.6/CUDA 10.1 with no modifications to the code:
#### Command
```bash
export dd=data-bin/wmt14_en_de_joi…
-
Hi,
I am trying to follow the **mbart training step** in [fairseq/examples/mbart/README.md](https://github.com/pytorch/fairseq/blob/5e79322b3a4a9e9a11525377d3dda7ac520b921c/examples/mbart/README.md…
-
Hi there,
I recently started going through the code in this repository after having read your paper, which I found very fascinating.
I would be very interested in trying to reproduce the results…
-
Dear authors,
May I ask for the hyper-parameters used in your paper for IWSLT (the smaller model) and WMT (the full model)? Such as learning rate, warmup steps, batch size, max learning rate, etc.
…
-
Hey there -- wanted to start by saying thanks for the excellent work!
I'm trying to run some of the preliminary experiments, but I'm not sure how I'd go about doing the following:
(1): Initializ…
-
Hi guys,
I was trying to train a transformer model with pipeline parallelism. Is this supposed to work already?
The command i tried (following the translation example):
`fairseq-train data…
-
作者您好!请问您的源码我怎么运行失败呢?尤其是预处理问题,存在很大的问题,这是怎么回事儿?