iwslt Search Results - Githubissues

430 results
for iwslt

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

espnet/espnet #2363

Bug: score_bleu.sh removes good tokens

Hello, While reproducing an experiment on MuST-C, I realized that the reference file used for computing BLEU is different from the original reference file: tokens such as `(_Gelächter_)` and `(_App…

formiel updated 4 years ago
15
acl-org/acl-2020-virtual-conference #491

W2: IWSLT - Schedule to be fixed

The schedule needs to be fixed as follows: Time (PDT) | Event | Speakers July 9, 9:00 AM-9:15 AM | Foreword/Introduction | Marcello Federico/Alex Waibel July 9, 9:15 AM-10:15 AM | Keynote + Q&A | …

marcellofederico updated 4 years ago
1
facebookresearch/fairseq #2766

M2M Corpus Inclusion?

This is really just feedback for the authors of the M2M 100 model. Based on the Venture Beat article I read about, it sounds like there is a need to expand the text corpus to include more content f…

normanhh3 updated 4 years ago
2
LiyuanLucasLiu/Transformer-Clinic #14

wmt_en_de admin: Function 'SoftmaxBackward' returned nan val…

I was wondering if you ever encountered nan-gradients during admin training. I'm in torch 1.6/CUDA 10.1 with no modifications to the code: #### Command ```bash export dd=data-bin/wmt14_en_de_joi…

sshleifer updated 3 years ago
8
facebookresearch/fairseq #2024

MBART Training: Missing mbart_large model architecture

Hi, I am trying to follow the **mbart training step** in [fairseq/examples/mbart/README.md](https://github.com/pytorch/fairseq/blob/5e79322b3a4a9e9a11525377d3dda7ac520b921c/examples/mbart/README.md…

shola-lawal updated 4 years ago
4
d-ataman/lmm #1

How to reproduce results from paper?

Hi there, I recently started going through the code in this repository after having read your paper, which I found very fascinating. I would be very interested in trying to reproduce the results…

j0ma updated 4 years ago
7
Edward-Sun/structured-nart #3

Optimization hyper-parameters used in the paper

Dear authors, May I ask for the hyper-parameters used in your paper for IWSLT (the smaller model) and WMT (the full model)? Such as learning rate, warmup steps, batch size, max learning rate, etc. …

da03 updated 4 years ago
4
bert-nmt/bert-nmt #37

Running Preliminary Explorations

Hey there -- wanted to start by saying thanks for the excellent work! I'm trying to run some of the preliminary experiments, but I'm not sure how I'd go about doing the following: (1): Initializ…

oshaikh13 updated 4 years ago
1
facebookresearch/fairseq #2782

Error when trying to train with pipeline parallelism

Hi guys, I was trying to train a transformer model with pipeline parallelism. Is this supposed to work already? The command i tried (following the translation example): `fairseq-train data…

thies1006 updated 4 years ago
2
dojoteef/synst #7

预处理出现问题，如何正常处理？

作者您好！请问您的源码我怎么运行失败呢？尤其是预处理问题，存在很大的问题，这是怎么回事儿？

Shajiu updated 4 years ago
12

上一页 1...24 25 26 27 28 29 30...43 下一页

430 results for iwslt

430 results
for iwslt