salesforce / nonauto-nmt

PyTorch Implementation of "Non-Autoregressive Neural Machine Translation"
BSD 3-Clause "New" or "Revised" License
269 stars 49 forks source link

Dataset #1

Open Maggione opened 6 years ago

Maggione commented 6 years ago

Thank you for sharing your codes. I have a question about how to preprocess the data. For example, for the iwslt en-de dataset, you use a file named train.tags.en-de.bpe.dev.en2 in the script run_alignment_iwslt.sh. It seems to not belong to the original dataset. Where does it come from?

VahidC commented 5 years ago

Hi, thank you for sharing your codes. I have the same problem, and I really need to know how can I handle this issue. any suggestion...?

alphadl commented 5 years ago

meet the same question too~ it is ambiguous in the paper when the author describing the valid&test sets for IWSTL ende 2016

zy8848 commented 4 years ago

meet the same queation too... I use this preprocessed dataset iwslt.tar.gz /train/train.tags.en-de.bpe.dev.en , and it works

wangwang110 commented 4 years ago

meet the same queation too... can you realease the data example ?