facebookresearch / Mask-Predict

A masked language modeling objective to train a model to predict any subset of the target words, conditioned on both the input text and a partially masked target translation.
Other
240 stars 38 forks source link

Will you release the dataset of wmt17-enzh ? #5

Closed JackHorse closed 4 years ago

JackHorse commented 5 years ago

Hi,

I want reproduce your result of wmt17-enzh, but your released model doesn't contain the bpe codes. Will you release wmt17-enzh dataset ?

tks

yinhanliu commented 4 years ago

we released a dictionary. you can bpe based on our dictionary.