neulab / awesome-align

A neural word aligner based on multilingual BERT
https://arxiv.org/abs/2101.08231
BSD 3-Clause "New" or "Revised" License
325 stars 47 forks source link

De-En dataset is missing #27

Closed ruoyuxie closed 3 years ago

ruoyuxie commented 3 years ago

Hello,

I could not find the De-En dataset on the provided link. It is not on the examples folder either. Where can I find this dataset?

Thank you

zdou0830 commented 3 years ago

Hi, you can follow the steps in this repo. In particular, you'll need to download test data for German-English and move it into the folder test.