neulab / awesome-align

A neural word aligner based on multilingual BERT
https://arxiv.org/abs/2101.08231
BSD 3-Clause "New" or "Revised" License
321 stars 46 forks source link

Parallel corpus data format for fine-tuning on parallel data #38

Closed shihabshahriar16 closed 2 years ago

shihabshahriar16 commented 2 years ago

Is there any example dataset for parallel data fine-tuning?

zdou0830 commented 2 years ago

You can see the examples in examples/*.src-tgt.