bitextor / bicleaner

Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.
GNU General Public License v3.0
150 stars 22 forks source link

Reuse Bitextor dictionaries for Bicleaner training #11

Closed mbanon closed 5 years ago

mbanon commented 5 years ago

In order to achieve an user-friendlier pipeline, bicleaner-train should be able to use the bitextor dictionaries.

mbanon commented 5 years ago

@mespla and @lpla are working on it.