bitextor / bicleaner

Bicleaner is a parallel corpus classifier/cleaner that aims at detecting noisy sentence pairs in a parallel corpus.
GNU General Public License v3.0
150 stars 22 forks source link

Bug fixing on feature_extract #4

Closed mbanon closed 6 years ago

mbanon commented 6 years ago

The function "feature_extract" now takes the source sentence and target sentence explicitely as input parameters, instead of the tab-separated row.