paracrawl / Domain_Adaptation

InDomain detection is a tool designed to extract in-domain data from a large collections of data.
GNU General Public License v3.0
1 stars 1 forks source link

Reduce dependency from Moses to preprocess #2

Closed kpu closed 5 years ago

kpu commented 5 years ago

It looks like you're using the Moses preprocessor only. Here's a lighter version already in ParaCrawl projects: https://github.com/kpu/preprocess/

dionwiggins commented 5 years ago

Addressed in the latest code.