Helsinki-NLP / OpusFilter

OpusFilter - Parallel corpus processing toolkit
MIT License
101 stars 18 forks source link

add jieba tokenizer for Chinese #27

Closed svirpioj closed 2 years ago

svirpioj commented 2 years ago

Replaces https://github.com/Helsinki-NLP/OpusFilter/pull/23

BrightXiaoHan commented 2 years ago

Thanks for your supplement.