Helsinki-NLP / OpusFilter

OpusFilter - Parallel corpus processing toolkit
MIT License
101 stars 18 forks source link

Debug the configuration by export filtered corpus. #41

Closed BrightXiaoHan closed 2 years ago

BrightXiaoHan commented 2 years ago

Is it possible to export lines filtered by filters in order to debug my configuration?

svirpioj commented 2 years ago

Yes. Use opusfilter-test --removed FILE to get the removed lines from single or multiple filters. (In the latest develop branch there is also option to output the filter scores.) Another option is to replicate the filter step in your configuration with filterfalse: true.

BrightXiaoHan commented 2 years ago

Thanks a lot