Closed jairosg closed 1 year ago
I do agree. Unfortunately writing documentation is time-consuming, and time is always limited. Some quick hints:
opusfilter-test
script to see how a single filter works on (a part of) your corpus. Especially use --removed
to store the pairs filtered out.
I have just started using this tool for my master's thesis and I have noticed that some documentation could be more complete with more examples in tasks like CrossEntropy filter and similar ones, since you can't know what to put in some parameters if you are a beginner and it leads you to get too many errors that you don't know how to fix.