Garrafao / durel_system_annotators

3 stars 0 forks source link

Validation function for loaded data #45

Closed Garrafao closed 7 months ago

Garrafao commented 7 months ago

annotate.py now applies the function validate_dataframe() to every loaded input file. The function checks the correctness of target word indices. It can be called with different levels of strictness. The default is 'relaxed' which will let most errors pass, but count and print them. If it is called with 'strict' the function will not let any errors pass. This will likely be too strict for most datasets as they have occasional errors.

Garrafao commented 7 months ago

Solved with commit 2cb5d44.