Unstructured-IO / unstructured

Open source libraries and APIs to build custom preprocessing pipelines for labeling, training, or production machine learning pipelines.
https://www.unstructured.io/
Apache License 2.0
8.61k stars 704 forks source link

Fix not counting false negatives and false positives in table metrics #3300

Closed plutasnyy closed 2 months ago

plutasnyy commented 3 months ago

This pull request fixes counting tables metric for three cases:

In short we can say that previous metric calculation didn't consider OD mistakes

plutasnyy commented 2 months ago

Metrics were ran end2end and verified on fix-not-counting-false-negatives-in-table-metrics branch