LAHTeR / htr-quality-classifier

Detect quality of (digitized) text.
GNU General Public License v3.0
3 stars 0 forks source link

Add Pages with Annotator Disagreement to Training Data #15

Open carschno opened 9 months ago

carschno commented 9 months ago

The current model is trained using non-empty pages on which both annotators agreed. Include pages with annotator disagreement into the training data (with weighted and/or average score) could improve the results.