ulb-sachsen-anhalt / digital-eval

Evaluate data from mass digitalization workflows
MIT License
5 stars 1 forks source link

Evaluation Error #13

Closed M3ssman closed 1 year ago

M3ssman commented 1 year ago

Description

Latest digital-eval reports for persian data sets

[WARN ][/data/ocr/groundtruth/rahbar/281792798/00000192.xml] _wrap invalid literal for int() with base 10: ''

and creates no evaluation report data at all

M3ssman commented 1 year ago

Resulted from the current Implementation, which (wrongly) assumes, that text information is always available at word level - but the GT-dataset this time relies on lines only.