ulb-sachsen-anhalt / ocrd-odem

OCR Workflows based on OCR-D
MIT License
3 stars 1 forks source link

Bug: Filtering structs of Kitodo METS fails #16

Closed M3ssman closed 6 months ago

M3ssman commented 6 months ago

Description

Filtering from logical structs doesn't work as expected for digital object created once with Kitodo2. Physical pages can be linked several times, with the logical root being the first one.

Actually we only look for the first logical element that is linked to the page. This approach works well for unique links like those from Visual Library, but not with Kitodo multiple links.

Solution

Respect the fact, that the same "page" can be linked to many logical structs. If so, filter page if any matched structure has been black listed (aka no OCR)