Open stefanCCS opened 2 years ago
div = div.get_div()[0] IndexError: list index out of range
That's actually #64 (which would entail the ignore
strategy), but the additional issue here is indeed that mets:fptr/mets:area
references instead of mere mets:structLink
matches for the mets:div/@ID
should be supported.
Maximally, support for the ENMAP profile is desired.
Further reference: ENMAP examples
It looks like, that the METS parser does not allow structures like this in METS:
If I call mm2tei with this kind of METS I get an exception:
As a starting point an "ignore" of
<fptr><area>
in<div>
area would be good. In general it would be even better, if the OCR text from ALTO is taken from the link referenced there.