qurator-spk / dinglehopper

An OCR evaluation tool
Apache License 2.0
58 stars 12 forks source link

DingleHopper does not create results #71

Closed stefanCCS closed 1 year ago

stefanCCS commented 1 year ago

Using ocrd, version 2.38.0

I have tried out ocrd-dinglehopper like this:

ocrd-dinglehopper -l DEBUG -I $gtfileGrp,$ocrFileGrp -O $dinglefolder -P textequiv_level line

Somehow, from one of the input files no text is taken: image

{
    "gt": "OCR-D-SEG-KRAK/OCR-D-SEG-KRAK_4749_007817786_00157.xml",
    "ocr": "OCR-D-TESS-OCR-MOD-04/OCR-D-TESS-OCR-MOD-04-4749_007817786_00157.xml",

    "cer": 4.5,
    "wer": Infinity,

    "n_characters": 56,
    "n_words": 0
}

Any idea?

kba commented 1 year ago

Can you provide the GT and OCR for this file as PAGE-XML? Looks to me like the OCR only recognized . for every character?

stefanCCS commented 1 year ago

Well, this was hint enough - many thanks. My GT-PAGE does not contain any text (I do not know at this moment ...). Therefore, DingleHopper just shows, what is available. I will close this issue now.