OCR-D / ocrd_tesserocr

Run tesseract with the tesserocr bindings with @OCR-D's interfaces
MIT License
38 stars 11 forks source link

recognize: split polygon failure #209

Closed bertsky closed 4 months ago

bertsky commented 4 months ago
  File "/data/ocr-d/ocrd_all/ocrd_tesserocr/ocrd_tesserocr/recognize.py", line 439, in process
    self._process_regions_in_page(tessapi.GetIterator(), page, page_coords, pcgts_mapping, dpi)
  File "/data/ocr-d/ocrd_all/ocrd_tesserocr/ocrd_tesserocr/recognize.py", line 517, in _process_regions_in_page
    polygon = join_polygons([polygon_from_x0y0x1y1(
  File "/data/ocr-d/ocrd_all/ocrd_tesserocr/ocrd_tesserocr/recognize.py", line 1447, in join_polygons
    return make_join([Polygon(poly) for poly in polygons], scale=scale).exterior.coords[:-1]
  File "/data/ocr-d/ocrd_all/ocrd_tesserocr/ocrd_tesserocr/recognize.py", line 1477, in make_join
    assert jointp.geom_type == 'Polygon', jointp.wkt
AssertionError: MULTIPOLYGON (((2617 3298, 2587 3298, 2585.5 3299.5, 2584 3298, 2583 3299, 2582 3298, 2581.5 3298.5, 2581 3298, 2564 3298, 2560 3302, 2564 3306, 2581 3306, 2581.5 3305.5, 2582 3306, 2583 3305, 2584 3306, 2585.5 3304.5, 2587 3306, 2617 3306, 2620 3303, 2623 3306, 2653 3306, 2653.5 3305.5, 2654 3306, 2656 3304, 2658 3306, 2666 3306, 2670 3302, 2666 3298, 2658 3298, 2656 3300, 2654 3298, 2653.5 3298.5, 2653 3298, 2623 3298, 2620 3301, 2617 3298)), ((2508 3302, 2512 3306, 2512.5 3305.5, 2513 3306, 2517 3302, 2513 3298, 2512.5 3298.5, 2512 3298, 2508 3302)), ((2547 3298, 2543 3302, 2547 3306, 2548.5 3304.5, 2550 3306, 2554 3302, 2550 3298, 2548.5 3299.5, 2547 3298)), ((2758.5 3298.5, 2758 3298, 2751 3298, 2747 3302, 2751 3306, 2758 3306, 2758.5 3305.5, 2759 3306, 2761.5 3303.5, 2764 3306, 2777 3306, 2781 3302, 2777 3298, 2764 3298, 2761.5 3300.5, 2759 3298, 2758.5 3298.5)), ((2724 3299, 2723 3298, 2722 3299, 2721 3298, 2693 3298, 2691 3300, 2689 3298, 2688.5 3298.5, 2688 3298, 2687.5 3298.5, 2687 3298, 2686 3298, 2682 3302, 2686 3306, 2687 3306, 2687.5 3305.5, 2688 3306, 2688.5 3305.5, 2689 3306, 2691 3304, 2693 3306, 2721 3306, 2722 3305, 2723 3306, 2724 3305, 2725 3306, 2726.5 3304.5, 2728 3306, 2733 3306, 2737 3302, 2733 3298, 2728 3298, 2726.5 3299.5, 2725 3298, 2724 3299)))
bertsky commented 4 months ago

Fixed in e67ffe7