mittagessen / kraken

OCR engine for all the languages
http://kraken.re
Apache License 2.0
673 stars 125 forks source link

Speed up legacy polygon extraction #585

Closed anutkk closed 3 months ago

anutkk commented 3 months ago

Speed up legacy polygon extraction using light software optimizations:

I tested this segmentation on the BiblIA Italian subset (3748 lines). Results:

Method Accuracy Runtime (minutes:seconds)
Current legacy extraction 97.29% 56:27
Slightly optimized legacy extraction 97.33% 29:26

The speedup is more pronounced for straight baselines (closer to 4x).

The difference in accuracy is only 0.04%.

anutkk commented 3 months ago

Cancelling due to typo