wanghaisheng / awesome-ocr

A curated list of promising OCR resources
http://wanghaisheng.github.io/ocr-arxiv-daily/
MIT License
1.66k stars 351 forks source link

Segmentation Based Recovery of Arbitrarily Warped Document Images #94

Closed wanghaisheng closed 6 years ago

wanghaisheng commented 6 years ago

http://users.iit.demokritos.gr/~bgat/Icdar2007_SegmentationBasedRecovery.pdf https://github.com/srinathnaik/segmentation-based-dewarping

wanghaisheng commented 6 years ago

摘要: Non-linear warping appears in document images when captured by a digital camera or a scanner, especially in the case that these documents are digitized bounded volumes. Arbitrarily warped documents may have several slope changes along the text lines as well as along the words of the same text line. In this paper, a novel segmentation based technique for efficient restoration of arbitrarily warped document images is presented. The proposed technique recovers the documents relying upon (i) text lines and words detection using a novel segmentation technique appropriate for warped documents, (ii) a first draft binary image de-warping based on word rotation and translation according to upper and lower word baselines, and (iii) a recovery of the original warped image guided by the draft binary image de-warping result. Experimental results on several arbitrarily warped documents prove the effectiveness of the proposed technique.

wanghaisheng commented 6 years ago

Icdar2007_SegmentationBasedRecovery.pdf