wip binarization datasets comparison table

sparkfish / shabby-pages

ShabbyPages is a state-of-the-art corpus of born-digital document images with both ground truth and distorted versions appropriate for use in training models to reverse distortions and recover to original denoised documents.

MIT License

48 stars 6 forks source link

wip binarization datasets comparison table #45

Closed gxlarson closed 1 year ago

gxlarson commented 1 year ago

this table is a WIP, but shows how we can compare ShabbyPages against other binarization and de-noising datasets. Mose of these prior work datasets are either way to small (the DIBCOs) or too naive (NO), so this table will highlight the strengths of SP