qurator-spk / eynollah

Document Layout Analysis
Apache License 2.0
328 stars 26 forks source link

Constrain number of columns #94

Open bertsky opened 1 year ago

bertsky commented 1 year ago

Sometimes one does have prior knowledge about the overall page layout of a document. For example, for historical monographs, there will often be no columns, but perhaps a few tables (which is, of course, very different, because the internal reading order of the paragraphs/cells in columns is vertical-first, but in tables it's horizontal-first).

It would be nice if there was a parameter (for the OCR-D wrapper) which can be used to constrain the results of the column detector. (A fixed number, or a range.)

vahidrezanezhad commented 1 year ago

Of course this is as you said a good idea. I will add this option as a parameter.