mgmeyers / pdfannots2json

GNU Affero General Public License v3.0
42 stars 5 forks source link

FR: Automatic recognition of headings #18

Open chrisgrieser opened 1 year ago

chrisgrieser commented 1 year ago

I was wondering whether it is possible to somehow detect headings in a PDF and add them as annotations automatically?

That way, you can automatically get headings for structuring your extracted annotations, for example. (Or even auto-generate ToCs from a PDF?)