cbrunet / python-poppler

Python binding to Poppler-cpp pdf library
GNU General Public License v2.0
95 stars 15 forks source link

Is it possible to know the order of the text blocks #85

Open rufimelo99 opened 8 months ago

rufimelo99 commented 8 months ago

Firstly, thanks for your contribution thus far.

I've been using Poppler for a while now and it is not clear how to sort the boxes that we receive.

I noticed in Poppler there is a "PopplerStructureElement" [https://poppler.freedesktop.org/api/glib/PopplerStructureElement.html]() , which, I believe, allow us to understand which box comes after which.

Is there a way to replicate this? Or is there an alternative way to know the order for certain?

Thanks in advance