VikParuchuri / marker

Convert PDF to markdown quickly with high accuracy
https://www.datalab.to
GNU General Public License v3.0
17.98k stars 1.03k forks source link

Perpendicular headlines in tables fail #378

Open JeandeBalzac opened 5 days ago

JeandeBalzac commented 5 days ago

Hi Vik really a great job you did. I tried with different tables and it works fine. However, if the headline are perpendicular, it will fail. I provided here an example.

image

The result in markdown can be seen here: image

Basically it should be possible with pymupdf to extract also perpendicular text.