VikParuchuri / marker

Convert PDF to markdown quickly with high accuracy
https://www.datalab.to
GNU General Public License v3.0
14.65k stars 764 forks source link

If a PDF contains #s, they will become headers in markdown #77

Closed mrticker closed 2 months ago

mrticker commented 5 months ago

For example, I've seen a paper with a table header saying "# parameters", and it became a H1 header after conversion. I think there should be some form of escaping these.

VikParuchuri commented 2 months ago

Will fix this in the next version (will merge soon).