VikParuchuri / marker

Convert PDF to markdown quickly with high accuracy
https://www.datalab.to
GNU General Public License v3.0
16.82k stars 955 forks source link

Using Surya along with "marker" to get a formatted md file as output #88

Closed trivikramak closed 7 months ago

trivikramak commented 7 months ago

Hi, How can we pass the output of "Surya" to 'marker' to get a neatly formatted md file? Can you give some guidelines or direction to work with?

VikParuchuri commented 7 months ago

Hi there - I'm planning to integrate them soon (replace tesseract/ocrmypdf with surya)