issues
search
VikParuchuri
/
marker
Convert PDF to markdown quickly with high accuracy
https://www.datalab.to
GNU General Public License v3.0
14.14k
stars
720
forks
source link
Improve performance
#143
Closed
VikParuchuri
closed
1 month ago
VikParuchuri
commented
1 month ago
Fewer false positives (and true positives :( ) for OCR heuristics
Speed up OCR performance by pulling in new surya version
Fix pdftext bug causing heuristic false positives
Improve pdf extraction time marginally