opendatalab / MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
https://opendatalab.com/OpenSourceTools?tool=extract
GNU Affero General Public License v3.0
18.27k stars 1.31k forks source link

Will support DocLayout-YOLO_ft? Like PDF-Extract Kit #761

Closed Schumpeterx closed 3 weeks ago

myhloli commented 1 month ago

We'll need some time to adapt, which will be updated in the future.

myhloli commented 3 weeks ago

We have integrated doclayout_yolo, you can test it on hugging face demo or modelscope demo. Also you can local deployment with next MinerU release version, it will at end of this month.