opendatalab / MinerU

A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
https://opendatalab.com/OpenSourceTools?tool=extract
GNU Affero General Public License v3.0
17.79k stars 1.29k forks source link

more interactive web app #899

Open pJahad opened 1 week ago

pJahad commented 1 week ago

Is your feature request related to a problem? Please describe. Current webapp does not allow me to click on a highlight on a PDF to see the corresponding markdown string and vice versa. This will make it easier to check PDF parsing results.

Describe the solution you'd like Here's how I imagine it would look: https://papermage.org/reader/papermage

Focusshang commented 5 days ago

Thank you for the feedback. We will add the feature to link the blocks in the PDF with the corresponding markdown text in the web app in a future version.