opendatalab / MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
https://opendatalab.com/OpenSourceTools
GNU Affero General Public License v3.0
13.24k stars 989 forks source link

Wrong detection and missing some details in the detection. #722

Open Ab-AI-1 opened 1 week ago

Ab-AI-1 commented 1 week ago

img_1

Why is it not able to detect a space-based table? Is there any configuration I should change, or what should I do, so It can detect a space-based table as a table rather than text?

Also, it is missing some content in the detection.

myhloli commented 1 week ago

Since this table lacks any framing lines, the visual model may be more inclined to classify it as a text block rather than a table block.

v3nus-py commented 7 hours ago

to fix your trouble check this solution click maybe this will solve your problem.