opendatalab / MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
https://opendatalab.com/OpenSourceTools
GNU Affero General Public License v3.0
11.19k stars 835 forks source link

表格识别不出来 #485

Closed JacksonZhangHuaQuan closed 2 weeks ago

JacksonZhangHuaQuan commented 2 weeks ago

Description of the bug | 错误描述

{ "bucket_info":{ "bucket-name-1":["ak", "sk", "endpoint"], "bucket-name-2":["ak", "sk", "endpoint"] }, "models-dir":"/tmp/models", "temp-output-dir":"/tmp", "device-mode":"cpu", "table-config": { "is_table_recog_enable": true, "max_time": 400 } } 配置文件已经同步;

How to reproduce the bug | 如何复现

在项目路径下,直接跑项目: magic-pdf pdf-command --pdf "demo/demo2.pdf" --inside_model true

image

Operating system | 操作系统

Windows

Python version | Python 版本

3.10

Software version | 软件版本 (magic-pdf --version)

0.6.x

Device mode | 设备模式

cpu

myhloli commented 2 weeks ago

需要升级0.7.x版本

JacksonZhangHuaQuan commented 2 weeks ago

🆗 可以了

JacksonZhangHuaQuan commented 2 weeks ago

🆗 可以了