opendatalab / MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
https://opendatalab.com/OpenSourceTools
GNU Affero General Public License v3.0
11.19k stars 835 forks source link

File not found #482

Open yibie opened 2 weeks ago

yibie commented 2 weeks ago

Description of the bug | 错误描述

No such file or directory: '~/tools/PDF-Extract-Kit/models/MFD/weights.pt'

But the file is here:

image

My config file:

{
    "bucket_info":{
        "bucket-name-1":["ak", "sk", "endpoint"],
        "bucket-name-2":["ak", "sk", "endpoint"]
    },
    "models-dir":"~/tools/PDF-Extract-Kit/models/",
    "device-mode":"cpu",
    "table-config": {
        "is_table_recog_enable": false,
        "max_time": 400
    }
}

How to reproduce the bug | 如何复现

Just run the command:

magic-pdf -p /Volumes/Collect/ebook/xxxxx.pdf -o ~/Documents/emacs/org/ref/ -m auto

Operating system | 操作系统

MacOS

Python version | Python 版本

3.10

Software version | 软件版本 (magic-pdf --version)

0.7.x

Device mode | 设备模式

cpu

drunkpig commented 2 weeks ago

"models-dir":"~/tools/PDF-Extract-Kit/models/" ==> "models-dir":"/abs/path/to/tools/PDF-Extract-Kit/models/",