opendatalab / MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
https://opendatalab.com/OpenSourceTools
GNU Affero General Public License v3.0
11.28k stars 845 forks source link

magic-pdf-dev jsonl -j 本地jsonl 报错 #367

Open dt-yy opened 1 month ago

dt-yy commented 1 month ago

Description of the bug | 错误描述

image

How to reproduce the bug | 如何复现

magic-pdf-dev jsonl -j part-662de9cad021-000046.jsonl

Operating system | 操作系统

Linux

Python version | Python 版本

3.10

Software version | 软件版本 (magic-pdf --version)

0.6.x

Device mode | 设备模式

cuda

icecraft commented 1 month ago

please make sure you have s3 config in your magic-pdf.json (usually under the HOME director on linux platform)