opendatalab / MinerU

A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
https://opendatalab.com/OpenSourceTools
GNU Affero General Public License v3.0
11.19k stars 835 forks source link

报错:symbol lookup error: /lib/x86_64-linux-gnu/libGLX.so.0: undefined symbol: pthread_mutex_lock #456

Open eeeease opened 3 weeks ago

eeeease commented 3 weeks ago

Description of the bug | 错误描述

正常配置环境,运行magic-pdf -p small_ocr.pdf 报错如下:

(MinerU) root@xxx-0:/xxx/Pro_Learn/Parse_PDF# magic-pdf -p small_ocr.pdf
2024-08-19 16:54:11.414 | INFO     | magic_pdf.libs.pdf_check:detect_invalid_chars:57 - cid_count: 0, text_len: 8, cid_chars_radio: 0.0
2024-08-19 16:54:11.416 | WARNING  | magic_pdf.filter.pdf_classify_by_type:classify:334 - pdf is not classified by area and text_len, by_image_area: False, by_text: False, by_avg_words: False, by_img_num: True, by_text_layout: False, by_img_narrow_strips: False, by_invalid_chars: True
/opt/conda/envs/MinerU/bin/python3.1: symbol lookup error: /lib/x86_64-linux-gnu/libGLX.so.0: undefined symbol: pthread_mutex_lock

How to reproduce the bug | 如何复现

通过google检索,没有发现同样的报错,可能是因为我的环境是在服务器上隔离出来的子环境。 配置文件如下:

{
    "bucket_info":{
        "bucket-name-1":["ak", "sk", "endpoint"],
        "bucket-name-2":["ak", "sk", "endpoint"]
    },
    "temp-output-dir":"/xxx/Pro_Learn/Parse_PDF/MinerU_tmp",
    "models-dir":"/root/.cache/modelscope/hub/wanderkid/PDF-Extract-Kit/models",
    "device-mode":"cpu"
}

相关信息如下:

(MinerU) root@xxx-0:/xxx/Pro_Learn/Parse_PDF# lsb_release -a
No LSB modules are available.
Distributor ID: Ubuntu
Description:    Ubuntu 20.04.3 LTS
Release:        20.04
Codename:       focal

(MinerU) root@xxx-0:/xxx/Pro_Learn/Parse_PDF# uname -r
3.10.0-1160.el7.x86_64

(MinerU) root@xxx-0:/xxx/Pro_Learn/Parse_PDF# python --version
Python 3.10.14
(MinerU) root@xxx-0:/xxx/Pro_Learn/Parse_PDF# magic-pdf -v
magic-pdf, version 0.7.0b1

尝试以下办法,无效:

apt-get install --reinstall libgl1-mesa-glx libgl1-mesa-dri

Operating system | 操作系统

Linux

Python version | Python 版本

3.10

Software version | 软件版本 (magic-pdf --version)

0.7.x

Device mode | 设备模式

cpu

snowyseason commented 1 day ago

我也遇到了这个问题,是启动layout模型出现的问题