DS4SD / docling-parse

Simple package to extract text with coordinates from programmatic PDFs
MIT License
29 stars 8 forks source link

feat: use system deps #33

Closed dolfim-ibm closed 1 month ago

dolfim-ibm commented 1 month ago

Resolve #26

dolfim-ibm commented 1 month ago

Tested on a RHEL VM

$ USE_SYSTEM_DEPS=ON poetry install
Installing the current project: docling-parse (1.3.1)

$ poetry run pytest
============================================================== test session starts ===============================================================
platform linux -- Python 3.9.18, pytest-7.4.4, pluggy-1.5.0
rootdir: /data/docling-dev/docling-parse
collected 4 items                                                                                                                                

tests/test_parse.py ....                                                                                                                   [100%]

=============================================================== 4 passed in 12.24s ===============================================================

$ ldd docling_parse/docling_parse.cpython-39-x86_64-linux-gnu.so 
    linux-vdso.so.1 (0x00007ffc2e3bc000)
    libqpdf.so.28 => /lib64/libqpdf.so.28 (0x00007f4d9ecb3000)
    libjpeg.so.62 => /lib64/libjpeg.so.62 (0x00007f4d9ec32000)
    libz.so.1 => /lib64/libz.so.1 (0x00007f4d9ec18000)
    libresolv.so.2 => /lib64/libresolv.so.2 (0x00007f4d9ec04000)
    libstdc++.so.6 => /lib64/libstdc++.so.6 (0x00007f4d9e800000)
    libm.so.6 => /lib64/libm.so.6 (0x00007f4d9eb29000)
    libgcc_s.so.1 => /lib64/libgcc_s.so.1 (0x00007f4d9eb0c000)
    libc.so.6 => /lib64/libc.so.6 (0x00007f4d9e400000)
    /lib64/ld-linux-x86-64.so.2 (0x00007f4d9ef4c000)
    libgnutls.so.30 => /lib64/libgnutls.so.30 (0x00007f4d9e000000)
    libp11-kit.so.0 => /lib64/libp11-kit.so.0 (0x00007f4d9e669000)
    libidn2.so.0 => /lib64/libidn2.so.0 (0x00007f4d9eaeb000)
    libunistring.so.2 => /lib64/libunistring.so.2 (0x00007f4d9e27b000)
    libtasn1.so.6 => /lib64/libtasn1.so.6 (0x00007f4d9ead1000)
    libnettle.so.8 => /lib64/libnettle.so.8 (0x00007f4d9ea7a000)
    libhogweed.so.6 => /lib64/libhogweed.so.6 (0x00007f4d9df68000)
    libffi.so.8 => /lib64/libffi.so.8 (0x00007f4d9ea6e000)