Closed heavenkiller2018 closed 11 months ago
and layoutparser[layoutmodels,tesseract]
can't be installed correctly
Unstructured File | π¦οΈπ Langchain
pip install layoutparser[layoutmodels,tesseract]
errors:
...
Collecting iopath (from layoutparser[layoutmodels,tesseract]->unstructured-inference==0.5.1->unstructured[local-inference])
Using cached iopath-0.1.10.tar.gz (42 kB)
Preparing metadata (setup.py) ... Collecting pdfplumber (from layoutparser[layoutmodels,tesseract]->unstructured-inference==0.5.1->unstructured[local-inference])
Downloading pdfplumber-0.9.0-py3-none-any.whl (46 kB)
ββββββββββββββββββββββββββββββββββββββββ 46.1/46.1 kB 9.4 MB/s eta 0:00:00
Requirement already satisfied: torch in /home/john/micromamba/envs/openai/lib/python3.11/site-packages (from layoutparser[layoutmodels,tesseract]->unstructured-inference==0.5.1->unstructured[local-inference]) (2.0.1)
Requirement already satisfied: torchvision in /home/john/micromamba/envs/openai/lib/python3.11/site-packages (from layoutparser[layoutmodels,tesseract]->unstructured-inference==0.5.1->unstructured[local-inference]) (0.15.2)
Collecting effdet (from layoutparser[layoutmodels,tesseract]->unstructured-inference==0.5.1->unstructured[local-inference])
Downloading effdet-0.4.1-py3-none-any.whl (112 kB)
ββββββββββββββββββββββββββββββββββββββ 112.5/112.5 kB 19.0 MB/s eta 0:00:00
Collecting pytesseract (from layoutparser[layoutmodels,tesseract]->unstructured-inference==0.5.1->unstructured[local-inference])
Downloading pytesseract-0.3.10-py3-none-any.whl (14 kB)
Requirement already satisfied: coloredlogs in /home/john/micromamba/envs/openai/lib/python3.11/site-packages (from onnxruntime->unstructured-inference==0.5.1->unstructured[local-inference]) (15.0.1)
Requirement already satisfied: flatbuffers in /home/john/micromamba/envs/openai/lib/python3.11/site-packages (from onnxruntime->unstructured-inference==0.5.1->unstructured[local-inference]) (23.5.26)
INFO: pip is looking at multiple versions of onnxruntime to determine which version is compatible with other requirements. This could take a while.
Collecting onnxruntime (from unstructured-inference==0.5.1->unstructured[local-inference])
Downloading onnxruntime-1.15.0-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (5.9 MB)
ββββββββββββββββββββββββββββββββββββββββ 5.9/5.9 MB 21.3 MB/s eta 0:00:0000:0100:01
Collecting layoutparser[layoutmodels,tesseract] (from unstructured-inference==0.5.1->unstructured[local-inference])
Downloading layoutparser-0.3.3-py3-none-any.whl (19.2 MB)
ββββββββββββββββββββββββββββββββββββββββ 19.2/19.2 MB 17.8 MB/s eta 0:00:0000:0100:01
Downloading layoutparser-0.3.2-py3-none-any.whl (19.2 MB)
ββββββββββββββββββββββββββββββββββββββββ 19.2/19.2 MB 12.8 MB/s eta 0:00:0000:0100:01
Downloading layoutparser-0.3.1-py3-none-any.whl (19.2 MB)
ββββββββββββββββββββββββββββββββββββββββ 19.2/19.2 MB 17.1 MB/s eta 0:00:0000:0100:01
INFO: pip is looking at multiple versions of onnxruntime to determine which version is compatible with other requirements. This could take a while.
Downloading layoutparser-0.3.0-py3-none-any.whl (19.2 MB)
ββββββββββββββββββββββββββββββββββββββββ 19.2/19.2 MB 21.1 MB/s eta 0:00:0000:0100:01
Downloading layoutparser-0.2.0-py3-none-any.whl (19.1 MB)
ββββββββββββββββββββββββββββββββββββββββ 19.1/19.1 MB 18.1 MB/s eta 0:00:0000:0100:01
WARNING: layoutparser 0.2.0 does not provide the extra 'layoutmodels'
WARNING: layoutparser 0.2.0 does not provide the extra 'tesseract'
WARNING: layoutparser 0.2.0 does not provide the extra 'layoutmodels'
WARNING: layoutparser 0.2.0 does not provide the extra 'tesseract'
Downloading layoutparser-0.1.3-py3-none-any.whl (19.1 MB)
ββββββββββββββββββββββββββββββββββββββββ 19.1/19.1 MB 21.8 MB/s eta 0:00:0000:0100:01
WARNING: layoutparser 0.1.3 does not provide the extra 'layoutmodels'
WARNING: layoutparser 0.1.3 does not provide the extra 'tesseract'
Collecting pycocotools==2.0.1 (from layoutparser[layoutmodels,tesseract]->unstructured-inference==0.5.1->unstructured[local-inference])
Downloading pycocotools-2.0.1.tar.gz (23 kB)
Preparing metadata (setup.py) ... Collecting fvcore==0.1.1.post20200623 (from layoutparser[layoutmodels,tesseract]->unstructured-inference==0.5.1->unstructured[local-inference])
Downloading fvcore-0.1.1.post20200623.tar.gz (32 kB)
Preparing metadata (setup.py) ... Collecting yacs>=0.1.6 (from fvcore==0.1.1.post20200623->layoutparser[layoutmodels,tesseract]->unstructured-inference==0.5.1->unstructured[local-inference])
Downloading yacs-0.1.8-py3-none-any.whl (14 kB)
Collecting portalocker (from fvcore==0.1.1.post20200623->layoutparser[layoutmodels,tesseract]->unstructured-inference==0.5.1->unstructured[local-inference])
Using cached portalocker-2.7.0-py2.py3-none-any.whl (15 kB)
Collecting termcolor>=1.1 (from fvcore==0.1.1.post20200623->layoutparser[layoutmodels,tesseract]->unstructured-inference==0.5.1->unstructured[local-inference])
Using cached termcolor-2.3.0-py3-none-any.whl (6.9 kB)
Requirement already satisfied: setuptools>=18.0 in /home/john/micromamba/envs/openai/lib/python3.11/site-packages (from pycocotools==2.0.1->layoutparser[layoutmodels,tesseract]->unstructured-inference==0.5.1->unstructured[local-inference]) (67.8.0)
Collecting cython>=0.27.3 (from pycocotools==2.0.1->layoutparser[layoutmodels,tesseract]->unstructured-inference==0.5.1->unstructured[local-inference])
Using cached Cython-0.29.35-cp311-cp311-manylinux_2_17_x86_64.manylinux2014_x86_64.manylinux_2_24_x86_64.whl (1.9 MB)
Requirement already satisfied: matplotlib>=2.1.0 in /home/john/micromamba/envs/openai/lib/python3.11/site-packages (from pycocotools==2.0.1->layoutparser[layoutmodels,tesseract]->unstructured-inference==0.5.1->unstructured[local-inference]) (3.6.3)
INFO: This is taking longer than usual. You might need to provide the dependency resolver with stricter constraints to reduce runtime. See https://pip.pypa.io/warnings/backtracking for guidance. If you want to abort this run, press Ctrl + C.
WARNING: layoutparser 0.1.3 does not provide the extra 'layoutmodels'
WARNING: layoutparser 0.1.3 does not provide the extra 'tesseract'
Collecting layoutparser[layoutmodels,tesseract] (from unstructured-inference==0.5.1->unstructured[local-inference])
Downloading layoutparser-0.1.2-py3-none-any.whl (19.1 MB)
ββββββββββββββββββββββββββββββββββββββββ 19.1/19.1 MB 11.3 MB/s eta 0:00:0000:0100:01
WARNING: layoutparser 0.1.2 does not provide the extra 'layoutmodels'
WARNING: layoutparser 0.1.2 does not provide the extra 'tesseract'
Collecting pycocotools (from layoutparser[layoutmodels,tesseract]->unstructured-inference==0.5.1->unstructured[local-inference])
Using cached pycocotools-2.0.6.tar.gz (24 kB)
Installing build dependencies ... Getting requirements to build wheel ... Preparing metadata (pyproject.toml) ... WARNING: layoutparser 0.1.2 does not provide the extra 'layoutmodels'
WARNING: layoutparser 0.1.2 does not provide the extra 'tesseract'
Collecting layoutparser[layoutmodels,tesseract] (from unstructured-inference==0.5.1->unstructured[local-inference])
Downloading layoutparser-0.1.1-py3-none-any.whl (19.1 MB)
ββββββββββββββββββββββββββββββββββββββββ 19.1/19.1 MB 12.4 MB/s eta 0:00:0000:0100:01
WARNING: layoutparser 0.1.1 does not provide the extra 'layoutmodels'
WARNING: layoutparser 0.1.1 does not provide the extra 'tesseract'
INFO: pip is looking at multiple versions of layoutparser[layoutmodels,tesseract] to determine which version is compatible with other requirements. This could take a while.
Downloading layoutparser-0.1.0-py3-none-any.whl (19.1 MB)
ββββββββββββββββββββββββββββββββββββββββ 19.1/19.1 MB 11.9 MB/s eta 0:00:0000:0100:01
WARNING: layoutparser 0.1.0 does not provide the extra 'layoutmodels'
WARNING: layoutparser 0.1.0 does not provide the extra 'tesseract'
Downloading layoutparser-0.0.1-py3-none-any.whl (10 kB)
WARNING: layoutparser 0.0.1 does not provide the extra 'layoutmodels'
WARNING: layoutparser 0.0.1 does not provide the extra 'tesseract'
WARNING: layoutparser 0.0.1 does not provide the extra 'layoutmodels'
WARNING: layoutparser 0.0.1 does not provide the extra 'tesseract'
WARNING: layoutparser 0.1.1 does not provide the extra 'layoutmodels'
WARNING: layoutparser 0.1.1 does not provide the extra 'tesseract'
WARNING: layoutparser 0.1.0 does not provide the extra 'layoutmodels'
WARNING: layoutparser 0.1.0 does not provide the extra 'tesseract'
ERROR: Cannot install layoutparser[layoutmodels,tesseract]==0.1.0 and layoutparser[layoutmodels,tesseract]==0.1.1 because these package versions have conflicting dependencies.
The conflict is caused by:
layoutparser[layoutmodels,tesseract] 0.1.1 depends on torch==1.4
layoutparser[layoutmodels,tesseract] 0.1.0 depends on torch==1.4
To fix this you could try to:
1. loosen the range of package versions you've specified
2. remove package versions to allow pip attempt to solve the dependency conflict
ERROR: ResolutionImpossible: for help visit https://pip.pypa.io/en/latest/topics/dependency-resolution/#dealing-with-dependency-conflicts
Note: you may need to restart the kernel to use updated packages.
zsh:1: no matches found: layoutparser[layoutmodels,tesseract]
Note: you may need to restart the kernel to use updated packages.
Hi, @heavenkiller2018! I'm Dosu, and I'm here to help the LangChain team manage their backlog. I wanted to let you know that we are marking this issue as stale.
From what I understand, you are experiencing a UnpicklingError
when using the UnstructuredFileLoader
from the langchain
library. The error message suggests that the pickle data is truncated. Additionally, you mentioned having trouble installing layoutparser[layoutmodels,tesseract]
and provided the error message.
Before we close this issue, we wanted to check with you if this issue is still relevant to the latest version of the LangChain repository. If it is, please let us know by commenting on the issue. Otherwise, feel free to close the issue yourself, or it will be automatically closed in 7 days.
Thank you for your understanding and cooperation!
System Info
β― pip list |grep unstructured unstructured 0.7.9 β― pip list |grep langchain langchain 0.0.215 langchainplus-sdk 0.0.17
Who can help?
No response
Information
Related Components
Reproduction
errors:
how to fix it
Expected behavior
no