Open timtensor opened 7 months ago
I am using the following code to install the data loader in google colab enviroment
from pathlib import Path from llama_index import download_loader from llama_index import SimpleDirectoryReader UnstructuredReader = download_loader('UnstructuredReader') dir_reader = SimpleDirectoryReader('./Data', file_extractor={ ".pdf": UnstructuredReader(), ".html": UnstructuredReader(), ".eml": UnstructuredReader(), }) documents = dir_reader.load_data()
However I keep running into the issue of ImportError: partition_pdf is not available.
ImportError: partition_pdf is not available.
0.9.29
Follow the description in colab enviroment
No response
I think you need to pip install "unstructured[pdf]"
pip install "unstructured[pdf]"
The reqs for this loader should maybe be updated
Bug Description
I am using the following code to install the data loader in google colab enviroment
However I keep running into the issue of
ImportError: partition_pdf is not available.
Version
0.9.29
Steps to Reproduce
Follow the description in colab enviroment
Relevant Logs/Tracbacks
No response