aryn-ai / sycamore

🍁 Sycamore is an LLM-powered search and analytics platform for unstructured data.
https://sycamore.readthedocs.io
Apache License 2.0
300 stars 31 forks source link

Migrate from PyPDf2 to pypdf. #561

Closed bsowell closed 1 month ago

bsowell commented 1 month ago

PyPDF2 has been deprecated and renamed to pypdf. We have also found that there are documents that pypdf can process that PyPDF2 cannot.

HenryL27 commented 1 month ago

IT failed to import pypdf

bsowell commented 1 month ago

IT failed to import pypdf

Oops. I updated all the poetry.lock files, but I missed the one at the root, which is of course the one that matters for the integ tests. Fixed now.