accessibility-luxembourg / simplA11yPDFCrawler

This tool crawls a list of websites and download all PDF and office documents. Then it analyses the PDF documents and tries to detect accessibility issues.
MIT License
22 stars 3 forks source link

requirements.txt should be reviewed as the project does only work with pikepdf<=6.2.9 #11

Open Max33260 opened 2 months ago

Max33260 commented 2 months ago

Hello,

When first using this lib, I could not manage to make it work as I had the following error :

ImportError: cannot import name '_qpdf' from 'pikepdf' (/home/<user>/.local/lib/python3.10/site-packages/pikepdf/__init__.py)

I found out that pikepdf should not be installed above 6.2.9

requirements.txt should be moved from

scrapy pikepdf
langcodes
pdfminer.six
bitstring
dateparser
typer

to

scrapy pikepdf==6.2.9 langcodes pdfminer.six bitstring dateparser typer

AlainVagner commented 2 months ago

Hello, thank you for this issue. We will indeed need to update the requirements.