This tool crawls a list of websites and download all PDF and office documents. Then it analyses the PDF documents and tries to detect accessibility issues.
MIT License
22
stars
3
forks
source link
requirements.txt should be reviewed as the project does only work with pikepdf<=6.2.9 #11
Hello,
When first using this lib, I could not manage to make it work as I had the following error :
ImportError: cannot import name '_qpdf' from 'pikepdf' (/home/<user>/.local/lib/python3.10/site-packages/pikepdf/__init__.py)
I found out that pikepdf should not be installed above 6.2.9
requirements.txt should be moved from
scrapy pikepdf
langcodes
pdfminer.six
bitstring
dateparser
typer
to
scrapy pikepdf==6.2.9 langcodes pdfminer.six bitstring dateparser typer