accessibility-luxembourg / simplA11yPDFCrawler

This tool crawls a list of websites and download all PDF and office documents. Then it analyses the PDF documents and tries to detect accessibility issues.
MIT License
22 stars 3 forks source link

Remove GET parameters in file names #1

Open AlainVagner opened 2 years ago

AlainVagner commented 2 years ago

In the crawler, remove GET parameters in the filename when saving. Find a solution for duplicates. The file extension is used for some statistics, this can lead to further issues.