cityofaustin / atd-data-tech

Austin Transportation Data & Technology Services
17 stars 2 forks source link

311 CSR reports prevent web scraping with python #13655

Closed Charlie-Henry closed 9 months ago

Charlie-Henry commented 10 months ago

Related to work here https://github.com/cityofaustin/atd-data-tech/issues/12923

Hichame created some CSR reports for us. They are stored as CSVs at a URL. We are unable to integrate with 311 due to these files having a web app firewall called incapsula which specifically prevents us scraping the data from these files using python. Is there a way to disable this web app firewall or to allow us to authenticate to get around this?

Here's more background:

Charlie-Henry commented 9 months ago

After meeting with Andy at Motorola, he whitelisted the city's IP ranges so they are no longer flagged by incapsula!