biglocalnews / warn-scraper

Command-line interface for downloading WARN Act notices of qualified plant closings and mass layoffs from state government websites
https://warn-scraper.readthedocs.io
Apache License 2.0
28 stars 10 forks source link

HI blocking access #600

Closed stucka closed 5 months ago

stucka commented 6 months ago

Hawaii appears to be blocking access to requests directly, throwing a "blocked" into the title tag. If requests offers up a page with a regular browser's User-Agent, Hawaii throws a challenge error to non-Javascript-enabled browsers.

stucka commented 5 months ago

HI seems to have re-enabled access. Code in #605 now allows a quick one-line change back to Google's cache.

But something else broke in the code -- I think problems with tracking location in the array that @Ash1R had set up -- and so I've replaced some of the parsing code to go line by line through the data, while building a list of dictionaries. Closing this.