biglocalnews / warn-scraper

Command-line interface for downloading WARN Act notices of qualified plant closings and mass layoffs from state government websites
https://warn-scraper.readthedocs.io
Apache License 2.0
29 stars 10 forks source link

Fix OH: Data source moved #532

Closed stucka closed 11 months ago

stucka commented 1 year ago

I don't know if the existing scraper works on the new URL. Looks like might work.

Ohio was here: https://jfs.ohio.gov/warn/index.stm

https://jfs.ohio.gov/job-services-and-unemployment/job-services/job-programs-and-services/submit-a-warn-notice/submit-a-warn-notice-sa-1/submit-a-warn-notice

stucka commented 1 year ago

New site 404s unless a User-Agent is offered in the GET headers.

Older years are not indexed in a predictable way, should be scraped from section#js-odx-content__body ...

Some older years are PDFs.

stucka commented 11 months ago

Done.