biglocalnews / warn-scraper

Command-line interface for downloading WARN Act notices of qualified plant closings and mass layoffs from state government websites
https://warn-scraper.readthedocs.io
Apache License 2.0
29 stars 10 forks source link

Patch CO scraper #615

Closed stucka closed 7 months ago

stucka commented 7 months ago

Testing with

csvcut -n co-old.csv to get a list of columns

workforce_region jobs location occupations permanent_job_losses dropdown letter naics end_date email at_the_location contact fein notice_date temporary_job_losses company received_date reason begin_date reduced_hours furloughs phone workforce_area

workforce_region,jobs,location,occupations,permanent_job_losses,dropdown,letter,naics,end_date,email,at_the_location,contact,fein,notice_date,temporary_job_losses,company,received_date,reason,begin_date,reduced_hours,furloughs,phone,workforce_area

csvsort co-old.csv >csv-old-sorted.csv

csvcut -c workforce_region,jobs,location,occupations,permanent_job_losses,dropdown,letter,naics,end_date,email,at_the_location,contact,fein,notice_date,temporary_job_losses,company,received_date,reason,begin_date,reduced_hours,furloughs,phone,workforce_area co-new-original.csv | csvsort >co-new-sorted-trimmed.csv

diff co-new-sorted-trimmed.csv co-old-sorted.csv