We need to get the EPA CEMS/CAMD to EIA crosswalk table integrated into our system in a durable way. The first step is scraping it and archiving it on Zenodo.
The files are currently published by EPA in this GitHub repo in a mix of CSV and XLSX files. We haven't scraped from GitHub before, so we'll need to figure out how to do that right.
Once we have Zenodo archives that store these files in a data package, we can work on integrating them into the ETL and database structure.
We need to get the EPA CEMS/CAMD to EIA crosswalk table integrated into our system in a durable way. The first step is scraping it and archiving it on Zenodo.
The files are currently published by EPA in this GitHub repo in a mix of CSV and XLSX files. We haven't scraped from GitHub before, so we'll need to figure out how to do that right.
Once we have Zenodo archives that store these files in a data package, we can work on integrating them into the ETL and database structure.