catalyst-cooperative / pudl-archiver

A tool for capuring snapshots of public data sources and archiving them on Zenodo for programmatic use.
MIT License
4 stars 1 forks source link

Update FERC archivers to handle multiple taxonomies in one year #349

Closed jdangerx closed 2 months ago

jdangerx commented 4 months ago

Overview

FERC form 1 was reported using multiple taxonomies in 2023:

This means that in the extraction, we need to use different taxonomies per file. Which also means that we should track which taxonomy to use in the metadata.

Success Criteria

### Next steps
* [ ] #178 
* [ ] open the XML files, look for the taxonomy links, and parse out a version from them
* [ ] update taxonomy archive filenames
zaneselvans commented 3 months ago

I think this should probably get bumped to High or Urgent priority so that we can unblock work on the new year of FERC Form 1 data.