catalyst-cooperative / pudl

The Public Utility Data Liberation Project provides analysis-ready energy system data to climate advocates, researchers, policymakers, and journalists.
https://catalyst.coop/pudl
MIT License
456 stars 105 forks source link

`entity_types_eia` table missing from pudl.sqlite #3075

Open arengel opened 7 months ago

arengel commented 7 months ago

Describe the bug

entity_types_eia table is missing from pudl.sqlite and the entity_type column in denorm_utilities_eia only has 1-3 letter codes that require entity_types_eia to decode making the column difficult to use. entity_types_eia is also missing from datasette as noted in #2248.

Bug Severity

How badly is this bug affecting you?

Expected behavior

Including the entity_types_eia in pudl.sqlite would address the issue. But from a user perspective, it would be nice if in the denorm_utilities_eia table there was a column with this info like the 'Ownership Type' column in the original 861 Utility_Data spreadsheet that requires no decoding.

Software Environment?

zaneselvans commented 7 months ago

Unfortunately this table doesn't yet exist, because we didn't finish reconciling the minor differences between the entity types used in EIA-860 and EIA-861, as described in #1392.

There's inconsistency in how these entity / ownership types are reported across different years, across different tables, and between the two datasets.