Open zaneselvans opened 1 year ago
In the boilers_entity_eia table, there are only null values for the boiler_manufactuer and boiler_manufacturer_code columns.
boilers_entity_eia
boiler_manufactuer
boiler_manufacturer_code
This is true in the outputs, and also after the clean_boilers_eia860() has been run, where this is done:
clean_boilers_eia860()
# Add boiler manufacturer name to column b_df["boiler_manufacturer"] = b_df.boiler_manufacturer_code.map( pudl.helpers.label_map( CODE_METADATA["environmental_equipment_manufacturers_eia"]["df"], from_col="code", to_col="description", null_value=pd.NA, ) )
Not sure why this isn't being caught by our "no null columns" data validations.
Is this only happening in the fast ETL, or in the full as well? This is a column that only exists in 2009 & 2010 I believe.
Ahhh, interesting. Not sure if I checked both.
Describe the bug
In the
boilers_entity_eia
table, there are only null values for theboiler_manufactuer
andboiler_manufacturer_code
columns.This is true in the outputs, and also after the
clean_boilers_eia860()
has been run, where this is done:Not sure why this isn't being caught by our "no null columns" data validations.