Closed karldw closed 6 years ago
Hmm. This does not look familiar. I wonder if they might have changed the file layouts when they moved everything into the archive directory. Grr. @stevenbwinter has mostly worked on the mapping of the spreadsheet rows & columns. We will take a closer look.
@stevenbwinter is looking into this. It appears that they've retroactively added a tab to the 923 spreadsheets, containing information about oil stocks, which is throwing off the parsing, since the tabs are read in based on their order in the spreadsheets. If that's the only change, it should be easy to fix.
Okay, @karldw both @swinter2011 and I have been able to completely wipe out our datastores and re-initialize the PUDL DB, after changing the parser to accommodate the new tabs EIA added to the spreadsheets.
Works for me too!
Running
init_pudl.py
(with the default arguments) fails when the code tries to ingest the EIA 923 boiler data for 2009. Do you have any tips?When I print out
newdata.columns
, it looks like it's missing headers: