Closed aesharpe closed 2 years ago
I like this idea! The calculations sound pretty straightforward and clear to me. I think doing these calculations in the output layer is more in line with the overall pudl conventions of doing most of the (even minor) calculations post the normalized db tables.
(It'll also be easier to integrate/update w/o reloading the full db 😎 )
I think doing these calculations in the output layer is more in line with the overall pudl conventions of doing most of the (even minor) calculations post the normalized db tables.
That makes sense -- it is kind of like a teeny tiny analysis layer. In the fullness of time, however, I think it might make sense to put them in transform? I'm very ok doing it in the output layer for now though.
Idk! I think we do a ton of these little calculations in the output layer right now (so much calculating of capacity_factor
in pudl/output
). We've always said the db tables were the tidy, cleaned tables and the outputs were where we do calculations. This may change slightly if we move our denomalized output tables into db views or otherwise publish them as db tables.... but even in that design configuration we has talked about keeping the normalized/"raw"-ish pudl tables separate from the de-normalized tables w/ simple calculations in them
We've always said the db tables were the tidy, cleaned tables and the outputs were where we do calculations.
That's fair! I can get behind that. I just couldn't remember if we added things like report_year or something in transform or output.
Discussion
Right now, the
opex_nonfuel
column only comes from the steam table. This is the column that the RMI Optimus formatted table relies on for theTotal O&M Cost
column. In order for this column to represent values from all the FERC tables we need to do some simple calculations.Discussed in # 216 in the
rmi_ferc1_eia
repoThe stated calculations are:
Questions