catalyst-cooperative / rmi-ferc1-eia

A collaboration with RMI to integrate FERC Form 1 and EIA CapEx and OpEx reporting
MIT License
3 stars 3 forks source link

fine tune the comparison metric for fuel cost and mmbtu #22

Closed cmgosnell closed 1 year ago

cmgosnell commented 4 years ago

I'm not sure if the comparison metric for total_mmbtu and total_fuel_cost are low because there are so many null values or because the range is so large. I tweaked the comparison feature creation a bit but I'd like to fine tune it.. or determine whether or not this is actually a problem.

zaneselvans commented 4 years ago

I'm not sure if this would be the right thing to do, but might it make sense to apply a standard scaler to the numeric columns that have wildly different sizes? I don't feel like have a super clear grasp on when that's appropriate though. I could also imagine filling in a dummy value for especially the heat content consumed based on the primary fuel type, net generation, and an expected heat rate -- so at least you'd have some value in there, that's in the right ballpark, to compare with whatever is available from FERC, if anything.

cmgosnell commented 1 year ago

no longer relevant.