catalyst-cooperative / ferc-xbrl-extractor

A tool for converting FERC filings published in XBRL into SQLite databases
MIT License
11 stars 1 forks source link

(WIP!) Xbrl steam/fuel #16

Closed zschira closed 2 years ago

zschira commented 2 years ago

@cmgosnell commented on Mon Jun 27 2022

status:

the steam and fuel table are both processed/concatenated and the ferc plant-id-er works!

I'd love some overall design input... I pretty aggressively moved cleaning details into global variable so I could make all functions take a df and/or a table name and/or a source of the ferc1 data (xbrl/dbf) in the hopes that this will be more dagster-friendly and standardized.

questions:

ferc1_tfr_dfs = {}
# make all the non-steam tables
for table in ferc1_settings.tables:
    ferc1_tfr_dfs[table_name] = global().get(table_name)(
        ferc1_dbf_raw_dfs.get(table_name),
        ferc1_xbrl_raw_dfs.get(table_name)
    )
# make the steam table using fuel
ferc1_tfr_dfs["plants_steam_ferc1"] = plants_steam_ferc1(
    steam_dbf_raw=ferc1_dbf_raw_dfs.get("plants_steam_ferc1"), 
    steam_xbrl_raw=ferc1_dbf_raw_xbrl.get("plants_steam_ferc1"),
    fuel_transformed=ferc1_tfr_dfs.get("fuel_ferc1")
)

remaining tasks:

avoiding for now tasks:

See #1707 and #1722 for the table-specific task lists


@cmgosnell commented on Wed Jun 29 2022

a note about the extract step:


@review-notebook-app[bot] commented on Thu Jul 07 2022

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB


@zaneselvans commented on Fri Aug 12 2022

I asked some questions about the FERC1 transform refactor design in this comment on #1739


@zaneselvans commented on Wed Aug 17 2022

Also @bendnorman did you see this list of outstanding questions on the linked issue? https://github.com/catalyst-cooperative/pudl/issues/1739#issuecomment-1213323493

zschira commented 2 years ago

I accidentally moved the wrong issue... Closing here