Closed bendnorman closed 5 months ago
I think our deduplication methodology might be a little too aggressive which explains the slight decrease in MW. Otherwise I'm pretty comfortable with these changes.
@bendnorman I see a bunch of interesting investigation/data quality tasks left. Do we have a way of quantifying when this data is "good enough?" If we finish those tasks in the TODO (make sure capacities add up, reconcile NYISO/SPP capacity diffs, etc.) does that mean we're done with this, or do you think we'll find more data quality issues and keep chasing those down?
Create a change log of ISO queue projects where each record is a project at a particular status. Projects will either have one or two records: one for when the project enters the queue and one for when the project is withdrawn or becomes operational.
With this change log, we should create a table that contains the number of projects that become operational, enter the queue and are withdrawn for a given geography and time frame. This table should allow users to understand queue changes for a given geography over time.
TODOs
queue_date
.infra_petrochemicals_and_plastics_proposed_pm2_5_tonnes_per_yea
and ,n_tracts_risk_management_plan_proximity_low_income
in the database but not in the parquet files / bigquery?n_tracts_risk_management_plan_proximity_low_income
wasn't because it never existed in a dataframe but it existed in the metadata. I removed it from the metadata. The untruncatedinfra_petrochemicals_and_plastics_proposed_pm2_5_tonnes_per_yea**r**
is in the dataframe but gets truncated when loaded to postgres but not parquet.queue_status
values.Out of Scope tasks
queue_id
andentity
Options
Withdrawn and operational date availability
Withdrawn
Overall 61% of withdrawn projects have a withdrawn date.
Operational
Only 52% of projects have actual operational dates. However, 89% of projects have a proposed operational date. Of projects with both proposed and actual operational dates, 70% of the projects' dates were within a year of each other. We could use proposed as guestimate for operational projects without actual operational dates.
ERCOT Integration