Open katie-lamb opened 3 days ago
The EIA 861 merger's table (core_eia861__yearly_mergers
) has address information for the new parent company that can be joined on by utility_id_eia
, but there are only a small number of utilities in this table (219 rows). But the merger_company
doesn't have an ID and doesn't look super clean (it's a string).
Also, we aren't harvesting utilities from 861, probably we should do this in parallel with developing the match.
Overview
See record linkage design doc for diagram and more notes.
We want to conduct record linkage between the SEC filers we've extracted and the companies (owner and operator utilities) that file with EIA (Form 860 and 861). There are a few more steps to make sure we have the correct data from both sides, and then we can do a proof of concept to make sure that splink can effectively connect SEC to EIA.
Success Criteria