Closed KaylaCrush closed 1 year ago
One obvious solution would be to collapse the unique columns into Lobbyist or Authorizing Officer Name Agent Type or Title
Ultimately I'm going to want each page I process to have a unique identifier. Currently I'm using a combination of the date range and either the business name(for entities) or lobbyist name(for individuals). Do I have another option??
partially done in beautiful soup refactor, opening new issue for adding source_id as a foreign key rather than having all the source info copied into every row
This header table sucks actually. I NEED to figure out a way to reconcile the business name / lobbyist name / source name issue.
Unique Individual Lobbyist Columns: Lobbyist Name, Agent Type
Unique Lobbyist Entity Columns: Authorizing Officer Name, Title
Common Columns: Business Name(sometimes blank), Address, City state zip, Country, Phone
HONESTLY, I'm not sure I NEED to be saving this info. What does it get me??
I feel like 1 makes it worthwhile by itself. Gotta figure out how I want to do this.