KaylaCrush / advocacy-maps

The good governance project (GGP) is a non-partisan democracy reform group.
https://mapletestimony.org
MIT License
0 stars 1 forks source link

re-examine header issue #36

Closed KaylaCrush closed 1 year ago

KaylaCrush commented 1 year ago

This header table sucks actually. I NEED to figure out a way to reconcile the business name / lobbyist name / source name issue.

Unique Individual Lobbyist Columns: Lobbyist Name, Agent Type

Unique Lobbyist Entity Columns: Authorizing Officer Name, Title

Common Columns: Business Name(sometimes blank), Address, City state zip, Country, Phone

HONESTLY, I'm not sure I NEED to be saving this info. What does it get me??

  1. It ties the name of an individual lobbyist to the name of their business
  2. it stores contact information, if that ever becomes a thing people wanna use

I feel like 1 makes it worthwhile by itself. Gotta figure out how I want to do this.

KaylaCrush commented 1 year ago

One obvious solution would be to collapse the unique columns into Lobbyist or Authorizing Officer Name Agent Type or Title

KaylaCrush commented 1 year ago

Ultimately I'm going to want each page I process to have a unique identifier. Currently I'm using a combination of the date range and either the business name(for entities) or lobbyist name(for individuals). Do I have another option??

KaylaCrush commented 1 year ago

partially done in beautiful soup refactor, opening new issue for adding source_id as a foreign key rather than having all the source info copied into every row