DataKind-DC / CARES

US CARES Act Payment Protection Program data, cleaned for analysis
GNU General Public License v3.0
6 stars 7 forks source link

Create consistent, smallest-possible-number of Lender names #6

Open kbmorales opened 4 years ago

kbmorales commented 4 years ago

posted by @johnmccambridge

process Lender across unified dataset and cross check against real bank names, and reduce to smallest possible set of consistent names

JohnMcCambridge commented 4 years ago

can validate against list of licensed lenders, however there are a lot of sources:

FDIC, NCUA, CDFI Fund, others?

Even without validation against list, we can also take elements of their name, and use that to often know what type of lender (e.g., Credit Unions often end with 'FCU' or 'CU')