feomike / slope

makes a bunch of data objects for time series line charts
1 stars 0 forks source link

acquire data #1

Closed feomike closed 8 years ago

feomike commented 8 years ago

acquire all pubic data from the national archives. use this catalog to acquire all the data files https://catalog.archives.gov/id/2456161?q=2456161

feomike commented 8 years ago

data for HMDA has changed a couple of times over the years, but generally speaking the national archives has all historical data on-line for download. each year has panel, transmittal sheet, and the loan application registry (LAR). for each of these there is a final and ultimate. the final is data that is public. the ultimate contains changes over a 2 year look back period that contains resubmissions from financial institutions because of audits or examinations. the National Archives also has documentation on each data file. while digital data goes back to 1981, the 1981 - 1989 formats for the LAR (the loan level data actually needed for this project) are delivered in an ancient binary format that i have yet to find a definition for the files, and therefore are ignoring them. for what its worth, my notes for downloading these files are contained in this google worksheet (https://docs.google.com/spreadsheets/d/1bGAzwTEUeg4oV7Y4nqHCfPZi_zA7KiVTmTVUUkU4FOI/edit?usp=sharing). for this project i am generally using final data, because that is the data that is actually published for that given year.

feomike commented 8 years ago

post script - this repo is purposefully working on open public data. nothing in this repo or the data acquired is confidential or closed in anyway.