Closed bayoumi17m closed 4 years ago
The PDF files have the dates + version numbers in their file name which means we can actually "go back in time" For example, https://www1.nyc.gov/assets/doh/downloads/pdf/imm/covid-19-daily-data-summary-hospitalizations-04042020-1.pdf is for April 4th and to get April 3rd we can just go to https://www1.nyc.gov/assets/doh/downloads/pdf/imm/covid-19-daily-data-summary-hospitalizations-04032020-1.pdf
NYC Health has this website which contains PDFs that have stratified disease progression information! The goal here is to fetch all the PDFs (and new ones as they come), and extract the table