CEIDatUGA / COVID-19-DATA

GNU General Public License v3.0
4 stars 6 forks source link

World fatalities #8

Closed jdrakephd closed 4 years ago

jdrakephd commented 4 years ago

I don't see where the "cumulative fatalities reported by country" in the old timeline spreadsheet (https://docs.google.com/spreadsheets/d/1SC28cM52m6s1gTJutpvFxadT9GGu-li90tsqkGuaM48/edit#gid=219891324) got ported over to the github -- am I missing it?

Related, did the "key events" tab can ported over.

Fine for some of this to stay in the spreadsheet, but it needs to be clear what is the definitive source

renikaul commented 4 years ago

That sheet wasn't on the list to be ported over. I can add it to the list. I've been updating the wiki to help define were to find sources. https://docs.google.com/document/d/1VcOnl5xqNg_b42fThnQ7A25sTKF24LmLESQaGFsLEFM/edit?usp=sharing

renikaul commented 4 years ago

@jdrakephd I think I finally understand your question. Working with @rlrichards to make sure we aren't missing anything.

rlrichards commented 4 years ago

This was a dropped ball/communication when the cases got ported over. I can get it over in the same manner as the cases by afternoon of March 30th if not before. The manual scraping goes up through 3-15 and then I will begin wiki archive scraping.

rlrichards commented 4 years ago

If this is needed before tomorrow afternoon let me know and I can try to get it done first thing.

rlrichards commented 4 years ago

So I've written the code to do this back scraping for fatalities but can't get it to work. Does anyone @lsalvador @arw36 @mvevans89 @paigemiller have experience with failed scraping from wayback machine? The same code that scrapes fine from the current page doesn't work on the archived urls/xpaths (and it worked fine on the site live on those previous days).

lsalvador commented 4 years ago

Never tried scraping from the wayback machine, but it seems that there is a way of extracting the old url: https://exposureninja.com/blog/extract-urls-archive-org/

mvevans89 commented 4 years ago

Assuming this issue has been fixed since the world fatality data has been getting updated