CSSEGISandData / COVID-19

Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE
https://systems.jhu.edu/research/public-health/ncov/
29.14k stars 18.43k forks source link

* * * * * * USE THIS time_series_19-covid-Confirmed.csv UNTIL YOU FIX YOUR SCHEMA (US states included) * * * * * * * * #1496

Open rks125 opened 4 years ago

rks125 commented 4 years ago

I used PowerQuery to recreate the time series file for today (3/24/2020, which includes 3/23/2020 as well) for confirmed cases. It took about 1.5 hours (of which 30 minutes was because I ended up realizing I could not use the timestamp in the table, reliably).

I recreated this to hold you over until you adjust to the new schema or John Hopkins fixes. Please let me know if you have any problems.

I did not update the Recovered or Deaths. I could do them if desired and I have time.

https://www.soothsawyer.com/john-hopkins-time-series-data-confirmed-case-csv-after-march-22-2020/?github=1

therealcyberlord commented 4 years ago

Can you post it on Github? Thank you for your work.

hippodribble commented 4 years ago

Also see issue #1458 for some Python code to transform the files directly to the old format.

rks125 commented 4 years ago

Also see issue #1458 for some Python code to transform the files directly to the old format.

The Python script has no state data for the US. The CSV I created has state data

hippodribble commented 4 years ago

@rks125 - I just ran the function on data from the 24th Seems to work OK.

The file I used was: COVID-19/csse_covid_19_data/csse_covid_19_daily_reports/03-24-2020.csv

The output is attached as 03-24-2020b.txt (I renamed it to .txt as GitHub doesn't allow .csv).

State/Province is in column 1.

Cheers

G.

03-24-2020b.txt

rks125 commented 4 years ago

@rks125 - I just ran the function on data from the 24th Seems to work OK.

The file I used was: COVID-19/csse_covid_19_data/csse_covid_19_daily_reports/03-24-2020.csv

The output is attached as 03-24-2020b.txt (I renamed it to .txt as GitHub doesn't allow .csv).

State/Province is in column 1.

Cheers

G.

03-24-2020b.txt

Got it. It looks like you are doing single day conversion. The file I created is combining the old time series with the recent daily files in order to create a refreshed older schema time series.

jkellndorfer commented 4 years ago

@rks125 Thanks so much for bridging the gap with your data set.

If you can also provide the DEATHS data in the old format for now, that would be quite helpful.

Here is a website I put together. http://ebd-covid19.s3-website-us-east-1.amazonaws.com/. Gave you a shoutout in the data acknowledgement section already. Code for website available at my open source github repo: https://github.com/EarthBigData/covid19

rks125 commented 4 years ago

I added Deaths, Recovered, Confirmed CSV files, updated up to 3/24/2020 on my site. I also included the PowerQuery tool if you want to refresh the data yourself.

https://www.soothsawyer.com/john-hopkins-time-series-data-confirmed-case-csv-after-march-22-2020/

jkellndorfer commented 4 years ago

Thanks!

Are you planning to update the data daily while we wait for John Hopkins to provide them for the U.S.? Is your tool running automatically?

rks125 commented 4 years ago

I will update it daily unless John Hopkins keeps breaking it further. They update their data at 5pm PT so I will have the refreshed files within 15 minutes unless I'm outside. :)

I also included the PowerQuery tool that I'm using on my website, in the event you wanted to refresh it. You just need Excel (on Windows).

jkellndorfer commented 4 years ago

Great, Thanks!