CSSEGISandData / COVID-19

Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE
https://systems.jhu.edu/research/public-health/ncov/
29.12k stars 18.4k forks source link

Derived (and augmented) dataset available in JSON, TSV and SQL formats #1281

Open cipriancraciun opened 4 years ago

cipriancraciun commented 4 years ago

I have wrote some scripts that take both the series and daily reports files output the following two files:

If you want to automate the download (given how GitHub handles URL's to raw files), you can use the links listed on this page.

Also some plots for these available at:


What I've done:

I will update these files twice per day, say at 06 UTC and 12 UTC.

Moreover I have also added the in the same format also the NY Times US dataset and the ECDC one.


The scripts are available in the following repository and consist mainly of jq snippets.


If anyone has other ideas about what I can add to these augmented datasets please let me know.

cipriancraciun commented 4 years ago

I'll point here a few other issues that are solved by my derived dataset:

cipriancraciun commented 4 years ago

Additional issues that popped-up in the last day that would be helped:

dnalkram commented 4 years ago

Thanks cipriancraciun but app I'm using throws a validation error trying to use the JSON file and online validators too indicate problems with format.

cipriancraciun commented 4 years ago

@dnalkram What is the actual error you are getting?

You could open an issue on my repository (https://github.com/cipriancraciun/covid19-datasets) so that we don't clutter the JHU repository with this.

dnalkram commented 4 years ago

In Power BI I get
"We found extra characters at the end of JSON input.". I then used site: https://www.freeformatter.com/json-validator.html and got: "The JSON input is NOT valid in JavaScript, illegally formed XML syntax (At line #1164), (At position #1)" Hope this helps.

cipriancraciun commented 4 years ago

@dnalkram Could you paste the URL you've used to download the file?

The links in the description above are directly from GitHub, and you must use the raw URL from that page.

To keep things simple, at the following link you can find the files on my own site, which shouldn't give you any issues:

(You can use these second links as I usually update them as soon as I push to GitHub.)

If you still get errors, download that link and double check that you are actually getting a JSON and not an error HTML page.

dnalkram commented 4 years ago

That file worked fine, thanks for all the help!

sbw78 commented 4 years ago

THANK YOU! I have spent the past 3 days trying to wrangle the CCSSE daily reports into a parseable time series, but with the constant changes in data format and inconsistent location naming I was going insane. Looking forward to checking out your data.

cipriancraciun commented 4 years ago

@sbw78 I'm glad I could help you. If you encounter any issues, please open a ticket on my repository and describe the issue. (Given the "quarantine" I usually reply fairly quickly.) :)

BTW, in the interim I have integrated also the NY Times dataset and the ECDC one; thus if you are looking for alternative data you can choose one of these.

ChrisParkerWA commented 4 years ago

Hi Ciprian, Thought I would submit a request here if I may. Is there any way to create a simple JSON dataset along the lines of, for example "country": "Australia" "cases": 4860 "deaths": 20 "recovered": 244

cipriancraciun commented 4 years ago

@ChrisParkerWA I just opened an issue on my repository (https://github.com/cipriancraciun/covid19-datasets/issues/12), where I've proposed to add such a simple format, but perhaps with only a few more values. Please verify my proposal there and let me know if it works for you.

cipriancraciun commented 4 years ago

In the interim I've also added SQL and SQLite DB files for all the datasets.

AmauryVanEspen commented 4 years ago

Hello @cipriancraciun there is several API available on SwaggerHub It seems https://gravitee.io/ could also do the job or a simple flask app http://michal.karzynski.pl/blog/2016/06/19/building-beautiful-restful-apis-using-flask-swagger-ui-flask-restplus/ Is there a willingness to setup this kind of service ?

cipriancraciun commented 4 years ago

@AmauryVanEspen I've just opened an issue about this feature request on my repository:

I would propose moving the discussion there, as this isn't strictly JHU related.

(I would be open to such an API, however we must first understand exactly what its use-cases would be.)