CSSEGISandData / COVID-19

Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE
https://systems.jhu.edu/research/public-health/ncov/
29.11k stars 18.39k forks source link

Import Unfriendly? #752

Open raeldor opened 4 years ago

raeldor commented 4 years ago

CSV seems a little import unfriendly. Would a file with dates as a key and 'confirmed', 'deaths', 'recovered' as columns not be more import friendly?

Sitius86 commented 4 years ago

Hi @raeldor, you must be new here.

This is the land of the lost. A forsaken place that data management forgot. You'll end up driving yourself crazy trying to advocate for "best practices" and "optimization" and "logic" here...welcome to the grand ETL tournament!

yy commented 4 years ago

Hi! I've made https://github.com/covid19-data/covid19-data to make the data pipeline transparent, automatic, and more sensible. Feel free to come over and help us!

feldspar-terra commented 4 years ago

infectedurl = 'https://raw.githubusercontent.com/CSSEGISandData/COVID-19/master/csse_covid_19_data/csse_covid_19_time_series/time_series_19-covid-Confirmed.csv' infected = pd.read_csv(infectedurl, error_bad_lines=False)

peterdrier commented 4 years ago

Try the daily files instead of the time series. It already has what you want and has been there the whole time.

StataAficionado commented 4 years ago

CSV seems a little import unfriendly. Would a file with dates as a key and 'confirmed', 'deaths', 'recovered' as columns not be more import friendly?

If you use Stata, you can use this routine: The data since 12 March are choppy, though.