CSSEGISandData / COVID-19

Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE
https://systems.jhu.edu/research/public-health/ncov/
29.13k stars 18.43k forks source link

The first time ever, the last entry has been changed to a float ? #2566

Open JupyterJones opened 4 years ago

JupyterJones commented 4 years ago

All the death entries for months have been integers. This last entry has been changed to a float ? Why would a death be represented considered by a float? https://raw.githubusercontent.com/CSSEGISandData/COVID-19/master/csse_covid_19_data/csse_covid_19_time_series/time_series_covid19_deaths_US.csv

Also why does the population of all the New York counties, when added together, come to 26,161,672. That is much higher than the population of the State of New York

richardschris commented 4 years ago

The float thing broke an ETL I had this morning, that was fun. But the New York counties thing is easy to explain: the FIPS file has data for each NYC borough broken out, plus the total population of the city. Just leave out Bronx, Queens, Kings, New York, and Richmond counties.

JupyterJones commented 4 years ago

@richardschris The float broke several notebook projects. Thank you for the Bronx, Queens, Kings, New York, and Richmond counties info.

Right after I opened the file, I made a temporary cheat for the float thing:

if line[-2:] == ".0": line = line.replace(".0","")

I am hoping it was just an error and not a "from now on" thing.