simonw / covid-19-datasette

Deploys a Datasette instance of COVID-19 data from Johns Hopkins CSSE and the New York Times
https://covid-19.datasettes.com/
61 stars 6 forks source link

Wrong data for Argentina dataset #2

Closed edvm closed 4 years ago

edvm commented 4 years ago

If you take a look at https://covid-19.datasettes.com/covid.csv?sql=select+*+from+daily_reports+where+country_or_region+%3D+%27Argentina%27%3B&_size=max you will notice that exists a record for date 2020-03-07 . The Argentinian goverment did not published any numbers for that date (you can confirm this at https://www.argentina.gob.ar/coronavirus/informe-diario).

Also, I found that that some numbers are incorrect. For example, data from the dataset for 2020-03-09 says that are a total of 12 infected people: 2020-03-09,Argentina,,12,1,0,-38.4161,-63.6167,2020-03-08T22:03:11 while in the official PDF file given by the Argentinian goverment (file:///home/edvm/Downloads/9-03-2020-nuevo-coronavirus-covid-19-reporte_diario_0%20(1).pdf) says that 17 is that number.

In this situation, we decided to read every PDF file given by the Argentinian goverment and maintain a new dataset, you can find it at: https://github.com/edvm/coronavirus/blob/master/dataset.csv

We created a small community which will be responsible of:

We aim to have the most accurate possible data.

simonw commented 4 years ago

Thanks for the report. I'm just re-publishing the data from https://github.com/CSSEGISandData/COVID-19 here so I recommend reporting this issue here: https://github.com/CSSEGISandData/COVID-19/issues

simonw commented 4 years ago

I reported your issue over there, copying across your comment and crediting you: https://github.com/CSSEGISandData/COVID-19/issues/1453