covid19-eu-zh / covid19-eu-data

Automated Data Collection: COVID-19/SARS-COV-2 Cases in EU by Country, State/Province/Local Authorities, and Date
80 stars 20 forks source link

France cases should be renamed #48

Open hyperknot opened 4 years ago

hyperknot commented 4 years ago

France is a very strange country, as it doesn't publish cases data. It provides hospitalized and deaths but not cases. I don't know what column is the script scraping, but it should be renamed to something else.

I also cross-checked with JHU and it is showing a very different number, so definitely not cases.

emptymalei commented 4 years ago

@hyperknot

I also cross-checked with JHU and it is showing a very different number, so definitely not cases.

I cross-checked the numbers and it seems that we are roughly matching with the JHU data.

Our data Screen Shot 2020-04-14 at 09 15 49

JHU data Screen Shot 2020-04-14 at 09 16 07

Please note that FR stopped updating tables on webpage on 2020-03-25. So the most recent record in the csv file is for 2020-03-25. Raw data in PDF format are still being updated on a weekly basis.

France is a very strange country, as it doesn't publish cases data. It provides hospitalized and deaths but not cases.

Could you elaborate on this? Is there a web page we could scrape?

hyperknot commented 4 years ago

Yes, I only know this one: https://www.data.gouv.fr/fr/organizations/sante-publique-france/datasets-resources.csv

So it seems that both datasources need to be mixed to have the complete numbers.

hyperknot commented 4 years ago

I see, the numbers are indeed the same, just the table hasn't been updated for a while.

emptymalei commented 4 years ago

Yes, I only know this one: https://www.data.gouv.fr/fr/organizations/sante-publique-france/datasets-resources.csv

So it seems that both datasources need to be mixed to have the complete numbers.

Seems that it's the same as this one

https://github.com/orgs/covid19-eu-zh/projects/1#card-35273964

Alright, we will add this source ASAP. It's been sitting on the todo list for too long.