sergei-mironov / COVID-19_plus_Russia

COVID-19 data from JHU CSSE, updated with details on Russian regions.
22 stars 6 forks source link

06 may data is not correct. #4

Closed IVanDeryabin76 closed 4 years ago

IVanDeryabin76 commented 4 years ago

In Moscow confirmed it should be 92 676 on m May 6 according to стопкороновирус.рф But in your dataset it is 85 973 on may 6 Please fix

MishaDav commented 4 years ago

На Яндекс картах 6 мая - 85973 как раз. А вот 7 мая 92676, а не 98522.

IVanDeryabin76 commented 4 years ago

Так где ж правильный источник? Яндекс или стопкороновирус? на стопокороновирус 98 522 (7 мая) -92 767 (6 мая)

IVanDeryabin76 commented 4 years ago

На Яндекс тоже статистике 98 522 (7 мая) -92 767 (6 мая). https://yandex.ru/covid19/stat Откуда 85 973 взялось?

MishaDav commented 4 years ago

На Яндекс тоже статистике 98 522 (7 мая) -92 767 (6 мая). https://yandex.ru/covid19/stat Откуда 85 973 взялось?

Если смотреть на диаграмму, то 6 мая - 85973. Другой вопрос, что последние данные указаны на 8 мая, которое ещё не кончилось.

https://datalens.yandex/r8r2cldmwff6n?56667170-2453-459b-a0ae-635d2e7fcfd2=Москва

sergei-mironov commented 4 years ago

Up to this moment I tried to include Yandex data with timestamps close enough to the upstream's timestamps. Upstream currently samples day X at 02:30 of day X+1 (+-1 hour). For example, file named '05-06-2020.csv' contains data with timestamp 05-07-2020 02:32 UTC. I included Russian data of 05-07-2020 03:22 UTC (the minimum timestamp I have which is >= upstream's).

sergei-mironov commented 4 years ago

На Яндекс тоже статистике 98 522 (7 мая) -92 767 (6 мая). https://yandex.ru/covid19/stat Откуда 85 973 взялось?

I'll check it soon

IVanDeryabin76 commented 4 years ago

Hello! Again not correct on 8 may REported 98522 for 07/05 and for 08/05 the same figure Bot for 08/05 it was reported 10 189

Please fix.

sergei-mironov commented 4 years ago

Please check the detailed plot. image

(*edit: I'll add some explanations: orange dots is the data that I actually included in the dataset, blue dots are all Yandex dumps which I've made. Green vertical lines are timestamps of HSSE daily updates. The original intention was to report measurements closer to green lines)

My conclusions are:

  1. 06 may data is actually correct
  2. 07-08 may is indeed not correct and should be fixed

@IVanDeryabin76 Do you agree?

techoracle commented 4 years ago

Dear @grwlf,

I found the data in OVID-19_plus_Russia/csse_covid_19_data/csse_covid_19_time_series/time_series_covid19_confirmed_RU.csv are duplicated for 7.Mai and for 8.Mai. The both columns have the same values. It seems the data on 7.Mai are wrong and missing. Could you correct the 7.Mai, please!

Thanks in advance!

I am starting to use your data in https://covid-trend.info/#/ru/russia and appreciate your work, thanks for this!

sergei-mironov commented 4 years ago

@techoracle fixed 05-07-2020 and timelines, please check

techoracle commented 4 years ago

@techoracle fixed 05-07-2020 and timelines, please check

yes, it's running very well now, thanks a lot for your fast fix!