MoH-Malaysia / covid19-public

Official data on the COVID-19 epidemic in Malaysia. Powered by CPRC, CPRC Hospital System, MKAK, and MySejahtera.
Other
970 stars 648 forks source link

Data issue: main/epidemic/linelist/linelist_deaths.csv #158

Closed wnarifin closed 3 years ago

wnarifin commented 3 years ago

Issues with https://github.com/MoH-Malaysia/covid19-public/blob/main/epidemic/linelist/linelist_deaths.csv dataset:

  1. Assuming date = date the patient passed away and _datepositive = date confirmed positive of covid, there are 2536 cases who were confirmed positive after their death (negative days for date - _datepositive. Of these cases 89 cases were only confirmed positive 30 days after their death. I am not sure whether the data were correct, which I suspect due to wrong data entry.
  2. Of these cases, 9 cases have a gap of > 100 days between _datepositive to date (i.e. death). I am not sure whether for these cases the data were correctly entered.
  3. One case no. 18087 has the date of death in the future (as of the date of this comment on 9/9/2021): date = 2021-09-30 _datepositive = 2021-09-01
  4. 19 cases received 2nd dose after their deaths.
  5. age contains a case equals "10 months". While the rest are numerical data, I think this case can be re-coded as 1 y.o.

Details in the attached file. analysis_screen.pdf

MoH-Malaysia commented 3 years ago

Thank you! These were just bugs from pre-processing into the Git push - we've resolved them.

  1. Pre-processing error - fixed.
  2. Entirely possible if the patient is in hospital for an extended period of time.
  3. Typo
  4. Pre-processing error - fixed.
  5. Fixed to integer.

Many thanks again for the prompt feedback :)

moosekaka commented 3 years ago

Hi, just to let you know, age column still contains an entry "6days" .

chez1978 commented 3 years ago

Thank you for the linelisting! A lot can be done with the data and will wait for additional linelisting breakdown of positive cases by sex and age (and vaxtype) to calculate proper death rates. But this is an amazing start and we can do some modelling already. total deaths with vaccine 1 dose 3532 and 2 dose 1409, totally 4941... it seems most likely because a majority of elderly population and those in institutions received sinovac so that affected its numbers.

On Fri, Sep 10, 2021 at 4:30 AM Ministry of Health Malaysia < @.***> wrote:

Thank you! These were just bugs from pre-processing into the Git push - we've resolved them.

  1. Pre-processing error - fixed.
  2. Entirely possible if the patient is in hospital for an extended period of time.
  3. Typo
  4. Pre-processing error - fixed.
  5. Fixed to integer.

Many thanks again for the prompt feedback :)

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub https://github.com/MoH-Malaysia/covid19-public/issues/158#issuecomment-916414963, or unsubscribe https://github.com/notifications/unsubscribe-auth/AVDR7PXNGVVYFFSLITTFTG3UBEKOPANCNFSM5DW6GQSA . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.

010111001 commented 3 years ago

Please review the following file :

linelist_deaths.csv

deaths_malaysia.csv