CSSEGISandData / COVID-19

Novel Coronavirus (COVID-19) Cases, provided by JHU CSSE
https://systems.jhu.edu/research/public-health/ncov/
29.12k stars 18.39k forks source link

CSSEGISandData / COVID-19 data from 3/18/2020 file #1032

Open tcarey1981 opened 4 years ago

tcarey1981 commented 4 years ago

I am pulling the daily files from https://github.com/CSSEGISandData/COVID-19/tree/master/csse_covid_19_data/csse_covid_19_daily_reports

I am looking at the state of Massachusetts on a daily basis. On the 3/17/2020 file Massachusetts had 218 confirmed cases, 0 deaths and 1 recovered. On the 3/18/2020 file Massachusetts shows 218 cases, 0 deaths and 0 recovered. Why is the recovered now at 0 for the 3/18/2020 file? Massachusetts is now at 256 cases, so why does the 3/18/2020 file still only show 218 cases for Massachuesetts?

https://www.mass.gov/doc/covid-19-cases-in-massachusetts-as-of-march-18-2020/download

Thank you,

Tim Carey tcarey@banecare.com

nielsenmarkus11 commented 4 years ago

I'm noticing the same for many other states and given all the recently logged issues it must be similar for all data for 3-18. Hopefully they get it corrected soon.

tcarey1981 commented 4 years ago

Do you think it will be corrected by tomorrow for today's data?

Thank you,

Timothy Carey, MSM Director of Data and Performance Analytics Bane Care Management LLC 350 Granite Street, Building 2, Suite 2203 | Braintree, MA 02184<x-apple-data-detectors://8/1> 781-635-2415 (Cell) www.banecare.comhttp://www.banecare.com/

My Why is to help all levels of staff use data analytics to continuously improve the patient experience and quality of care.


Confidentiality Notice:


The information transmitted in this e-mail message including any attachments, is for the sole use of the intended entity or recipient(s) and may contain confidential and/or privileged information. Any unauthorized review, retransmission, disclosure, distribution or other use of or taking of any action in reliance upon this information by persons or entities other than the intended recipient is prohibited. If you received this email in error, please contact the sender and delete the e-mail and any attached material immediately.

From: Mark Nielsen notifications@github.com Sent: Thursday, March 19, 2020 10:31 AM To: CSSEGISandData/COVID-19 COVID-19@noreply.github.com Cc: Tim Carey tcarey@banecare.com; Author author@noreply.github.com Subject: Re: [CSSEGISandData/COVID-19] CSSEGISandData / COVID-19 data from 3/18/2020 file (#1032)

I'm noticing the same for many other states and given all the recently logged issues it must be similar for all data for 3-18. Hopefully they get it corrected soon.

- You are receiving this because you authored the thread. Reply to this email directly, view it on GitHubhttps://github.com/CSSEGISandData/COVID-19/issues/1032#issuecomment-601211534, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AO366NGYJN6KNIW4BR35XLTRIIUCJANCNFSM4LPIJQPA.


ballcoach12 commented 4 years ago

There seem to be lots of similar errors in this data set, including some heterogeneity in how it's reported. For example, there are State/Country entries for the US, and also County/State/Country for the US. It isn't clear whether these should be totaled for a particular state, or whether the County/State/Country is a subset of State/Country. They appear to be separate counts, though. The granularity of the data needs to be normalized.

tcarey1981 commented 4 years ago

Please see the attachment for how I am using the data files. There's a few tabs of different views.

[cid:image001.png@01D5FDF1.ED85C410]

Thank you,

Timothy Carey, MSM Director of Data and Performance Analytics Bane Care Management LLC 350 Granite Street, Building 2, Suite 2203 | Braintree, MA 02184<x-apple-data-detectors://8/1> 781-635-2415 (Cell) www.banecare.comhttp://www.banecare.com/

My Why is to help all levels of staff use data analytics to continuously improve the patient experience and quality of care.


Confidentiality Notice:


The information transmitted in this e-mail message including any attachments, is for the sole use of the intended entity or recipient(s) and may contain confidential and/or privileged information. Any unauthorized review, retransmission, disclosure, distribution or other use of or taking of any action in reliance upon this information by persons or entities other than the intended recipient is prohibited. If you received this email in error, please contact the sender and delete the e-mail and any attached material immediately.

From: ballcoach notifications@github.com Sent: Thursday, March 19, 2020 1:23 PM To: CSSEGISandData/COVID-19 COVID-19@noreply.github.com Cc: Tim Carey tcarey@banecare.com; Author author@noreply.github.com Subject: Re: [CSSEGISandData/COVID-19] CSSEGISandData / COVID-19 data from 3/18/2020 file (#1032)

There seem to be lots of similar errors in this data set, including some heterogeneity in how it's reported. For example, there are State/Country entries for the US, and also County/State/Country for the US. It isn't clear whether these should be totaled for a particular state, or whether the County/State/Country is a subset of State/Country. They appear to be separate counts, though. The granularity of the data needs to be normalized.

- You are receiving this because you authored the thread. Reply to this email directly, view it on GitHubhttps://github.com/CSSEGISandData/COVID-19/issues/1032#issuecomment-601310670, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AO366NHBVI6A2NDYRJOXPPLRIJIGTANCNFSM4LPIJQPA.


nielsenmarkus11 commented 4 years ago

@tcarey1981 I'm not sure when it'll be corrected... I did notice that the author took a few days break last month as well... maybe getting some well deserved rest?

tcarey1981 commented 4 years ago

Okay, thank you for that information.

Thank you,

Timothy Carey, MSM Director of Data and Performance Analytics Bane Care Management LLC 350 Granite Street, Building 2, Suite 2203 | Braintree, MA 02184<x-apple-data-detectors://8/1> 781-635-2415 (Cell) www.banecare.comhttp://www.banecare.com/

My Why is to help all levels of staff use data analytics to continuously improve the patient experience and quality of care.


Confidentiality Notice:


The information transmitted in this e-mail message including any attachments, is for the sole use of the intended entity or recipient(s) and may contain confidential and/or privileged information. Any unauthorized review, retransmission, disclosure, distribution or other use of or taking of any action in reliance upon this information by persons or entities other than the intended recipient is prohibited. If you received this email in error, please contact the sender and delete the e-mail and any attached material immediately.

From: Mark Nielsen notifications@github.com Sent: Thursday, March 19, 2020 1:45 PM To: CSSEGISandData/COVID-19 COVID-19@noreply.github.com Cc: Tim Carey tcarey@banecare.com; Mention mention@noreply.github.com Subject: Re: [CSSEGISandData/COVID-19] CSSEGISandData / COVID-19 data from 3/18/2020 file (#1032)

@tcarey1981https://github.com/tcarey1981 I'm not sure when it'll be corrected... I did notice that the author took a few days break last month as well... maybe getting some well deserved rest?

- You are receiving this because you were mentioned. Reply to this email directly, view it on GitHubhttps://github.com/CSSEGISandData/COVID-19/issues/1032#issuecomment-601321592, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AO366NDHOVBHMA552RA7M6LRIJKYLANCNFSM4LPIJQPA.


tcarey1981 commented 4 years ago

Hi all,

I noticed that the daily data file now includes more columns, like a states county region (if known).

I am seeing a lot of cases marked as "Unassigned" in the County (see below screen shot as an example). Can this be cleaned up? I know Massachusetts' DPH shows a complete breakdown by county for all confirmed cases in Massachusetts.

[cid:image001.png@01D601AB.0EADE0C0]

Thank you,

Timothy Carey, MSM Director of Data and Performance Analytics Bane Care Management LLC 350 Granite Street, Building 2, Suite 2203 | Braintree, MA 02184<x-apple-data-detectors://8/1> 781-635-2415 (Cell) www.banecare.comhttp://www.banecare.com/

My Why is to help all levels of staff use data analytics to continuously improve the patient experience and quality of care.


Confidentiality Notice:


The information transmitted in this e-mail message including any attachments, is for the sole use of the intended entity or recipient(s) and may contain confidential and/or privileged information. Any unauthorized review, retransmission, disclosure, distribution or other use of or taking of any action in reliance upon this information by persons or entities other than the intended recipient is prohibited. If you received this email in error, please contact the sender and delete the e-mail and any attached material immediately.

From: Mark Nielsen notifications@github.com Sent: Thursday, March 19, 2020 10:31 AM To: CSSEGISandData/COVID-19 COVID-19@noreply.github.com Cc: Tim Carey tcarey@banecare.com; Author author@noreply.github.com Subject: Re: [CSSEGISandData/COVID-19] CSSEGISandData / COVID-19 data from 3/18/2020 file (#1032)

I'm noticing the same for many other states and given all the recently logged issues it must be similar for all data for 3-18. Hopefully they get it corrected soon.

- You are receiving this because you authored the thread. Reply to this email directly, view it on GitHubhttps://github.com/CSSEGISandData/COVID-19/issues/1032#issuecomment-601211534, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AO366NGYJN6KNIW4BR35XLTRIIUCJANCNFSM4LPIJQPA.


tcarey1981 commented 4 years ago

Good morning,

Can you please let me know when the 6/24/2020 csv file will be posted on this link:? I greatly appreciate your help. 😊

https://github.com/CSSEGISandData/COVID-19/tree/master/csse_covid_19_data/csse_covid_19_daily_reports

[cid:image001.png@01D64AB4.DB67A5E0]

Thank you,

Timothy Carey, MSM Director of Data and Performance Analytics Bane Care Management LLC 350 Granite Street, Building 2, Suite 2203 | Braintree, MA 02184<x-apple-data-detectors://8/1> 781-635-2415 (Cell) www.banecare.comhttp://www.banecare.com/

My Why is to help all levels of staff use data analytics to continuously improve the patient experience and quality of care.


Confidentiality Notice:


The information transmitted in this e-mail message including any attachments, is for the sole use of the intended entity or recipient(s) and may contain confidential and/or privileged information. Any unauthorized review, retransmission, disclosure, distribution or other use of or taking of any action in reliance upon this information by persons or entities other than the intended recipient is prohibited. If you received this email in error, please contact the sender and delete the e-mail and any attached material immediately.

From: Mark Nielsen notifications@github.com Sent: Thursday, March 19, 2020 10:31 AM To: CSSEGISandData/COVID-19 COVID-19@noreply.github.com Cc: Tim Carey tcarey@banecare.com; Author author@noreply.github.com Subject: Re: [CSSEGISandData/COVID-19] CSSEGISandData / COVID-19 data from 3/18/2020 file (#1032)

I'm noticing the same for many other states and given all the recently logged issues it must be similar for all data for 3-18. Hopefully they get it corrected soon.

— You are receiving this because you authored the thread. Reply to this email directly, view it on GitHubhttps://github.com/CSSEGISandData/COVID-19/issues/1032#issuecomment-601211534, or unsubscribehttps://github.com/notifications/unsubscribe-auth/AO366NGYJN6KNIW4BR35XLTRIIUCJANCNFSM4LPIJQPA.