Closed aimeehan1 closed 1 year ago
On 2022-07-26, G.h observed discrepancy in U.S. confirmed case data between different webpages within CDC's website:
U.S. Map & Case Count page at 3,487 Global Map page at 3,846 Both pages reporting data as of 2022-07-25.
https://www.cdc.gov/poxvirus/monkeypox/response/2022/us-map.html https://www.cdc.gov/poxvirus/monkeypox/response/2022/world-map.html
Thank you for providing the dataset!
Sorry for jumping in, but I tried to create a pandas.DataFrame
with cumulative number of confirmed/recovered/fatal cases using your linelist data.
https://gist.github.com/lisphilar/23d23f8692f70f2663a6c4890758a7ab
I assumed the followings. Is my understanding correct?
numpy.nan
: active or "Status = omit_error" casesIs it possible to provide recovered/fatal data as well as confirmed? Total populaton and cumulative number of confirmed/recovered/fatal cases are very useful for data analysis. I developed a Python library (COVID-19 data, named CovsirPhy) and analysed them with math models.
@lisphilar You're welcome!
Please do not apologize for jumping in; we made our work open source because we want your input!
Would you kindly open a new issue in this repository for the problem you described? This allows us to keep all "epics", features, and bugfixes discrete.
I would also refer you to our data dictionary, which might help answer some of your questions.
@jim-sheldon Thank you for your quick response!
I just have created four issues #177 #178 #179 #180 and I'm looking forward to having discussion with you and your team there.
Line list is discontinued as of 2022-09-22
Comments from discussion 2022-07-13 Errors.
Observed increase in reporting errors. Examples: ECDC report (Argentina, Australia), Spain (computer reporting issue), Belgium (cases don't sum to total https://epidemio.wiv-isp.be/ID/Documents/Monkeypox/MPX_Update_12072022_FR.pdf), U.S. CDC (Illinois reporting errors), etc. Sometimes errors are acknowledged, other times data is changed without notice.
Pattern of inconsistencies in reporting among global/regional reports (e.g. WHO, PAHO, ECDC) and in comparison to country level MOH reporting. Currently, curators identify a change in cases from these global/regional report updates and then look for secondary .gov (national/local) sources of information as verification. But, if we cannot find secondary sources, then we default to the global/regional report numbers. Example, Mexico (PAHO reporting 27 cases, could not verify through MOH site, defaulted to PAHO report #), Malta (WHO reporting 9 cases, could not verify through MOH site, default to WHO #), etc. Reminder to curators that it's important to look for secondary sources of information.
Changes in reporting formats.
Change in cumulative case calculations. Some countries now include probable counts in totals. confirmed + probable = total. Examples: Belgium, Australia.
Standard reporting format no longer supports tracking of confirmed and/or suspected cases. Example, changes to Brazil’s heat map that displays suspected case counts changed to aggregate numbers – so, now we only track confirmed cases.
"Active" versus "recovered/inactive" case status (no longer have the clinical symptoms of monkeypox, they have recovered from acute illness). Example, Italy, Andalusia cases have been reported as active case totals, but we are tracking cumulative totals. Reminder to curators to check cumulative counts (active + inactive). Due to limited metadata, we are not currently able to update individual case status to "recovered/inactive." https://www.rtvsol.es/noticias/andalucia/salud-y-familias-informa-de-que-actualmente-en-andalucia-hay-193-casos-activos-de-viruela-del-mono