ecohealthalliance / One-Health-Database-Africa

Other
0 stars 0 forks source link

Issues from Humdata Checklist #5

Open collinschwantes opened 2 years ago

collinschwantes commented 2 years ago

HDX quality assurance (QA) officers should check each new or updated dataset against this checklist to ensure datasets shared on HDX meet the minimum HDX quality standards for datasets. The checklist is in three parts: a) a checklist for dataset resources, b) a data responsibility checklist and c) a metadata quality checklist. Each of these checklists are presented below.

A. Data Resource

B. Data responsibility

C. Metadata Quality

collinschwantes commented 2 years ago

@sebaum Need to make sure that all datasets have appropriate source information. Many of the manually updated datasets do not contain a source URL or doi for the dataset. Ideally source points directly to the source information not an organization's homepage.

sebaum commented 2 years ago

JEE Scores - Came from each country's individual JEE report (https://extranet.who.int/sph/jee)

Rabies Management CDC - Came from country look up on CDC's National Rabies Assessment (https://www.cdc.gov/rabies/resources/countries-risk.html)

Political Stability and Absence of Violence/ Terrorism -- World Bank I believe (https://databank.worldbank.org/source/worldwide-governance-indicators/Series/PV.EST)

Population growth, 2010-2019 (%) - Calculated from original population data from world bank

FAO Protein - This captures "Average protein supply (g/cap/day) (3-year average)" and "Average supply of protein of animal origin (g/cap/day) (3-year average)" from FAO's Food Security Data. Average protein supply (g/cap/day) (3-year average): http://data.un.org/Data.aspx?q=Economy&d=FAO&f=itemCode%3A21013 Average supply of protein of animal origin (g/cap/day) (3-year average): http://data.un.org/Data.aspx?q=Average+protein+supply+(g%2Fcap%2Fday)+(3-year+average)&d=FAO&f=itemCode%3A21014

Combined data sheet_African Countries: This is an excel file that Catherine had put together for her dissertation, from which Sarah Hayes pulled in data. I think all the data here is account for except the "malaria incidence per 1000 people". I checked the World Bank data and it doesn't match what is in this dataset, so I will likely need to ask Catherine where she got it from.

OIE simulation -- (https://www.woah.org/en/what-we-do/animal-health-and-welfare/disease-data-collection/simulation-exercises/)

eid_risk - from allen et al 2015 This is from the Allen et al. (2015) repo. Emma had aggregated the data to the country level for another project and this data is included.

collinschwantes commented 2 years ago

@sebaum we can could also cite Catherine's dissertation if thats the source of the data

sebaum commented 2 years ago

@collinschwantes But it's not her own data though. Let me ask and see if she can get back to me quickly.

collinschwantes commented 2 years ago

@sebaum can't find a paper where Toph Allen was lead auther in 2015. Is this a different EHA Allen?

sebaum commented 2 years ago

@collinschwantes sorry collin, it's 2017. https://www.nature.com/articles/s41467-017-00923-8. It's the hotspots paper.

collinschwantes commented 2 years ago

Found the source with some help from Emma - https://github.com/ecohealthalliance/hotspots2/blob/master/inst/out/ncd-eid-link/hs2_country.csv

collinschwantes commented 2 years ago

@sebaum I requested permission to use IUCN redlist data and WOAH data from outside WAHIS.

collinschwantes commented 2 years ago

@sebaum see https://github.com/ecohealthalliance/gheri-africom-data/blob/feature/humdata_compliant/R/update_humdata_metadata.R - there are a couple of places where I need some help to fill out the metadata for HDX. (where ever there is an "@Sarah" placeholder ).

collinschwantes commented 1 year ago