rs-delve / covid19_datasets

Interfacing several COVID-19 related datasets
MIT License
45 stars 19 forks source link

Canadian Death Data Suspicious #22

Open sbodenstein opened 3 years ago

sbodenstein commented 3 years ago

This looks wrong:

import pandas as pd
deaths_path = 'https://raw.githubusercontent.com/rs-delve/covid19_datasets/master/age/data/all_deaths_by_age_sex.csv'
DEATHS_DF = pd.read_csv(deaths_path, parse_dates=['Date'])
DEATHS_DF[(DEATHS_DF.Age == '90+') & (DEATHS_DF.ISO == 'CAN')].groupby('Sex').sum()

gives {'b': 1830.0, 'f':1203.0, 'm': 46.0}. The category b!=m+f, and also the m category is way too low (there are only around double the number of females +90, not 20x).