Closed armsp closed 4 years ago
@armsp I believe the discrepancy is due to a duplication of counts in the aggregation in locations for which there is subnational data. For example, for the US we have population counts at the county, state, and national level. So if you aggregate by the Country_Region value, you will wind up tripling the count for the US. Canada similarly has national and provincial population figures, which is likely why you see roughly a doubling when aggregating.
@CSSEGISandData You are absolutely right. I should have investigated further. Got the correct results with just the following -
population_d[population_d['Country_Region'] == population_d['Combined_Key']]
I aggregated the population per
Country_Region
from the population file and compared it with the Worldometer's population dataAny idea why JUH's population data is often many times more than the actual population? For quite a few of the other countries the data matches. But some of the largest countries, its all over the place.