datasets / awesome-data

Curated list of quality open datasets
https://datahub.io/collections
788 stars 101 forks source link

Population growth rate #143

Open Snekili opened 8 years ago

Snekili commented 8 years ago

Info

Dataset from United Nation Population Division The 2015 Revision of World Population Prospects is the twenty-fourth round of official United Nations population estimates and projections that have been prepared by the Population Division of the Department of Economic and Social Affairs of the United Nations Secretariat. The main results are presented in a series of Excel files displaying key demographic indicators for each development group, income group, major area, region and country for selected periods or dates within 1950-2100. A publication labelled Key findings and advance tables, which provide insights on the results of this latest revision, is also made available here.

Data:

File

Licence:

Databases Permission is required to reuse content from any and all United Nations’ online platforms and databases (namely, legal and statistical databases).

Suggested citation: United Nations, Department of Economic and Social Affairs, Population Division (2015). World Population Prospects: The 2015 Revision, DVD Edition.

Snekili commented 8 years ago

@rgrp Data > http://data.un.org/Explorer.aspx?d=GHG&f=seriesID%3aCH4

https://drive.google.com/drive/folders/0B_KlKJBuuX4aSjFhRGJTaVVIcUU

Snekili commented 8 years ago

Un Population division > http://esa.un.org/unpd/wpp/Download/Standard/Population/

zelima commented 8 years ago

@Snekili I could start packaging this one following http://esa.un.org/unpd/wpp/Download/Standard/Population/ (second link) Population Growth Rate file has several sheets - which one do we need to package, estimates?

Snekili commented 8 years ago

@zelima I think you should use first sheet - ESTIMATES as that sheet is for Total population, both sexes combined, from 1950-2015

Snekili commented 8 years ago

@zelima I dont know why this sheet is called "Estimates" because it really Historic data, and all of the rest are in fact Estimates from 2015-2100.

zelima commented 8 years ago

@Snekili datapackage is ready you an view it here: http://data.okfn.org/tools/view?url=https%3A%2F%2Fgithub.com%2Fzelima%2Fpopulation-growth-rate please review and help with README.md :)

Snekili commented 8 years ago

@zelima I've added a comment that we can use in README. BUT Im not sure abut this licence thingy... What do you think?

zelima commented 8 years ago

@rgrp rights and permissions page is a little bit unclear about permissions - it says:

Permission is required to reuse content from any and all United Nations’ online platforms and databases (namely, legal and statistical databases).

but it also says:

Excerpts from a non-sales publication No permission is necessary to reproduce excerpts from a non-sales publication provided that proper credits are given.

Could you please help here?

gsilvapt commented 8 years ago

Sorry to interrupt: We have this package online that also contains population data: https://github.com/datasets/registry/issues/101

It is not automated, as the source is not an API or similar that could allow me to automate the process. But we can probably take care of that in this package.

zelima commented 8 years ago

@gsilvapt thanks for comment. @Snekili in case we deal with license part, do we need to have population growth rate separately? Please consider these differences between them (I'll call them data1[mine] and data2[Gsllvapt's], ):

Data1 contains growth rate from 1950 to 2015, for every 5 year Data2 contains growth rate from 1990 to 2009, for every 1 year

@gsilvapt this is what I saw with first glance, maybe you'd add something else?

gsilvapt commented 8 years ago

@zelima The source from data 2 package is not an API or something that will be updated frequently, AFAIK. Hence, it may make sense to use only the UN's because we can automate the process of maintaining and curating the data set.

We can also write a secondary Python script to see if the data match in both data sets and, if not, target the differences. We can keep it separated and just add a note saying data2 set is yearly growth rates from 1990 to 2009 and data1 is 5 year intervals of growth rates from 1950 to 2015, if that makes any sense.

rufuspollock commented 8 years ago

@Snekili @zelima may also be worth comparing this to the World Bank one. What is the difference between this and world bank population (growth) datasets?

zelima commented 8 years ago

@rgrp World bank data contains annual growth rate from 1960 to 2015. And I think it is cleaner and tidier then 2 previous

http://data.worldbank.org/indicator/SP.POP.GROW