datasets / world-cities

List of major cities of the world as a datapackage
https://datahub.io/core/world-cities
235 stars 201 forks source link

Improvements / clarifications on contents and fields (subcountry, geopoint etc) #5

Open rufuspollock opened 8 years ago

rufuspollock commented 8 years ago

Comments from https://github.com/openspending/cosmopolitan/issues/25#issuecomment-190203135

@lexman any thoughts? I know we already have #3 re native name. What about second two points?

@pwalsh @lexman re city population note that we have https://github.com/datasets/population-city

lexman commented 8 years ago

Hello @pwalsh,

At least should include standard name and the English variant Actually, the field name is the english variant, and you can consider it the standard name (at least for foreign people).

I assume that subcountry equals what is "region" in geonames, but "subcountry" is confusing terminology (for me)

I understand this is really confusing, because all countries don't have the same administrative clustering, so I relied on geoname's work. The documentation of the datapackage says :

Subcountry can be the name of a state (eg in United Kingdom or the United States of America) or the major administrative section (eg ''region'' in France''). See admin1 field on geonames website (http://www.geonames.org/) for further info about subcountry.

Is it understandable ? Would it be better if we reused the name admin1 from geonames for this column ?

We also take location (geopoint) and population from geonames, but they are not present here. I grant that there are likely better data sources, particularly for population.