cristan / improved-un-locodes

UN/LOCODE dataset, but with more and actually reliable coordinates
4 stars 1 forks source link
nodejs nominatim unlocode wikidata

UN/LOCODE list with improved coordinates

The coordinates in the original UN/LOCODE list have 2 major problems:

1. Only 80% of locations have coordinates

This doesn't just include tiny villages, but major transport hubs like Shanhai Port (CNSHG), Port of Shenzhen (CNSZP), Hong Kong (HKHKG) and Los Angeles (USLAX).

2. Many coordinates are just wrong

Problems like typos (AUMID), pointing to the wrong country (CKPZK) and just flat out being wrong (EGSCN)

This project aims to solve most of these cases by combining the data with data from OpenStreetMap's Nominatim and Wikidata.

CSV with improved locations

You can find the improved list as code-list-improved.csv. It has both corrected coordinates, as well as just way more of them (98.3%).

How the improved list is created

Other than that, all differences between the UN/LOCODE have been manually (quickly) tested and the correct ones are manually specified. Differences between Wikidata and this list are also tested and the correct ones manually specified, making this list as reliable as you can reasonably expect.

Extra columns

3 extra columns are created:

Extra scripts

This project also contain extra scripts to automatically detect problems with the UN/LOCODE dataset, like incorrect regions.

About UN/LOCODES

The United Nations Code for Trade and Transport Locations is a code list mantained by UNECE, United Nations agency, to facilitate trade. The list is comes from the UNECE page, released twice a year.

License

UN/LOCODE data

All unlocode data is licensed under the ODC Public Domain Dedication and Licence (PDDL).

Nominatim data

ODbL 1.0. http://osm.org/copyright

Wikidata

CC-0 (No rights reserved)

All other contents in this repo

CC-0 (No rights reserved)