opentraveldata / optd

Deprecated Git repository. Please move to
https://github.com/opentraveldata/opentraveldata
Other
24 stars 13 forks source link

Wrong continent name for some POR entries #11

Closed da115115 closed 11 years ago

da115115 commented 11 years ago

The ori_por_public.csv file contains POR (point of reference) entries containing non standard continent names (e.g., America, Atlantic)

$ cut -d'^' -f20 ./ori_por_public.csv | sort -u
Africa * 
America 
Antarctica * 
Asia * 
Atlantic 
Australia 
continent_name 
Europe * 
Indian 
North America * 
Oceania * 
Pacific 
South America * 
da115115 commented 11 years ago

The mapping between countries and their related continents must be extracted from Geonames and made available when generating the ORI-maintained file of POR. A new AWK script may be necessary.

alexprengere commented 11 years ago

You can map the country_code to the continent using GeoBase:

$  GeoBase -b countries -s code continent -q 
#code^continent
BD^AS
BE^EU
BF^AF
BG^EU
BA^EU
...

Then you can finish replacing the continent codes by their names (you can also get this mapping by GeoBase).