NRGI / resource-projects-etl

ETL processes for rp.org
GNU General Public License v2.0
3 stars 2 forks source link

Country Codes #30

Closed caprenter closed 8 years ago

caprenter commented 9 years ago

Some country codes are coming in as e.g. UK, Uk and so on. Are we testing against a code list on import. Is that a good way to deal with the issue below?

image

timgdavies commented 9 years ago

Two options here:

(1) Tighten up the import scripts (requires more custom handling in the tag-lifter script... so I'm cautious about that);

(2) Have a process to drop all country names, and re-import against ISO code from an authoritative list;

I suspect (2) might be the better option for the moment - but will review...

timgdavies commented 8 years ago

I've updated the way country codes are generated in TagLifter which should address the ID side of this, we might still need to look at queries and part of (2) to avoid the multiple labels problem.