VertNet / DwCVocabs

Real-world values for Darwin Core terms
GNU General Public License v2.0
13 stars 9 forks source link

Clean up continent for countries having more than one #56

Open tucotuco opened 4 years ago

tucotuco commented 4 years ago

See

https://en.wikipedia.org/wiki/List_of_transcontinental_countries https://en.wikipedia.org/wiki/Boundaries_between_the_continents_of_Earth#The_Americas,_Australia_and_Oceania https://en.wikipedia.org/wiki/List_of_transcontinental_countries#/media/File:VE-Dependencias_Federales_ubicacion.png https://en.wikipedia.org/wiki/Boundaries_between_the_continents_of_Earth#/media/File:Map_of_Sunda_and_Sahul.png https://en.wikipedia.org/wiki/Federal_districts_of_Russia https://en.wikipedia.org/wiki/European_Russia#Alignment_with_administrative_divisions

tucotuco commented 4 years ago

All but Russia done in https://github.com/VertNet/DwCVocabs/commit/2f4b001749a3b8143cc96c520df5bf49c24f8a0c.

Jegelewicz commented 4 years ago

What does "clean-up" mean? How did you handle them?

tucotuco commented 4 years ago

I assigned continent to all geography records where there was only one, based on the references in the first comment. Note that the list of country codes in that comment is just a checklist for countries for which to check. Distinct geography records with those country codes might differ in continent assignment. We standardize geography as an entity, not field by field within those that make up geography (continent, country, stateProvince, county, municipality, waterbody, islandGroup, island - all as defined by Darwin Core).