phoible / dev

PHOIBLE data and development.
https://phoible.org/
GNU General Public License v3.0
121 stars 30 forks source link

Canonical language names #305

Open bambooforest opened 4 years ago

bambooforest commented 4 years ago

@drammock -- I came across a language name "Awngi" in SPA and UPSID in this report:

https://github.com/bambooforest/phoible-scripts/blob/master/ethiopia/get_ethiopia_languages.md

it's an inappropriate ethnonym. i would like to update in our aggregated data the field LanguageName to SourceLanguageName and replace the former with the canonical names from Glottolog, whose editors deal with bad names in general.

the canonical language name can be added via the Glottolog codes to the InventoryID-LanguageCodes.csv mapping file.

thoughts?

drammock commented 4 years ago

SGTM, go for it