unicode-org / icu4x

Solving i18n for client-side and resource-constrained environments.
https://icu4x.unicode.org
Other
1.38k stars 178 forks source link

Deduplicate tz locations against root #5759

Closed robertbastian closed 2 weeks ago

robertbastian commented 3 weeks ago

A lot of location names are the same in many languages.

For exemplar cities we have root data, however for territories we don't. For territories I'm resolving the und name by using a Latin-script endonym (i.e. for region XY, resolve und-Latn-XY), and if that fails, just use English (not perfect, but it's better than the raw region code for both display purposes and deduplication).