buda-base / public-digital-library

http://library.bdrc.io
5 stars 6 forks source link

cleanup of prefLabels and phonetics #944

Open eroux opened 1 month ago

eroux commented 1 month ago

To improve UX, I propose do the following

  1. remove all the legacy phonetics (since we'll handle phonetics search differently)
  2. remove all the English translations of person names
  3. Use culturally appropriate prefLabels, for instance let's have only a Tibetan prefLabel for Tibetans (not Chinese or English), and an English, Sanskrit or Chinese prefLabel for others. Some exceptions will be Indian masters who can have Sanskrit + Tibetan + Chinese prefLabels
  4. convert all the ala-lc transliterated labels into Wylie, and remove duplicates

This will require a lot of manual review