lexibank / vanuatuvoices

Sound-Comparisons Vanuatu
Other
3 stars 1 forks source link

update the orthoprofiles #17

Closed LinguList closed 3 years ago

LinguList commented 3 years ago

@xrotwang @maryewal, please check the orthography profile and my additional replacements. The lexibank code is hacky, as I could not assign the formspec to the normal dataset, but the more important part is the orthography profile. There are quite a few problems, but we can address them. I leave the profile unchanged, so that @maryewal can see the changes I made and correct them. Note that only the second column is important (the rest is mere information, so I didn't add it). But you will see that we face the typical problems of wrong unicode (e.g., nasalization was wrongly marked), other symbols that are not IPA, and other idiosyncrasies. In any case, this may also help to refine the transcrpitions (although I'd recommend to always segment them in the future, as there are otherwise quite a few ambiguities that are difficult to resolve by orthoprofiles afterwards).

You can merge this once you want, I consider my work finished here for the time being ;)

maryewal commented 3 years ago

@LinguList So, I just need to review the orthography.csv file? I don't need to do anything with the lexemes file, do I?

LinguList commented 3 years ago

No.

LinguList commented 3 years ago

The lexemes shows some character problems, which could not really be handled: they use the diacritic for whistled sounds on both vowel and consonant, or even twice on the consonant, which is difficult to see, and difficult to modify. So I used the direct replacements there.

Bibiko commented 3 years ago

Are there any issues still? Can we merge?