phoible / dev

PHOIBLE data and development.
https://phoible.org/
GNU General Public License v3.0
113 stars 30 forks source link

Updates / additions to Mande languages #40

Open bambooforest opened 9 years ago

bambooforest commented 9 years ago

See Vydrin 2007, which contains phonemic inventories for South Mande languages (known at that time).

In particular, on the p. 8, there is a vocalic inventory of Dan-Gweetaa. It should be modified: the semi-closed vowels (ɩ, ʋ, ʋ̈) are not separate phonemes but allophones of e, o, ɤ respectively under extra-high tone; the semi-closed nasal vowels (given in brackets) should be also eliminated (they are allophones, rather than phonemes). Phoneme ɒ is not necessary long, it can be short. A third modulated tone (extrahigh-extralow) has been discovered.

I'm told there are corrections to be done on other South Mande languages as well.

Further: there are two "Dan"'s in the database: Dan (GM) and Dan (UPSID). Dan (GM) is an early and pioneering study by Bearth and Zemp. The latter seems to refer to a Liberian variety (Vydrin, pc).

The former needs to be updated to reflect what has been learned about the language. The latter contains inexactitudes (ibid).

drammock commented 9 years ago

I've said this before, but I don't think we should update existing records based on new knowledge. Instead, we should add a new record for that language, tied to a particular resource, and assign it a higher trump number than the older and allegedly inaccurate records. It's more in keeping with the concept of a doculect, and I think it will serve us well in the long-term to hew closely to that concept. Consequently I'm labeling this as new inventory rather than inventory error.

drammock commented 4 years ago

@bambooforest can this issue be closed?

bambooforest commented 4 years ago

@drammock -- let me update a few things, e.g. we have two IDs in the bibtex file for the same reference here. Also, by now Vydrin has published the description (draft) that he sent us before:

So I can add this as a new inventory. Where should we start adding new inventories -- under which existing raw-data or do we start a new "source"? If we're going for semantically coherent sources, PH, UZ, and GM could be basically all be one, since they were collected and curated in the same manner and have no coherent genealogical or geographic basis (well, perhaps GM, which focused on Africa and later SE Asia).

I don't think we should add to UZ because that work is not being funded by UZ. We could add to PH and keep the existing structure, using InventoryIDs that start after the last current one (in ER), but then we should update the aggregation script to sort the output from within sources, if it doesn't already do so.

Let me know what you think. I'll make a PR for updating this current issue.

drammock commented 4 years ago

I'd be inclined to start a new source and name it after Neuchâtel... NE or UN?