lexibank / abvdoceanic

Creative Commons Attribution 4.0 International
5 stars 2 forks source link

Ra'ivavae is missing in the data: why? #13

Closed LinguList closed 3 years ago

LinguList commented 3 years ago

@SimonGreenhill, it seems that Ra'ivavae is not in the data (id=1213 in ABVD). I wonder how this happened, is it my code, or is it the data that you originally extracted? And is it possible there are more languages missing, or were they excluded for other reasons?

LinguList commented 3 years ago

@maryewal, I think that with the other issue in walworthpolynesian, and a check on the data again, this issue can then be solved, so we don't have real flaws, we just had an older version of glottolog, as far as I can see now.

SimonGreenhill commented 3 years ago

It's not in the filtered list of things to analyse

LinguList commented 3 years ago

Ah, thanks! So this is deliberate, right?

LinguList commented 3 years ago

Then we can close this issue.

SimonGreenhill commented 3 years ago

ja, the phylogenetic analysis is only running off a subset of the 'best' data, not all oceanic in ABVD. I would have thought Ra'ivavae would be in the final list but obviously not (perhaps @marywal can double check)

maryewal commented 3 years ago

we did some picking out of Vanuatu and NC, but Ra'ivavae should be in there.

SimonGreenhill commented 3 years ago

Hmm, this is Ben's list (i.e. it'll be left out of the phylogenies too). Can we double check this?

maryewal commented 3 years ago

Yep, will ask him at today's meeting.

LinguList commented 3 years ago

Is Ben in this repository, so we can discuss with him? I think all have admin rights, so you can add additional persons to the repository, if you want.

maryewal commented 3 years ago

We discussed at our CoOL meeting today and Simon is right that it is/was a glottocode issue (where there was the same glottocode for a group of languages, Ben pulled out the one with highest coverage). We are checking that there aren't other such cases like Ra'ivavae/Rurutu for the oceanic tree set, but I'm not sure it matters too much for the sound inventories? (ben is @King-Ben)

king-ben commented 3 years ago

Ra'ivavae has the same language-level Glottocode as Rurutuan in Glottolog

On 25.08.21 11:41, Mary Walworth wrote:

We discussed at our CoOL meeting today and Simon is right that it is/was a glottocode issue (where there was the same glottocode for a group of languages, Ben pulled out the one with highest coverage). We are checking that there aren't other such cases like Ra'ivavae/Rurutu for the oceanic tree set, but I'm not sure it matters too much for the sound inventories? (ben is @king-ben https://github.com/king-ben)

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/lexibank/abvdoceanic/issues/13#issuecomment-905345900, or unsubscribe https://github.com/notifications/unsubscribe-auth/ANTHRV6UPIN3UDUOZCTEFILT6S3F5ANCNFSM5CWO4Q6A. Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&utm_campaign=notification-email.

LinguList commented 3 years ago

Yes, @king-ben, one should use the dialect glottocodes instead. The language-dialect distinction is anyway arbitrary in some sense, or does this yield a problem for any further comparisons?

maryewal commented 3 years ago

I think we can close this one? (Sorry, don't have rights to close in this repo or I'd just do it :))