tupian-language-resources / tuled

TuLeD: Tupían lexical database
Creative Commons Attribution 4.0 International
2 stars 0 forks source link

create full cognates from partial cognates #44

Open LinguList opened 3 years ago

LinguList commented 3 years ago

This is in fact easy with the function:

from lingrex.cognates import common_morpheme_cognates

wordlist = Wordlist("wordlist.tsv") 
common_morpheme_cognates(wordlist)

This can ideally just be added to the tuled export procedure, so we add full cognates in addition to partial cognates.

What is also possible, but the tuled folk would need to edit the data accordingly, is to code for salient cognates. Salient cognates are described here, and the idea is to use the MORPHEMES to indicate whether a certain partial cognate ID is important or not to make up for a full cognate. In this way, you can just ignore prefixes. @LanguageStructure, maybe you think about this, as this would be the best procedure. The paper above should give all information needed on this, but I'll gladly answer questions.

LanguageStructure commented 3 years ago

many thanks! This is interesting!

On Thu, Oct 14, 2021 at 5:45 PM Johann-Mattis List @.***> wrote:

This is in fact easy with the function:

from lingrex.cognates import common_morpheme_cognates wordlist = Wordlist("wordlist.tsv") common_morpheme_cognates(wordlist)

This can ideally just be added to the tuled export procedure, so we add full cognates in addition to partial cognates.

What is also possible, but the tuled folk would need to edit the data accordingly, is to code for salient cognates. Salient cognates are described here https://doi.org/10.17613/0v48-aa64, and the idea is to use the MORPHEMES to indicate whether a certain partial cognate ID is important or not to make up for a full cognate. In this way, you can just ignore prefixes. @LanguageStructure https://github.com/LanguageStructure, maybe you think about this, as this would be the best procedure. The paper above should give all information needed on this, but I'll gladly answer questions.

— You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub https://github.com/tupian-language-resources/tuled/issues/44, or unsubscribe https://github.com/notifications/unsubscribe-auth/ALNZPK7X3BCSVDXCDHWSGSTUG33I7ANCNFSM5F77LY6A . Triage notifications on the go with GitHub Mobile for iOS https://apps.apple.com/app/apple-store/id1477376905?ct=notification-email&mt=8&pt=524675 or Android https://play.google.com/store/apps/details?id=com.github.android&referrer=utm_campaign%3Dnotification-email%26utm_medium%3Demail%26utm_source%3Dgithub.