Open Woseseltops opened 4 years ago
I wish it were this simple! In most cases, the -A -B -C variants are synonyms, and stem from dialectal variation. Hopefully we can find evidence for this from the regional frequencies. But I do like the idea, we'd have to think about an algorithm that takes this regional variation into account. We now mostly group items in a lemma if they're phonologically related, it seems. So perhaps like this: suggest a lemma relation iff there are no more than two phonology fields different and iff there is not strong evidence for dialectal variation from the regional frequencies info. Would that be doable?
Aha okay apparently a 'lemma' is not what I thought it was.
So perhaps like this: suggest a lemma relation iff there are no more than two phonology fields different and iff there is not strong evidence for dialectal variation from the regional frequencies info. Would that be doable?
Yes doable, but then we still don't have what we need for #354 : a way to find interesting cases like gloss a and gloss b mean the same thing, but in region 1 they use gloss a and in region 2 they use gloss b. Apparently, it's not just lemmas but mainly synonyms that we are interested in.
I added variants to the Frequency View tab. If there are variants (relations saved by the user, or otherwise pattern variants when there is no other relation between the glosses). Issue #602 needs to be done first to fill the tables.
It still won't make suggestions, but if there are variants, they will show up so the user doesn't need to keep searching for them.
Idea coming out of a conversation with @susanodd: much more cool results will come out of #354 if all glosses that belong to one lemma are also identified as such in Signbank. Perhaps we can help the sign linguists and create functionality that:
@ocrasborn what do you think?