gustafl / lexeme

A new take on language learning.
1 stars 0 forks source link

Consider disallowing compounds and expressions #112

Closed gustafl closed 8 years ago

gustafl commented 8 years ago

For

A multiword expression can be a compound, a fragment of a sentence, or a sentence. The group of lexemes which makup up a MWE can be continuous or discontinuous. It is not always possible to mark a MWE with a part of speech. Multiword Expressions are, apart from Disambiguation, one of the two key problems for Natural Language Processing (NLP) and especially for machine translation (MT).

Against

The number of MWEs in a speaker's lexicon is estimated to be of the same order of magnitude as the number of single words. Specialized domain vocabulary overwhelmingly consists of MWEs, hence, the proportion of MWEs will rise as a system adds vocabulary for new domains, because each domain adds more MWEs than simplex words.

gustafl commented 8 years ago

No way. Compounds and expressions are core features in languages! Include them or put this project to rest.