no-plagiarism / pymorphy3

Morphological analyzer / inflection engine for Russian and Ukrainian languages.
https://pymorphy2.readthedocs.io/
MIT License
67 stars 9 forks source link

Some adjectives such as апельсиновый lack their short forms #22

Open brigadier opened 10 months ago

brigadier commented 10 months ago

Some adjectives such as апельсиновый, ноготковый, настурциевый etc lack their short forms ({ 'ADJS', 'neut'}) апельсиново, ноготково, настурциево which are enough common at least in the combined color constructions such as апельсиново-красный or ноготково-желтый. These words have their other forms so this exclusion seems deliberate. Is there a way to disable that limitation and generate these short adjectives?

insolor commented 10 months ago

ADJS is not a form, it's a part of speech. As far as I know, pymorphy2 (3 etc.) doesn't have such functionality to convert words between parts of speech.

Several years ago I needed to convert words from nouns to adjectives, and the original author said that there are no such links between similar words in different parts of speech in the dictionary, because there are no such links in the source corpus (opencorpora.ru). See https://github.com/pymorphy2/pymorphy2/issues/64#issuecomment-113611151.

I suppose this is the same problem with ADJF and ADJS, of course they are both adjectives, but in the dictionary they are different parts of speech and there are no links between them.

To add such functionality, it is necessary to add such links in the dictionary first.

insolor commented 10 months ago

By the way, апельсиново, ноготково etc. are not short adjectives (ADJS), they are adverbs (ADVB). And opencopora.ru even doesn't aware of such words.

image

image