Closed mansayk closed 5 years ago
@mansayk, what is the context for this word, and what command did you use to get the analysis?
I believe both analyses are possible, and the transducer should return both. If you're running it through the disambiguator (tagger) as well, then you need to provide some context for it to perform accurately—if there's no surrounding text, then it essentially just guesses.
echo "күләмдәге" | apertium-destxt -n | lt-proc -z -w 'apertium-tat/tat.automorf.bin' | cg-proc -z 'apertium-tat/tat.rlx.bin' | cg-proc -z -w -1 'apertium-tat/dev/mansur.bin' | apertium-retxt ^күләмдәге/күлә<n><sg><sg><px1sg><loc><subst><nom>$
I don't remember the context of that exact case, but for example, it might be: "Бик зур күләмдәге эш башкарылган!" ("A lot of work has been done.")
And there is no word "күлә" in Tatar language and I think it should not be returned at all.
@IlnarSelimcan, do you know why күлә is in the transducer?
I marked this word as Use/Arch, but there is no difference:
echo 'күләмдәге' | apertium-destxt -n | lt-proc -z -w 'apertium-tat/tat.automorf.bin' | cg-proc -z 'apertium-tat/tat.rlx.bin' | cg-proc -z -w -1 'apertium-tat/dev/mansur.bin' | apertium-retxt
^күләмдәге/күлә<n><sg><sg><px1sg><loc><subst><nom>$
Does this Use/Arch tag work? How can I remove stems with this tag from analysis?
I just saw your message in another issue about Use/Arch not being implemented yet. So my previous question has an answer now.
I disabled this word temporarily.
^күләмдәге/күлә<n><sg><sg><px1sg><loc><subst><nom>$