Open aarppe opened 3 years ago
In addition, there appear to be some extra-FST glitches:
12
to 21
- this may only apply to the object cases, i.e.+12PlO
-> 21PlO+
, e.g. Prt+4Sg/Pl+12PlO+ s/he raises s.o. in poverty; s/he raises s.o. as an orphan
which does not generate, in contrast to Prt+4Sg/Pl+21PlO+ s/he raises s.o. in poverty; s/he raises s.o. as an orphan
which does generate s/he/they raised you and us in poverty; s/he/they raised you and us as an orphan. We might want to check this for the possessors as well, which in generation should be Px21Pl+
instead of Px12Pl+
.Imm+2Pl+ he/she shines a light on it
or Prt+21Pl+4Sg/PlO+ he/she finishes (it/him) for him/her/them; he/she tans (it/him) for him/her/them
: Once the above matters are resolved, we go down from 512,210 non-generated forms to only some 43,493 missing ones, cf.
cat inc/phrases/verbs.phrases | grep '+?' | grep -v 'him/herself' | grep -v '(s.o. ' | egrep -v '\<it\>' | grep -v 12 | grep -v 'he/she' | grep -v 4 | wc -l
43493
For noun phrases, most/many are not properly constructed with an initial feature, e.g.
cat inc/phrases/nouns.phrases| grep '+?' | head -10
Piegan country, in the Piegan country +?
small piece of cloth, scrap +?
domestic animal +?
shorts; underwear +?
crab; lobster +?
birthday cake +?
my vagina, my vulva +?
cucumber; literally: our deceased grandmother +?
Shoal Lake Cree Nation, SK; Cree reserve +?
intestine
The rest appear to be cases with diacritic characters, which now ought to be fixed.
The English phrase generation of some forms does not work for some of the English definitions, which needs to be fixed in the generator FSTs:
Verbs:
it
, when that is not the actual object in the phrase (often parenthesized), e.g. s/he finishes (it/him) for s.o.; s/he tans (it/him) for s.o., or otherwise whenit
is the (implicit or explicit) object and thus neither an element that should be inflected, e.g. He shines a light on it.s.o.
in parentheses, indicating an implicit object that should not be inflected, e.g. s/he has (s.o. as) a mother-in-law1Sg
and2Sg
forms, e.g. s/he gives him/herself a difficult time, s/he makes things difficult for him/herself, s/he is very tough on him/herself-s
that could subsequently be analyzed verbs, if rule-based analysis is applied , e.g. powers in s/he is released, s/he is let go by the powers, s/he is set down by the powers; s/he is permitted by the powersNouns