Open chief opened 9 years ago
Diminutive forms are not stem by choice. I can't recollect the reason but if you check the stemming samples you can see which words will get stemmed wrongly.
@astathopoulos hmm I was afraid of that - ok I'll look into it, thanks!
@greenonion Try on skroutz "φουρνος" and "φουρνακι". You don't want to get the same results on this searches.
@astathopoulos Yep, I see. So you think it will lead to overstemming.
Yep! There are some cases where the stemming process is subjective.
So should we treat it on a per-case basis or ignore it in general? For example maybe we want «φανέλα» and «φανελάκι» to have the same stem, not sure.
@chief I think this is true for many (all?) diminutives. For example, «ΠΑΙΧΝΙΔΑΚΙ» is stemmed to «ΠΑΙΧΝΙΔΑΚ», while «ΠΑΙΧΝΙΔΙ» is stemmed to «ΠΑΙΧΝΙΔ». I will look into the handling of the suffix «ΑΚΙ» and try to fix it.