barzerman / barzer

barzer engine code
MIT License
2 stars 0 forks source link

critical stemming bug #577

Closed bodritto closed 11 years ago

bodritto commented 11 years ago
  1. ent#1 pattern: any(корсет корсеты)

2.ent#2 pattern: any(корсеты)

query: корсет -> ent#1 корсеты -> ent#2, ent#1

0xd34df00d commented 11 years ago

More like a clash for me. Initial investigation makes me think корсеты is stemmed to корсет so the first any() clause reduces to any(корсет).

barzerman commented 11 years ago

question is why isnt stemming activated?

On Mon, Jul 1, 2013 at 8:17 PM, Georg Rudoy notifications@github.comwrote:

More like a clash for me. Initial investigation makes me think корсеты is stemmed to корсет so the first any() clause reduces to any(корсет).

— Reply to this email directly or view it on GitHubhttps://github.com/barzerman/barzer/issues/577#issuecomment-20292496 .

www.barzer.net

barzerman commented 11 years ago
<stmset xmlns:xsi="http://www.w3.org/2000/10/XMLSchema-instance" xmlns="http://www.barzer.net/barzel/0.1">

<stmt n="13"> <pat> <t>планшет</t></pat> <tran> <mkent c="10" s="8" i="planshet"/></tran> </stmt>
<stmt n="11"> <pat> <t>планшеты</t></pat> <tran> <mkent c="10" s="8" i="planshety"/></tran> </stmt>
</stmset>
0xd34df00d commented 11 years ago

Oh, планшет and корсет and брикет and the likes are stemmed to планш and корс and брик and the likes.

barzerman commented 11 years ago

good find! i will update russian stemmer now