barzerman / barzer

barzer engine code
MIT License
2 stars 0 forks source link

wrong keyboard layout + product tranlsiteration #537

Closed nchepanov closed 11 years ago

nchepanov commented 11 years ago

this is still a problem

  1. it's about 3-5 % of queries
  2. autocomplete does not understand such queries as мультиварка редмонд and I'm pretty sure, that Юровицкий will try to do that.
0xd34df00d commented 11 years ago

мультиварка redmond doesn't work for me as well.

0xd34df00d commented 11 years ago

Done in issue_537_beni_translit branch:

cmd>q
редмонд
parsing: редмонд
parsed. printing
<barz u="1000106">
<bead n="1" c="2">
    <entity id="2758" n="REDMOND"  r="0" class="1000206" subclass="6" />
    <srctok origmarkup="0,7" bytemarkup="0,14">редмонд</srctok>
</bead>
<beni><entity id="38107" n="Пароварка REDMOND RST-1103"  r="0" class="1000206" subclass="7" lev="18" cov="1" rel="0.000202202" />
<entity id="25019" n="Мультиварка REDMOND RMC-M11"  r="0" class="1000206" subclass="7" lev="21" cov="1" rel="0.000202202" />
<entity id="19214" n="Пароварка REDMOND RST-M1104"  r="0" class="1000206" subclass="7" lev="20" cov="1" rel="0.000202202" />
<entity id="19215" n="Тостер REDMOND RT-M403"  r="0" class="1000206" subclass="7" lev="15" cov="1" rel="0.000202202" />
<entity id="38096" n="Весы кухонные REDMOND RS-724 черные"  r="0" class="1000206" subclass="7" lev="0" cov="1" rel="0.000202202" />
<entity id="38097" n="Весы кухонные REDMOND RS-725"  r="0" class="1000206" subclass="7" lev="0" cov="1" rel="0.000202202" />
<entity id="38099" n="Весы кухонные REDMOND RS-728B"  r="0" class="1000206" subclass="7" lev="0" cov="1" rel="0.000202202" />
<entity id="38100" n="Весы кухонные REDMOND RS-M718"  r="0" class="1000206" subclass="7" lev="0" cov="1" rel="0.000202202" />
<entity id="38101" n="Весы кухонные REDMOND RS-M723"  r="0" class="1000206" subclass="7" lev="0" cov="1" rel="0.000202202" />
<entity id="38102" n="Весы напольные REDMOND RS-710 черные"  r="0" class="1000206" subclass="7" lev="0" cov="1" rel="0.000202202" />
<entity id="38103" n="Весы напольные REDMOND RS-719 PK розовые"  r="0" class="1000206" subclass="7" lev="0" cov="1" rel="0.000202202" />
<entity id="38104" n="Весы напольные REDMOND RS-719 PK серые"  r="0" class="1000206" subclass="7" lev="0" cov="1" rel="0.000202202" />
<entity id="38105" n="Кофеварка капельная REDMOND RCM-1501"  r="0" class="1000206" subclass="7" lev="0" cov="1" rel="0.000202202" />
<entity id="38106" n="Кофемолка REDMOND RCG-1602"  r="0" class="1000206" subclass="7" lev="0" cov="1" rel="0.000202202" />
<entity id="19213" n="Йогуртница REDMOND RYM-M5401"  r="0" class="1000206" subclass="7" lev="0" cov="1" rel="0.000202202" />
<entity id="38408" n="Блендер REDMOND RHB-2912 белый"  r="0" class="1000206" subclass="7" lev="0" cov="1" rel="0.000202202" />
</beni><traceinfo>
<match gram="2" file="site/alpha/1000106/rules.E.2719.entities.xml" stmt="253313" emit="1"/>
</traceinfo>
<query>редмонд</query>
<confidence>
</confidence>
</barz>

This stuff works by applying english sounds-like heuristic to the input.

Another issue is that some words are transliterations of japan-whatever brands themselves, so Panasonic, for example, is typically read as Панасоник, not Пэйнэйсоуник as SL would produce. This could only be solved by a dictionary of such brands IMO.

barzerman commented 11 years ago

transliteration has been deployed . @nikitos3000 please test asap - must be fully tested and integrated by monday meeting

barzerman commented 11 years ago

done