ogata0916 / mozc

Automatically exported from code.google.com/p/mozc
0 stars 0 forks source link

Mozc tries to use English words first #60

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
Mozc tries to use English words first.

Official Mozc-0.13.499.102
ばーべきゅーのじかんです =>
バーベキューの時間です

After you apply the patch:
--- dictionary04.txt    2010-10-09 09:28:54.000000000 +0900
+++ dictionary04.txt.new    2010-10-22 19:59:22.000000000 +0900
@@ -68797,6 +68797,7 @@
 ですくとっぷうぃじぇっと   2249    2249    8396    デスクトップウィジェット
 のりつが   1351    1351    5213    乗り継が
 ばーべきゅー 2249    2249    4974    バーベキュー
+ばーべきゅー 2249    2249    10020   BBQ
 はいいら   1349    1349    6560    はいいら
 はかばかしかろ  2829    2829    10763   捗ばかしかろ
 のおー  2249    2249    8385    ノオー

ばーべきゅーのじかんです =>
Bbqの自観です

Original issue reported on code.google.com by heathros...@gmail.com on 22 Oct 2010 at 11:15

GoogleCodeExporter commented 9 years ago
I'm not sure I understand your point. What is *exactly* the problem?

Original comment by yusu...@google.com on 23 Oct 2010 at 1:42

GoogleCodeExporter commented 9 years ago
I thought "Mozc uses small cost word first in the same 品詞 words".

ばーべきゅー    2249    2249    4974    バーベキュー
ばーべきゅー    2249    2249    10020    BBQ

2249 = 2249 (same 品詞),
4974 < 10020,
So I thought "Mozc will convert 
[ばーべきゅーのじかんです] to
[バーベキューの時間です]".

But Mozc doesn't do it.
Mozc returns "Bbqの自観です".

Points:
I don't understand the reason why
1. Mozc uses "Bbq" first.
2. "時間" is changed to "自観".
"自観" is a rare word.

Maybe Mozc doesn't follow the cost of words in some cases,
and that is natural for Mozc.

Please close this issue.

Thank you so much.

Original comment by heathros...@gmail.com on 23 Oct 2010 at 4:49

GoogleCodeExporter commented 9 years ago

Original comment by yusu...@google.com on 23 Oct 2010 at 5:25

GoogleCodeExporter commented 9 years ago
Not reproduced on my test machine.
Please try it after clearing preference learning data or turning on "incognito 
mode".

Original comment by t...@google.com on 27 Oct 2010 at 4:17

GoogleCodeExporter commented 9 years ago
Thank you for testing it.
I tested it again and I could reproduce it.

1. Apply "dictionary04.diff" and 
build deb packages (build 499) on Ubuntu 10.10.
2. Install the new deb packages.
3. Clear old Mozc files.
mv ~/.mozc ~/mozc_bak
killall ibus-daemon
ibus-setup
/usr/lib64/ibus-mozc/ibus-engine-mozc

4. Check "プライバシー" => "シークレットモード".

5. Type "ばーべきゅー".
Could you see the suggested words?
"バーベキュー" "BBQ"

6. Type "ばーべきゅーのじかんです" and press enter.
You will see "Bbqの自観です".

Yusukes-san, could you reproduce it?

Original comment by heathros...@gmail.com on 31 Oct 2010 at 10:03

Attachments:

GoogleCodeExporter commented 9 years ago

Original comment by yukawa@google.com on 1 Apr 2012 at 2:25

GoogleCodeExporter commented 9 years ago

Original comment by yukawa@google.com on 1 Apr 2012 at 2:29