ryanqq / paoding

Automatically exported from code.google.com/p/paoding
0 stars 0 forks source link

请问,如何限定分词的长度?我发现有些超级长的分词,想滤掉 #60

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
v8b-10jcatf22v10c-15pcatmega128-16auatmega128l-8au
 jcat27c256r-15jiat28c16-15piat28c17e-15piat28c256-
 jcat29c040a-12tuat29c040a-90tiat29c512-12pcat45db0
 jc44atf1502as-15jc44atf1504as-15ac100atf16v8b-10jc
 jc44atf1504as-15ac100atf16v8b-10jcatf22v10c-15pcat
 jcatf22v10c-15pcatmega128-16auatmega128l-8auatmega

呵呵,居然把这些词都分出来了,我想滤掉,想设定一个分��
�长度上线,可以吗?

Original issue reported on code.google.com by yuweimin...@gmail.com on 19 Apr 2010 at 3:35

GoogleCodeExporter commented 9 years ago
可以的。

按照reno.gan的计划,5.1左右可以发布。reno.gan可以考虑在配置�
��件中作一个最大长度设置?

Original comment by qieqie.wang on 20 Apr 2010 at 4:35

GoogleCodeExporter commented 9 years ago
好啊,期待期待

Original comment by yuweimin...@gmail.com on 21 Apr 2010 at 1:28

GoogleCodeExporter commented 9 years ago
请问可以限定分词长度的新版本release了没有?
如果遇到长分词,是截取分词,还是过滤掉呢?这里还有两��
�策略可选喔

Original comment by yuweimin...@gmail.com on 16 May 2010 at 3:46