zzmjohn / mmseg4j

Automatically exported from code.google.com/p/mmseg4j
Apache License 2.0
0 stars 0 forks source link

MMSeg4j如何设置停用词库 排除干扰字的呢? #18

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
MMSeg4j如何设置停用词库 排除干扰字的呢?

如同paoding和 ik的停用词库。

Original issue reported on code.google.com by hideh4i@gmail.com on 9 Mar 2011 at 9:47

GoogleCodeExporter commented 9 years ago
只有使用Lucene自带的分词过滤器吗?

Original comment by hideh4i@gmail.com on 10 Mar 2011 at 1:59

GoogleCodeExporter commented 9 years ago
同问,如何加载停用词库 排除干扰字?

Original comment by uu...@qq.com on 3 May 2013 at 10:21

GoogleCodeExporter commented 9 years ago
同问,能像IK Analyzer那样设置停用词刺客么?

Original comment by ring...@gmail.com on 14 May 2014 at 8:04

GoogleCodeExporter commented 9 years ago
solr有自己的过滤器的啊,如下:
<filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" 
/>      

Original comment by wskb...@gmail.com on 3 Dec 2014 at 12:25