hellogithub2000 / ik-analyzer

Automatically exported from code.google.com/p/ik-analyzer
0 stars 0 forks source link

希望作者增加IKTokenizerFactory到ik分词器上 #104

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
这个是本人简单的实现方式,测试可用。感觉上缺了这一块��
�持,功能还是不够,望作者笑纳

Original issue reported on code.google.com by liuzihen...@gmail.com on 25 Dec 2012 at 5:05

Attachments:

GoogleCodeExporter commented 9 years ago
附带schema.xml调用方式 

<fieldType name="text_ik" class="solr.TextField" positionIncrementGap="100">
      <analyzer type="index">
        <tokenizer class="org.wltea.analyzer.henry.IKTokenizerFactory" useSmart="true"/>
        <!--
        <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" enablePositionIncrements="true" />
        -->
        <!-- in this example, we will only use synonyms at query time
        <filter class="solr.SynonymFilterFactory" synonyms="index_synonyms.txt" ignoreCase="true" expand="false"/>
        -->
        <filter class="solr.LowerCaseFilterFactory"/>
      </analyzer>
      <analyzer type="query">
        <tokenizer class="org.wltea.analyzer.henry.IKTokenizerFactory" useSmart="false"/>
        <!--
        <filter class="solr.StopFilterFactory" ignoreCase="true" words="stopwords.txt" enablePositionIncrements="true" />
        -->
        <filter class="solr.SynonymFilterFactory" synonyms="synonyms.txt" ignoreCase="true" expand="true"/>
        <filter class="solr.LowerCaseFilterFactory"/>
      </analyzer>
    </fieldType>

Original comment by liuzihen...@gmail.com on 25 Dec 2012 at 5:07

GoogleCodeExporter commented 9 years ago
看来太多这样的需求了,下个版本就加上把

Original comment by linliang...@gmail.com on 27 Dec 2012 at 2:12

GoogleCodeExporter commented 9 years ago
[deleted comment]
GoogleCodeExporter commented 9 years ago
[deleted comment]
GoogleCodeExporter commented 9 years ago
[deleted comment]
GoogleCodeExporter commented 9 years ago
太需要这个需求,能否来一个hotfix? 
文档上写到“官方取消了 BaseTokenizerFactory 接口, 而直接使用 
Lucene Analyzer 标准接口” 
但是官方好像说“As of Solr 4.0 BaseTokenFilterFactory has been renamed 
to TokenFilterFactory and moved to the package 
org.apache.lucene.analysis.util.”  只是改了名称,换了包名而已. 

谢谢

Original comment by my.name....@gmail.com on 16 Jan 2013 at 3:00

GoogleCodeExporter commented 9 years ago
现在有分发包吗?
我这正需要个IKTokenizerFactory
但我不会Java

Original comment by liujia0...@gmail.com on 18 Jan 2013 at 8:12

GoogleCodeExporter commented 9 years ago
[deleted comment]
GoogleCodeExporter commented 9 years ago
这样的做法在Solr的4.2上面是可行的,的确可以做出一个能同�
��输出同义词的分词器...谢谢分享。不过,在Solr的4.6和4.7上��
�却会出现问题。Solr可以正常启用,但是会出现no cores 
available的讯息。请问有人知道怎么办吗?谢谢。

Original comment by ccli....@gmail.com on 11 Mar 2014 at 9:55

GoogleCodeExporter commented 9 years ago
解法:http://thinkjet.me/solr-lucene-tokenizer-filter.html
在solr 4.7上面可以用。

Original comment by ccli....@gmail.com on 28 Mar 2014 at 6:11

GoogleCodeExporter commented 9 years ago
4.10  solr不行. 

Original comment by linzuxio...@gmail.com on 16 Sep 2014 at 1:21