atilika / kuromoji

Kuromoji is a self-contained and very easy to use Japanese morphological analyzer designed for search
Apache License 2.0
950 stars 131 forks source link

Integration with solr #87

Closed nisha-kajale closed 8 years ago

nisha-kajale commented 8 years ago

Hi, I am new to solr. I have downloaded kuromoji And placed it in solr-5.3.0\server\lib And added

in solr-5.3.0\server\solr\configsets\basic_configs\conf Now if i do search it should treat each of search term as japanese right? Or do i need to specify which text should be treated as japanese

cmoen commented 8 years ago

Apache Solr already is already bundled with JapaneseTokenizer (based on Kuromoji) since version 3.6. Simply use field type text_ja and you should be able to search Japanese just fine. There's no need to put any jars anywhere to use Japanese - it works out-of-the-box.

See https://speakerdeck.com/atilika/japanese-linguistics-in-lucene-and-solr for more information on features, etc.

nisha-kajale commented 8 years ago

hi ,

Thanks for quick reply. Can you tell me where should i write this text_ja in schema.xml or solrconfig.xml of my core And also if schema.xml which file should i use. As there are several schema.xml

cmoen commented 8 years ago

Please have a look at the Apache Solr documentation on https://cwiki.apache.org/confluence/display/solr/Getting+Started for further details. Thanks.