manticoresoftware / manticoresearch

Easy to use open source fast database for search | Good alternative to Elasticsearch now | Drop-in replacement for E in the ELK soon
https://manticoresearch.com
GNU General Public License v3.0
8.68k stars 483 forks source link

custom Chinese dictionary #1479

Open lgl5240 opened 9 months ago

lgl5240 commented 9 months ago

Hello, I'd like to ask if you support the use of a custom Chinese dictionary for segmentation?

sanikolaev commented 9 months ago

It should be possible. We discussed it in this issue https://github.com/manticoresoftware/manticoresearch/issues/371#issuecomment-654596874

If you manage to make it work, pls let me know, we'd like to add it to the docs, make an article about it etc.

lgl5240 commented 9 months ago

这应该是可能的。我们在本期#371(评论)中对此进行了讨论

如果您设法使其工作,请告诉我,我们希望将其添加到文档中,撰写一篇有关它的文章等。

It should be possible. We discussed it in this issue #371 (comment)

If you manage to make it work, pls let me know, we'd like to add it to the docs, make an article about it etc.

thank you

forcemeter commented 9 months ago

@sanikolaev

https://www.amazonaws.cn/en/new/2022/amazon-opensearch-custom-dictionaries-ik-analysis-plugin/ https://github.com/soosinha/opensearch-analysis-ik

Here's a more elegant implementation, referring to the IK plugin for opensearch(same to es), that supports a dynamic Api interface to the dictionary, 'remote_ext_dict'