apache / doris

Apache Doris is an easy-to-use, high performance and unified analytics database.
https://doris.apache.org
Apache License 2.0
12.34k stars 3.21k forks source link

Please Parser=English/Unicode, Parser_ When using mode to specify the mode for word segmentation, the following modes are supported: #25408

Open houzhiyou1 opened 11 months ago

houzhiyou1 commented 11 months ago

Search before asking

Description

Please Parser=English/Unicode, Parser When using mode to specify the mode for word segmentation, the following modes are supported: Fine Grained: fine-grained mode, Coarse_ Grained: Coarse grained pattern, tends to separate longer words

Solution

No response

Are you willing to submit PR?

Code of Conduct

airborne12 commented 11 months ago

I don't quite get your idea. Would you mind using Chinese?

houzhiyou1 commented 11 months ago

就是倒排索引现在只对中文做了分词,能不能提供对英文如:English,可以分为E,n,glish;EN,glish;E,n,g,l,i,s,h,数学的111,可分为1,1,1;11,1,test11可分为te,st11,test,11等。。。

发件人:airborne12 @.> 发送日期:2023-10-17 17:08:40 收件人:apache/doris @.> 抄送人:houzhiyou1 @.>,Author @.> 主题:Re: [apache/doris] Please Parser=English/Unicode, Parser_ When using mode to specify the mode for word segmentation, the following modes are supported: (Issue #25408)

I don't quite get your idea. Would you mind using Chinese? — Reply to this email directly, view it on GitHub, or unsubscribe. You are receiving this because you authored the thread.Message ID: @.***>