-
### 问题描述
记录中有刘德华,不想搜【柳】的时候,出现刘德华被命中的情况
### Action
- [ ] 自问自答,如果有更好的解决方法,大家可以写在评论区
- [ ] 我的方案
- [x] 可以做到搜索不同同音字
- [ ] 后续优化:高亮该高亮的部分
-
**Describe the bug**
A clear and concise description of what the bug is.
**To Reproduce**
Steps to reproduce the behavior:
1. Go to edi.wang
2. Click on search
3. See error
**Expected behav…
-
## My Environment
* __ArangoDB Version__: 3.8.3
## Component, Query & Data
We have a collection AbstractEvent with field 'text', which contains 1~30 Chinese characters and we want to p…
-
**Is your feature request related to a problem? Please describe.**
[There](https://github.com/lazyloong/obsidian-fuzzy-chinese) is another plugin for obsidian which introduces fuzzy search of pin…
-
版本:6.2.1
index:
{
"index" : {
"analysis" : {
"analyzer" : {
"pinyin_analyzer" : {
"tokenizer" : "my_pinyin"
…
-
我使用首字母搜索的时候发现翘舌音(z/c/s+h)会在一起导致搜索异常。
比如库中有“中华人民共和国”:
curl -XGET 'localhost:9200/news/_search' -d '{"query":{"match_phrase":{"name":"zhonghua"}}}'
curl -XGET 'localhost:9200/news/_search' -d '{"quer…
-
### Self Checks
- [X] I have searched for existing issues [search for existing issues](https://github.com/langgenius/dify/issues), including closed ones.
- [X] I confirm that I am using English to su…
-
例如:
```
PUT /medcl/
{
"index" : {
"analysis" : {
"analyzer" : {
"pinyin_analyzer" : {
"tokenizer" : "my_pinyin"
…
-
Once I've removed stopwords using nltk or similar, I want to be able to see the original text snippets and not the ones without stopwords. How can I achieve that?
-
This is a "living issue". Editing is appreciated.
### Context:
- Most prominent benchmark for embedding models: https://huggingface.co/spaces/mteb/leaderboard
- We can choose to index the pdf dat…