-
For [linguistic processing](https://docs.vespa.ai/en/linguistics.html), such as tokenization and stemming, Vespa integrates with [Apache OpenNLP ](https://opennlp.apache.org/models.html). The downside…
-
### Describe the current behavior
Google Search Console gives the error "Video is not the main content of the page" when indexing videos on our PeerTube site.
This is one of the video pages that G…
-
### Elasticsearch Version
8.10.4
### Installed Plugins
_No response_
### Java Version
_bundled_
### OS Version
Elastic Cloud - GCP - Iowa (us-central1)
### Problem Description
…
-
Interesting: https://forum.obsidian.md/t/make-a-better-pdf-annotation-in-canvas/61298
Currently, PDF++ does not support Canvas because internal links in canvas files are not indexed by Obsidian.
H…
-
I have the following query:
```
from index 'PersonDtoIndex'
where OwningContact_Id = "08dc49e4-3043-2c2b-30d0-42ebaadf0000"
```
And the following index:
```
using System;
using System.…
-
**Describe the bug**
The `language` annotation is applied once even though multiple ones are provided and as a result, the search query is stemmed just once.
**To Reproduce**
Schema:
```sd
…
-
Finding terms within X positions of the start of a field is currently possible via `intervals` queries, although it only really works for single-valued fields. It is not at all possible to find a ter…
-
### Self Checks
- [X] This is only for bug report, if you would like to ask a question, please head to [Discussions](https://github.com/langgenius/dify/discussions/categories/general).
- [X] I have s…
-
I still don't know if there is room for optimization there but the performance drop is significant. I observed on a dataset with documents containing a big array of small text, a drop of 25% in indexi…
-
I think that it would probably make sense to index any `alt` text on `img`s.