-
How can I add a stopwords list for an additional language?
-
## Describe the bug
En testant le suggester des PCS dont les stopwords contiennent DANS, j'ai parmi les échos :
![image](https://github.com/user-attachments/assets/8d665ba8-25c1-4eaf-b80a-1e9c6170da…
-
I am trying to remove some frequently occurring words in my corpus using stm's built-in textProcessor. My code ran without any errors, but the words I specified were not removed. Does anyone know if I…
-
### Bug Description:
**Issue Summary:**
When performing a phrase search on a full-text field 'title' that includes stopwords, no results are returned even when an exact match exists in the data.
…
-
### Missing functionality
Word clouds contain the most common words, and for free text fields, these words are often: 'and', 'to', 'the', 'from' etc. Which provide no meaningful insight into the da…
-
*@kelson42 commented on Mar 12, 2017, 8:21 PM UTC:*
Stopwords are words which should not be indexed (during the FT index process) and also be ignore during the FT search. This stopwords are language …
ghost updated
1 month ago
-
Use NLTK English stopword list in initial implementation.
Then make it possible to choose from other stopword lists from a database of stopword lists.
-
## User Story
As a user it's distracting to see common stopwords (https://github.com/pulibrary/pul_solr/blob/main/solr_configs/catalog-production-v2/conf/stopwords.txt) highlighted in search results. …
-
### Bug Description:
Manticore removes all ngram characters from the stopwords file despite their collocation.
```sql
CREATE TABLE IF NOT EXISTS test (data text indexed) charset_table='non_cjk' s…
-
There seems to be an issue with stopwords not being properly excluded by the current search config. For example, if you search for [Demandes en obtention](http://portal.ehri-project.eu/search?q=Demand…