-
There seems to be an issue with stopwords not being properly excluded by the current search config. For example, if you search for [Demandes en obtention](http://portal.ehri-project.eu/search?q=Demand…
-
tbd if I have time
-
nltk_data/packages/corpora/stopwords.zip contains four wrong german stopwords:
```
unse
unsem
unsen
unses
```
-
At the top of the search results, a hatnote states that in and on have been excluded from the search, yet lower down those words are highlighted in selected content from returned pages.
-
I am trying to remove some frequently occurring words in my corpus using stm's built-in textProcessor. My code ran without any errors, but the words I specified were not removed. Does anyone know if I…
-
hello,
how can i config of stop words for arabic language and english language,
i did it like that but it doesn't work :
'analysis' => [
'analyzer' => [
'custom_analy…
-
While working on bodleian/fihrist-mss#28 I've realized that, while stopwords files exists (for English, Arabic, and other languages) in the standard Solr installation, they haven't been enabled in the…
-
When I try to access the swahili stopwords using the below feature, I'm getting a traceback that the swahili stopwords are missing in the documentation, I'm currently working on building a swahili lan…
-
this is for the future.
When the histogram is daemonized with -D then it will be possible to call ./histogram multiple times with various parameters. Each time ./histogram is called, the files passe…
-
I am passing the set of English stopwords which I create from `yake/StopwordsList/stopwords_en.txt`.
```python
text = "YAKE! is a light-weight unsupervised automatic keyword extraction method whic…