-
For some reason, locally on a mac I'm getting this error:
![screenshot_2024-05-08_at_14 12 50](https://github.com/amaccis/php-stemmer/assets/13278505/e8aba988-6a01-4bb5-97eb-9f25ca371f55)
```
…
-
Currently, '`porter`' stemmer is used by default for duckdb indexing here https://github.com/huggingface/datasets-server/pull/1296/files#diff-d9a2c828d7feca3b7f9e332e040ef861e842a16d18276b356461d2aa34…
-
Hello!
I'm experimenting with the default Brazilian Portuguese stemmer and I'm finding that while it does work with singular/plurals, most adjectives won't work. Searching for alternative stemmers le…
-
They are not added, which will definitely cause problems with recall in the case where users add punctuation to their queries.
Unfortunately, this is a bug-compatibility with lunr.js issue: https:/…
-
Info/implementation:
http://lexicalresearch.com/software.html
This is an English-language stemmer, so it can wait until ~~2.0~~ 0.4.
-
Please look at the code below. Is there a way to avoid transliteration of "sheet" substring to "шеет" one in the 2nd case?
Code:
from nltk.stem import SnowballStemmer
stemmer = SnowballStemmer(la…
-
- [ ] Clear prefixes first, clear suffixes second
- [ ] `al` `kal` `fal` `bal` `bb` should marked first, and set `is_noun`
- [ ] `aa` `ww` `ff` should marked first
- [ ] Greedy to choose betwe…
-
Hello,
I'm using Snowball via Elasticsearch, which is based on Lucene. The Snowball German stemming is not matching some common forms:
- "schlummert" should match "schlummern" (infinitive) but i…
-
### Problem:
The sumy module uses the nltk package for stemming and stop words, but nltk does not support e.g. the Polish language out of the box.
### Solution:
#### Stop words:
Download the Poli…
-
Currently from the generated code for C#, the stemmer uses an internal state to keep track of position etc. Unfortunately this prevents re-use of the same stemmer instance across multiple threads. Cre…