-
We should test to see if the EnglishTokenizer impl is sufficient for German, and if not, add an additional tokenizer. EnglishTokenizer is based on porter stemmer.
-
We should test to see if the EnglishTokenizer impl is sufficient for Spanish, and if not, add an additional tokenizer. EnglishTokenizer is based on porter stemmer.
-
You currently offer the widely used Porter Stemmer and also the Lancaster Stemmer. The former is less aggressive and the later is often times too aggressive. It would be nice to implement the Lancaste…
-
Hi all,
I'm working on a project with Pyserini and would appreciate some guidance on efficiently updating document indexes. My goal is to avoid reindexing the entire document list whenever a docume…
-
Hi,
First, I want to thank you: having snowball stemmer in javascript world is great.
I have two questions:
- how stable is this library. You just release it. is it stable enough?
- I'd like to use …
fk-hb updated
9 years ago
-
This package currently contains no license information.
The porter stemmer class implementation by Richard Heyes is derived from the original implementation by Jon Abernathy: http://www.chuggnutt.com…
-
This commit adds some failing unit tests: https://github.com/webis-de/ir_axioms/commit/4a747d4bd22f4aea1fb754ebef48dbb5febbcc8a
Should be simple to resolve this. We load the term-pipeline from the …
-
```
$ gem install ferret
Building native extensions. This could take a while...
ERROR: Error installing ferret:
ERROR: Failed to build gem native extension.
```
followed by many error…
-
- [x] Quellen, Forschungsgruppe Snowballstemmer
- [x] NER Implementierung (Stanford Server Socket)
- [ ] Chunking
-
Hi, I get this error using the library, I'm trying to use the natural Porter stemmer and the sentence tokeniser and in both of them I get the same error
```
Uncaught Error: ENOENT: no such file or…