-
The latest Travis builds fail with an error like this:
```
../../../virtualenv/python3.5.6/lib/python3.5/site-packages/gensim/__init__.py:5: in
from gensim import parsing, corpora, matutils,…
-
In #34 I resorted to a Snowball stemmer for Finnish because of difficulties installing libvoikko in a virtual environment (and Travis might be problematic too).
But it would be worth at least tryin…
-
According to the [Annif-fusion experiments](https://github.com/osma/Annif-fusion/blob/master/Annif-Fusion-LTR.ipynb), using PAV a.k.a. isotonic regression could provide a significant improvement in an…
-
After #136 it would be possible to use a DocumentDirectory (directory with txt + tsv files) as a training corpus. It would need to implement the DocumentCorpus interface (ABC). This in itself is very …
-
I get this error when running under mod_wsgi:
```
Traceback (most recent call last):
File "/var/www/Annif/annif.wsgi", line 6, in
application = create_app()
File "/opt/Annif/annif/…
-
We need a web UI for testing Annif, e.g. a simple HTML form similar to the one in the prototype (http://annif.org)
-
Right now the `annif.corpus` classes are a bit of a mess. They are trying to support different kinds of corpora:
* SubjectIndex: subjects only as a TSV file + lookup and save functionality
* Subje…
-
Document and implement an `optimize` CLI command (no REST API necessary) that works similar to the `optimize.py` script in the prototype, i.e., given a document corpus with gold standard subjects, loo…
-
We could support a command for evaluating how well Annif performs when compared against a gold standard (one or more manually created subject sets per document).
The CLI command for a single docume…
-
The same format used by Annif prototype