-
There are some noticable inaccuracies in the output from the frog lemmatiser (such as `*heden` not being lemmatised to `*heid`), perhaps we can improve the lemmatisation.
One option is to add a dif…
-
Hey Jan, thanks for the awesome work. Been using the R package to handle lemmatisation on media corpora for multiple Central and Eastern European languages, however, I am wondering if there is a way t…
-
-
According to the [List_of_unsolved_problems_in_computer_science](https://en.wikipedia.org/wiki/List_of_unsolved_problems_in_computer_science)
> Is there any perfect stemming algorithm in the Englis…
-
Lemmatisation with ixa gave bad results. The worst thing is that ixa-pipes do some internal sentence splitting which is a nightmare for MT corpora. In order to be able to reconstruct the corpus, we sh…
-
with modification and cleanup, serve up Celano's work via an API (this is already partially being done in scaife.perseus.org for the token list widget) but put the information in the data plane for su…
-
This issue was created automatically with bugzilla2github
# Bugzilla Bug 2343
Date: 2017-02-28T17:38:57+01:00
From: Sjur Nørstebø Moshagen <>
To: Børre Gaup <>
CC: borre.gaup, ciprian.…
-
So hey, first of all thanks for the program. I found it very attractive. I have been toying around with it. I have been expanding it a little bit. I am NOT a skillful programmer. I am NOT an expert. I…
-
this basically involves porting over our existing search and ES infrastructure but dedicated to just Homer.
Reference Model: A3
-
Trying to run lemmatisation or validation of ATF files on a remote server occasionally, but ***not*** always, fails with error
```
[nisaba] error: Unexpected message from server: /home/oracc/tmp/sop…