-
reported by Tess Dejaeghere:
> thread 'main' panicked at 'computed lemma: OtherError("Rule does not match
> current word (unable to find and remove suffix o)")',
> /home/tessd/tessdlama/opt/cargo…
-
An example from aemw/amarna/P271181.
The editor, quite reasonably, types:
@label r 083 - r 088
@?Now, [I will dispatch] Mane during this year. He held back [... ...]?@ Because of this, etc.…
-
I was just training a udpipe lemmatiser myself on dutch and browsed a bit github for udpipe_train and saw this.
I'm the author of the udpipe R package and happen to work on some 18th-19th century tex…
-
Hello. Discovered your repo while looking for examples of udpipe_train
I'm the author of the udpipe R package and happen to work on some 18th-19th century texts myself (dutch / french).
I see you a…
-
There are some noticable inaccuracies in the output from the frog lemmatiser (such as `*heden` not being lemmatised to `*heid`), perhaps we can improve the lemmatisation.
One option is to add a dif…
-
Bonjour,
Depuis passage à Windows 11, dans la version PC, certains boutons ne sont plus visibles. Par exemple, le bouton "Lemmatiser".
- Est-ce un pb connu avec Windows11 ?
- Y a-t-il une solutio…
-
Hi, Thank you for making the code open-source.
Can we use the code-base for languages other than English, example German? I see you have used spaCy-en. Would installing spaCy-german help?
-
Do we also include words that have several meanings like bitc*? Also I think that for polish language is quite important to include different forms of particular words.
-
_corpkit_ is currently oriented toward English, but nothing stops at least some features from being extended to other languages. I should be able to get around to the basics (encodings, as well as mul…
-
Upon running `oracc harvest` in saa0/saa12, the harvester fails to harvest. The following error is encountered:
`ORACC::XML: /home/oracc/bld/saao/saa12/P285/P285577/P285577.xtf:1: parser error : In…