-
Hi,
I noticed a problem with the behaviour for the word 'schulen' in German (as a noun vs as a verb) when using all capitals:
```sh
>>> simplemma.lemmatize("Schulen", "de")
'Schule' # plural …
-
I have a suggestion to implement external morphology system in GD besides Hunspell.
The main goal: to replace word stemming. Spelling suggestion is out of my scope.
external morphology system is just …
-
I am a single user, and this is an amalgamation of the issues I've encountered and ideas for improvement I've had over the course of using your project these past few days.
I'd first, however, like…
-
Current version of Annotace does not work with Spark NLP profile in english (optionally investigate czech language as well), please investigate and describe it here. **It used to work in previous vers…
-
May be unnecessary for Release 0.1
-
Dear Authors,
I am happy to see your results on the insurance data set and tempted to re-produce on my side. But, I could not replicate on my test data. The reasons are that you applied stop-word r…
-
If you see the commands below, for any single word identified, the lemma is being shown as different which for me is a bug. Please, correct me if not and arabic follows another pattern different of th…
-
This is a follow-up of #953
Add a new feature to the Token to represent the "form" of a token. However, a tokenizer may choose to set this feature differently to establish a basic normalization wit…
-
Hi, nice project. Actually I began to do something similar when I encounter your project and doubt whether continue now on my own or join forces with you. Can you please explain how do I suppose to tr…
-
I have been having an ongoing issue with the website attributing extended senses to words lemmed with just the GW. This issue occurs when the word is written the same way as an attestation that does …