-
Post questions here for:
Manning and Schütze. 1999. Foundations of Statistical Natural Language Processing. MIT Press:
Chapter 3 (“Linguistic foundations”): 81-113.
-
We should test to see if the EnglishTokenizer impl is sufficient for German, and if not, add an additional tokenizer. EnglishTokenizer is based on porter stemmer.
-
Recent discussions have suggested that the UD documentation could benefit from a more detailed definition of "word". We can use this issue to discuss the existing definition and possible improvements.…
-
NB: It appears that annotators thought that 'aspect' verbs were to be marked as auxiliary verbs. The corresponding verbs in English are not considered AUX.
'**tornar-se** = become' (infinitive) sh…
-
Candidates for a next round of FastSpell, after inspecting HPLT data through HPLTAnalytics.
(Current config: https://github.com/mbanon/fastspell/blob/main/src/fastspell/config/similar.yaml ).
* …
-
When the packages will be retired, many popular online tutorials will become obsolete. For example, if we write: "*how to load vector data (or shapefile) in R*" in Google, the top 3 results (at least …
kadyb updated
11 months ago
-
## How to reproduce the behaviour
import spacy
nlp = spacy.load('de')
s1 = 'Der schöne Garten' …
-
### Terms
- [X] I have searched [open and closed feature requests](https://github.com/scribe-org/Scribe-Data/issues?q=is%3Aissue+label%3Afeature)
- [X] I agree to follow Scribe-Data's [Code of Conduc…
-
> He checado que las palabras son correctos en mis vocabularios. Veo que tengo que repetirlos.
User claims it must be "correctAs" and "repetirlos".
-
## Short Description
A Concept tries to capture the idea behind a classification of entities, allowing us to reason about everything in the class as if it were one thing. For example, the ideas (ment…