-
-
Hi all,
I started an NLP project where I needed high accuracy sentence segmentation, and therefore decided to use stanza.
I was thrilled to find this library, since Spacy is quite intuitive. How…
-
Hello,
As a follow up on #11298 #11347, I would like to report some lemmatization problems with the spaCy 3.6 models for Italian, Spanish and French. We did not have these issues with the 3.2 versi…
-
This is probably a rare case occurring only when adding a spacymoji step to the pipeline of a StanfordNLPLanguage instance. However, what happens is that the spacymoji constructor uses the StanfordNLP…
-
## Expected Behavior
It would be nice if [some of the features](https://radimrehurek.com/gensim/models/phrases.html) of the `gensim.models.Phrases()` tool could get implemented into the `doc.…
-
When running data through the en_core_web_trf model concurrently I am getting different results between runs.
I cannot find anywhere in the documentation or other github issues where this behaviour i…
-
The aim of this issue is to discuss and understand when `tokenize` should happen in the pipeline.
The current solution is to apply `tokenize` once the text has already been cleaned, either with `c…
-
I regularly follow the developments on this project, and I must say that I am very interested and pleased with the direction `curated-transformers` is taking. The code is very understandable and high-…
-
The first thing that I found is the experimental biaffine parser can parse sentences with multiple roots. (I thought about the reporting of this situation as an issue, but dependency trees can include…
-
I am using `rstcheck` in pre-commit hook. In sphinx configuration, I have added an extra dependency - `sphinx-autoapi`, which auto generates documentation from docstrings based on package list provide…