-
Hello,
I understand that we need to split document in smaller piece because OpenAI can not get the whole texts as input.
However, my challenge is to cut texts in a smart way so it does not cut t…
-
I really like spacy-llm but it is impossible for me to use it. I keep having connection time out with a working API key from OpenAI and after spending much time at setting all the framework finding th…
-
If the fishing Spacy component encounters any HTTP error, `ent._` will just return nothing and move on to the next `doc`.
It would be great if there is an option for the component to block the spac…
-
Love the project! Thanks for putting it together.
The README mentions an ask for ways this project can be improved.
It would be amazing if you included the [spacy-transformers](https://spacy.io/…
-
Most of the currently ingested treebanks are encoded from the [Perseids Treebank template](https://perseids-publications.github.io/treebank-template/instructions/getting-started/) and conform to the […
-
Hi! First and foremost - thank you for these notebooks!!!
In [your spancat notebook](https://github.com/nestauk/dap_medium_articles/blob/dev/spancat_tutorial/spancat_training.ipynb),
I have follow…
-
The `pyspelling` package provides and architecture for parsing and spellchecking a variety of document types (markdown, pyhton, HTML etc) and filtering different objects (eg URLs). But it's a bit tric…
-
Hello,
I am using `Spacy` to divide sentences after joining a set of words with whitespaces. But to my dismay, this process has unpredictable and unexplainable behaviour. I have a custom segment…
-
Clean existing pipeline and move it into the new repository.
Convert lemmtaizer rules to proposed spaCy JSON format and update PR.
-
**Describe the bug**
Hi, I use the example in the demo.
I'm facing this error:
ValueError: [E090] Extension 'similarity' already exists on Span. To overwrite the existing extension, set `force=…