-
KenLMScorer is fantastic. Just so useful. However, it isn't core to replaCy and should be a custom pipeline component (that we expect most people to use... think like `en_core_web_sm` is for spaCy - a…
-
Hi, I noticed from #12328 that spaCy has switched to `pymecab-ko` for the Korean tokenizer in the upcoming `spaCy` 4.0, but there seems to be some installation/import issues of this package on macOSes…
-
### Environment
* Operating System: Mac OS 14.6.1 (23G93)
* Python Version: Python 3.11.7
* Python packages:
```
absl-py==2.1.0
aiohttp==3.9.3
aiosignal==1.3.1
altgraph==0.17.4
annotated-ty…
-
In the implementation of the mistral LLM connection an attribute is created which checks if the Instruct model version was selected. This check fails, as it is checked if lowercase "instruct" is in th…
-
Ajouter le code pour l'extraction des entités nommées :
- via Spacy
- via AWS
- via NLTK
-
Running this EntityLinker example code from the github repo throws the following error
`OSError: [Errno 36] File name too long: '/home/username/.scispacy/datasets/e9f7327283e43f0482f7c0c71b71dec278…
-
Running 10-15K documents through the preprocessing chain taken a couple of hours (most of the time on spaCy processing). For the COVID data this goes up to almost 30 hours. The culprit seems to be tha…
-
error in google colab due to
TypeError Traceback (most recent call last)
[](https://localhost:8080/#) in ()
----> 1 qg=QGen()
[/usr/local/lib/python3.10/dist-pac…
-
I'm getting the following error when I execute `train.py`
`ValueError: [E024] Could not find an optimal move to supervise the parser. Usually, this means the GoldParse was not correct. For example,…
-
pkuseg is no longer compatible with python 3.9.
please use jieba or spacy_pkuseg instead.
Please refer to https://github.com/lancopku/pkuseg-python/issues/162