-
您好~
最近看rasa_nlu官方文档,上面写pipline有多种选择方式,除了作者您描述的两种以外,有一种推荐是:
Best for most: spaCy + sklearn
问题是:这个组合+jieba能用来处理中文吗?不了解spaCy,所以想问问作者,当初为什么选择jieba+mitie+sklearn。而不是上面这种组合呢?谢谢
-
Hello,
It takes too long to parse the doc object, i.e to iterate over sentence and tokens in them. Is that expected ?
```
snlp = stanfordnlp.Pipeline(processors='tokenize,pos', models_dir=model_d…
-
**Describe the bug**
The current implementation for `MySqlContainer` to check if the database is running is like below:
```python
wait_for_logs(
self,
re.compile(".*: ready for connecti…
-
It seems the config isn't quite right to include token probabilities. I'm not 100% sure of the solution, but this issue should help (https://github.com/explosion/spaCy/discussions/6388#discussioncomme…
-
How can I use this for the Sinhala language?
-
I have found that SpanMarker models such as tomaarsen/span-marker-mbert-base-multinerd to be very usefull at NER recognition. But Presidio does not seem to support the class 'span_marker.configuration…
-
To incorporate auxiliary verb rules into the [USAS Rule Based Tagger](https://ucrel.github.io/pymusas/api/taggers/rule_based#usasrulebasedtagger).
# Definition of auxiliary verb rules
All POS ta…
-
Hi
I am the maintainer of another spacy pipeline sentiment library and i am trying to figure out how to benchmark spacy sentiment models fairly.
i have written something here https://github.com/sl…
-
hi
how can I solve this error??
OSError: [E050] Can't find model 'en'. It doesn't seem to be a shortcut link, a Python package or a valid path to a data directory.
Anybody can help me??
-
It looks to me like the problem is the size of the dependencies. We should see what can be stripped out. Possibly it's the spacy models?