-
- 版本、环境信息:
1)PaddlePaddle版本:2.2.2.post101
2)GPU:Driver Version 418.40.04.
CUDA Version 10.1
CDUATOOLKIT release 10.1,V10.1.105
…
-
not a bug report per se
I'm wondering how spacy/chinese models compares with the stanza project?
Stanza already provides chinese support with many features
https://stanfordnlp.github.io/stanza/mo…
dcsan updated
4 years ago
-
What's the expected behaviour if a web page contains multiple languages? For example, if a page contains Chinese and Japanese, the segmentation process and full-text indexes could be different. Even t…
-
It's so great that RediSearch could be another choice for full-text search since i am an Elasticsearch fans and a search engineer for almost ten years.
I am the author of [Friso](https://github.com…
-
### Bug Description
BM25Retriever cannot work on chinese.
### Version
main
### Steps to Reproduce
```python
from llama_index.retrievers.bm25 import BM25Retriever
from llama_index.core import Do…
-
> 因为我们目前发现在之前的工作其实是繁简混合的“古文”,存在不严谨之处,所以正在训练纯粹的繁体古文模型,工作正在进行中,如果有进展再行告知。
不知進展如何? 能否協助提供 “古文” 的 checkpoint 供測試? 謝謝。
-
I want to use https://github.com/fxsjy/jieba as a Chinese text segmentation to enhance Whoosh when I'm using Haystack.
But I don't find any native solution, so I hack `whoosh_backend.py`:
orig:
> …
-
Hello, my name is Guoao Wei. I am a Chinese student interested in NLP and I can help with the Chinese language support for this amazing repository.
## About me
I received a bachelor's degree of …
-
First, I’m sorry about to post issue in chinese in last issue.
for this issue, I have transfer my data to libffm format,
and traindata is 730M, validatedata is 290M
my device info: 16cores + 64GB…
-
## Your Environment
* Operating System:
* Python Version Used:
* Scattertext Version Used:
* Environment Information:
* Browser used (if an HTML error):
Hi,
It seems in your demo code…