-
**Describe the bug**
Hashtag entities not working in Chinese (zh).
**To Reproduce**
```javascript
const { NlpManager } = require('node-nlp');
const manager = new NlpManager({ languages: ['zh'] …
-
C++ 代码
lm::ngram::Config config;
model = new lm::ngram::Model(language_model_path);
初始话C++ 版本的 kenlm 模型报错
相同代码时而能成功 ,但常时间下是报错的
Segmentation fault (core dumped)
段错误 可能是哪儿指针存在问题,求大神救救我
-
Exception:
![image](https://github.com/user-attachments/assets/1851b880-9f3d-495a-8eae-3f0c36cc9c9e)
I never specified the path in the code: /home/john/extern_data/corenlp-segmenter/dict-chris6.se…
-
这里报错要怎么办呀?
File "/Users/lyl/Desktop/postgraduate/一/NLP/SpERT_chinese-main/spert/util.py", line 218, in check_version
state_dict = torch.load(model_path, map_location=torch.device('cpu'))
Fi…
-
python可以添加自定义词典吗
-
not a bug report per se
I'm wondering how spacy/chinese models compares with the stanza project?
Stanza already provides chinese support with many features
https://stanfordnlp.github.io/stanza/mo…
dcsan updated
4 years ago
-
I have tried to tokenize (using other NLP tools) the Chinese articles and pass it into `FingerPrint->hash` function.
I got two fingers:
01100101011010111111111001001101010111100000111010010000000…
-
**Background**
1. In Chinese text, characters (not spaces) provide an approximate solution for word tokenization.
That is, every Chinese character can be treated as if it is a token (or word). Wh…
-
目标场景是使用paddle识别ocr结果 在另外的流程中使用modelscope的模型进行nlp分析。但似乎只要添加了from modelscope import pipeline, Tasks就报错。 注释掉就没问题。是否包冲突、有无解决优化和解决方案
- 系统环境/System Environment:
```
conda install pytorch==2.0.1 torchvi…
-
**Hello.**
![image](https://user-images.githubusercontent.com/99230615/173962594-7d485359-9f01-4f84-8eb9-3ea3ec4ed992.png)
**How are you?**
![image](https://user-images.githubusercontent.com/9923…