-
On travis the NER tagger is erroring out, but because we are not looking at the errors via:
```
$tagger->getErrors();
```
check out the `rraub-ner-tagger-error-catching` branch for explanation
rraub updated
8 years ago
-
I am a Chinese student.
My professional direction is the information extraction. So i want to communicate with you!
-
我在您的博客了解到“项目链接里包含了用真实电信业务数据训练的total_word_feature_extractor.dat”,请问训练用的数据可以从哪里获取到吗?
-
尊敬的覃博士,您好。我在词性标记过程中遇到了麻烦,请求您的帮助。具体情况如下:
第一,环境信息
R version 3.5.1 (2018-07-02)
Platform: x86_64-w64-mingw32/x64 (64-bit)
Running under: Windows >= 8 x64 (build 9200)
Matrix products: default
l…
-
It looks like seshat has support activating different tokenisers depending on the language set at index creation time:
https://github.com/matrix-org/seshat/blob/71e17fa9c53de4776ee34b8b68f3b783147a…
-
- [ ] Select an analyzer for our test project
https://github.com/NatLibFi/Annif/wiki/Analyzers
-
Hi there 👋
Let's translate the course to Traditional Chinese so that the whole community can benefit from this resource 🌎!
Below are the chapters and files that need translating - let us know he…
-
你好!请教一个小白问题,origin_data里的训练及预测语料是用什么工具整理成那种格式的,能提供下代码吗?谢谢!
-
**Is your feature request related to a problem? Please describe.**
[There](https://github.com/lazyloong/obsidian-fuzzy-chinese) is another plugin for obsidian which introduces fuzzy search of pin…
-
### #FIXME: whitespace tokenizing does not work on chinese and japanese
Implementing NLP whitespace tokenizer for japanese and chinese language can be a little hard.:radioactive: It can't be done lik…