-
2022-01-10 17:05:17,709 INFO: char embedding size: 6169
2022-01-10 17:05:17,918 INFO: word embedding size: 71222
2022-01-10 17:05:21,442 INFO: Model(
(token_embedder): ConvTokenEmbedder(
(wo…
-
单个的英文单词不能通过取关键词取到吗?类似于c这个单词,即使把c加入到自定义的词典也不行啊
-
Hi.
现在的工程需要对中文文章进行摘要的提取,我使用过了gensim的summarize,不过gensim对中文的支持并不是特别好,现在使用snownlp来进行摘要的提取。不过发现一个问题:利用snownlp提取的摘要是没有原始文章的段落顺序的。
比如:
`
from snownlp import SnowNLP
text = u"""我是一个中国人。我出生在辽宁。我喜欢编程。....…
-
I am passing the set of English stopwords which I create from `yake/StopwordsList/stopwords_en.txt`.
```python
text = "YAKE! is a light-weight unsupervised automatic keyword extraction method whic…
-
我已经仔细查看过本库的README和之前的Issues,没有发现解决方案。
**问题描述**
在安装的时候,不管是从源码编译requirements.txt,还是pip install HarvestText. 都会出现 mysql_config not found
**如何复现**
操作系统:MacOs
python版本:3.6
HarvestText版本:最…
-
我已经仔细查看过本库的README和之前的Issues,没有发现解决方案。
**问题描述**
**如何复现**
操作系统:
python版本:
HarvestText版本:
-
Hi, thanks for making the library so usable! Asking for suggestions.
**Context:**
- Running SetFit on a small set of documents (financial; ~40 docs for 4 classes; ~10 docs per class for testing…
-
相关名词:
- 短语提取
- 语块提取
- 新词发现
- 关键词提取
- keyword/keyphrase extraction/chunking
- [NLTK](http://www.nltk.org/howto/collocations.html) [collocation](https://zhiyu-chen.gitbooks.io/notes-of-nlp-with-…
-
I've tried upgrading the `v2` actions to `v3` but still get errors during `pre-commit-hooks` installation.
Could this be a transient error on GH?
how pre-commit/action@v2.0.0 fails:
Run pre…
-
I see that the parameter `deacc` is set to True for all languages when `tokenize` is called from `clean_text_by_word` and `tokenize_by_word`. This means that in Nordic languages, characters with umlau…