-
Anyone hitting the following fault when trying to run the example in readme locally?
INFO:sentence_transformers.SentenceTransformer:Load pretrained SentenceTransformer: all-MiniLM-L6-v2
Segmentati…
-
**Describe the bug**
Issue with feature annotation that is linked to a custom layer
**To Reproduce**
Here is a snippet of my typesstsem file.
```
web…
-
When calling KyTea with a raw input file, KyTea stops processing at the first `newline` character.
Some sentences (particularly headlines) are delimited by a `newline` rather than any punctuation. If…
-
Hello!I want to finetune your model distiluse-base-multilingual-cased on chinese corpus like LCQMC.
So,do Chinese sentences need word segmentation?
-
In [AG_Editable](http://libagar.org/man3/AG_Editable), we should implement [Unicode text segmentation](http://www.unicode.org/reports/tr29/) when performing word wrapping or selections.
Test cursor…
-
It seems our current word tokenizer is too dumb to see nbsp. That is not good. Not sure whether to replace that even before sentence segmentation or directly in the tokenizer.
ghost updated
8 years ago
-
Hi Professor,
I'm trying to do screen recording on a server using the python interface, which seems to rely on raisimUnity, but when I run raisimUnity, it prompts a Segmentation fault (core dumped…
-
if i wanna use my own textual data to pre-train a electra from scatch, what is the format of the text?
Only sentence segmentation or even more ??
Please help.
-
It is in the Unicode UTS 35 spec, and there is a proposal to add it to ECMA-402. We should support it in ICU4X.
sffc updated
2 months ago
-
您好,參考官網的範例寫了一個 `get_nlp_result ` function,會 iterate `data_df` 的 row,將文字資料 `row[text_col]` 依序送進 ws、pos 和 ner function 處理
在跑ner的時候有時會遇到 AssertionError error,請問這是甚麼問題造成的呢?
```
def get_nlp_result(d…