chinese-word-segmentation Search Results

langgenius/dify #8034

It is necessary to upgrade the weaviate client.

### Self Checks - [X] I have searched for existing issues [search for existing issues](https://github.com/langgenius/dify/issues), including closed ones. - [X] I confirm that I am using English to…

jiandanfeng updated 3 days ago

nguyenhoanganh2002/XTTSv2-Finetuning-for-New-Languages #2

How to train two or more languages simultaneously?

Great job! I have a small question: I want to avoid catastrophic forgetting or the ability to handle bilingualism, such as training both Chinese and English simultaneously. Can the language be set to …

ScottishFold007 updated 3 days ago

go-ego/gse #176

In Chinese word segmentation, only a single word is separate…

Execute the following code (tabooSegmentCustomDicList there are more than 2000 words) ` for _, tabooSegmentCustomDic := range tabooSegmentCustomDicList { lowerCaseWord := strings.ToLower(tabooSeg…

xiaominger updated 2 months ago

explosion/spacy-course #125

Wrong word segmentation in chinese course

The sentence is just splited by character. ``` # 导入spacy并创建中文nlp对象 import spacy nlp = spacy.blank("zh") # 处理文本 doc = nlp("我喜欢老虎和狮子。") # 遍历打印doc中的内容 for i, token in enumerate(doc): …

placebokkk updated 2 years ago

vim/vim #14943

Enhancing Support for Sentence and WORD Delimiters in Chines…

Chinese and Japanese, unlike English which relies on spaces for separation, use distinct punctuation marks such as full stops (。), exclamation marks (！), and question marks (？) to denote the end of se…

VimWei updated 3 months ago

wkgcass/public-chat #10

Will LLM do word segmentation for Chinese?

/chat: Will LLM do word segmentation for Chinese? Or do they simply read each Chinese character and run the process?

wkgcass updated 1 year ago

langchain4j/langchain4j #210

Segment overlap for chinese have no effect

**About Chinese word segmentation.** All of document splitters extends HierarchicalDocumentSplitter class, When I set the overlap parameter,overlapFrom() is called, But there will force method invoc…

caopengan updated 4 months ago

manticoresoftware/manticoresearch #931

Jieba integration

ICU is not a good choice in China. In addition, it is very important for Chinese word segmentation to customize the dictionary, because the application of words in different industries is completely d…

oabu updated 3 weeks ago

vschroeter/obsidian-virtual-linker #32

If there are no spaces before or after terms, the links will…

![image](https://github.com/user-attachments/assets/9a919cd5-14e0-410a-aa7e-5916ee40ec27)

nanjingman updated 1 week ago

forTEXT/catma #332

Analyze: No word segmentation/incorrect tokenization for Chi…

**Describe the bug** The analyze module does not perform correct segmentation for Chinese texts. As Chinese does not have any white-space word segmentation, CATMA treats only punctuation symbols as w…

pcdi updated 1 year ago

715 results for chinese-word-segmentation

715 results
for chinese-word-segmentation