korean-tokenizer Search Results

371 results
for korean-tokenizer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

mikemccand/stargazers-migration-test #814

Decouple Kuromoji's morphological analyser and its dictionar…

I've inspired by this mail-list thread. As many Japanese already know, default built-in dictionary bundled with Kuromoji (MeCab IPADIC) is a bit old and no longer maintained for many years. While i…

mikemccand updated 2 years ago
33
UKPLab/EasyNMT #44

It keeps on downloading the models again and again when star…

vishwas31 updated 2 years ago
17
elastic/elasticsearch #37751

Korean (nori) Analysis Synonym Filter build failed

Error When Index Setting "Synonym Filter" with "Korean (nori) Analysis" **Elasticsearch version** (`bin/elasticsearch --version`): 6.5.3 **Plugins installed**: [ analysis-nori ] **JVM v…

AnSungHyun updated 2 months ago
5
irthomasthomas/undecidability #643

I finally got perfect labels (classification task) via promp…

- [ ] [I finally got perfect labels (classification task) via prompting : r/LocalLLaMA](https://www.reddit.com/r/LocalLLaMA/comments/1amvfua/i_finally_got_perfect_labels_classification_task/) # TIT…

irthomasthomas updated 6 months ago
1
KimDoubleB/LAB #2

엘라스틱서치 실무 가이드

[엘라스틱서치 실무 가이드](https://www.yes24.com/Product/Goods/71893929) 책을 읽고 정리 ## 챕터 별 바로가기 [1장 - 검색 시스템 이해하기](https://github.com/KimDoubleB/LAB/issues/2#issuecomment-1735292654) [2장 - 엘라스틱서치 살펴보기](htt…

KimDoubleB updated 10 months ago
10
huggingface/datatrove #135

Enhancing word_tokenize (like nltk) Support for Multiple La…

Hello, I'm currently working on text processing that involves filtering (like gopher) in various languages. But now, the default word_tokenization in datatrove filters is based on English, as shown…

justHungryMan updated 3 months ago
5
irthomasthomas/undecidability #642

TabbyML: Self-hosted AI coding assistant.

- [ ] [tabby/README.md at main · TabbyML/tabby](https://github.com/TabbyML/tabby/blob/main/README.md?plain=1) # tabby/README.md at main · TabbyML/tabby # 🐾 Tabby [![latest release](https://shield…

irthomasthomas updated 6 months ago
1
NON906/sd-webui-chatgpt #8

How can i resolve this?

This model's maximum context length is 4097 tokens. However, your messages resulted in 4112 tokens (3992 in the messages, 120 in the functions). Please reduce the length of the messages or functions. …

onexzero updated 8 months ago
13
elastic/elasticsearch #27290

ICU Tokenizer: letter-space-number-letter tokenized inconsis…

**Elasticsearch version** (`bin/elasticsearch --version`): 5.3.2 **Plugins installed**: [analysis-hebrew, analysis-icu, analysis-smartcn, analysis-stconvert, analysis-stempel, analysis-ukrainian, e…

Trey314159 updated 2 months ago
6
snunlp/KR-BERT #5

Which vocabulary am I supposed to use?

Before asking for help, thank you for sharing your decent work with the public. Your idea of optimization of tokenizer for Korean has deeply inspired me. So I really want to try your model in tand…

hamgcho updated 4 months ago
3

上一页 1...7 8 9 10 11 12 13...38 下一页

371 results for korean-tokenizer

371 results
for korean-tokenizer