korean-tokenizer Search Results

371 results
for korean-tokenizer

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

facebookresearch/fastText #224

What preprocessing steps were applied to the Wikipedias to t…

I would like to use fastText for languages that don't have clear word boundaries, such as Chinese, Japanese, Thai or Vietnamese. I have found various softwares to partition text from these languages …

ageron updated 5 years ago
12
huggingface/transformers #49

Multilingual Issue

Dear authors, I have two questions. First, how can I use multilingual pre-trained BERT in pytorch? Is it all download model to $BERT_BASE_DIR? Second is tokenization issue. For Chinese and Ja…

hahmyg updated 5 years ago
1
shivasiddharth/GassistPi #884

Language doesn't change

### Board and OS details: Raspberry Pi 3 - Raspbian Lite **CPU** ``` processor : 0 model name : ARMv7 Processor rev 4 (v7l) BogoMIPS : 38.40 Features : half thu…

oscardimanno updated 5 years ago
1
twitter/twitter-korean-text #114

java.util.regex.PatternSyntaxException: Look-behind pattern…

On running the Twitter Korean Text, I'm getting the error: ` Look-behind pattern matches must have a bounded maximum length near index 9 ((?

scorpionhiccup updated 5 years ago
7
MiKTeX/miktex #515

Add support for custom repositories in the local network

I need a way to provide a local repository for our users (because they cannot connect the online repositories). -"Local package repository (file system)" does not allow to choose a network path so …

christophvw updated 4 years ago
7
lovit/KR-WordRank #3

simple example

Thanks for great project ...! I'm using this kr-wordrank for data analytics ( especially for my application's user) I was using for big data log succesfully... But When I ran my simple code ag…

Ella77 updated 5 years ago
2
shivasiddharth/GassistPi #909

Assistant Crashed

**IMPORTANT NOTICE If you do not complete the template below it is likely that your issue will not be addressed. When providing information about your issue please be as extensive as possible so th…

Friday8229 updated 5 years ago
19
MiKTeX/miktex #383

miktexsetup x86 version "is not a valid Win32 application" e…

Good evening again! 1. There are no links to x86 versions of any software for Windows at https://miktex.org/download 2. I've downloaded x86 version of "Command-line installer" from http://mirr…

Wymaxep2011 updated 4 years ago
12
deeppavlov/DeepPavlov #158

Question: How to use the model for my own task?

I want to use the model go_bot for my own task. In particular, the model is similar to configs/go_bot/gobot_dstc2.json. But I want to use my own dialogue data and slots. My question is what files do I…

aCombray updated 5 years ago
4
mikemccand/stargazers-migration-test #231

Nori, a Korean analyzer based on mecab-ko-dic [LUCENE-8231]

There is a dictionary similar to IPADIC but for Korean called mecab-ko-dic: It is available under an Apache license here: https://bitbucket.org/eunjeon/mecab-ko-dic This dictionary was built with MeC…

mikemccand updated 6 years ago
62

上一页 1...30 31 32 33 34 35 36...38 下一页

371 results for korean-tokenizer

371 results
for korean-tokenizer