-
I would like to use fastText for languages that don't have clear word boundaries, such as Chinese, Japanese, Thai or Vietnamese. I have found various softwares to partition text from these languages …
-
Dear authors,
I have two questions.
First, how can I use multilingual pre-trained BERT in pytorch?
Is it all download model to $BERT_BASE_DIR?
Second is tokenization issue.
For Chinese and Ja…
-
### Board and OS details:
Raspberry Pi 3 - Raspbian Lite
**CPU**
```
processor : 0
model name : ARMv7 Processor rev 4 (v7l)
BogoMIPS : 38.40
Features : half thu…
-
On running the Twitter Korean Text, I'm getting the error:
` Look-behind pattern matches must have a bounded maximum length near index 9
((?
-
I need a way to provide a local repository for our users (because they cannot connect the online repositories).
-"Local package repository (file system)" does not allow to choose a network path so …
-
Thanks for great project ...! I'm using this kr-wordrank for data analytics ( especially for my application's user)
I was using for big data log succesfully... But When I ran my simple code ag…
-
**IMPORTANT NOTICE
If you do not complete the template below it is likely that your issue will not be addressed. When providing information about your issue please be as extensive as possible so th…
-
Good evening again!
1. There are no links to x86 versions of any software for Windows at
https://miktex.org/download
2. I've downloaded x86 version of "Command-line installer" from
http://mirr…
-
I want to use the model go_bot for my own task. In particular, the model is similar to configs/go_bot/gobot_dstc2.json. But I want to use my own dialogue data and slots. My question is what files do I…
-
There is a dictionary similar to IPADIC but for Korean called mecab-ko-dic:
It is available under an Apache license here:
https://bitbucket.org/eunjeon/mecab-ko-dic
This dictionary was built with MeC…