-
ICU allows for specifying arbitrary boundary rules based on a regex-like syntax. Moreover, it supports dictionary-based break iteration with dictionaries specified by users. I haven't created any inte…
-
This project is very interesting, i am wondering what i need to do to add an additional language to it, in my case i want to use it for Finnish.
Maybe, if we have a list of task needed for a new l…
-
## Description
I'm trying to use Typesense with my content in Thai. What's special is that Thai (and a few other languages) doesn't use spaces to separate words. Typesense seems to care about that.…
-
Hi, I try to use wordfreq on Japanese on Centos 7. I keep getting an error of `Couldn't find the MeCab dictionary named 'mecab-ipadic-utf8'`, however, there's no such package on Centos 7. It's called …
-
Hi Abigail .. I was trying to run the code using the already existing training model that was uploaded, as I do not have a powerful enough machine to train. I believe the vocab size is set to 50000 in…
-
### Describe the bug
` error: Microsoft Visual C++ 14.0 or greater is required. Get it with "Microsoft C++ Build Tools": https://visualstudio.microsoft.com/visual-cpp-build-tools/
[end of out…
li589 updated
1 month ago
-
unlike standard analyzer, nori analyzer removes the decimal point.
Elasticsearch version (bin/elasticsearch --version): 6.6.2
Plugins installed: [ analysis-nori ]
JVM version (java -version):…
-
Hi @monologg, thank's for your great work! I was trying to play around with your model on huggingface but I got this error `Can't load config for 'monologg/koelectra-base-finetuned-naver-ner'. Make su…
-
- [ ] [unsloth/README.md at main · unslothai/unsloth](https://github.com/unslothai/unsloth/blob/main/README.md?plain=1)
# unsloth/README.md at main · unslothai/unsloth
…
-
The readme makes it sound very simple: "Replace bert with xphonebert"
Looking a bit closer looks like it's quite a feat to make StyleTTS2 talk in non-english languages (https://github.com/yl4579/Styl…