-
The readme makes it sound very simple: "Replace bert with xphonebert"
Looking a bit closer looks like it's quite a feat to make StyleTTS2 talk in non-english languages (https://github.com/yl4579/Styl…
-
When using `case_markup` in `space`/`none` mode, unexpected behavior happens:
```python
>>> pyonmttok.Tokenizer("none", case_markup=True).tokenize("你好世界,这是一个Test。")
... (['⦅mrk_case_modifier_C⦆', …
-
## Requested feature
I'm working on a corpus of short documents. Recent developments in examining short texts like from twitter, etc, have been documented. I'm including two files.
[X Cheng 2014…
-
I trained a unigram model on ```botchan.txt``` following the documentation examples. I then reapplied this model to the training text and I evaluated new logprobs from it by counting the tokens.
Th…
-
#My System Configurations
**CUDA: 9.1**
**libCUDNN 7.1**
Tensorflow Version: '1.8.0-rc0'
The system works for default Vietnam to English dataset but while training with Bodo English dataset th…
-
Hi there I have completed training for Vosk language model adaptation for US English model and I have picked graph, G.fst, G.carpa, and rnnlm_out folder from the trained model and replaced those with …
-
@bheinzerling
Could you provide training script?I want to train with my own data.
-
We have to do a fair amount of text preprocessing of our data before feeding it into tensorflow. Since the text manipulation abilities of tensorflow and tensorflow tranform are still relatively immatu…
-
-
## Keyword: metric learning
### Sparse online relative similarity learning
- **Authors:** Dezhong Yao, Peilin Zhao, Chen Yu, Hai Jin, Bin Li
- **Subjects:** Machine Learning (cs.LG); Artificial Inte…