-
hello, I have two questions.
1. why choose librosa.stft() and magphase() as input features? why not MFCC?
2. why choose '257' as spectrum length? I want to train voxceleb dataset, can you give me s…
-
I have started to train models based on [this tutorial](https://huggingface.co/blog/fine-tune-wav2vec2-english) (thanks to @patrickvonplaten) and so far everything works.
> Note: The model I am fi…
-
I just tried the Text to Speech feature... Interesting, but how is one supposed to change the spoken language? So far, the Text to Speech feature is totally unusable with the French EPUBs I have...
-
Here you will find a long list of the articles thats need to be coded. They are divided into sections, one for each coder (TR = Timo, MR = Melanie, JC = Joseph, AB = Agata, LK = Liam). Each item in th…
-
I am a newcomer in the field of speech recognition, I want to finetune the chinese model: vosk-model-cn-0.1, but there are few related materials, I hope my seniors can give some guidance。
请问有大佬微调过…
-
I used `kaldifeat` to extract some features and stored them using the default storage type, which is `LilcomChunkyWriter`, but it seemed to be throwing some errors at the time of data loading:
```
…
-
Note, 13th of February 2023: Next de facto discussion place until further notice is at #779.
————————————
So today I learned that [GitHub threads max out at 2,500 comments](https://github.com/Da…
-
![image](https://user-images.githubusercontent.com/19343842/140937585-b10a2f68-fb97-44a3-920a-b308667f8466.png)
the
The microphone cannot say Chinese
what is ok
你好啊 have no voice
```
# Copyrig…
-
Hello,
i have lot of short Chinese audio wave files of 5 seconds or so in hand. When i transcribe them with Azure Speech-To-Text REST API and Java SDK respectively, i found REST API recognition acc…
-
If possible it would be great for the speech recognition to be able to accept different languages at the same time