-
Hi,
I am trying to fit my own model in this app. When I speak into the app, it extracts & prints the MFCC features, but crashes afterwards giving the following error:
04-03 12:56:16.754 24654-2481…
-
I ran your model on below mentioned filenames. Out of 60 files, the result was 21 times wrong. Am I doing anything wrong?
```
Prediction for file [ ../Audio_Speech_Actors_01-24/Actor_01/03-01…
ghost updated
5 years ago
-
there is apply-cmvn-online in kaldi,how to realize it in pykaldi.
i want to find something in doc.but i don't get it.
-
Thanks philipperemy for this work,
1) What does number of features do? Couldnot understand that
2) I just want to the number of layers to something like 4 or 5 but there is a dimension issue. Is…
-
sir,can you explain your x_train,y_trian,x_test,y_test?it seems that the y_* isn't the label for data
-
Sometimes we may wait a few seconds for ASR system to process a piece of audio.
How did you solve this situation? How to improve the speed of recognition?
The question is "How did you get t…
-
```
...
File "/u/zeyer/setups/librispeech/2018-02-26--att/returnn/TFEngine.py", line 1180, in train
line: self.train_epoch()
locals:
self =
self.train_epoch =
File "…
-
How could I fix it?I am trying to decode the chain model of aishell2.
ERROR (online2-wav-nnet3-latgen-faster[5.4]:OnlineTransform():online-feature.cc:421) Dimension mismatch: source features have dim…
-
Respected Sir,
Greetings of the day !!!
Sir first of all thank you so much for such amazing library you shared with us.
Sir I am using SpeechPy library for extracting the MFCC of audio signal.
…
ghost updated
5 years ago
-
The audio information:
Input File : 'aa.wav'
Channels : 1
Sample Rate : 16000
Precision : 16-bit
Duration : 00:00:00.64 = 10160 samples ~ 47.625 CDDA sectors
File Size …