-
Hi,
I saw you contributed the CTC loss function in [https://github.com/FluxML/Flux.jl/pull/1287]. Thanks for all that work :).
There you mentioned you had an example with a publicly available speec…
-
Hi, I just wanna share the results obtained so far when training wavegan with 'continuous' speech.
Description of the data: the dataset consists of 1000 wavs varying from 2 secs to 10 secs long, fro…
-
Hello, I don’t know much about the feature extraction mfcc part of the code. If I want to view the 650 feature values after feature extraction by mfcc, where are these 650 feature values? Is it in the…
-
![123123](https://user-images.githubusercontent.com/31679768/37318557-2042fbcc-26a6-11e8-9749-618329833819.png)
I face this problem for long time....
-
HI, guys:
I following the steps to collect audio samples, train the net, and when I exec "precise-listen", I got errors as below, I use the latest "dev" branch. Can anyone help me on this?
Tha…
-
问题详细描述:
**01 使用图1中PaddleSpeech的小数据集(自己录制的音频文件)微调方案得到微调后的模型(12句模型)目录结构如图2**
![image](https://user-images.githubusercontent.com/126248892/221166226-db6e51f7-d27a-4359-9f26-17749f031f2f.png)
**图1 小…
-
`torchaudio` is an extension library for PyTorch, designed to facilitate audio processing using the same PyTorch paradigms familiar to users of its tensor library. It provides powerful tools for audio…
-
__Write your question or issue with as much detail as possible__
I notice in the code that mel filter banks are there, but are these used only for the mfcc-extraction or can I get these output dire…
-
-
Hello,
I'm trying to figure out what I need to do so to my numpy array can be vocoded by the UniversalVocoder.
Attached is a sample npy file.
The output is from a modified https://github.com/…