-
Implement feature extraction independently from the training part: Since this part takes some time, it should be done once so that multiple trainings can be done without computing them each time.
i…
-
When I use this model file: whisper_frontend_mesonet_mfcc.yaml, the following error occurs.
RuntimeError: Expected 3-dimensional tensor, but got 2-dimensional tensor for argument #1 'self' (while…
YMLLG updated
4 months ago
-
If there any function in this package support to compute the similarity of two audio file after MFCC- features extraction. Thanks a lot.
-
I tried the solution provided by lf0_lstm.py and so. When I tried to modify the parameters in tranning, a script in /data/training/compute_dtw.sh made me confused.
` ZASKA="Zaska -P $PRM_NAME $PRM…
-
Hi, Do you know how to implement this data enhancement method, using fbank features instead of mfcc features?
Looking forward to your reply, thank you very much
-
### Question
As your Important disclaimer, I accessed at flashlight (https://github.com/facebookresearch/flashlight) and build completely.
At the stage of python bindings, I built flashlight and wan…
-
**Debugging checklist**
[x] Have you read the troubleshooting page (https://montreal-forced-aligner.readthedocs.io/en/latest/user_guide/troubleshooting.html) and searched the documentation to ensur…
-
Hi,
I detected some features and algorithms used in Speech Processing unavailable in Essentia which might be interesting to implement. Here my proposals:
1. Mel Spectrogram
2. Delta MFCC
3. De…
-
__Write your question or issue with as much detail as possible__
I notice in the code that mel filter banks are there, but are these used only for the mfcc-extraction or can I get these output dire…
-
**Describe your question**
I want to train a CTC/Attention based acoustic model using MFCC features for ASR task. So, for that which config file and recipe should be used?