-
The mean and std I created are different from the values in mfcc_stats.pkl you provided.
Can you please check if I am doing something wrong?
I attached a simple code below.
thanks.
-------…
-
ERROR (nnet3-compute[5.5.1061~2-e4eb]:EnsureFrameIsComputed():nnet-am-decodable-simple.cc:101) Neural net expects 'input' features with dimension 43 but you provided 40
-
Hi,
from your [Python tutorial](https://essentia.upf.edu/essentia_python_tutorial.html) I was able to extract low level features like MFCCs, but I have no clue how to extract ("predict") high level f…
-
`torchaudio` is an extension library for PyTorch, designed to facilitate audio processing using the same PyTorch paradigms familiar to users of its tensor library. It provides powerful tools for audio…
-
### Details:
Create ANNs for classifying UrbanSound8K. MFCCs of the audio samples are present in the linked dataset below and are to be directly used in the classification input as features.
Experim…
-
In
https://github.com/lhotse-speech/lhotse/blob/d9c4141319adb39f64684c762aa541467d25f7fc/lhotse/kaldi.py#L144-L145
It uses `kaldiio` as the feature type.
However,
https://github.com/lhotse-sp…
-
`Traceback (most recent call last):
File "myExperiment.py", line 11, in
aT.extract_features_and_train(data, 1.0, 1.0, aT.shortTermWindow, aT.shortTermStep, "svm", "svmSMtemp", False)
File …
-
Yi Liu, Hello.
Thank you very much for your solution!
I trained with dataset voxcelev1&2 and xvector_nnet_tdnn_amsoftmax_m0.20_linear_bn_1e-2_tdnn4_att. Everything works as expected. Training, …
-
Attempting to cobble together code from https://github.com/MTG/essentia/issues/970 in order to experiment with Song Segmentation.
It would be great to have a working example as part of the Python e…
-
I am trying to use features of the parselmouth module and I'm finding that some methods lack enough documentation to know what their output is. Usually I can work backwards from the output of the same…