-
the provided link is not found.
-
Hi,
I am trying to use your audio model. Preprocessing run without any error, however, training (using main_FineTuneWav2Vec_CV.py) doesn't continue. I don't see any error on the terminal prompt, and…
-
Thank you for releasing the dataset, it is really valuable for deepfake researches. Could you release the labels of testset and the exact manipulation methods of 16cls_labels?
-
We need to enhance the voice frequency analysis model.
Currently, we are using Neural Network with some basic layers.
Requires knowledge of:
- Python
- Neural network
- Data science
- Machine le…
-
Hi Professor Bagus,I noticed that the y_egemaps.npy file extracted using the IEMOCAP database was used during VAD training, but I did not find the code to generate the file in the code.I am wondering …
-
I noticed that it mentioned that I was supposed to use and elder version of sklearn, but which version exactly should i use? I got an issue "No such file or directory: 'SVM_emotion_clf_24.pkl'", is th…
-
||link|
|----|---|
|paper| [Emotions Understanding Model from Spoken Language using Deep Neural Networks and Mel-Frequency Cepstral Coefficients](https://ieeexplore.ieee.org/abstract/document/912269…
-
I'm trying to run `synthesize` with the pretrained model, like such:
```bash
python3 synthesize.py --text "This sentence is a test" --speaker_id Actor_01 --emotion_id neutral --restore_step 450000…
-
Why input_shape equals to (100, 34)? 100 means time_steps? How to understand it? Thank you very much!
-
Please, how do I get the spectogram dataset?