-
# Task Name
Emoji-Grounded Speech Emotion Recognition
## Task Objective
The primary goal of the Emoji-Grounded Speech Emotion Recognition (EG-SER) task is to develop a system that can accurat…
-
Hi Zhifeng,
Thank you so much for your help!
This issue is related to https://github.com/NVIDIA/audio-flamingo/issues/5, https://github.com/NVIDIA/audio-flamingo/issues/6, https://github.com/NVI…
-
Using this method i want to get embeddings from the model for query image. So, can you please give me some instruction for how to get embeddings from model.
-
||link|
|----|---|
|paper| [Emotions Understanding Model from Spoken Language using Deep Neural Networks and Mel-Frequency Cepstral Coefficients](https://ieeexplore.ieee.org/abstract/document/912269…
-
Hi,
I am trying to use your audio model. Preprocessing run without any error, however, training (using main_FineTuneWav2Vec_CV.py) doesn't continue. I don't see any error on the terminal prompt, and…
-
Post a link for a "possibility" reading of your own on the topic of Sound and Image Learning [for week 7], accompanied by a 300-400 word reflection that: 1) briefly summarizes the article (e.g., as we…
lkcao updated
2 years ago
-
Hi, I found that codec_superb_data contains many datasets and does not give the code for data preprocessing, does it mean that I need to resynthesize each dataset separately by myself according to the…
-
The task is to perform pre-training on a ResNet model on a simple emotion detection classification task. The input to the ResNet should be MFCCs computed using 1second of audio with a sampling rate of…
-
Hi good day, Am currently working on data fusion and stumbled upon your code. It has been of great help but it seems the code isn't finished yet. Am really intrested in this and would like to know if …
-
When i run “python live_predictions.py” , there comes the axis error :
Traceback (most recent call last):
File "live_predictions.py", line 61, in
live_prediction.make_predictions()
File…