-
Speech Emotion Recognition (SER) system was defined as a combination of different frameworks and works based on analyzing audio signals to identify emotions. We can use one or combine other parts to r…
-
### Describe the feature
Those who want to work on this issue kindly get the issue assigned from the maintainers with hacktoberfest label @SUGAM-ARORA @Ojas-Arora
### Add ScreenShots
N.A
### Recor…
-
### 🐛 Describe the bug
Code to reproduce the problem
```
import torch
from transformers import AutoModelForAudioClassification, AutoFeatureExtractor
model = AutoModelForAudioClassification.…
-
Hello,
I would like to inquire whether the training data for the qwen2-audio-instruction model includes the IEMOCAP dataset for fine-tuning in speech emotion recognition tasks. Any clarification on …
-
# Task Name
Emoji-Grounded Speech Emotion Recognition
## Task Objective
The primary goal of the Emoji-Grounded Speech Emotion Recognition (EG-SER) task is to develop a system that can accurat…
-
Apparently, and [according to its own creators](https://github.com/audeering/w2v2-how-to/issues/31#issuecomment-1662719286), the audEERING model was not the wisest of choices.
To address such shor…
-
This repo. provides only 47 short audio files with valence and arousal annotation CSV files. Could someone suggest a larger and Open Source dataset preferably in English* for training the regression m…
-
May I ask whether cross-domain speech emotion recognition (SER) model will open source and when? Thanks!
-
# Speech Emotion Captioning
Speech emotion captioning is to describe the emotion in speech using natural language.
## Task Objective
Compared with traditional speech emotion recognition(wher…
-