-
What is the format for inputting data. I mean what data tree structure should we use, if we use it for any other speech problem ? Say a simple binary classification problem
-
Are there any plans to support torchaudio, such as StreamReader and StreamWriter classes
-
-
Please if possible get in touch sir I'm working as a research scholar at University of Hyderabad.
-
# Speech Emotion Captioning
Speech emotion captioning is to describe the emotion in speech using natural language.
## Task Objective
Compared with traditional speech emotion recognition(wher…
-
Hi, I found that codec_superb_data contains many datasets and does not give the code for data preprocessing, does it mean that I need to resynthesize each dataset separately by myself according to the…
-
### Feature Name
Llava-next -34B
### Feature Description
Research about Llava-next -34B
### Research Findings
### LLaVA-NeXT-34B
**LLaVA-NeXT-34B** is a model in the LLaVA-NeXT series, which e…
-
I have two main questions.
My first question is which task number in this link https://instructions.apps.allenai.org/ is similar to a multi-class classification?
My second question is, if we want to…
-
@Aryan-Chharia Hello, I would like to propose a new project for this repository: Real-Time Sign Language Detection Using OpenCV, Deep Learning and MediaPipe. This system will recognize sign language g…
-
### System Info
## setup with crash
- `transformers` version: 4.41.2
- Platform: macOS-14.5-arm64-arm-64bit
- Python version: 3.12.3
- Huggingface_hub version: 0.23.3
- Safetensors version: 0.4.…