-
-
Hello,
Thanks for your works! What I want to ask is: When the representation's size is (T, F), and it's corresponding labels' size is (L,), how to ensure that the first dimension is consistent be…
-
hello,
the provided vocoder checkpoint using mHubert does not support multi-speaker. Do you have a multi-speaker checkpoint?
`mhubert_vp_en_es_fr_it3_400k_layer11_km1000_lj`
-
Hi! I tried your pretrained checkpoint in colab and got some extra values at the spectrogram in the first case and broken harmonics in the second case.
First audio is 44100Hz real speech (converted t…
-
**Description:-**
Incorrect latex representation in Assessment.
**Steps to reproduce the issue:-**
Speech-signal-processing lab
->
List of experiments
->
Formant Synthesis experiment
->
Ass…
-
Stockpile of any games that have an interesting or unique representation of dialogue. Include a screenshot of the dialogue taking place and a link to the game on steam (or similar). Check out other ga…
-
# Task Name
Emoji-Grounded Speech Emotion Recognition
## Task Objective
The primary goal of the Emoji-Grounded Speech Emotion Recognition (EG-SER) task is to develop a system that can accurat…
-
Hi everyone!
I have just published this project on GitHub: https://github.com/davidmartinrius/speech-dataset-generator/
Now you can create datasets automatically with any audio or lists of audi…
-
## 🚀 Feature Request
The XLS-R [1] paper demonstrates the performance of the model on the LID task on VoxLingua107 dataset. I am running some model comparisons for the LID task and will appreciate …
-
Dataloader name: `vihos/vihos.py`
DataCatalogue: http://seacrowd.github.io/seacrowd-catalogue/card.html?vihos
| Dataset| vihos |
|-------------|---|
| Description | This dataset consists of human…