-
Custom charts are great and it's hard to find metrics in the tables but somehow they show results only for 3 datasets and lack the most important ones, the ones without augmentations.
https://wandb…
-
It is a good job. I have a question about the Evaluation datasets objaverse-lvis. I have download the test_datasets.zip. However, it seems that its dim is 1280, not 1024. So I can not use it for zero-…
-
Scores updated:
-------
Acc_ground_truth: 93.85%
Acc_resync_audio: 16.10%
Cos_similarity: 36.48%
ACC: 16.10%
---------
Log results
--------------------------------------------------
F…
-
## Description
We need to choose a small number (1-3, depending on size) of open source RAG evaluation datasets. Having at least 1 open source dataset allows us to begin running basic evaluations tha…
-
Here is the result for [SpeechTokenizer](https://github.com/ZhangXInFD/SpeechTokenizer).
The bit rate is 2kbps, following are the results:
**Results in exps/results.txt**
Codec SUPERB applica…
-
# 16 kHz 2kbps
## parameter size:
encoder (including quantizer) : 29MB decoder: 40MB
### exps/results.txt
Codec SUPERB application evaluation
Stage 1: Run speech emotion recognition.
Acc: 74.…
-
Hi, thanks for sharing your work, it's very impressive. We'd like to evaluate the M2UGen model and other compared models with your evalution datasets. In `Evaluation/Image2Music/evaluate.py`, we notic…
-
Bit rate=8k
Downstream tasks (only 16khz model used)
```
Stage 1: Run speech emotion recognition.
Acc: 75.21%
Stage 2: Run speaker related evaluation.
Parsing the resyn_trial.txt for resyn w…
Slyne updated
2 weeks ago
-
for the 16kHz Codec model: the bitrate is 2kbps;
for the 44.1kHz Codec model: the bitrate is 6.89kbps;
for the 48kHz Codec model: the bitrate is 7.5kbps;
#1、Here is the exps/results.txt
Codec SU…
-
I followed the guide in ReadMe and compile the STFT using a RTX4090. It successfully compiled but, when I run the finetuning, it outputs the following error:
Traceback (most recent call last):
F…