-
I am trying to finetune models to support one more speaker, but it looks I am doing something wrong.
I want to use "dia_hard" pipeline, so I need to finetune models: {sad_dihard, scd_dihard, emb_vo…
-
Hello SpeechBrain creators,
Thanks for putting together a comprehensive speech library. It's been very fun to explore!
I am doing a project where I would like to train a SepFormer separation mo…
-
Hi, I was trying to train the model and it crashed at stage 6
```
Namespace(baseLR=0.001, batchSize=32, featDim=30, featDir='exp/xvector_nnet_1a/egs/', local_rank=0, logStepSize=200, maxLR=0.002, mo…
-
Hi, I would like to use your rawNet code for training and am testing it out on the voxCeleb dataset first. However, for VoxCeleb1 I do not see the veri_test.txt file. For VoxCeleb2, I only see list_te…
-
The code down below **does not compute** the max sequence length. Please check the `length` formula.
https://github.com/tyiannak/deep_audio_features/blob/71e0ba5cfc5fd43613886be807fff8e4205b51f4/util…
-
Which datasets are used for training and testing?
-
We're one year after the initial publication of this project. I've been busy with both exams and work since, and it's only last week that I passed my last exam. During that year, I have received SO ma…
-
Hi. Thanks for the good resource!!
In your paper, the performance of the metric learning-based model is said to be affected by the batch size. I have two questions.
1. I'm wondering if max_frame…
-
Hi,
Looks like current pre-push is also testing the .py code that is copied in the results directory whereas pre-commit is not.
So its creating errors while pushing.
`eg: ./recipes/VoxCeleb/Xvector…
-
Hi there, thanks for Kaldi :)
I want to perform speaker diarization on a set of audio recordings. I believe Kaldi recently added the speaker diarization feature. I have managed to find [this link](…