taylorlu Speaker-Diarization issues

taylorlu / Speaker-Diarization

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

Apache License 2.0

470 stars 121 forks source link

issues

Newest

Newest Most commented Recently updated Oldest Least commented Least recently updated

数据集

#64 woshizhishixuebao closed 1 year ago
0
spec_len = sr/hop_length/embedding_per_second

#63 xiangzai0115 opened 2 years ago
0
How to generate a dynamic diagram to show

#62 jiangxinxing opened 2 years ago
0
Hello, can you provide the papers published on this model?

#61 jiangxinxing opened 2 years ago
0
Predicted labels doesn't match with Ground truth labels but the accuracy of test results is 0.8%

#60 SanaullahOfficial opened 2 years ago
0
how to get the label of rmdmy.wav to calculate the DER？

#59 potatojoker opened 2 years ago
0
what about your parameters of embeddings_per_second and overlap_rate consistent with your results in readme？

#58 yangguangxiaoshuai opened 2 years ago
0
Diarization result varries as we run inference multiple time on same audio.

#57 alamnasim opened 3 years ago
0
How can I generate this training set file「./ghostvlad/training_data.npz」

#56 Zomun opened 3 years ago
0
Speaker-Diarization for 2 person conversation

#55 ArvindSharma18 opened 3 years ago
3
Which version of Keras, Tensorflow and Pytorch are compatible?

#54 asr-lord opened 3 years ago
7
How many utterances and iteration you use for pretrained uisrnn model

#53 Curisan opened 3 years ago
0
Is there a way to fine tune the pre-trained model on another language data?

#52 saumyaborwankar opened 3 years ago
0
Where can I find ghostvlad/training_data.npz ?

#51 yuanlorna opened 3 years ago
1
I want only 2 speakers as my output ,as my sample consists of only 2 speakers,what change in code should i do to achieve this

#50 sourav1122 opened 3 years ago
0
Cuda Out Of Memory when invoking train.py

#49 yelou-renni opened 3 years ago
3
What is the exact version of tensorflow and Keras needed if I want to run the code？

#48 vanellope666 opened 3 years ago
1
Innacurate start and till time of slices attained

#47 Gaurav470 opened 3 years ago
2
Can not reprodcut the cluster result

#46 WeiyueSu opened 3 years ago
1
How to save the Plot/Animation/Video with Audio

#45 vinayentc closed 3 years ago
1
How can i Train this model on my own Dataset and what should be the data set structure requirements

#44 nome2050 opened 4 years ago
0
What does the uisrnn pytorch model output exactly and which variable holds that output?

#43 Harry-Garrison opened 4 years ago
0
Question about using dvector created by VGG to train UISRNN

#42 mengjie-du opened 4 years ago
0
AlreadyExistsError: Another metric with the same name already exists.

#41 ishara133 opened 4 years ago
0
Very poor performance on my own wav file, is there anything wrong?

#40 Yunlong-He opened 4 years ago
2
Handling silent speech segments

#39 rahulranjan29 closed 4 years ago
0
Can speaker model be reproduce?

#38 zh794390558 opened 4 years ago
4
max( ) arg is an empty sequence

#37 shiwanglei closed 4 years ago
0
About the training data

#36 soliloquy1983 opened 4 years ago
0
How can I replace cluster id to speaker name

#35 vinayaksable2399 opened 4 years ago
7
The loss of uis-rnn model

#34 Naminwang opened 4 years ago
2
create new model to my new dataset

#33 AbdallahQoutbAli opened 4 years ago
3
Module Error

#32 VinuAbraham opened 4 years ago
1
Could you please also provide the online implementation of UISRNN?

#31 ashutosh14139 opened 4 years ago
3
feats = np.array(feats)[:,0,:] # [splits, embedding dim] IndexError: too many indices for array

#30 vnylp opened 4 years ago
1
creating a new model

#29 praveenssivam opened 4 years ago
1
can you tell me how to create new model to my new dataset which is english language and predict using that new created model.. Could u please tell me the steps to create new mdel

#28 praveenssivam opened 4 years ago
1
About final output

#27 xiaozhi2015 closed 4 years ago
3
How can I resolve 'killed' problem?

#26 nyongja closed 4 years ago
1
How to caculate DER, EER ?

#25 tranquangchung closed 4 years ago
2
Using speaker diarization on mobile devices

#24 MatthewWaller opened 4 years ago
5
In the prediction stage(speakerDiarization),how to determine the num_class in the spkModel.vggvox_resnet2d_icassp

#23 934453938 opened 4 years ago
0
MemoryError

#22 LavenderMP closed 5 years ago
0
Sliding for long audios

#21 ozcelikkale opened 5 years ago
2
Slow performance?

#20 chrisspen opened 5 years ago
6
Added missing dependencies. Updated tensorflow imports.

#19 chrisspen closed 2 years ago
4
How to save plot

#18 LavenderMP opened 5 years ago
1
Training loss converenge

#17 chienducnguyen opened 5 years ago
0
File size exceed Zip

#16 Arroosh opened 5 years ago
3
Unable to view figure

#15 Arroosh closed 5 years ago
0