issues
search
taylorlu
/
Speaker-Diarization
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
Apache License 2.0
470
stars
121
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
数据集
#64
woshizhishixuebao
closed
1 year ago
0
spec_len = sr/hop_length/embedding_per_second
#63
xiangzai0115
opened
2 years ago
0
How to generate a dynamic diagram to show
#62
jiangxinxing
opened
2 years ago
0
Hello, can you provide the papers published on this model?
#61
jiangxinxing
opened
2 years ago
0
Predicted labels doesn't match with Ground truth labels but the accuracy of test results is 0.8%
#60
SanaullahOfficial
opened
2 years ago
0
how to get the label of rmdmy.wav to calculate the DER?
#59
potatojoker
opened
2 years ago
0
what about your parameters of embeddings_per_second and overlap_rate consistent with your results in readme?
#58
yangguangxiaoshuai
opened
2 years ago
0
Diarization result varries as we run inference multiple time on same audio.
#57
alamnasim
opened
3 years ago
0
How can I generate this training set file「./ghostvlad/training_data.npz」
#56
Zomun
opened
3 years ago
0
Speaker-Diarization for 2 person conversation
#55
ArvindSharma18
opened
3 years ago
3
Which version of Keras, Tensorflow and Pytorch are compatible?
#54
asr-lord
opened
3 years ago
7
How many utterances and iteration you use for pretrained uisrnn model
#53
Curisan
opened
3 years ago
0
Is there a way to fine tune the pre-trained model on another language data?
#52
saumyaborwankar
opened
3 years ago
0
Where can I find ghostvlad/training_data.npz ?
#51
yuanlorna
opened
3 years ago
1
I want only 2 speakers as my output ,as my sample consists of only 2 speakers,what change in code should i do to achieve this
#50
sourav1122
opened
3 years ago
0
Cuda Out Of Memory when invoking train.py
#49
yelou-renni
opened
3 years ago
3
What is the exact version of tensorflow and Keras needed if I want to run the code?
#48
vanellope666
opened
3 years ago
1
Innacurate start and till time of slices attained
#47
Gaurav470
opened
3 years ago
2
Can not reprodcut the cluster result
#46
WeiyueSu
opened
3 years ago
1
How to save the Plot/Animation/Video with Audio
#45
vinayentc
closed
3 years ago
1
How can i Train this model on my own Dataset and what should be the data set structure requirements
#44
nome2050
opened
4 years ago
0
What does the uisrnn pytorch model output exactly and which variable holds that output?
#43
Harry-Garrison
opened
4 years ago
0
Question about using dvector created by VGG to train UISRNN
#42
mengjie-du
opened
4 years ago
0
AlreadyExistsError: Another metric with the same name already exists.
#41
ishara133
opened
4 years ago
0
Very poor performance on my own wav file, is there anything wrong?
#40
Yunlong-He
opened
4 years ago
2
Handling silent speech segments
#39
rahulranjan29
closed
4 years ago
0
Can speaker model be reproduce?
#38
zh794390558
opened
4 years ago
4
max( ) arg is an empty sequence
#37
shiwanglei
closed
4 years ago
0
About the training data
#36
soliloquy1983
opened
4 years ago
0
How can I replace cluster id to speaker name
#35
vinayaksable2399
opened
4 years ago
7
The loss of uis-rnn model
#34
Naminwang
opened
4 years ago
2
create new model to my new dataset
#33
AbdallahQoutbAli
opened
4 years ago
3
Module Error
#32
VinuAbraham
opened
4 years ago
1
Could you please also provide the online implementation of UISRNN?
#31
ashutosh14139
opened
4 years ago
3
feats = np.array(feats)[:,0,:] # [splits, embedding dim] IndexError: too many indices for array
#30
vnylp
opened
4 years ago
1
creating a new model
#29
praveenssivam
opened
4 years ago
1
can you tell me how to create new model to my new dataset which is english language and predict using that new created model.. Could u please tell me the steps to create new mdel
#28
praveenssivam
opened
4 years ago
1
About final output
#27
xiaozhi2015
closed
4 years ago
3
How can I resolve 'killed' problem?
#26
nyongja
closed
4 years ago
1
How to caculate DER, EER ?
#25
tranquangchung
closed
4 years ago
2
Using speaker diarization on mobile devices
#24
MatthewWaller
opened
4 years ago
5
In the prediction stage(speakerDiarization),how to determine the num_class in the spkModel.vggvox_resnet2d_icassp
#23
934453938
opened
4 years ago
0
MemoryError
#22
LavenderMP
closed
5 years ago
0
Sliding for long audios
#21
ozcelikkale
opened
5 years ago
2
Slow performance?
#20
chrisspen
opened
5 years ago
6
Added missing dependencies. Updated tensorflow imports.
#19
chrisspen
closed
2 years ago
4
How to save plot
#18
LavenderMP
opened
5 years ago
1
Training loss converenge
#17
chienducnguyen
opened
5 years ago
0
File size exceed Zip
#16
Arroosh
opened
5 years ago
3
Unable to view figure
#15
Arroosh
closed
5 years ago
0
Next