issues
search
google
/
uis-rnn
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
https://arxiv.org/abs/1810.04719
Apache License 2.0
1.56k
stars
319
forks
source link
issues
Newest
Newest
Most commented
Recently updated
Oldest
Least commented
Least recently updated
Request support of hessian-free optimization in rnn training.
#93
linjing-lab
opened
4 months ago
0
update requirements.txt for run_tests.sh dependencies
#92
siddheshtv
closed
4 months ago
3
assign gpu with arguments
#91
pthavarasa
opened
2 years ago
0
Predicted labels doesn't match with Ground truth labels but the accuracy of test results is 0.8%
#90
SanaullahOfficial
closed
2 years ago
1
[Bug] Making a prediction on CPU after training on GPU
#89
pthavarasa
opened
2 years ago
0
Any documentations on training from scratch using custom data in other languages ?
#88
thangld201
opened
2 years ago
1
rnn initial state trainable
#87
jardnzm
closed
3 years ago
1
Is there a way to fine tune an already existing pre-trained model?
#86
saumyaborwankar
closed
3 years ago
1
[Question] The dimension of toy test data [test_sequence] is (25, 95, 256) what does the first 2 dimension represent? Toy train data [train_sequence] has dimension (4627, 256) which is understandable.
#85
saumyaborwankar
closed
3 years ago
1
[Question] About num_non_zero
#84
shubhunegi30
closed
3 years ago
1
Is is possible to pre-load the model for multiple request?
#83
ghost
closed
3 years ago
1
No module named coverage
#82
yerzhan7orazayev
closed
3 years ago
1
Change input size
#81
surasakBoonkla
closed
3 years ago
1
[Question] Are input d-vectors for training assumed L2-normalized?
#80
vadimkantorov
closed
4 years ago
8
Unable to convert pytorch model to tensorflow in Diarization on mobile device.
#79
alamnasim
closed
4 years ago
2
how to control the number of different speaker when predicting?
#78
NewEricWang
closed
4 years ago
1
Fix typo: recommanded -> recommended
#77
mattmatters
closed
4 years ago
2
uis-rnn gives different result on broken audios and continuous audios
#76
ashu170292
closed
4 years ago
5
Question about custom data generator
#75
YanaHontarenko
opened
4 years ago
0
what is train data format?
#74
jkkj1630
closed
4 years ago
1
[Bug] Predict method does not finish
#73
ArlindKadra
opened
4 years ago
3
about model
#72
zyc1310517843
closed
4 years ago
1
Embedding Extraction Procedure
#71
divyeshrajpura4114
closed
4 years ago
1
add crp_alpha support
#70
aluminumbox
closed
4 years ago
8
I run the demo use toy data, found loss is negative number.
#69
Naminwang
closed
4 years ago
1
Confusion about predicted labels
#68
clabornd
closed
5 years ago
3
[Question]About UIS-RNN d-vector
#67
BarCodeReader
closed
5 years ago
7
Repo
#66
chrisspen
closed
5 years ago
1
Handling unknown speakers?
#65
chrisspen
closed
5 years ago
1
Handling unknown speakers?
#64
chrisspen
closed
5 years ago
3
uis-rnn error is too high.
#63
wychoi44
closed
5 years ago
1
How to create training data?
#62
chrisspen
closed
5 years ago
2
[Question] Online Speaker Decentralize
#61
hungnvk54
closed
5 years ago
0
question about the sigma2 prior loss increasing during the training
#60
maxandchen
closed
5 years ago
2
[Question] How can I get the text of each speaker after speaker diarization?
#59
rowhitswami
closed
5 years ago
3
Loss calculation in prediction
#58
gen35
closed
5 years ago
2
corrected transition_bias estimation, fixes #55
#57
DonkeyShot21
closed
5 years ago
6
[Issue #26] add arg 'num_speaker'
#56
bencq
closed
5 years ago
4
[Bug] Incorrect estimation of transition_bias
#55
DonkeyShot21
closed
5 years ago
13
Is UIS-RNN the technique used in Speaker Diarization in Google Speech-to-text API? [Question]
#54
blazejdolicki
closed
5 years ago
1
Understanding the use of resize_sequence() and the batch creation for RNN training
#53
vickianand
closed
5 years ago
2
[Question] how to relate sigma2_prior_loss() with the paper
#52
jprobichaud
closed
5 years ago
1
Understanding diarization labels
#51
zyc1310517843
closed
5 years ago
6
uis-rnn can't work for long utterances dataset?
#50
wrongbattery
opened
5 years ago
19
The number of speakers and whether the content of the speaker needs to be the same。
#49
zyc1310517843
closed
5 years ago
1
[Question] Which feature was used for VAD?
#48
seungwonpark
closed
5 years ago
1
Any plan on upgrading to pytorch 1.0+?
#47
tanvirfuad
closed
5 years ago
3
How to convert audio data into test data of algorithm for testing
#46
zyc1310517843
closed
5 years ago
1
[Question] Can I use speaker annotated datasets in other language rather than English?
#44
muntasir2000
closed
5 years ago
1
How to define a speaker per segments from overlapped widow of frames? Prediction on realtime data?
#43
alamnasim
closed
5 years ago
5
Next