Closed divyeshrajpura4114 closed 4 years ago
Now Its working Prefectily... I read description given in definition of DetectionPrecision function that I just need to give Speech segments as input, rather than giving both Speech and NonSpeech segments as Input.
Glad your problem is solved.
I have used AVASpeech Dataset and tries to apply energy based voive activity detection. However using pyannote.metrics, it gives me 100% Precison and 100% Recall, which is unexpected. You can compare line no 4 and 12 where both reference and hypothesis has mismatch for nearly 2 seconds. I tried on many files and it gives always 100% result. So can anyone please help me, If I am doing anything worng??
Code :
Below is the output of my program. Here, 0 represents Non-Speech Segmnets and 1 represents Speech Segments. Each audio is of 30s.