Closed Oscaarjs closed 7 months ago
Hi.
Let us test on the wav file you provided.
This is a new type of error we have never encountered.
It apprears p_value
value in this line is causing this error.
I will follow your settings and check what is causing this error.
@tango4j any updates on this issue?
This issue is stale because it has been open for 30 days with no activity. Remove stale label or comment or this will be closed in 7 days.
This issue was closed because it has been inactive for 7 days since being marked as stale.
Describe the bug
When running diarization on a specific file I'm getting:
IndexError: shape mismatch: indexing tensors could not be broadcast together with shapes [7557001], [7559750]
The same pipeline/config/setup etc has worked for a couple of thousand other files but now on this (and a few more files) I'm getting this issue suddenly.
Steps/Code to reproduce bug
Shared a Google Colab of how one can reproduce the error, the steps are also listed below.
Shared Google Colab notebook ipynb
Steps:
Pre-steps: Upload
issue_file.wav
,config.yaml
andspeech_timestamps.rttm
to runtime (e.g. Colab) Above mentioned files can be obtained:issue_file.wav
: Audio fileconfig.yaml
: diarization configspeech_timestamps.rttm
: speech timestamps rttmLet me know if anything's missing or unclear.
3.
4.
This fails:
Expected behavior
Diarization shouldn't fail on a seemingly non-corrupt audio-file. This config has been tested multiple times before on other files without any issues.
Environment overview (please complete the following information)
Environment details