Diarization pipeline fails at end of audio file (RuntimeError: Sizes of tensors must match except in dimension 0.)

ccmilne commented 3 months ago

Ubuntu 22.04.4 LTS - pyannote.audio 3.3.1 - EC2 g5.4xlarge

Receiving this error when running the diarization pipeline on an mp3 file:

RuntimeError: Sizes of tensors must match except in dimension 0. Expected size 160000 but got size 147200 for tensor number 12 in the list.

Code to reproduce:

Full error:

qalabeabbas49 commented 2 months ago

Hi, I am not sure but try converting mp3 to wav and trying again.

ccmilne commented 2 months ago

Hi, I am not sure but try converting mp3 to wav and trying again.

Converting to a WAV file worked. Not sure why, but thanks!

qalabeabbas49 commented 2 months ago

Hi, I am not sure but try converting mp3 to wav and trying again.

Converting to a WAV file worked. Not sure why, but thanks!

It has something to do with torachaudio backend. Sometimes it doesn't work well with mp3 format.

pyannote / pyannote-audio