MahmoudAshraf97 / whisper-diarization

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
BSD 2-Clause "Simplified" License
3.28k stars 272 forks source link

Kernel size error #29

Closed RasmusBacklund closed 1 year ago

RasmusBacklund commented 1 year ago

Hi,

When I run the whisperx.align part, certain soundfiles gives me this error:

RuntimeError: Calculated padded input size per channel: (1). Kernel size: (2). Kernel size can't be greater than actual input size.

Any idea on what causes this error and how to fix it?

MahmoudAshraf97 commented 1 year ago

Hello, can you give me more details? also posting your issue in whisperX repo will get you better help