WhisperX default is without_timestamps=True; while faster-whisper's default is without_timestamps=False. This affects transcript quality, whisperX output can have long (5-15s) continuous transcript drops. Attached an example of a small audio clip (~10s, so VADs in WhisperX and Faster-Whisper are off) that has worse transcript output when without_timestamps=True. whisperxWithoutTimestepsExample.zip
WhisperX default is
without_timestamps=True
; while faster-whisper's default iswithout_timestamps=False
. This affects transcript quality, whisperX output can have long (5-15s) continuous transcript drops. Attached an example of a small audio clip (~10s, so VADs in WhisperX and Faster-Whisper are off) that has worse transcript output whenwithout_timestamps=True
.whisperxWithoutTimestepsExample.zip