I use the same file to transcript in faster_whisper, it seems ok
from faster_whisper import WhisperModel
model_size = "large-v3"
model = WhisperModel(model_size, device="cuda")
segments, info = model.transcribe("test_02.mp3", beam_size=5)
print("Detected language '%s' with probability %f" % (info.language, info.language_probability))
for segment in segments:
print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text))
After a certain segment, all subsequent recognized texts are incorrect:
I use the same file to transcript in faster_whisper, it seems ok
Can anybody help me ?