To help prevent words from getting cut off on the boundaries of the 10-second audio chunks, append the final 1 second of audio data to the next audio segment.
This should allow Whisper to better pick up words that get cut off.
To prevent duplicate words from appearing at the end of one transcription and the start of the next, check the prev_transcript value:
To help prevent words from getting cut off on the boundaries of the 10-second audio chunks, append the final 1 second of audio data to the next audio segment.
This should allow Whisper to better pick up words that get cut off.
To prevent duplicate words from appearing at the end of one transcription and the start of the next, check the
prev_transcript
value:Using the formatted output from #2:
Performance impact: Little to none