Closed aedocw closed 9 months ago
This uses whisper to make a transcript of each audio output chunk, and does a fuzzy comparison of the transcript to the original text. If it's below a threshold, it will try to encode again.
This uses whisper to make a transcript of each audio output chunk, and does a fuzzy comparison of the transcript to the original text. If it's below a threshold, it will try to encode again.