Closed caiwuu closed 4 weeks ago
When VAD detects multiple segments of speech in an audio clip, the timestamps from the second segment onward are incorrect, as shown in the image below This is incorrect: This is correct:
When VAD detects multiple segments of speech in an audio clip, the timestamps from the second segment onward are incorrect, as shown in the image below This is incorrect: This is correct: