WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
BSD 2-Clause "Simplified" License
11.26k
stars
1.18k
forks
source link
How to achieve known text content and obtain the timestamp of the text corresponding to the audio #839
Open
RichardQin1 opened 1 month ago
It is known that the text is a segment of the audio
eg:
test.mp3 input(text,test.mp3) output:
How to obtain the start and end timestamps of each sentence