Closed saurabhvyas closed 5 years ago
I've done something to get sentence-level timestamps. It's quite straight-forward:
gentle
and get all word-level timestampsOne heurisic is to match only the first and last word, and disregard everything in between.
Thanks for your comment, I guess I''l close this now, because I am no longer using this, maybe its helpful to others.
Thanks for creating gentle, in the output json, I can see starting and ending times of words, and phonemes, however, I want to ask, if it's possible, to get starting and ending times of larger groups of text, like phrases or sentences, something, which is very useful for creating dataset for ASR system, from youtube subtitles and audio.