Closed VivianZ123 closed 11 months ago
output: 0.000 13.000 SPEAKER_01 xxx 13.000 17.000 SPEAKER_00 xxx xxxx 17.000 19.000 SPEAKER_01 xxxxxx
when it instead should be:
11.954 13.183 SPEAKER_01 xxx 13.677 16.681 SPEAKER_00 xxx xxxx 17.568 18.763 SPEAKER_01 xxxxxx
This Stabilizing Timestamps for Whisper works: https://github.com/jianfch/stable-ts
output: 0.000 13.000 SPEAKER_01 xxx 13.000 17.000 SPEAKER_00 xxx xxxx 17.000 19.000 SPEAKER_01 xxxxxx
when it instead should be:
11.954 13.183 SPEAKER_01 xxx 13.677 16.681 SPEAKER_00 xxx xxxx 17.568 18.763 SPEAKER_01 xxxxxx