I am attempting to run whisperx with word-level timestamps, but despite passing the relevant option, the output is of the form {'segments': [ ... ], 'language': 'en'} with no word_segments.
I dug around a bit but could not find out why this is happening. I have confirmed that model.options.word_timestampts is True, so I believe it is an internal issue with model.transcribe, and perhaps the options are not properly being wrapped to faster-whisper.
I am attempting to run whisperx with word-level timestamps, but despite passing the relevant option, the output is of the form
{'segments': [ ... ], 'language': 'en'}
with noword_segments
.I dug around a bit but could not find out why this is happening. I have confirmed that
model.options.word_timestampts
is True, so I believe it is an internal issue withmodel.transcribe
, and perhaps the options are not properly being wrapped to faster-whisper.My code:
It should be noted that running via command line works fine:
This properly includes
word_segments
in the json output.