While looking into this issue, I noticed that faster_whisper and openai_whisper now respond in different formats when output=json.
It looks like ASR_ENGINE=openai_whisper returns a structure that looks like this (notice the named fields in the segments array):
On the other hand, ASR_ENGINE=faster_whisper instead returns a structure that looks like this (notice the lack of named fields in segments).
I think it goes without saying, but the format used by openai_whisper offers a significantly improved DX. I'm not sure what is involved in getting faster_whisper to respond with similarly formatted JSON, but that strikes me as the path forward.
While looking into this issue, I noticed that
faster_whisper
andopenai_whisper
now respond in different formats whenoutput=json
.It looks like
ASR_ENGINE=openai_whisper
returns a structure that looks like this (notice the named fields in thesegments
array):On the other hand,
ASR_ENGINE=faster_whisper
instead returns a structure that looks like this (notice the lack of named fields insegments
).I think it goes without saying, but the format used by
openai_whisper
offers a significantly improved DX. I'm not sure what is involved in gettingfaster_whisper
to respond with similarly formatted JSON, but that strikes me as the path forward.