Open thomask-gh opened 1 year ago
Have you tried adding
options = whisper.DecodingOptions()
?
Regarding the transcription you ran, I have no suggestion yet.
Hi,
Same problem here and everything is lost after hours of work :-(
Did you found a way ?
I made it work adding the default dict {"highlight_words": False, "max_line_count": None, "max_line_width": None}
as a third argument to writer
here https://github.com/ad-aures/mufidiwiwhi/blob/main/mufidiwiwhi/transcribe.py#L434
I also had to change in utils.py
: add options: dict
to the signature of write_result
and write_all
. And in write_all
, add options
as the last argument of writer
I also tried to to set "max_line_count": 2, "max_line_width": 32
to have a format with correct podcasting 2.0 specifications https://github.com/Podcastindex-org/podcast-namespace/blob/main/transcripts/transcripts.md#properties but it did not change anything
I should probably clean that code. Will try to do that this week…
Hi, any update on this @benjaminbellamy? :blush:
This project is not dead, yet. 😉 I'm working on it now with the v3 models that were published last month by openai. Stay tuned. 📻️
Great to know, thanks a lot for your work on it, wishing you strength for this and a happy holiday season! :wink: :sparkles:
Hi @benjaminbellamy
Any news on this ? :-)
The error goes away for me with a different version of whisper, however the final output seems incomplete.
$ pip install openai-whisper==20230314
Whenever I call mufidiwiwhi, the whole transcriptions happens and gets printed to the terminal, but at the very end, when it must be written to the files, it crashes with the following error:
Indeed, the writers provided by whisper require an
options
dict that also cannot just be empty (I tried passing an empty dict) because some of the concrete writers expect some specific options to be present in it.This requirement has been added in this commit and the options dict is generated in
transcribe.py
from the parsed command line arguments.In the meantime, I have saved in a separate the output of the transcription that was printed to the terminal and I’d like to find a way to convert it to srt without having to re-run the transcription, which took several hours. What suggestions would you have for this? Thanks!