Highlight and Max line width

EtienneAb3d / WhisperTimeSync

Synchronize Whisper's timestamps over an existing accurate transcription

131 stars 22 forks source link

Hi @Linch1 I may add some parameters. Waiting for that, you can change this file: https://github.com/EtienneAb3d/WhisperHallu/blob/main/transcribeHallu.py Line 431: adjust the options at your need

                if(transcribe_options["word_timestamps"]):
                    srtOpts = { "max_line_width" : 30, "max_line_count" : 1, "highlight_words" : transcribe_options["word_timestamps"]}

Line 441: remove this filtering

                if(transcribe_options["word_timestamps"]):
                    result["text"] = re.sub("(\n[^<\n]*<u>|</u>[^<\n]*\n)"#Remove lines without highlighted words
                                            ,"\n",re.sub(r"\n[^<\n]*\n\n","\n\n"#Keep only highlighted words
                                                         ,result["text"]))

PS: only works with standard Whisper

EtienneAb3d / WhisperTimeSync

Highlight and Max line width #13