faster-whisper-GUI
faster-whisper、whisperX,GUI with PySide6
-
model download
-
Links
-
What's this
- this is a GUI software of faster-whisper , you can:
- Transcrib audio or video files to srt/txt/smi/vtt/lrc file
- provide all paraments of VAD-model and whisper-model
- now, it support whisperX
- Demucs model support
- whisper large-v3 model support
-
Best wishes to the world that received this message
Star History
![Star History Chart](https://api.star-history.com/svg?repos=CheshireCC/faster-whisper-GUI&type=Timeline)
-
UI Language
![屏幕截图 2024-03-11 183130](https://github.com/CheshireCC/faster-whisper-GUI/raw/main/README.assets/183130.png)
-
Theme Color
![屏幕截图 2024-03-11 184459](https://github.com/CheshireCC/faster-whisper-GUI/raw/main/README.assets/184459.png)
![image-20240311184818398](https://github.com/CheshireCC/faster-whisper-GUI/raw/main/README.assets/image-20240311184818398.png)
-
Load Model / Download Model / Convert Model
![image-20231118155123131](https://github.com/CheshireCC/faster-whisper-GUI/raw/main/README.assets/image-20231118155123131.png)
-
Large-v3 模型支持
![image-20231118155209847](https://github.com/CheshireCC/faster-whisper-GUI/raw/main/README.assets/image-20231118155209847.png)
-
Demucs AVE
![DemucsFunction](https://github.com/CheshireCC/faster-whisper-GUI/raw/main/README.assets/DemucsFunction.png)
-
batch process
![image-20231008150849827](https://github.com/CheshireCC/faster-whisper-GUI/raw/main/README.assets/image-20231008150849827.png)
-
File List
![0.3.0_newFIleSystem](https://github.com/CheshireCC/faster-whisper-GUI/raw/main/README.assets/0.3.0_newFIleSystem.png)
-
FileFilter
![fileFilter](https://github.com/CheshireCC/faster-whisper-GUI/raw/main/README.assets/fileFilter.png)
-
WhisperX function
![0.3.0_whisperx](https://github.com/CheshireCC/faster-whisper-GUI/raw/main/README.assets/0.3.0_whisperx.png)
-
paraments of faster-whisper model
![image-20231113020210745](https://github.com/CheshireCC/faster-whisper-GUI/raw/main/README.assets/image-20231113020210745.png)
-
Silero VAD
![image-20231113020407272](https://github.com/CheshireCC/faster-whisper-GUI/raw/main/README.assets/image-20231113020407272.png)
-
setting
![image-20231118155300816](https://github.com/CheshireCC/faster-whisper-GUI/raw/main/README.assets/image-20231118155300816.png)
-
Show result and edit timestample
![0.3.0_result](https://github.com/CheshireCC/faster-whisper-GUI/raw/main/README.assets/0.3.0_result.png)
![image-20231007191942864](https://github.com/CheshireCC/faster-whisper-GUI/raw/main/README.assets/image-20231007191942864.png)
-
words-level timestamps —— karaoka lyric (work in VTT
/LRC
/SMI
format)
- play with foobar2000 , ESLyric plugin,
lrc
format lyric
![image-20230811130449688](https://github.com/CheshireCC/faster-whisper-GUI/raw/main/README.assets/image-20230811130449688.png)