Pikurrot / whisper-gui

A simple GUI to use Whisper.
MIT License
89 stars 6 forks source link
gradio gui huggingface interface speech-recognition speech-to-text transformers whisper whisper-ai whisperx

Whisper-GUI

A simple GUI made with gradio to use Whisper.

whisper-gui-img

Requirements

Optionally, to use Nvidia GPU on Windows:

Note: For AMD GPUs (ROCm), GPU support for Whisper is only available in Linux.

Set up

Otherwise, manual steps are:

Run the GUI

To run the program every time, you can just run the same whisper-gui.bat or whisper-gui.sh (whatever your OS), which will also automatically check for updates of this repository.
Your transcriptions will be saved by default in the outputs folder of the repository.

Otherwise, to run manually:
conda activate whisperx
python main.py --autolaunch

Docker container (CPU only for now)

To run this software in a docker container, visit this dockerhub project.
Thank you 3x3cut0r!

Licensing

This project is primarily distributed under the terms of the MIT License. See the LICENSE file for details.

Third-Party Code
Portions of this project incorporate code from WhisperX, which is licensed under BSD-4-Clause license. This code is used in accordance with its license, and the full text of the license can be found within the relevant source files.