abus-aikorea / voice-pro

Comprehensive Gradio WebUI for audio processing, powered by Whisper engines (Whisper, Faster-Whisper, Whisper-Timestamped). Features Voice Changer, zero-shot Voice Cloning, YouTube downloading, vocal isolation, Text-to-Speech (Edge-TTS, F5-TTS), and multi-language translation. Perfect for content creators and developers.
MIT License
968 stars 98 forks source link
faster-whisper gradio podcast speech-recognition speech-synthesis speech-to-text stt subtitles text-to-speech transcription translate translator tts voice-cloning voice-conversion webui whisper yt-dlp

Voice-Pro: The best gradio web-ui for transcription, translation and text-to-speech 🔊

🌍 한국어English中文简体中文繁體日本語

GitHub License GitHub Release

Voice-Pro is the best gradio WebUI for transcription, translation and text-to-speech. It can be easily installed with one click. Create a virtual environment using Miniconda, running completely separate from the Windows system (fully portable). Supports real-time transcription and translation, as well as batch mode.

🚄 Run screen

⭐ Key Features

💻 Execution environment

📀 Installation

Voice-Pro can be easily installed with one click. Just run 🚀configure.bat and 🚀start.bat

step 1. Package preparation

step 2. Install and run the program

  1. 🚀 Run configure.bat
    • Install git, ffmpeg and CUDA (if using NVIDIA GPU) on Windows.
    • You only need to run it the first time.
    • An internet connection is required, and it may take over an hour depending on the system.
    • Never close the Windows-Command window during installation.
  2. 🚀 Run start.bat
    • Start Voice-Pro. Web-UI will run automatically.
    • When running for the first time, Voice-Pro is installed first.
    • An internet connection is required, and it may take over an hour depending on the system.
    • Never close the Windows-Command window during installation.
    • If a problem occurs during installation, delete the installer_files folder and run start.bat again.

step 3. Uninstall program

❓Tips & Tricks

If Browser does not run automatically

If a CUDA Out-Of-Memory error occurs

How to improve the quality of subtitles?

📢 caution

Windows Defender may give a warning about untrusted application and disallow further execution of Voice-Pro. If SmartScreen security level is set to "Warn", just click "More info" and then click "Run anyway". If SmartScreen is set to level "Block" there will be no button to run the installation. In this case, open the properties of the start.bat file, and check "Unblock", apply the change and run the start.bat again.

When Windows Defender mistakenly recognizes a batch file as a Trojan, this is often called a 'False Positive'. To solve this problem, you can go through the following steps:

  1. File exception handling: In Windows Defender, you can set certain files or processes to skip security scanning. To do this, follow the steps below:
    • Click the ‘Start’ button and go to ‘Settings’.
    • Click ‘Update & Security’.
    • Select ‘Windows Security’ and go to ‘Virus & threat protection’.
    • Click ‘Manage Virus & Threat Protection Settings’.
    • Select 'Add exception' in 'Virus & threat protection settings'.
    • Select 'File or Folder', find the batch file in question and add it as an exception.
  2. Temporarily disable Windows Defender: This may be a temporary solution. However, you must be careful when using this method as it may expose your computer to other threats.
  3. Report the problem to anti-virus software: If you are sure that the file is not a Trojan horse, you can report it to Microsoft as a False Positive. Microsoft will review this and take any necessary action.

📬 Contact us

👍 YouTube

🙏 Credits

©️ Copyright

by ABUS