ArkanDash / Multi-Model-RVC-Inference

RVC Inference with multiple model and huggingface support
MIT License
102 stars 30 forks source link
ai conversational-ai conversion converter inference retrieval-model rvc tts vc voice voice-conversion voiceconversion
# Multi-Model RVC Inference ### Simplified RVC Inference for HuggingFace or Google Colab [![License](https://img.shields.io/github/license/arkandash/Multi-Model-RVC-Inference?style=for-the-badge)](https://github.com/ArkanDash/Multi-Model-RVC-Inference/blob/master/LICENSE) [![Repository](https://img.shields.io/badge/Github-Multi%20Model%20RVC%20Inference-blue?style=for-the-badge&logo=github)](https://github.com/ArkanDash/Multi-Model-RVC-Inference)

Information

Please support the original RVC, without it, this inference wont be possible to make.
Original RVC Repository

Features

Automatic Installation

Install ffmpeg first before running these command.

Manual Installation

  1. Install Pytorch

    • CPU only (any OS)
      pip install torch torchvision torchaudio
    • Nvidia (CUDA used)
      # For Windows (Due to flashv2 not supported in windows, Issue: https://github.com/Dao-AILab/flash-attention/issues/345#issuecomment-1747473481)
      pip install torch==2.1.0 torchvision==0.16.0 torchaudio==2.1.0 --index-url https://download.pytorch.org/whl/cu121
      # Other (Linux, etc)
      pip install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cu121
  2. Install ffmpeg

  3. Install Dependencies

    pip install -r requirements.txt
  4. Download Pre-model

    
    # Hubert Model
    https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/hubert_base.pt
    # Save it to /assets/hubert/hubert_base.pt

RVMPE (rmvpe pitch extraction, Optional)

https://huggingface.co/lj1995/VoiceConversionWebUI/blob/main/rmvpe.pt

Save it to /assets/rvmpe/rmvpe.pt


5. Run WebUI <br />
```bash
python app.py

How to use

Command Line Arguments

Other Inference

Advanced RVC Inference