HallowSiddharth / VoiceCraftAI

VoiceCraftAI is a revolutionary AI tool to dub videos into multiple regional languages and lip-sync at the same time.
47 stars 13 forks source link

Voice Craft AI - Where the essence of your voice meets the diversity of languages.

VoiceCraft

Trying it Yourself

Pre-requisites

Setting up VoiceCraftAI in your system

  1. Clone this directory
  2. Run the setup.ipynb file (all the cells)
  3. Now open your AICoverGen folder and go to the src sub folder 3.1 Replace the main.py with the modifications/AICoverGen mods/main.py
  4. Enable GPU Acceleration and install Tensorflow, CUDA,etc. 4.1 Follow this link : https://docs.google.com/document/d/1MQ35ZeMZupJQCz4pUmI2Z0j6yD5VaVRWYBtosGNK0p8/edit?pli=1#heading=h.aczyuw2yex2w
  5. Make sure to change the absolute path in each of the following files to your similar path:
    • web.py
    • test.py
    • trimming.py
    • voice_cloning.py

Voice Craft AI is a cutting-edge dubbing software powered by artificial intelligence (AI), designed to seamlessly translate English audio into various Indian regional languages. Our innovative platform offers a simplified yet robust process to dub videos in multiple languages while retaining the nuances of the original speaker's voice.

The Dubbing Process Voice Craft AI follows a systematic approach to transform English audio into regional languages with a touch of naturalness. We follow a 8-step pipeline in order to achieve this end product.

sih

Setup

Input: Youtube video / User uploaded video

Step 1: Audio Extraction

Step 2: Speech Transcription

Step 3: Translation

Step 4: Voice Synthesis

Step 5: Voice Model Creation

Step 6: Final Audio Generation

Step 7: Video Integration and Subtitle Generation

Step 8: Lip-sync Integration (Working on this feature)

This marks the end of our pipeline.

Output: 20+ voice-cloned and lip-synced videos in various regional languages.

The End Result

Voice Craft AI's unique pipeline takes a single video as input and transforms it into multiple output videos. These output videos are not only professionally dubbed in various Indian regional languages but also feature the speaker's own voice and are synchronized with lip movements. The addition of subtitles further enhances the user experience, making the content more accessible to a wider audience. With Voice Craft AI, the power of AI is harnessed to break language barriers, making content more inclusive, engaging, and relatable to diverse audiences across India. Whether for entertainment, education, or information sharing, our solution ensures that your message is heard and understood by all. We're experimenting on more features actively, if you guys want to contribute, let us know.

Innovate, Create, Inspire