MLo7Ghinsan / DiffSinger_colab_notebook_MLo7

DiffSinger training colab notebook to make training easier hopefully
https://github.com/openvpi/DiffSinger
32 stars 9 forks source link
diffsinger svs

Custom Local Training GUI is moved to DiffTrainer


DiffSinger training notebook: Open In Colab

current supported data format:

NOTE:

Zip file format examples:

[NOTE] .ds training has the same zip organization as lab + wav, but with only .ds files- no wav needed
#single speaker (lab + wav)
your_zip.zip:
    |
    |
    your_speaker_folder:
        |
        |
        data_1.wav
        data_1.lab
        .
        data_2.wav
        data_2.lab
        .
        data_3.wav
        data_3.lab
        .
        ...
#single speaker (csv + wav)
your_zip.zip:
    |
    |
    your_speaker_folder:
        |
        |
        wavs (folder named "wavs" containing all the wavs)
        .
        transcriptions.csv
#multi speaker (lab + wav)
your_zip.zip:
    |
    |
    your_speaker_folder_1:
        |
        |
        data_1.wav
        data_1.lab
        .
        data_2.wav
        data_2.lab
        .
        data_3.wav
        data_3.lab
        .
        ...
    your_speaker_folder_2:
        |
        |
        data_1.wav
        data_1.lab
        .
        data_2.wav
        data_2.lab
        .
        data_3.wav
        data_3.lab
        .
        ...
#multi speaker (csv + wav)
your_zip.zip:
    |
    |
    your_speaker_folder_1:
        |
        |
        wavs (folder named "wavs" containing all the wavs)
        .
        transcriptions.csv
    your_speaker_folder_2:
        |
        |
        wavs (folder named "wavs" containing all the wavs)
        .
        transcriptions.csv


Vocoder finetuning notebook: Open In Colab

current supported data format:

NOTE:

SOFA training notebook (wip): Open In Colab

current supported data format:

NOTE:


Plans (update might not be in order):


Credits: