Unable to run training script of Natural Speech 2

open-mmlab / Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

https://openhlt.github.io/amphion/

MIT License

4.19k stars 357 forks source link

Unable to run training script of Natural Speech 2 #43

Open dongngm opened 6 months ago

dongngm commented 6 months ago

Hi,

I ran into multiple issues trying to run the training script: In ns2_dataset.py:

self.utt2phone[utt] = utt_info["phones"]: where phones comes from? I suspect we need to run the phonemizer first? but I don't see extract_phone=True in the config file
utt_info["num_frames"] is utt_info["Duration"], right?

In exp_config_base.json:

use_code=true, use_pitch=true, use_phone, should extract_acoustic_token=true, extract_pitch=true, extract_phone=true also?
There seems to be some mismatch between tts/preprocessing.py and the config file. For example: code_dir should be acoustic_token_dir?

HeCheng0625 commented 6 months ago

It has some differences for the data processing for NS2 between other TTS. We will update the data processing section as soon as possible.

vn09 commented 6 months ago

Hi @HeCheng0625 ,

I hope this message finds you well. I understand that these things take time and effort, and I appreciate the work you're putting into it.

If possible, could you please provide an estimated timeline for when we might expect the update?

HeCheng0625 commented 6 months ago

Hi, we will update a new checkpoint and data processing pipeline on a large dataset (> 1 w hours) in about two weeks. Now, we only use libritts to train the model. Now, we use our pretrained model on libritts: https://huggingface.co/amphion/naturalspeech2_libritts Or, try the toy demo: https://huggingface.co/spaces/amphion/NaturalSpeech2

vn09 commented 6 months ago

Thanks @HeCheng0625.

vn09 commented 6 months ago

Hi @HeCheng0625 , I just wanted to hear from you if there have been any updates on the data processing pipeline.

shreeshailgan commented 3 months ago

Any updates on the data preprocessing pipeline?

CreepJoye commented 1 month ago

Hello，@dongngm I encountered the same issue and have the same confusion. Do you have a solution to this problem? Any advice will be appreciated!

chazo1994 commented 3 weeks ago

@HeCheng0625 @RMSnow Do you have any updates on the preprocessing pipeline for neuralspeech2?