Closed mnabenas closed 1 year ago
I seem to have duplicated the generator for some reason. I have uploaded the correct discriminator/optimiser to the folder: https://drive.google.com/drive/u/1/folders/121e6UgV1qtKGTbdEoGmQhp4BdAQqJUrB
Thanks! With that is working just fine. Last question, during the conversion we use this command:
python convert_all.py \
--checkpoint_path out_ft_IEMOCAP_en_4class/logdir/checkpoint_13651 \
--input_list ../emotion/datasets/MSP-IMPROV/files_neutral.txt \
--output_dir ../augmentation/datasets/MSP-IMPROV_aug/IEMOCAP_evc/hifi-gan_v1_ft_mel_vocoded/ \
--wav \
--hifi_gan_path ../hifi-gan/cp/v1_cv_10lang_ft_cv_10lang/g_00496000 \
--hparams \""emo_list=[anger,happiness,neutral,sadness]"\",emo_embedding_dir=embeddings/IEMOCAP/,mel_mean_std=../data/IEMOCAP/mel_mean_std.npy,pretrain_n_speakers=1967,n_symbols=315
But in the args there is the --neutral argument. Is it which index of my list of emotions is the neutral one? I cannot find the documentation for that argument and looks like is a required one. Thanks again
Yes, it represents the index of the neutral emotion in the list of embeddings. However, I think that if spemb_input == False
it doesn't actually matter, which is the case for the trained models linked in the README.
Are the pre trained files for Hifi GAN correct? I was trying to fine tune them using ESD and after creating the forward outputs I tried to run the fine tune command
but it throws the following error:
After checking the files downloaded I noticed that both weigh the same, is that correct? Thanks for any help