New Dataset - Githubissues

jdbermeol commented 7 years ago

Hi, So everything worked perfectly with your pre-process Vctk. Now I want to test with Nancy data set. I'm using the script you suggested, but I have 2 questions:

When I run the script I get 2 files on the norm_info folder: label_norm_HTS_420.dat and norm_info_mgc_lf0_vuv_bap_63_MVN.dat. Based on the shape the correct file is norm_info_mgc_lf0_vuv_bap_63_MVN.dat, but I want to be sure.
In order to combine both datasets, should I have to run the script for each speaker and them combine somehow the norms file, or should I put all data in one folder and process it?

Thanks.

jdbermeol commented 7 years ago

Hi again, Also I check .npz files and they don't contain several files that are in your Vctk data:

audio_norminfo
code2char
text_features
code2phone
text_norminfo Do you know how to build it?

jayavanth commented 7 years ago

Uncomment the required arrays here

karandwivedi42 commented 7 years ago

@jdbermeol @jayavanth did you find an answer to the original question:

In order to combine both datasets, should I have to run the script for each speaker and them combine somehow the norms file, or should I put all data in one folder and process it?

jdbermeol commented 7 years ago

@karandwivedi42 No, I don't know the answer yet. Also, the script used to work but now I get this error:

Traceback (most recent call last): File "/home/ubuntu/loop/preprocessing/latest_features/merlin/src/run_merlin.py", line 1175, in main_function(cfg) File "/home/ubuntu/loop/preprocessing/latest_features/merlin/src/run_merlin.py", line 693, in main_function acoustic_worker.prepare_nn_data(in_file_list_dict, nn_cmp_file_list, cfg.in_dimension_dict, cfg.out_dimension_dict) File "/home/ubuntu/loop/preprocessing/latest_features/merlin/src/frontend/acoustic_base.py", line 122, in prepare_nn_data self.prepare_data(in_file_list_dict, out_file_list, in_dimension_dict, out_dimension_dict) File "/home/ubuntu/loop/preprocessing/latest_features/merlin/src/frontend/acoustic_composition.py", line 126, in prepare_data features, frame_number = io_funcs.load_binary_file_frame(in_file_name, in_feature_dim) File "/home/ubuntu/loop/preprocessing/latest_features/merlin/src/io_funcs/binary_io.py", line 64, in load_binary_file_frame fid_lab = open(file_name, 'rb') IOError: [Errno 2] No such file or directory: '/home/ubuntu/loop/preprocessing/latest_features/merlin/egs/build_your_own_voice/s1/experiments/my_new_voice/acoustic_model/data/mgc/*.mgc' + echo 'All successfull!! Your demo voice is ready :)' All successfull!! Your demo voice is ready :) Feature extraction complete! Traceback (most recent call last): File "extract_features.py", line 1411, in save_numpy_features() File "extract_features.py", line 853, in save_numpy_features shutil.copy2(audio_norm_source, audio_norm_dest) File "/home/ubuntu/miniconda2/envs/loop/lib/python2.7/shutil.py", line 130, in copy2 copyfile(src, dst) File "/home/ubuntu/miniconda2/envs/loop/lib/python2.7/shutil.py", line 82, in copyfile with open(src, 'rb') as fsrc: IOError: [Errno 2] No such file or directory: '/home/ubuntu/loop/preprocessing/latest_features/final_acoustic_data/norm_info_mgc_lf0_vuv_bap_63_MVN.dat'

jdbermeol commented 7 years ago

@adampolyak, @ytaigman. Hi, So I have been able to run the extract_features script for a speaker on the VCTK dataset. However, each run is going to create a norm_infor folder. So I go back to my original question. Is there a way to combine the output of each norm_info folder. Or should I create a big folder with all samples and run the script using that folder?

adampolyak commented 7 years ago

norm_info_mgc_lf0_vuv_bap_63_MVN.dat is indeed the correct norm file.
Both ways are valid. The norm file contains mean and std of the dataset - see the generation code. You can merge the statistics or just run the script on the merged folder.

jackchinor commented 7 years ago

@jdbermeol I want to train my own data set, but I found it too complicate to do it. Should I firstly run the install_tts.py? and then run the extract_features script? when I run the install_tts.py , it occurred an error as follow: Traceback (most recent call last): File "install_tts.py", line 174, in pe(untar_cmd) File "install_tts.py", line 114, in pe for line in execute(cmd, shell=shell): File "install_tts.py", line 107, in execute raise subprocess.CalledProcessError(return_code, cmd) subprocess.CalledProcessError: Command '['tar', 'xzf', '/tmp/kastner/speech_synthesis/speech_tools-2.4-release.tar.gz']' returned non-zero exit status 2

I don't know how to fix it . Could you help me with it? really appreciate

dengbingfeng commented 7 years ago

@jackchinor in the install_tts.py, you can find you need to download kk_all_deps.tar.gz or install some tools first

jackchinor commented 7 years ago

@dengbingfeng I created the kk_all_deps.tar.gz, but run the install_tts.py file, it doesn't work, an error occured

jackchinor commented 7 years ago

@jdbermeol finally ,I generate .npz files , but some of the .npy files are not contained, just the same with you. 1.audio_norminfo 2.code2char 3.text_features 4.code2phone 5.text_norminfo Can you tell me how to build them?Really appreciate...

jdbermeol commented 7 years ago

@jackchinor You will need to uncomment this line: https://gist.github.com/kastnerkyle/cc0ac48d34860c5bb3f9112f4d9a0300#file-extract_feats-py-L1034

You will see that the missing matrices are also commented, so you will need to uncommented too. The only one that needs to remain commented is the code2speaker.

jackchinor commented 7 years ago

@jdbermeol I see, thank you so much.

ankitmishra262 commented 7 years ago

I am trying to run extract_feats.py on the complete VCTK dataset by following the advice from this comment above, to put all the wav files and text in one big directory and run the script on them.

Before I get to the part of save_dict saving all features, I'm getting the following error.

Feature extraction complete!
Traceback (most recent call last):
    File "extract_feats.py", line 1440 in <module>
        save_numpy_features()
    File "extract_feats.py", line 1020 in save_numpy_features
        assert phonemes[0] == 'pau'
IndexError: tuple index out of range

My best guess is that the phonemes tuple is not being created properly. Any suggestions or am I making some common mistake?

jdbermeol commented 7 years ago

hi @ankitmishra262, great question, same happens to me, I could not solve it, I have to restrict my self to the subsample of speaker Facebook team use in the paper.