azraelkuan / tensorflow_wavenet_vocoder

wavenet vocoder using tensorflow
27 stars 10 forks source link

there are some question about my implementation #6

Open john0117 opened 6 years ago

john0117 commented 6 years ago

Hi azraelkuan, thanks for your work and sharing! I encounter three questions during implementation.

my environment: win10, python==3.6.7, tensorflow==1.11, anaconda==3

one is when I finished "preprocess.py", my file(LJSpeech-1-mel.npy) was only 176KB and file(LJSpeech-1-audio.npy) was only 281KB. I think it may be that something is repeatedly covered or the problem is caused by the difference between windows and Linux? i am not sure about this.

The second problem is encountered during the Synthesize step. I didn't find a file called "eavl.txt". There are only three files (LJSpeech-1-audio.npy, LJSpeech-1-mel.npy, train.txt) in output path after preprocess.

the final question is it seems that the input of the parameter is adjusted, unlike the command in the readme? about '--eval_txt' i just set the output folder for preprocess.

tensorflow_wavenet_vocoder>python mul_generate.py --eval_txt ./FeaPath/ --wav_out_path ./WavOut/ checkpoint ./log_ljspeech/train/2018-11-18T18-07-48/model.ckpt-99999 ---hparams gc_enable=False,global_channel=0,global_cardinality=0,NPY_DATAROOT=/your_npy_datadir/,sample_rate=22050 usage: mul_generate.py [-h] [--logdir LOGDIR] [--temperature TEMPERATURE] [--save_every SAVE_EVERY] [--eval_txt EVAL_TXT] [--hparams HPARAMS] checkpoint mul_generate.py: error: unrecognized arguments: --wav_out_path checkpoint ./log_ljspeech/train/2018-11-18T18-07-48/model.ckpt-99999 ---hparams gc_enable=False,global_channel=0,global_cardinality=0,NPY_DATAROOT=/your_npy_datadir/,sample_rate=22050

or this code can't running on windows? Tell me if I'm wrong, thanks ^_^

bigdan12 commented 4 years ago

the eval_txt is extracted from the train_txt。。。。。。i see