chrisdonahue / wavegan

WaveGAN: Learn to synthesize raw audio with generative adversarial networks
MIT License
1.32k stars 283 forks source link

parameters #24

Closed pankajbadatia closed 5 years ago

pankajbadatia commented 5 years ago

Hi i am working on a audio 'wav' file which is 44100Hz. Can you please suggest me other settings needed to update. I have only two classes . I have made dataset dir and put wav files in it

firstly i changed sample rate = 44100hz works with --data_slice_len =16384 which is default I think. tensorboard shows output . training starts scalars and other things are visible

also tried to change --data_slice_len = 32768 or 65536 (its ok if output file is not exactly 1sec , it can be more) But tensorboard doesn't show output. no scalars and only graph and projector options available

Can anyone help me on this . Its urgent..

chrisdonahue commented 5 years ago

this might be happening because your wav files are too short. Try adding command line arg --data_pad_end. You might also want to add --data_first_slice which will only use the first "slice" from your waveforms, which I'm guessing you will want because you're using short waveforms.

pankajbadatia commented 5 years ago

Hi thanks it worked ...

Also can you please tell me what is incept_k=10 whether I need to change this since i have two output class