yangdongchao / Text-to-sound-Synthesis

The source code of our paper "Diffsound: discrete diffusion model for text-to-sound generation"
http://dongchaoyang.top/text-to-sound-synthesis-demo/
345 stars 36 forks source link

About CC_pretrained model #11

Closed yizhidamiaomiao closed 1 year ago

yizhidamiaomiao commented 1 year ago

Dear authors,

When I try to reproduce your code, it seems that in your "Diffsound/running_command/run_train_audioset.py" file, you trained your model by loading a model called "--load_path OUTPUT/pretrained_model/CC_pretrained.pth".

Do we have to find and download CC_pretrained.pth? Where can we find it?

Thanks a lot.

yangdongchao commented 1 year ago

Hi, this model is not needed, load_path means that you can load a pre-trained model, but if you train your model from scratch, you donot need this.