archinetai / audio-diffusion-pytorch

Audio generation using diffusion models, in PyTorch.
MIT License
1.92k stars 167 forks source link

CUDA OF Memory for 80GB A100 : follow the mousai paper setting of text condition #79

Open SuperiorDtj opened 11 months ago

SuperiorDtj commented 11 months ago

the test auido is 32-channel 2**15-length, for the batch 2 Besides, the num of trainable paras of the text condition generationis only 672M when follow the paper setting(text embding dim is 768 for t5-base) image