Training Soundstream on Single GPU

yangdongchao / AcademiCodec

AcademiCodec: An Open Source Audio Codec Model for Academic Research

574 stars 80 forks source link

Training Soundstream on Single GPU #2

Closed rishikksh20 closed 1 year ago

rishikksh20 commented 1 year ago

Hi @yangdongchao I am planning to training SoundStream codec from this repo to clean version of Libri light dataset + VCTK datasets and will open source the checkpoint, but I have single A100 for that, is it possible to train Soundstream on single A100 with lower batch size for longer time period?

yangdongchao commented 1 year ago

Hi @yangdongchao I am planning to training SoundStream codec from this repo to clean version of Libri light dataset + VCTK datasets and will open source the checkpoint, but I have single A100 for that, is it possible to train Soundstream on single A100 with lower batch size for longer time period?

Yes, it just costs more time. Our code code suports one GPU to train it. I believe it can be trained well. We expect your traning results.

rishikksh20 commented 1 year ago

Any idea how many days of training it will takes to generate good result on single 3090 or A100 ?

yangdongchao commented 1 year ago

Any idea how many days of training it will takes to generate good result on single 3090 or A100 ?

It relys on the number of your audio. If you use 1000 h data, I believe one 80G A100 only needs 1-2 weeks. I suggest you set 1 second segment input the model, and set larger batch size.

rishikksh20 commented 1 year ago

ok thanks

seastar105 commented 1 year ago

@rishikksh20 have successfully trained soundstream with default setting of this repo?