zelaki / DreamSound

Code for Investigating Personalization Methods in Text to Music Generation
https://zelaki.github.io/
29 stars 5 forks source link

questions about the hyper parameter of the DB setting #8

Open Everglow-ZJU opened 8 months ago

Everglow-ZJU commented 8 months ago

Dear authors, You claimed in the paper that "using a single NVIDIA RTX-3090 GPU with a training batch size of 4, employing learning rates of 4 × 10−6 for DB", is that correct? I found it slow on my V100GPU under this hyper parameter setting(two and an half hour for training a single concept),my gradient accumulation steps is set to 1, and max_train_steps is 1500, I would appreciate it if you could help me