Open Henry839 opened 4 months ago
Hi @Henry839 , Did you figure out the training time? or time per 50 steps?
Hi @Henry839 , Did you figure out the training time? or time per 50 steps?
Hi @Henry839 , Did you figure out the training time? or time per 50 steps?
on 40G A100, it takes about 10s per 50 steps with batch_size = 16
Dear authors, Could you please offer the training time of SEDD on 8 A100 GPUs? Thanks so much.