Open zhangzaibin opened 1 year ago
I use one A100 GPU and set the batch size to 32. I need 11 hours for a single epoch. Is this normal?
Are you using matrixVT?
I use one A100 GPU and set the batch size to 32. I need 11 hours for a single epoch. Is this normal?