Closed Sunting78 closed 3 years ago
V100 32G run base model
Emm, you can try to reduce the batch size and reduce learning rate accordingly
V100 32G run base model