Closed yifan-bao closed 1 week ago
It seems accelerate is correctly configured to use 4 GPUs, maybe the model fits in one gpu but it's not enough for a batch size of 10? can you try lowering it and using mixed precision via --precision bf16
or --precision fp16
?
I configure acclerate config correctly but it gives me out-of-memory issue. I examine the GPU usage and can see that all the 4 processes are using the first gpu. I'm sure that my model fits into one card. I test one card evaluation and the process runs correctly. The following is my accelerate config:
The following is my running script: