Open nahidalam opened 3 months ago
How much GPU I need for training at a minimum? 8xA100 (40GB) or 8xA100(80GB)?
Our experiments used 8xA100 80G. You may consider setting the parameter --model.finetune_per_device_batch_size lower without changing the batch size to train on GPUs with lower memory.
How much GPU I need for training at a minimum? 8xA100 (40GB) or 8xA100(80GB)?