Fix multi-gpu training with huggingface accelerate

MedARC-AI / fMRI-reconstruction-NSD

fMRI-to-image reconstruction on the NSD dataset.

MIT License

280 stars 39 forks source link

Fix multi-gpu training with huggingface accelerate #15

Closed PaulScotti closed 10 months ago

PaulScotti commented 1 year ago

Right now our training notebook supports multi-gpu training with huggingface accelerate! But when we actually try to train with multi-gpu the results never perform as well as when we train with single-gpu.

I tried adjusting learning rate and batch size but this didn't seem to fix the discrepancy in results between multi-gpu and single-gpu.

Figure out what's going on here and I will be very grateful :)

PaulScotti commented 10 months ago

Working in MindEyeV2