Support multi-GPU training?

wutong16 / DistributionBalancedLoss

[ ECCV 2020 Spotlight ] Pytorch implementation for "Distribution-Balanced Loss for Multi-Label Classification in Long-Tailed Datasets"

362 stars 46 forks source link

Hi @chen-judge!

Thank you for asking! Sorry that it's indeed a bug caused by the .cuda() operation, which loads data to GPU 0 by default. Currently, this code does not support multi-GPU training mainly because of the use of ClassAwareSampler. It is possible to write a DistributedClassAwareSampler to properly distribute the samples to different devices while maintaining the class-aware sampling strategy, but it's not necessary for these datasets since they're rather small and fast to train on.

wutong16 / DistributionBalancedLoss

Support multi-GPU training? #4