Distributed training issue

wusize / ovdet

[CVPR2023] Code Release of Aligning Bag of Regions for Open-Vocabulary Object Detection

https://openaccess.thecvf.com/content/CVPR2023/papers/Wu_Aligning_Bag_of_Regions_for_Open-Vocabulary_Object_Detection_CVPR_2023_paper.pdf

Other

176 stars 4 forks source link

Distributed training issue #19

Open yyyyyyfs opened 1 year ago

yyyyyyfs commented 1 year ago

Hi, size. I‘m using my own dataset to recurrent your work. I noticed you used slurm for training. But for me, I only can use distributed training with dist_train.sh to train my own project. But there are many problems, such as val dataloader is None and I noticed you using mmdet by dependency package instead of importing through files, It‘s hard for us to debug when using dist_train.sh. If possible, Can you test the method of distributed training？Be deeply grateful！

wusize commented 1 year ago

Hi! Thanks for raising this issue. I will check it soon.

yyyyyyfs commented 1 year ago

Hi! Thanks for raising this issue. I will check it soon.

That‘s great！looking forward to your good news! Thanks！