Hi, size. I‘m using my own dataset to recurrent your work. I noticed you used slurm for training. But for me, I only can use distributed training with dist_train.sh to train my own project. But there are many problems, such as val dataloader is None and I noticed you using mmdet by dependency package instead of importing through files, It‘s hard for us to debug when using dist_train.sh. If possible, Can you test the method of distributed training?Be deeply grateful!
Hi, size. I‘m using my own dataset to recurrent your work. I noticed you used slurm for training. But for me, I only can use distributed training with dist_train.sh to train my own project. But there are many problems, such as val dataloader is None and I noticed you using mmdet by dependency package instead of importing through files, It‘s hard for us to debug when using dist_train.sh. If possible, Can you test the method of distributed training?Be deeply grateful!