[x] I have searched related issues but cannot get the expected help.
[x] I have read the FAQ documentation but cannot get the expected help.
[x] The bug has not been fixed in the latest version.
Describe the bug
I run several common detection models (Deformable DETR, DINO, etc.) on 8xAMD-MI250 GPUs, the running speed is extremely slow no matter on training or inference, each iteration will cost about 30 mins in training, and the usage of GPUs is also unstable and low.
However when running these models with larger backbone, such as Swin-L or ViT-L, the speed will be normal.
One example of training deformable detr's encoder is attached: 20240612_113140.log
Reproduction
1. What command or script did you run?
Run any models such as Deformable DETR with defacult setting on ROCm envs. For example:
Checklist
Describe the bug
I run several common detection models (Deformable DETR, DINO, etc.) on
8xAMD-MI250
GPUs, the running speed is extremely slow no matter on training or inference, each iteration will cost about 30 mins in training, and the usage of GPUs is also unstable and low.However when running these models with larger backbone, such as Swin-L or ViT-L, the speed will be normal.
One example of training deformable detr's encoder is attached: 20240612_113140.log
Reproduction
1. What command or script did you run?
Run any models such as Deformable DETR with defacult setting on ROCm envs. For example:
2. Did you make any modifications on the code or config? Did you understand what you have modified?
No, I run the raw mmdet codes.
3. What dataset did you use?
COCO2017
Environment
Here is my running env: