Reproduced CenterNet-S4_ DLA_ 8x with RTX3090, but the FPS is only 26.4, far from 71

xingyizhou / CenterNet2

Two-stage CenterNet

Apache License 2.0

1.19k stars 189 forks source link

Reproduced CenterNet-S4_ DLA_ 8x with RTX3090, but the FPS is only 26.4, far from 71 #83

Open FH-JAVA opened 2 years ago

FH-JAVA commented 2 years ago

xingyizhou commented 2 years ago

Can you show your pytorch/ cuda/ cudnn/ detectron2 version? You can find them on the very beginning of the output log. The output log should also show the runtime breakdown if you use a recent version of detectron2, can you report them as well?

Best, Xingyi

FH-JAVA commented 2 years ago

Can you show your pytorch/ cuda/ cudnn/ detectron2 version? You can find them on the very beginning of the output log. The output log should also show the runtime breakdown if you use a recent version of detectron2, can you report them as well?

Best, Xingyi Thank you for your reply. The previously rented GPU was released, so I rented another rtx3090 and reproduced CenterNet-S4 DLA 8x, but FPS is now 14.4, lower than before. the log is available here.

MoaazAbdulrahman commented 2 years ago

@xingyizhou @FH-JAVA I am facing exactly the same issue on RTX 3090 the inference time is way far from the reported numbers. Were you able to solve this issue?