pytorch / vision

Datasets, Transforms and Models specific to Computer Vision
https://pytorch.org/vision
BSD 3-Clause "New" or "Revised" License
16.32k stars 6.97k forks source link

Adding CenterNet to Torchvision/detection #4595

Open chang4869 opened 3 years ago

chang4869 commented 3 years ago

🚀 The feature

Add CenterNet to torchvision

Motivation, pitch

centernet is a better one-stage object detector than SSD and it is also very practical.This anchor-free detector is currently relatively popular.

Alternatives

No response

Additional context

No response

cc @datumbox

mikeseven commented 2 years ago

There are 2 centernet in the literature. The most used I think is centernet objects as point that is the basis for many applications.

The nice thing about this model is that it is a generic architecture and many other models already in torchvision can be used for backbone, neck, and head.

adamsau commented 2 years ago

@datumbox is there any plan on adding this model?

datumbox commented 2 years ago

@adamsau Thanks for checking.

Our intention is to prioritize the model on the future but not yet. The main issue at the moment is that we don't want to add more Keypoint detection models (I'm aware CenterNet supports normal bboxes too) until we have completed the Transforms v2 API. This is because in order to train the models properly, we need to have better support on the transforms side for Keypoints.

adamsau commented 2 years ago

@datumbox thanks for the update, definitely need some fast model to detect keypoints since the keypoint rcnn is too slow for real world applications. Hope there will be new models coming soon!

datumbox commented 2 years ago

Makes sense. Can certainly ping you once we are ready in case you are willing to contribute it. :)