AlexeyAB / darknet

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )
http://pjreddie.com/darknet/
Other
21.77k stars 7.96k forks source link

Any plan to support pp-yolo? #6350

Open toplinuxsir opened 4 years ago

toplinuxsir commented 4 years ago

Any plan to support pp-yolo? Thanks! https://arxiv.org/abs/2007.12099

AlexeyAB commented 4 years ago


image


image

FilipLangr commented 4 years ago

@AlexeyAB I believe there's a minor mistake in:

  • Coord Conv – should be used only for conv-1x1 in backbone

PP-YOLO paper says (at page 4): "In order to reduce the loss of efficiency as much as possible, we do not change convolutional layers in backbone, and only replace the 1x1 convolution layer in FPN and the first convolution layer in detection head with CoordConv."

AlexeyAB commented 4 years ago

Fixed it.

toplinuxsir commented 4 years ago

@AlexeyAB Great work, Where to find the pp yolo config file to train custom detection? Thanks !

scamianbas commented 4 years ago

@AlexeyAB Great work, Where to find the pp yolo config file to train custom detection? Thanks !

Hi @AlexeyAB and congrats for the great work. Is this still in the todo list ? Thanks !

Goru1890 commented 4 years ago

Great work! @scamianbas Yes, it's seems to be still in progress https://github.com/AlexeyAB/darknet/projects/1#card-42538662.

matankley commented 3 years ago

Hi, The new_coords=1 parameter in yolo_layer implements the calculation process in the [yolo_layer].

But from what I understand it doesn't implement CoordConv the same way as pp-yolo. No extra 2 channels are added and there is no possibility to add new_coords toi [conv] layer.

Am i correct ? Any plans to support this ?

Thanks

tamir12312 commented 3 years ago

Hi,

I need help to understand how can I implement deformable convolution layer instead of convolution. I can't find any parameter that suggests to replace the simple convolution operation with the deformable one.

In your comment, you wrote that "we replace the 3×3 convolutional layer in the last stage of ResNet with deformable convolution layer" - How did you do that and can I replicate it?

Thanks alot in advance! Tamir :)