AlexeyAB / darknet

YOLOv4 / Scaled-YOLOv4 / YOLO - Neural Networks for Object Detection (Windows and Linux version of Darknet )
http://pjreddie.com/darknet/
Other
21.57k stars 7.95k forks source link

YOLOR - You Only Learn One Representation: Unified Network for Multiple Tasks #7717

Open AlexeyAB opened 3 years ago

AlexeyAB commented 3 years ago

YOLOR - You Only Learn One Representation: Unified Network for Multiple Tasks:

YOLOR is better than YOLOv5 (u5r5), PP-YOLOv2, CenterNet2, EfficientDet and many other

It is improved Scaled-YOLOv4-P6 (+0.9% AP):

121610547-f6b92f00-ca5e-11eb-91d2-f782c35fb658


image


image

Hwijune commented 3 years ago

@AlexeyAB hi!

i want to test the YOLOR model on darknet.

  1. Can I train the YOLOR model in Darknet? Or can I convert the torch to darknet?

  2. I wonder if the four layers below can be replaced with the existing layers of the darknet. [implicit_add] [implicit_mul] [control_channels] [shift_channels]

IgnacioTsai commented 3 years ago

As I know the four new layers: [implicit_add] [implicit_mul]

[control_channels] [shift_channels]

[shift_channels] means create the same tensor shape of the input tensor, making normal distribution with average 0, standard deviation as 0.2 in default setting, and add both tensor value on each element using implicit_add.

[control_channels]means create the same tensor shape of the input tensor, making normal distribution with average 0, standard deviation as 0.2 in default setting, and multiply both tensor value on each element using implicit_mul.

But I did not see the difference between ImplicitA, ImplicitC and ImplicitM. It looks like that they are the same, but get different result by [shift_channels] and [control_channels].

In the paper, should both methods called "feature alignment"?

By the way, YOLOR uses SiLU activate function.