Open becauseofAI opened 4 years ago
https://arxiv.org/ftp/arxiv/papers/1908/1908.08681.pdf and https://arxiv.org/pdf/2004.10934.pdf
This is gradient clipping https://machinelearningmastery.com/how-to-avoid-exploding-gradients-in-neural-networks-with-gradient-clipping/
Because it affects on whole network, so may be it should be used in [net]-section instead of [yolo]-layer, but in the original repo it was implemented in the [yolo] layer https://github.com/pjreddie/darknet
leaky
andmish
?max_delta
in yolov4-custom and what doesmax_delta=5
mean?random=1
only used in the last yolo layer instead of the all three yolo layers?