DefTruth / yolov5face-toolkit

🍅 YOLO5Face 2021 with MNN/NCNN/TNN/ONNXRuntime
GNU General Public License v3.0
54 stars 8 forks source link

what's the output dimensions means in face detection only model? #2

Open lucasjinreal opened 2 years ago

lucasjinreal commented 2 years ago

output is type: float32[1,25200,16]

what does 16 mean here?

DefTruth commented 2 years ago

my blog about this model: https://zhuanlan.zhihu.com/p/461878005
arxiv paper: https://arxiv.org/abs/2105.12931
official repo: https://github.com/deepcam-cn/yolov5-face

16 = 4(bbox offsets) + 1 (object prob, foreground or not) + 10(5 landmarks) + 1(face prob)