Open ghost opened 6 years ago
Each anchor in each cell can detect one object. Also Yolo v3 can detect multilabeled object, for example, dog, sofa, tv
in one box, it dog, sofa and tv
are placed close to each other.
Anchors are inital sizes of objects (size + aspect ratio). Initial size will square, but output bounded boxes (after logistic regression) may not be square.
Thanks reply.
Each anchor in each cell can detect one object. Also Yolo v3 can detect multilabeled object, for >example, dog, sofa, tv in one box, it dog, sofa and tv > are placed close to each other.
So, in predictions.png, only YOLO v3 can draw bounding boxes of several class in one cell??
So, in predictions.png, only YOLO v3 can draw bounding boxes of several class in one cell??
Yolo v3 just do it much better.
In the new repository, theoretically Yolo v2 can draw bounding boxes of several class in one cell, but it can be very rare due to the use of softmax.
Or if is used low -thresh 0.001
for Yolo v2 - look at diningtable, dog
:
Thanx very much!
I have roughly read the paper of YOLO v1, v2. I have two questions.
Question 1: Does the output image of YOLO v1, v2, v3 only recognize one object per grid?
Question 2: Input Image Size = [4160, 4160] YOLO Image Size = [416, 416] anchors = 10, 10
In the example above, is the rectangular area always square? I heard that the anchors are aspect ratios.