zigvu / khajuri

Video Pipeline
0 stars 0 forks source link

Recombine scores across scales/strides #91

Closed eacharya closed 9 years ago

eacharya commented 9 years ago

Find a better way to have tighter localization bboxes. The basic idea are:

  1. Never loose a caffe above-threshold detection. This implies a simple average like we have done up to now won't work.
  2. For pixels that have multiple above-threshold detections across sliding windows should have higher score
  3. Pixels in edges and corners don't get visited that often, so a above-threshold detection should count for more
  4. Each logo appearance should be analyzed independently. Thus, if a frame has two logos detected in a particular scale, the bbox of one should NOT be influenced by bbox of another. Similarly, if the frame has the same logo detected at different scales, the bbox made in one scale should not influence bbox made in another.

Rather than write formula that satisifies all conditions at once, we used multiple filters (sigmoid in our case) to approximate what we wanted. The filters need to be tuned along with scale/stride changes.

regmiz commented 9 years ago

Updated calculations to match the cellMap style at 442cb324b08d66b1bf31cb24e0d620f4d58b1bc2

regmiz commented 9 years ago

Closing this as Issue93 is now closed