Open lix4 opened 3 years ago
Based on your code, do you expect at most 50 ground truth bouncing boxes for each video clip? And it is consistent with 7x7 grids. So, each grid is responsible for predict one box?
Yes, you are completely true!
Based on your code, do you expect at most 50 ground truth bouncing boxes for each video clip? And it is consistent with 7x7 grids. So, each grid is responsible for predict one box?