Evaluation for multiple vanishing points

zhou13 / neurvps

Neural Vanishing Point Scanning via Conic Convolution

MIT License

178 stars 22 forks source link

Evaluation for multiple vanishing points #10

Closed yanconglin closed 4 years ago

yanconglin commented 4 years ago

HI, Yichao,

Thanks for the nice work!

Currently I have two concerns regarding the evaluation.

Could you please explain " if dst < np.pi / n:" ? It is not always the case that two vps are this far (cosine distance) from each other, at least in the toy dataset I created by hand. Sometimes, the angular distance between two vps is smaller than this threshold. I would assume there are extra constraints on the datasets (camera matrix) used in the paper?
Was wondering if Neurvps is possible to detect vps (>3) in non-Manhatten world.

Thanks in advance!

Kind regards, Yancong

zhou13 commented 4 years ago

Yes. The datasets used in the paper either contain one dominant vanishing point or three orthogonal vanishing points as assumed in the Manhatten world.
To detect vanishing points in the non-Manhatten world, the simplest method is probably to change the line you mentioned to another heuristics expression or constant value. A better implementation might combine the minimal consine distance constraints with non-maximal suppression on the vanishing point heat map.

yanconglin commented 4 years ago

Thanks for the reply. Regarding answer 1: Even though in the Manhatten world (3 othorgonal vps), " if dst < np.pi / n" does not hold all the time. From my observation (on a toy dataset, with randomly generated camera matrix), the cos_distance between two orthogonal vps can be smaller than this PI/3 in some cases. So I was wondering in which case " if dst < np.pi / n" holds? Guess the intrinsic (extrinsic) matrices have some sort of influence on this threshold. But so far it is not clear to me. Or am I completely wrong?

Rgarding answer 2: Was thinkning if it is possible figure out this minimal consine distance constraints, then Neurvps would be able to detect more than 3 vps.

zhou13 commented 4 years ago

Each vanishing point corresponds to a set of parallel lines. Therefore, we can represent each VP with its direction vector. The cosine constraints basically say that the angle between two sets of parallel lines (of the VP) should be larger than PI/3. In the cases say n = 3. np.pi / 3 = 60 degree < 90 degree. Therefore, "dst < np.pi / n" should hold for the Manhatten world.

I think you may not use the correct camera intrinsics if "dst < np.pi / n" does not hold in your Manhatten settings.

zhou13 commented 4 years ago

I will close this issue for now. Let me know if you still have any questions.