Confusion about `yaw` and `pitch`.

Ahmednull / L2CS-Net

The official PyTorch implementation of L2CS-Net for gaze estimation and tracking

MIT License

318 stars 78 forks source link

Confusion about `yaw` and `pitch`. #32

Open shaform opened 5 months ago

shaform commented 5 months ago

In your model, the forward function returns pre_yaw_gaze and pre_pitch_gaze:

https://github.com/Ahmednull/L2CS-Net/blob/a4d8f7fa5436a2b2b9f088471623b552a85811bd/l2cs/model.py#L70

However, in the pipeline, the two variables are assigned as gaze_pitch and gaze_yaw:

https://github.com/Ahmednull/L2CS-Net/blob/a4d8f7fa5436a2b2b9f088471623b552a85811bd/l2cs/pipeline.py#L122

It seems yaw and pitch are reversed. Why would this be the case?

tiamo405 commented 3 weeks ago

Have you found the reason why? If left as is, can it be trained?

tiamo405 commented 3 weeks ago

line 207 train.py: pitch, yaw = model(images_gaze) line 22,23: self.fc_yaw_gaze = nn.Linear(512 block.expansion, num_bins) self.fc_pitch_gaze = nn.Linear(512 block.expansion, num_bins) line 68 69 70: pre_yaw_gaze = self.fc_yaw_gaze(x) pre_pitch_gaze = self.fc_pitch_gaze(x) return pre_yaw_gaze, pre_pitch_gaze Because both have the same calculation structure, I think changing the return part is fine, the result when training will not affect anything.

shaform commented 3 weeks ago

It turns out the pipeline was not written by the authors, but it was created in https://github.com/Ahmednull/L2CS-Net/pull/18. So perhaps the PR author would know the answer.