Closed jxncyym closed 2 years ago
@erkil1452 thank you for your reeply.
@erkil1452 I'm very sorry I still not fully understood what you said. For I'm newer to this, could you describe detail about the process? I guess: first we fix the camera and the screen,then we calibrate the camera to get the rotation and translation matrix. do you mean use the camera position as the origin of the world coordinate system,and use the distance of the camera position and the gaze target to compute the gaze target world coordinate,compute the head position world coordinate in the same way, then subtract the gaze target and head position, we can get the gaze vector,and use the rotation and translation matrix to convert the gaze vector to camera coordinate system, Is that rigtht?If what I said is not right, could you describe the process detail? for I am a newer to this.
Yes, it is as you say.
thank you very much
@erkil1452 hello, I have some questions:
in the article, you said"We compute the gaze vector in the Ladybug coordinate system as a simple difference gL =pt − pe. " so what the pe represent, is the right eye 3d coordinate or the left eye 3d coordinate?
you describe the process to get target 3d coordinate as that "We use the original AprilTag library to detect the marker in each of the camera views and estimate its 3D pose using the known camera calibration parameters and marker size. We then use the pose and known board geometry to find the 3D location of the target cross pt." I understand the AprilTag can get the 2d coordinate of the marker, then how to get the target 3d coordinate, could you describe the process detail? or you can give an example to describe the process, such that : the detected marker position is (20,50), then maker size is 20 pixel, ......
in the paper, you use 7 pictures to estimate the gaze of the middle picture, do you evaluate the performance of using 5 pictures or 3 pictures?
I notice a new gaze dataset:ETHX-Gaze,they collect the dataset use 2d camera, but I don't find the way to get the ground truth, do you know how they get the gaze label?