the data format [frame_id, pedestrian_id, x, y] , how are the x and y obtained? I want to know this because I want to get train data from new video, but I don't know how to get this coordinate.
the above data [frame_id, pedestrian_id, x, y], how can I map x,y back to the frame coordinates? is this formular right:
pixel_coordinate = (x, y) * inverse(H) ?
Hi, great work! I have some questions:
Hope to get your help. Thanks!