Hi, thanks for your significant work. I am trying to use the model of itoa_side to predict the human joint key point on our data (e.g., 320*240 depth image from kinect), but I am confused about how to transform the output to readable pixel value and depth value.
Hi, thanks for your significant work. I am trying to use the model of itoa_side to predict the human joint key point on our data (e.g., 320*240 depth image from kinect), but I am confused about how to transform the output to readable pixel value and depth value.