Thank you for your reply and hard work!

Thank you for your reply and hard work! I have two more questions to ask you. 1.How can I input a picture and then display the hand skeleton? like in the paper. 2.Can the input be changed from a picture to real-time gesture recognition using a depth camera?

You can structure the input image to match the format expected by the "process_single_view_input" function. Subsequently, proceed to pass it through the model to obtain the output, which should then undergo processing via the "process_output" function. Finally, utilize this function to display the 2D skeleton, utilizing the first two dimensions of the "output["coord_uvd"]" data.
I believe the input of HaMuCo could be altered to accommodate depth images by converting them into 3 channels. However, HaMuCo does not fully account for the nature of depth images, rendering it suboptimal for this input format and potentially necessitating further adjustments.

zxz267 / HaMuCo

Thank you for your reply and hard work! #4