fabro66 / GAST-Net-3DPoseEstimation

A Graph Attention Spatio-temporal Convolutional Networks for 3D Human Pose Estimation in Video (GAST-Net)
MIT License
312 stars 70 forks source link

Integrate a new Pose Estimation model along with the existing tracking #67

Open hoangtnm opened 1 year ago

hoangtnm commented 1 year ago

Hi, I'm trying to replace HRNet with yolov7-pose, which returns bounding boxes and poses at the same time. However, I had a problem aligning the poses to the track_bboxs. For example, if two persons are in the video, and I run python gen_skes.py -v baseball.mp4 -np 1 --animation, I will get one pose for first person ID util that ID disappears, which is okay. In my case, if I run with -np 1 and my yolov7-pose returns two bounding boxes and two poses, how to have the same behaviour with your original code? Can you give some suggestions?