fabro66 / GAST-Net-3DPoseEstimation

A Graph Attention Spatio-temporal Convolutional Networks for 3D Human Pose Estimation in Video (GAST-Net)
MIT License
311 stars 70 forks source link

A Suggestion for 2D pose detection #46

Open hujb48 opened 3 years ago

hujb48 commented 3 years ago

Hi, It is a great work, and here I used the mediaPipe to instead the Yolov3+HrNet for testing the model in the wild video, the effect is pretty good with quicker speed in processing each frame from 13fps to about 45-50fps in RTX3070 in windows10 and 50-55fps in Ubuntu. So maybe it can improve your performance and specifications of your model, Hope it can be useful.

vicentowang commented 2 years ago

@hujb48 have you include real time 3d visualization at 50-55 fps ? I use yolov5s+hrnet got 26. fps, but when i use 3d visualization ,it down to 10 fps.

hujb48 commented 2 years ago

@hujb48 have you include real time 3d visualization at 50-55 fps ? I use yolov5s+hrnet got 26. fps, but when i use 3d visualization ,it down to 10 fps.

@vicentowang no, the speed which I have tested is just for 2d pose detection, however I haven't visualize the 3d result in real-time but have simulated to output the 3d coordinate instead of the 3D real-time rendering, and the speed of the whole process of the pre- trained model together with the 3d coordinate output in real-time is 25fps.

gao123qiang commented 2 years ago

@hujb48 1, how to edit the mediapipe, i can't find the python code in solutions, only c++, 2, if solved, hotw to edit the ediapipe points to 17 points the model needs? how to solve it?

hujb48 commented 2 years ago

@hujb48 1, how to edit the mediapipe, i can't find the python code in solutions, only c++, 2, if solved, hotw to edit the ediapipe points to 17 points the model needs? how to solve it?

hi, here you might find the answer.

  1. https://google.github.io/mediapipe/solutions/pose.html - "Python Solution API"
  2. you could figure out the common point between coco/Human 3.6M and Mediapipe, and pick them up by the corresponding index. This can also refer https://google.github.io/mediapipe/solutions/pose.htm - "Pose Landmark Model (BlazePose GHUM 3D)"