Closed antithing closed 4 years ago
Hi, thank you for your question. Sorry for making you feel confused. The speed is not included in the motivations of our paper, and 38 fps only refers to the network forward inference and it is the upper bound. So is the keypoint assignment part. The repo is only a research prototype and I didn't do code acceleration and rebuilding as OpenPose did. Our rough code using single scale inference without flipping can run at about 2 fps.
Ah I see. Thank you for explaining it. :) Stay healthy in these crazy times.
It's really nice of you :) Wish you healthy and happy ALL the time!
@hellojialee Thanks for this great work!
I have refactored post-processing in a more intuitive way and added C++ acceleration, now it can run up to 7~8 fps using single scale with flipping.
Besides, I changed score calculation 1-1.0/score
to score / count
for per human in evaluation.py
, which increases AP by 0.3 % in COCO minival set 2017.
You can check the results in my forked repo.
FYI.
@sokunmin Awesome work! Could I recommend your Repo in the README? My respect.
@hellojialee Yes and welcome if you don't mind. :) I've learnt a lot on your great work. Hope it also helps to those who are interested in it.
@sokunmin Many thanks! I feel excited that the prototype Repo may help others. Best wishes to you.
@sokunmin I have tried the code you posted but is takes around 10 secs per frame, way slower than stated. Also when I set the parameter --run_cpp
I get the following error:
UnboundLocalError: local variable 'person_to_joint_assoc' referenced before assignment
Also, related to apex I get the following error:
Warning: multi_tensor_applier fused unscale kernel is unavailable, possibly because apex was installed without --cuda_ext --cpp_ext. Using Python fallback. Original ImportError was: ModuleNotFoundError("No module named 'amp_C'")
@sokunmin I have tried the code you posted but is takes around 10 secs per frame, way slower than stated. Also when I set the parameter
--run_cpp
I get the following error:UnboundLocalError: local variable 'person_to_joint_assoc' referenced before assignment
Also, related to apex I get the following error:
Warning: multi_tensor_applier fused unscale kernel is unavailable, possibly because apex was installed without --cuda_ext --cpp_ext. Using Python fallback. Original ImportError was: ModuleNotFoundError("No module named 'amp_C'")
@nicolasugrinovic
Hi
person_to_joint_assoc
was not clean or still inferenced and then following processed data from next image assign to it. But I still don't know how to fix it yet. What I did was cleaning built folder and build cpp files again.
BTW, the size of IMHN backbone is huge. If you want a faster model, you can figure out some other ways to down size model(e.g.following Shufflenetv2 MobileNet design guidelines). This is gonna help speed up inference.
Hi, and thank you for making this code available.
I am running it in windows, on a GTX 1080, and using the demo_image.py file with the model from google drive and the time it takes to detect keypoints is more than 6 seconds.
What am i doing wrong? How can i get close to the 38 fps that you mention on the readme?
Thank you again!