Fang-Haoshu / RMPE

RMPE: Regional Multi-person Pose Estimation, forked from Caffe. Research purpose only.
Other
95 stars 21 forks source link

demo.py cannot show result #3

Closed ujsyehao closed 6 years ago

ujsyehao commented 6 years ago

when I run python examples/demo.py, the program can work normally: 38 but cannot show the result, Figure 1 is empty: 39

I have some extra information to provide:

Thank you in advance @Fang-Haoshu

Fang-Haoshu commented 6 years ago

It's weird...Can you try to store the image instead of drawing it? Like using plt.savefig()

ujsyehao commented 6 years ago

Hi, when I add plt.savefig(path) after plt.draw(), it can show the correct result, but I do not know the reason. Can you tell me the reason? Thank you in advance.

I have another confusion, I test the speed use sample images(demo.py provide): 40 average speed is 0.74 second but in Read.md

1.5s/frame

So I am confused it.

Is there any problem with my test method? Below is my test method code: start = time.time() image = caffe.io.load_image(filename) ... for i in xrange(len(preds)): __... end = time.time() print end - start plt.draw() plt.savefig('mypath') time.sleep(1)

ujsyehao commented 6 years ago

Hi, @Fang-Haoshu Do you have time to solve the problem? Thank you in advance!

Fang-Haoshu commented 6 years ago

Hi, this is not problem. The time varies according to the number of people per image.

ujsyehao commented 6 years ago

I am sorry to bother you again, I am a newbie to pose estimation field and cannot figure out the time cost in RMPE/AlphaPose program.

As far as I know, use one image to test speed(only discuss batchsie=1 situation), time cost = detection time + pose estimation time No matter how many people in a picture, it brings slight time cost difference in object detection(faster RCNN RPN network always output 300 proposals, then classify network operates on these 300 proposals), However, you say

The inference time depends on the number of persons in the image. On a Titan X GPU card, the inference time is about 0.2s per person.

So 0.2s = per person pose estimation time without people detection or 0.2s = per person pose estimation time plus people detection time?

Fang-Haoshu commented 6 years ago

human detection is fast and thus the average time for each person can be ignored.

ujsyehao commented 6 years ago

@Fang-Haoshu So 0.2s = per person pose estimation time without people detection?