microsoft / human-pose-estimation.pytorch

The project is an official implement of our ECCV2018 paper "Simple Baselines for Human Pose Estimation and Tracking(https://arxiv.org/abs/1804.06208)"
MIT License
2.95k stars 605 forks source link

Which Faster RCNN repo do you use during testing and validation #52

Open HuAndrew opened 6 years ago

HuAndrew commented 6 years ago

Hi, @leoxiaobin Thanks for sharing your excellent work! It have very good results.I am curious about which your bounding box detector.

I evaluate your valided bbox results, it's get 56.8 performance in detection task, and I get 49.5 with maskrcnn detector. Would you give your refered codes or repo, and have you train faster RCNN codes?

Thanks a lot!

MaskRCNN

bearpaw commented 5 years ago

Hi @HuAndrew , same question here.

I am playing with this code recently and was also wondering about how did you generate the detection part.

To be more specific, I am detecting humans from the COCO val 2017 keypoints images (5000 images) from the person_keypoints_val2017.json. I try to use Yolo v3 detector and keep only the bounding boxes regarding humans. Then I dump the JSON file which is similar to this repo's.

However, the size of the generated JSON is quite small compared with theirs (~1.3MB vs 16.4MB). Also, when I run cocoEval and use person_keypoints_val2017.json as groundtruth, I can only get about 40 AP.

Any suggestions? Thank you in advance :)

yurymalkov commented 5 years ago

I have the same question. Can you please share your detector or give a link to a similar one?

namheegordonkim commented 5 years ago

👍 . Related papers keep mentioning of the "person detector used in Simple Baseline..." but it's nowhere to be found

Odaimoko commented 5 years ago

Hi @HuAndrew , same question here.

I am playing with this code recently and was also wondering about how did you generate the detection part.

To be more specific, I am detecting humans from the COCO val 2017 keypoints images (5000 images) from the person_keypoints_val2017.json. I try to use Yolo v3 detector and keep only the bounding boxes regarding humans. Then I dump the JSON file which is similar to this repo's.

However, the size of the generated JSON is quite small compared with theirs (~1.3MB vs 16.4MB). Also, when I run cocoEval and use person_keypoints_val2017.json as groundtruth, I can only get about 40 AP.

Any suggestions? Thank you in advance :)

Well the author said 56.4 AP on person category. I have used Detectron's model . In End-to-End Faster & Mask R-CNN Baselines, the entry X-101-64x4d-FPN with 42.4 box AP can get 55.7 AP on person cat. I think this is competitive.

HuAndrew commented 5 years ago

@bearpaw @Odaimoko Hello, I test multi detector, like mask, cascade_RCNN , and the detector vis and other preds' results are as follows:

vis samples

image

preds samples

256x192_pose_resnet_50_d256d256d256 total person detect AP keypoint
ground truth 11004 XXXXX 72.4
faster author 104125 56.4 70.5
mask rcnn_0.7 13167 48.6 68.1
mask rcnn_0.5 15530 49.5 68.6
mask rcnn_0.3 15796 49.6 68.7
Cascade_RCNN 73597 53.0 70.0

Then From the test results, something can be found:

Welcome to Join pose forum www.ilovepose.com

wmcnally commented 2 years ago

Evaluated using the Detectron2 repo:

https://github.com/facebookresearch/detectron2/blob/main/MODEL_ZOO.md