akanazawa / hmr

Project page for End-to-end Recovery of Human Shape and Pose
Other
1.53k stars 395 forks source link

why not use segmentation information? #16

Closed lyupei closed 5 years ago

lyupei commented 5 years ago

How about adding human instance segmentation and projection area loss?

akanazawa commented 5 years ago

Hi I tried this as well and this is a good idea as tried in several other papers:

However, the problem with segmentation loss is that you're making a big assumption that the person is not occluded. But in cluttered natural settings (like COCO) people are often occluded, and the segmentation maps are missing for the occluded parts. Then you can't rely segmentation loss and it becomes a noisy signal. It's fine for images like LSP where it's sports and you mostly see the entire full body, but not for images like COCO. It's an interesting research problem to deal with incomplete segmentation (due to occlusion) as a training signal.

Best,