benquick123 / C-VTON

C-VTON: Context-Driven Image-Based Virtual Try-On Network
Eclipse Public License 2.0
139 stars 29 forks source link

image_body_parse and image_parse_with_hands #2

Closed arhamlet closed 2 years ago

arhamlet commented 2 years ago

Can you kindly mention the two models that you have used to create the two folders named 'image_body_parse' and 'image_parse_with_hands' to obtain the labels for the images in the preprocessed-viton dataset. Which models have you used to obtain the labels as shown in these two images: 000001_0 000001_0

benquick123 commented 2 years ago

Hi, sorry for a late reply. In the paper, we've used a DensePose body parser, as well as models and code from this and this repositories for generation of above images, respectively. Note, that the latter example you posted has an additional labels for neck area (light green on the image) and hands, which we heuristically obtained by subtracting results from body and clothing parser outputs.

Since a significant impairment of the current works lies in the incorrect garment parsing, you might be able to easily obtain better results by finding newer models aiming to solve body and clothing segmentations problems.