PaddlePaddle / PaddleSeg

Easy-to-use image segmentation library with awesome pre-trained model zoo, supporting wide-range of practical tasks in Semantic Segmentation, Interactive Segmentation, Panoptic Segmentation, Image Matting, 3D Segmentation, etc.
https://arxiv.org/abs/2101.06175
Apache License 2.0
8.71k stars 1.68k forks source link

implementation of pp-matting v2 #3455

Closed AmberzzZZ closed 3 months ago

AmberzzZZ commented 1 year ago

问题确认 Search before asking

请提出你的问题 Please ask your question

hi, this is how you describe pp matting v2 in the front page: PP-MattingV2 is a lite matting SOTA model developed by PaddleSeg. It extracts high-level semantc informating by double-pyramid pool and spatial attention, and uses multi-level feature fusion mechanism for both semantic and detail prediciton. but in the code i see only a global alpha prediction head. How exactly does the semantic task and the detail task integrate into this network in training time.

Asthestarsfalll commented 1 year ago

I think the answer is in your question

It extracts high-level semantc informating by double-pyramid pool and spatial attention, and uses multi-level feature fusion mechanism for both semantic and detail prediciton.

Which corresponds to code

AmberzzZZ commented 1 year ago

Is the auxilliary semantic & detail prediction task no longer needed in this version of matting network?

Asthestarsfalll commented 1 year ago

Is the auxilliary semantic & detail prediction task no longer needed in this version of matting network?

I think double-pyramid pool, spatial attention and multi-level feature fusion grant the ppmatting-v2 the ability to extract high-level semantc information so that it doesn't need to use a auxilliary semantic map to supervise the training.

You may read the paper to obtain more details.