kijai / ComfyUI-Florence2

Inference Microsoft Florence2 VLM
MIT License
682 stars 44 forks source link

The segmentation results for some images are incorrect. #69

Closed SofiaRin closed 1 month ago

SofiaRin commented 1 month ago

Hello, I found that when executing the referring expression segmentation task, the output masks for some images of people and animals appear to be completely incorrect. Could this be due to incorrect node parameter settings?

By the way, is there any documentation that explains the meanings of the node parameters?

Snipaste_2024-09-09_20-49-47 Snipaste_2024-09-09_20-45-31 Snipaste_2024-09-09_20-43-24 Snipaste_2024-09-09_20-53-00

kijai commented 1 month ago

The model in general is just not very good at segmentation, would recommend checking out Florence+SAM2 instead:

https://github.com/kijai/ComfyUI-segment-anything-2

SofiaRin commented 1 month ago

Thank you, I'll take a look at that.