Full-Segment-Anything

This code is originated from the following Segment Model, where all of the code come from META AI Research, FAIR.

Affiliation: Meta AI Research, FAIR

Authors: Alexander Kirillov, Eric Mintun, Nikhila Ravi, Hanzi Mao, Chloe Rolland, Laura Gustafson, Tete Xiao, Spencer Whitehead, Alex Berg, Wan-Yen Lo, Piotr Dollar, Ross Girshick

Explanation: The Segment Anything Model (SAM) produces high quality object masks from input prompts such as points or boxes, and it can be used to generate masks for all objects in an image. It has been trained on a dataset of 11 million images and 1.1 billion masks, and has strong zero-shot performance on a variety of segmentation tasks.

Why is Full-Segment-Anything needed?

Segment-Anything code has the following critical issues for doing further research.

Cannot conduct batch-input on the full-grid prompt (automatic mask generation)
Can batch-input on the small number of prompts, Excluding post-processing: removing duplicated or small regions and holes.
Non-flexible input image size (Fixed resolution of 1024)

Therefore, Full-Segment-Anything addresses the above issues:

Can conduct batch-input on the full-grid prompt (automatic mask generation)
Can batch-input on the small number of prompts, Including post-processing: removing duplicated or small regions and holes.
Flexible input image size (**128, 256, 512, 1024, etc)**

(Not did we re-train, but we modified in the code-level)

Version Update

Adding Mobile-SAM with flexible inputs (but I observed that its performance is really dependent with the image resolution size)
By modifying (window partition & unpartiion) image encoding in Original SAM, the performance has been improved on the lower resolution: 256, 512

Visualization of Full-Segment-Anything

Figure 1. Full-Segment-Anything on Image Resolution *128*

ByungKwanLee / Full-Segment-Anything

readme

Full-Segment-Anything

Why is Full-Segment-Anything needed?

Version Update

Visualization of Full-Segment-Anything