Yuqifan1117 / Labal-Anything-Pipeline

Baby-DALL3: Annotation anything in visual tasks and Generate anything just all in one-pipeline with GPT-4 (a small baby of DALL·E 3).
82 stars 3 forks source link

Suggestion - Integrate MobileSAM into the pipeline for lightweight and faster inference #6

Closed mdimtiazh closed 1 year ago

mdimtiazh commented 1 year ago

Reference: https://github.com/ChaoningZhang/MobileSAM

Our project performs on par with the original SAM and keeps exactly the same pipeline as the original SAM except for a change on the image encode, therefore, it is easy to Integrate into any project.

MobileSAM is around 60 times smaller and around 50 times faster than original SAM, and it is around 7 times smaller and around 5 times faster than the concurrent FastSAM. The comparison of the whole pipeline is summarzed as follows:

image

image

Best Wishes,

Qiao

Yuqifan1117 commented 1 year ago

Great job! Can greatly speed up the whole process, but can its performance be quantitatively verified?