Open QingSuML opened 1 year ago
Since SAM uses a vit model to handle image encoding, it arguably already is a 'transfer result' in a sense (i.e it's an example of how the vit models do on segmentation tasks).
That being said, there is a neat repo: awesome-segment-anything which someone has put together showing SAM projects applied to different use cases.
The paper provides results only in segmentation domains. As it is model targeting dense prediction task, it would be interesting to see its transfer performance on other dense prediction tasks, such as object detection and depth prediction.
Would it be possible to obtain the results pertaining to object detection and depth prediction from your study? This information would establish a constructive baseline for forthcoming research endeavors, enabling meaningful comparisons.
Thanks