long8v / PTIR

Paper Today I Read
19 stars 0 forks source link

[73] Simple Open-Vocabulary Object Detection with Vision Transformers #81

Open long8v opened 1 year ago

long8v commented 1 year ago

image

paper

TL;DR

Details

Architecture

image

training details

zero-shot performance

image

one-shot image-conditioned result

image

one-/few-shot performance

image