Few shot vs Open-vocabulary

mlzxy / devit

MIT License

306 stars 45 forks source link

Hi @theodu, thanks for your feedback. The short answer is there is no difference in pipelines between open-vocabulary and few-shot.

In the paper, I try to look at both open-vocabulary and few-shot from the same objective, achieving open-set object detection beyond a fixed category set, while using text (open-vocabulary) and using images (few-shot) only differs in their category representation.

Under this general objective, I evaluate on both open-vocabulary and few-shot benchmarks instead of only the latter ones. Honestly the dataset formats between the two are almost identical, besides the fact the open-vocabulary model performs much better because of far more recent research attention. Hope this answer could help you.

mlzxy / devit

Few shot vs Open-vocabulary #13