-
I don't manage to understand the difference between your few-shot and open-vocabulary models.
Your approach is based on image-only models and Open-vocabulary approach relies on a text based embedding…
-
Hello, i am following your paper
your reference `[16] X. Gu, T.-Y. Lin, W. Kuo, and Y. Cui, “Zero-shot detection via vision and language knowledge distillation,” arXiv preprint arXiv:2104.13921,` i…
-
About 'K-Means Clustering of Frozen Diffusion Features', how do you perform on the dataset? Because the LDM model accept the text input to generate the new image samples, and what do you input to obta…
-
[DOWNLOAD.md](https://github.com/pzzhang/VinVL/blob/main/DOWNLOAD.md) says `We also provide the X152-C4 objecte detection config file and pretrained model on the merged four datasets (COCO with stuff,…
-
Hi, @wondervictor, a huge shoutout for your remarkable contributions!
I've seamlessly integrated YOLO-World into [X-AnyLabeling](https://github.com/CVHub520/X-AnyLabeling), marking a significant ad…
-
# Nonce word detection
Wug testing is a linguistic experiment in which a speaker (often children) is asked to morphologically inflect a word that is nonexistent in a language (a nonce word) such as…
-
**This issue will be kept open and pinned for a long time, as we hope to hear everyone's opinions, suggestions, and needs!**
We want to make YOLO-World stronger and encourage more diverse application…
-
Hi!
Thanks for your interesting work on open vocabulary detection.
I read the paper and tried to run the code, but had some trouble. Hope for your help!
1. How can I get this file "clip_feat.pkl…
-
I've been exploring how we can work with BODS data as a Linked Data graph.
At least at a basic level it is possible to convert BODS JSON -> JSON-LD -> RDF with the addition of a short '@context' e…
-
Hello there! I'm interested in your work, but I'm having some differences when reproducing the results of the paper. So, I'd like to consult with you.
1. In the QA task, is it true that the result of…