-
Great job! How i can run Grounding DINO 1.5 Edge, is there any inference or demo code?
-
Dear authors:
Thank you for your great work! I wonder why you used DINOV1 instead DINOV2, which is more suitable for dense prediction task. Thank you!
-
Submit a dino PR ~between the start & end date~ to get access to exclusive slack emojis for public messages.
Each PR needs to include a new dino + add it to the README. Include your Slack username …
-
hi,
I am working on a custom medical imaging binary classification task. I am comparing performances between DINO(VIT base) and DINOv2(VIT large) after training DINO and DINOv2 models, with eval_l…
-
HI!
I would like to thank you first for such a good and updated repo regarding Vision Transformers.
I want to know if I can use 3d medical images to pretrain the ViT using 3D medical images?. D…
-
Thanks for the awesome GLIP, I share our recent work 🦖OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion.
* OV-DINO is a novel unified open vocabulary detection approac…
-
Hi!
why is the dinov2 num_patches set to 256?
the image size is 336, and the kernel size is 14. the num patches should be the same to clip, which is 576.
-
In the code of making one-hot targets for class prediction, the `no-object` class prediction seems not to be supervised:
https://github.com/IDEA-Research/DINO/blob/3ffda400b0f1d4a919fbe2a9cf567e79d21…
-
![image](https://user-images.githubusercontent.com/104975373/167124672-60b6e5ba-ab90-4bb1-903c-e82e03c78635.png)
Dinos gо throuht others dinos, trees and water !
![image](https://user-images.gith…
-
How to improve the execution speed of OCR, grounding-dino, and chatgpt-4o models to transition mobile-agent from laboratory research to engineering use?
1. I replaced the original grounding-dino mo…