-
Hello,
I'm currently facing a validation error in my Label Studio project, which seems to stem from a discrepancy between the labels detected by the Grounding DINO model and those defined in my Lab…
-
It seems some evaluation code about the class of RefL4Dataset are missing ,could you please update it?
-
Multi-modal 3D dataset encompasses 1.4M meta-annotated captions on 109k objects and 7.7k regions as well as over 3.04M diverse samples for 3D visual grounding and question-answering benchmarks. The ov…
-
It seems like grounding-dino states in the documentation that it can take a batch of images, but when I try to do so, I get an error, as specified here - https://discuss.huggingface.co/t/how-to-perfor…
-
If i want to locate a specific target ,such as "a person wearing a yellow hat", what user_query can i use in inference?
```
python -m groma.eval.run_groma \
--model-name {path_to_groma_7b_finet…
-
### System Info
- `transformers` version: 4.41.2
- Platform: Linux-5.4.0-182-generic-x86_64-with-glibc2.31
- Python version: 3.11.8
- Huggingface_hub version: 0.23.0
- Safetensors version: 0.4.…
-
### Model description
I know the transformers library has not included object tracking models in the past, but this one can either plug into any object detection model or be an end-to-end open world …
-
I am wondering if Moondream can be used for grounding tasks such Object Localization? Something similar to what cogagent does with GUI but I would like to train on my custom dataset. If I fine-tune mo…
-
## 🚀 Feature
Currently, the project uses `GroundingDINO` as the visual grounding model which is the best performing model for some benchmark datasets
![current benchmarks for zero-shot object dete…
-
The current code only supports adding new objects at the time of tracking initialization, and there is no way to track newly appeared objects while preserving the original tracking state. Could you pr…