phrase-grounding Search Results

199 results
for phrase-grounding

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

IDEA-Research/GroundingDINO #259

Visualizing Grounding Dino Model Predicted Bounding Box in V…

Hi Team, Thanks for your help. boxes, logits, phrases = predict( model=model, image=image, caption=TEXT_PROMPT, …

solomonmanuelraj updated 9 months ago
1
AILab-CVC/YOLO-World #372

yoloworld模型能否在Flickr30K Entities数据集上评估他的Phrase Grounding能力？就…

yoloworld模型能否训练像Visual grounding相关数据集RefCOCO dataset , RefCOCO+ dataset , and RefCOCOg dataset 或者Flickr30K Entities数据集来学习物体之间的空间关系，比如能够推理出“抓住中间的东西”或“选择橙色左边的香蕉”这样的能力

dq1125 updated 3 months ago
1
IDEA-Research/GroundingDINO #321

Grounding DINO is now available in 🤗 Transformers!

Hi folks! Grounding DINO is now available in the Transformers library, enabling easy inference in a few lines of code. Here's how to use it: ```python from transformers import AutoProcessor,…

NielsRogge updated 4 months ago
7
IDEA-Research/T-Rex #85

About Visual Prompt Encoder and Contrastive Alignment

Hello, authors. I would like to ask two questiones. 1. How to deal with box query feature and point query feature after deformable cross- attention, contact? 2. How to get corresponding text prompts…

hao416 updated 5 days ago
31
chaos-moon/paper_daily #29

Grounded SAM

全路径为: DETR -> DINO -> GLIP -> Grounding DINO -> Grounding SAM。其中DINO指的是一种基于DETR的目标检测模型

zc12345 updated 1 year ago
5
jefferyZhan/Griffon #6

The code of "Training-Free Confidence Scoring Mechanism"

Thanks for sharing your work. Are there some codes for "Training-Free Confidence Scoring Mechanism"? I clone the repository and only find `eval/run_llava.py` for runing a demo. And are there the evalu…

CongHan0808 updated 5 months ago
1
athrado/text-2-scene-graph #1

How to cite

**I had to re-create this repository because of some issues with the git history so I'm re-posting this issue.** _JackWhite-rwx commented: Excuse me,Your paper "Employing the Scene Graph for Phras…

athrado updated 1 year ago
4
huggingface/transformers #24671

Is there any plan to add kosmos-2 to the transformers.

### Model description Kosmos-2 is a grounded multimodal large language model, which integrates grounding and referring capabilities compared with Kosmos-1. The model can accept image regions select…

BIGBALLON updated 10 months ago
30
shikras/d-cube #12

GLIP

Thanks for sharing the wonderful work, the paper differentiate GLIP with GroundingDINO, FIBER, the former is classified into open vocabulary object detection, while the latter is named bi-functional m…

twangnh updated 6 months ago
3
IDEA-Research/GroundingDINO #1

Some questions on the details in paper

To the Authors This is a very interesting and good work on visual grounding tasks with a Query-based detector. The paper is also well written and clear. Super interesting results with GLIGEN as we…

Dwrety updated 1 year ago
10

上一页 1...1 2 3 4 5 6 7...20 下一页

199 results for phrase-grounding

199 results
for phrase-grounding