language-grounding Search Results

ZCMax/LLaVA-3D #7

Training model for 3D VG

Hello. Could you please advise me on how to properly train a model for 3D VG on ScanRefer: model, losses, dataset, metrics? Your current model can predict bounding boxes only as text and only with …

col14m updated 4 days ago

gpt4vision/OvSGTR #14

Query about the pre-training process

Thank you for your outstanding work, but I still met many problems in the process of reproducing the pre-training results. I use the following command to pre-train the groundingdino_swint: bash …

qwerhk839 updated 1 week ago

flowersteam/Grounding_LLMs_with_online_RL #19

How to solve the "ModuleNotFoundError: No module named 'expe…

I'm using 6GPUs on a single machine. This is my command: ```shell python -m lamorel_launcher.launch --config-path Absolute/Path/To/Grounding_LLMs_with_online_RL/experiments/configs --config-name lo…

Josh00-Lu updated 2 months ago

langgenius/dify #10189

Plz support grounding feature of Gemini

### Self Checks - [X] I have searched for existing issues [search for existing issues](https://github.com/langgenius/dify/issues), including closed ones. - [X] I confirm that I am using English to su…

sweellan updated 1 week ago

IDEA-Research/GroundingDINO #350

Try OV-DINO, a more powerful open-vocabulary detector.

Thanks for the awesome Grounding-DINO, I share our recent work 🦖OV-DINO: Unified Open-Vocabulary Detection with Language-Aware Selective Fusion. * OV-DINO is a novel unified open vocabulary detecti…

wanghao9610 updated 3 weeks ago

huggingface/transformers #27031

Microsoft's GLIP Grounding Language Image Pretraining

### Model description Combines best practices of CLIP and object detectors. Allows for localization and grounding of text and image content. ### Open source status - [X] The model implementation i…

ethansmith2000 updated 1 year ago

linukc/SUN3D_DATASETS #27

ScanERU

Aiming to link natural language descriptions to specific regions in a 3D scene represented as 3D point clouds, 3D visual grounding is a very fundamental task for human-robot interaction. The recogniti…

linukc updated 2 months ago

open-mmlab/mmdetection #12010

Grounding-Dino: BERT model frozen or not?

Hello! Thanks for the great re-implementation of GroundingDino. I am trying to understand you code. In the [usage.md](https://github.com/open-mmlab/mmdetection/blob/main/configs/mm_grounding_din…

laurenzheidrich updated 2 weeks ago

JasonQSY/3DOI #2

if there is any plans to release the code of AffordanceLLM

apologize for the questions about your another significant work . really appreciate your work AffordanceLLM: Grounding Affordance from Vision Language Models and this 3DOI about the breaking contrib…

vvvvvjdy updated 4 months ago

Show-han/Zeroshot_REC #6

你好，你们的工作自称是“zero-shot”可是却需要训练，跟 ReCLIP 的 setting 完全不一致啊，这该怎么…

你好，你们的工作自称是“zero-shot”可是却需要训练，跟 ReCLIP 的 setting 完全不一致啊，这该怎么解释？难道审稿的时候没有审稿人质疑？论文当中对训练的方式和数据也没有解释清楚，还故意放补充材料。 your work is labeled as "zero-shot," but it requires training, which contradicts the ReC…

linhuixiao updated 1 month ago

587 results for language-grounding

587 results
for language-grounding