mbzuai-oryx / GeoChat

[CVPR 2024 🔥] GeoChat, the first grounded Large Vision Language Model for Remote Sensing
https://mbzuai-oryx.github.io/GeoChat
410 stars 30 forks source link

Evaluation results about Grounding #41

Open xuliu-cyber opened 4 months ago

xuliu-cyber commented 4 months ago

Hi,I use the script to evaluate on the grounding task, but I got the prediction jsonl file contains the obvious wrong answer. For example, the first row is: {"question_id": "fast_6217", "image_id": "train_5007_0017", "answer": "{<89><47><97><55>|<58>}{<50><24><54><28>|<58>}{<48><16><52><20>|<58>}", "ground_truth": [[[584.0, 337.0], [619.0, 313.0], [601.0, 282.0], [565.0, 304.0]], [[553.0, 287.0], [592.0, 262.0], [573.0, 229.0], [534.0, 254.0]], [[517.0, 237.0], [555.0, 214.0], [534.0, 181.0], [498.0, 204.0]]], "question": "3 airplanes at the right", "type": "ref", "dataset": "FAST", "obj_ids": [1, 2, 3], "size_group": "small"} The difference between the answer and gt is too large. Is it normal? Thanks!