Hello, I want to reproduce the results on RefCOCO, RefCOCO+ and RefCOCOg, but I found that VLMEvalKit doesn't support these datasets, and lmms-eval only support REG(Grounded Captioning) evaluation on these datsets. Did you evaluate REC(Visual Grounding) Evaluation results with your own scripts? Thank you!
Hello, I want to reproduce the results on RefCOCO, RefCOCO+ and RefCOCOg, but I found that VLMEvalKit doesn't support these datasets, and lmms-eval only support REG(Grounded Captioning) evaluation on these datsets. Did you evaluate REC(Visual Grounding) Evaluation results with your own scripts? Thank you!