REC Results - Githubissues

The paper mentioned referring expression comprehension (REC) - a vital task that measures the language-driven grounding ability of a visual-language multimodal model. RefCOCO/+/g are also used for training in Stage 4 as mentioned in paper. However, the reported experiments does not have the RefCOCO's results even though Table 2 states it can do grounding task. Will these test results be updated? A comparison between TinyGPT-V and its counterpart Shrika would be very useful for a more comprehensive evaluation of the mentioned method.

DLYuanGod / TinyGPT-V

REC Results #16