Closed ApolloRay closed 4 months ago
For test visual_correspondence.
Hi, the "full prediction" means raw model output, and "prediction" means the output after choice extraction (GPT3.5)
Hi, the "full prediction" means raw model output, and "prediction" means the output after choice extraction (GPT3.5)
But I got totally different result. LLaVa offer a online demo, can you use the same prompt and the same concat image as input to get the same full prediction result ? Maybe you can offer three images and three prompts in this demo.
Hi it seems our settings are different. we use temperature=0 and locally ran llava1.6. Can you try this setting?
Test difference.
In saved output, llava v1.5 13b,full prediction gave full sentences. But in llava v1.6 34b, full predictions were just A/B/C/D ?
Can you provide the original output ( before gpt3.5 )for llava_v1.6_34b ? For task multi-view_reasoning, I got prediction results which are almost (A).