Can a proper inference sample code be provided on which we can try VCD? I've been trying to set it up with LLaVA but I think I'm doing things wrong. If the authors can provide a simple script in which we can try and see the difference between a VCD and non VCD LVLM, that would be really helpful.
Hi,
Can a proper inference sample code be provided on which we can try VCD? I've been trying to set it up with LLaVA but I think I'm doing things wrong. If the authors can provide a simple script in which we can try and see the difference between a VCD and non VCD LVLM, that would be really helpful.