New paper for hallucination benchmarks

Hi, thanks for curating and maintaining this amazing list of MLLM hallucination! We would like to contribute our work released a few months ago to this list:

CounterAnimal Do CLIPs Always Generalize Better than ImageNet Models? (Mar. 18, 2024)

We trace back the hallucination issue to the multimodal alignment in CLIPs, which are often incorporated as MLLM visual encoders. We find surprising phenomena that CLIPs demonstrate high vulnerability to the natural distribution shifts captured by CounterAnimal and perform even worse than previous ImageNet-based models. This issue further leads to severe hallucinations of MLLMs such as LlaVA and MiniGPT-4.

BTW, we also released a project page with the link to our benchmark, and feel free to decide what is the best way to add/or not add it to the list : ) Page

Thank you very much!

showlab / Awesome-MLLM-Hallucination

New paper for hallucination benchmarks #5