LAION-AI / CLIP_benchmark

CLIP-like model evaluation
MIT License
534 stars 68 forks source link

Add compositionality benchmarks #97

Open mehdidc opened 12 months ago

mehdidc commented 12 months ago
HarmanDotpy commented 11 months ago

listing some others: SugarCREPE: https://github.com/RAIVNLab/sugar-crepe/tree/main VL-Checklist: https://github.com/om-ai-lab/VL-CheckList Winoground: https://huggingface.co/datasets/facebook/winoground maybe SVO (verb understanding): https://github.com/deepmind/svo_probes maybe VALSE: https://github.com/Heidelberg-NLP/VALSE

mehdidc commented 11 months ago

Thank you very much @HarmanDotpy !

mehdidc commented 11 months ago

Ok SugarCREPE is there, will look at the rest

mehdidc commented 7 months ago

Another one to consider: https://github.com/arijitray1993/COLA, https://cs-people.bu.edu/array/research/cola/

mehdidc commented 5 months ago

Also:

vishaal27 commented 5 months ago

Add MMVP-VLM (winoground-style dataset): https://github.com/tsb0601/MMVP?tab=readme-ov-file https://huggingface.co/datasets/MMVP/MMVP_VLM/

vishaal27 commented 4 months ago

Add https://github.com/Top34051/colorswap