LAION-AI / CLIP_benchmark

CLIP-like model evaluation
MIT License
618 stars 79 forks source link

Add compositionality benchmarks #97

Open mehdidc opened 1 year ago

mehdidc commented 1 year ago
HarmanDotpy commented 1 year ago

listing some others: SugarCREPE: https://github.com/RAIVNLab/sugar-crepe/tree/main VL-Checklist: https://github.com/om-ai-lab/VL-CheckList Winoground: https://huggingface.co/datasets/facebook/winoground maybe SVO (verb understanding): https://github.com/deepmind/svo_probes maybe VALSE: https://github.com/Heidelberg-NLP/VALSE

mehdidc commented 1 year ago

Thank you very much @HarmanDotpy !

mehdidc commented 1 year ago

Ok SugarCREPE is there, will look at the rest

mehdidc commented 11 months ago

Another one to consider: https://github.com/arijitray1993/COLA, https://cs-people.bu.edu/array/research/cola/

mehdidc commented 10 months ago

Also:

vishaal27 commented 10 months ago

Add MMVP-VLM (winoground-style dataset): https://github.com/tsb0601/MMVP?tab=readme-ov-file https://huggingface.co/datasets/MMVP/MMVP_VLM/

vishaal27 commented 9 months ago

Add https://github.com/Top34051/colorswap

escorciav commented 3 weeks ago

@mehdidc sugarcrepe is there & runs, but there is an issue with the run all functionality. Refer to #128