qcri / LLMeBench

Benchmarking Large Language Models
76 stars 15 forks source link

add asset GPT4-o EN-QA #359

Closed AridHasan closed 1 week ago