Closed zhimin-z closed 10 months ago
BTW, are those metrics all accuracy
?
Desiderata is a newly proposed evaluation metric in ChEF, focusing on dimensions of capabilities beyond the visual abilities of MLLMs. These metrics include the trustworthiness and interactivity. Please refer to our paper ChEF for more details. Thanks for your interest.
But I still fail to find this one in terms of the exact evaluation metrics. Is the evaluation result accuracy
or not?
But I still fail to find this one in terms of the exact evaluation metrics. Is the evaluation result
accuracy
or not?
Any update? @Coach257
I fail to find any of those recipes in the original paper...