I'm trying to evaluate Claude v3's performance for some document understanding tasks, with a workflow that includes passing the image of the page in as one of the inputs.
Is fmeval considering native handling for image/multi-modal fields in input datasets?
I'm trying to evaluate Claude v3's performance for some document understanding tasks, with a workflow that includes passing the image of the page in as one of the inputs.
Is fmeval considering native handling for image/multi-modal fields in input datasets?