Hi Zechun,
great work!
Could you please share the details about the evaluation code? like which codebase was used to run inference etc.
thank you,
Kalyani
Would like to get my hand on the evaluation code.
For example, do you prompt the model for evaluating the zero shot accuracy and do you take argmax over the whole vocab or only on A/B/C/D.
Thanks!
Hi Zechun, great work! Could you please share the details about the evaluation code? like which codebase was used to run inference etc. thank you, Kalyani