Open chancharikmitra opened 10 months ago
Hi, for the SEEDBench and Q-Bench, we treat this as the Multiple Choice Question for open-generation. You can refer to the MMBench (which is also a MC benchmark) for reference, especially for the prompt reference.
Hello! First of all, this is really fascinating work. Thanks for the contribution.
I wanted to reach out and ask if you could share the evaluation scripts for mPLUG-OWL2 you used for benchmarks shown in the main figure (e.g. SEEDBench, QBench, etc.). It would also be great if you could provide (or include in the script) any specific prompting you might have done for your zero-shot evaluation on those datasets.