Thank you for sharing the data and code for this leaderboard.
Would it be possible to share the model generations on the AIR-Bench dataset from the leaderboard? This would greatly help in reproducing the results and exploring new directions for evaluating LALMs.
Hi!
Thank you for sharing the data and code for this leaderboard.
Would it be possible to share the model generations on the AIR-Bench dataset from the leaderboard? This would greatly help in reproducing the results and exploring new directions for evaluating LALMs.
Thank you for your time and effort!