Open agm-eratosth opened 3 weeks ago
Testing Llama-3.1-405B-Instruct locally would be challenging due to the large memory requirements. The 405B parameter model size exceeds typical local testing capabilities.
Would using an API be feasible? There are quite some available: https://artificialanalysis.ai/models/llama-3-1-instruct-405b/providers
@Wyyyb @wenhuchen I would really love to have Llama-3.1-405B-Instruct on the leaderboard.
Model: https://huggingface.co/meta-llama/Llama-3.1-405B-Instruct
They claim a MMLU-Pro score of 73.3%.
Thanks again for all you hard work maintaining this leaderboard!