A joint community effort to create a Swahili central leaderboard for LLMs. These models are currently evaluated on the Open LLM Leaderboard. Contributions and corrections are welcome!
We refer to a model being "open" if it can be locally deployed and used commercially.
https://swahili-llm-leaderboard.streamlit.app/
https://huggingface.co/spaces/Mollel/swahili-llm-leaderboard
Model Name | Publisher | Open? | Basemodel | Average | ARC | HellaSwag | MMLU | TruthfulQA | Winogrande | GSM8K |
---|---|---|---|---|---|---|---|---|---|---|
Mollel/Swahili Gemma | Mollel | Yes | Gemma | 61.32 | 58.96 | 76.4 | 61.02 | 52.1 | 75.61 | 43.82 |
LeroyDyer/Mixtral_AI_SwahiliTron_7b | leroy Samuel Dyer | Yes | Mixtral | 61.14 | 57.08 | 81.59 | 58.49 | 60.66 | 75.53 | 33.51 |
Mollel/Swahili-Alpaca-Llama-3-8b_16bit | Mollel | Yes | Llama-3 | 60.97 | 57.25 | 78.03 | 63.97 | 50.13 | 76.87 | 39.58 |
mwitiderrick/SwahiliInstruct-v0.1 | Derrick | Yes | Mistral | 58.92 | 57.59 | 80.92 | 57 | 58.08 | 74.66 | 25.25 |
mwitiderrick/SwahiliInstruct-v0.2 | Derrick | Yes | Mistral | 54.25 | 55.2 | 78.22 | 50.3 | 57.08 | 73.24 | 11.45 |
We are always happy for contributions! You can contribute by the following:
The benchmark is English-based, and we need support translating it into Swahili. We welcome sponsorships to help advance this endeavor. Your sponsorship would facilitate this essential translation effort, bridging language barriers and making the benchmark accessible to a broader audience. We're grateful for the dedication shown by our collaborators and aim to extend this impact further with the support of sponsors committed to advancing language technologies."