ScandEval / ScandEval

Evaluation of language models on mono- or multilingual tasks.
https://scandeval.com
MIT License
66 stars 13 forks source link

[BENCHMARK DATASET REQUEST] Högskoleprovet #257

Open saattrupdan opened 3 months ago

saattrupdan commented 3 months ago

Dataset name

hogskoleprovet

Dataset link

https://www.hogskoleprovet.nu/gamla-hogskoleprov/

Dataset languages

Describe the dataset

These are previous Swedish qualification exam for higher education, which feature math questions, vocab questions, reading comprehension and image+text questions (typically related to reading plots). This might be relevant as knowledge, reasoning and/or extractive QA, but it might be a bit too specialised.

saattrupdan commented 3 months ago

https://github.com/ScandEval/ScandEval/issues/233 seems to be a subset of these.