A benchmark for evaluating sentence/document embeddings of Scandinavian language models.
You can install the Scandinavian Embedding Benchmark (seb
) via pip from PyPI:
pip install seb
To see more examples, see the documentation.
Documentation | |
---|---|
π§ Installation | Installation instructions on how to install this package |
π©βπ» Usage | Introduction on how to use the package |
π Documentation | A minimal and developing documentation |
Type | |
---|---|
π¨ Bug Reports | GitHub Issue Tracker |
π Feature Requests & Ideas | GitHub Issue Tracker |
π©βπ» Usage Questions | GitHub Discussions |
π― General Discussion | GitHub Discussions |
To cite this work please refer to the following article:
Enevoldsen, K., Kardos, M., Muennighoff, N., & Nielbo, K. (2024). The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding. https://openreview.net/forum?id=pJl_i7HIA72
or use the following BibTeX:
@misc{enevoldsen2024scandinavian,
title={The Scandinavian Embedding Benchmarks: Comprehensive Assessment of Multilingual and Monolingual Text Embedding},
author={Kenneth Enevoldsen and MΓ‘rton Kardos and Niklas Muennighoff and Kristoffer Laigaard Nielbo},
year={2024},
eprint={2406.02396},
archivePrefix={arXiv},
primaryClass={cs.CL}
}