KennethEnevoldsen / scandinavian-embedding-benchmark

A Scandinavian Benchmark for sentence embeddings
https://kennethenevoldsen.github.io/scandinavian-embedding-benchmark/
MIT License
27 stars 3 forks source link

Author style clustering? #144

Open KennethEnevoldsen opened 9 months ago

KennethEnevoldsen commented 9 months ago

Might be interesting to add author-style clustering based on:

https://huggingface.co/datasets/MiMe-MeMo/Corpus-v1.1

x-tabdeveloping commented 9 months ago

That's actually a great idea we probs have a dataset for that in-house don't we?