embeddings-benchmark / mteb

MTEB: Massive Text Embedding Benchmark
https://arxiv.org/abs/2210.07316
Apache License 2.0
1.83k stars 246 forks source link

extracting embedding from other than CLS token during eval mteb #999

Open riyajatar37003 opened 3 months ago

riyajatar37003 commented 3 months ago

Hi,

Where exactly i need to made changes in order to use the other tokens representation as embedding instead of 'cls' token?

can u point out to the file?

thanks

malteos commented 2 months ago

You can change that in the config file of the corresponding sentence transformers model, e.g., here.

KennethEnevoldsen commented 2 months ago

You might also take a look at the documentation for how to use a custom model.