STS training, score normalization [0,1] vs [-1,1]

UKPLab / sentence-transformers

State-of-the-Art Text Embeddings

https://www.sbert.net

Apache License 2.0

14.83k stars 2.44k forks source link

Closed cccntu closed 4 years ago

cccntu commented 4 years ago

The scores are normalized to [0,1], but cosine similarity is in the range of [-1, 1]. Is this a bug, or is this intended?

nreimers commented 4 years ago

It tested both back in 2019 and normalizing to 0...1 worked a bit better.

But you could also test easily both and see what works better.