Cosine similarity is inconsistent with the cluster

resemble-ai / Resemblyzer

A python package to analyze and compare voices with deep learning

Apache License 2.0

2.79k stars 429 forks source link

Hi, when I tried visualizing the voices, it is shown that there is one sample (female voice) that is actually far away from the male speaker's utterances (which is expected).

However, when I compute the cosine similarity between the female's utterance versus the male ones, the value is quite high (0.88). I don't know if I perform the cosine similarity correctly here.

embed_1 = encoder.embed_utterance(y1)
embed_2 = encoder.embed_utterance(y2)
cosine_sim = embed_1 @ embed_2

Any help is very much appreciated !

resemble-ai / Resemblyzer

Cosine similarity is inconsistent with the cluster #42