Closed cjrh closed 10 months ago
Hello !
Sadly I'm afraid that I won't be of much help on this one. In my use cases I haven't used the normalized distances, I have made them available in this crate for completeness.
However I would also make the same assumptions as you, that is:
The vectors I am adding to the index are not normalized. Is this correct?
I think this is correct.
My search vector in the code above is not normalized. Is that correct?
I also thinks this is how it should be done
Is there a simple way I can reason about the quantitative value of the radius parameter for the NormalizedCosine distance type?
I am not much knowledgeable for this one neither, although I think you can find a kind of graphical explanation for the radius parameter on this blog post. For more precise information I would indeed recommend to get in touch with NGT's original author.
Thanks!
Hi @lerouxrgd!
I hope this is an easy one to answer.
I have simple code that does the following:
NormalizedCosine
, dimension 3radius
setting.I am trying to understand how the radius numerically affects the search results. I am also asking about whether the normalization is fully handled for me, or whether I need to do my own normalization on search vector, for example.
Basic code, for discussion, looks something like this:
These are my questions:
[1.1, 2.1, 3.1]
with its normalized version[0.5280169 , 0.57601843, 0.62401997]
and I got very different behaviour in the search results using the same radius values, which leads me to think I don't understand how the normalization works.radius
is defined by the upstream NGT library so it isn't really a question for you, but since I'm here anyway: Is there a simple way I can reason about the quantitative value of theradius
parameter for theNormalizedCosine
distance type?