Open dopc opened 1 year ago
@jroose-jv is this an expected behavior? My understanding is that minhash_many is a batch version of minhash.
Sorry for the late response. If you want consistency across all weighted minhash, I recommend picking either minhash
or minhash_many
but not both.
I want to use minhash_many
, but its result does not have any meaning, as far as I understand. In above, I used both of them to show the difference between them.
hey, thanks for this great project. I want to use min hash for my text embedding vectors which have both negative and positive numbers. I have searched the issues and found that weighted min hash can be used for that. I tried it and it actually works we.
my problem is about
minhash_many
function. its result is different thanminhash
function. below is a minimal code to reproduce and a screenshot to demonstrate without running the code.I want to use
minhash_many
since it is faster than for loop. So is this normal or something unexpected. thx.