magsol / pyspark-lsh

Locality-sensitive hashing in PySpark.
Other
27 stars 12 forks source link

how to compute similar items? #1

Closed richardxy closed 8 years ago

richardxy commented 9 years ago

Hi, Thanks for writing the package. It seems to me that the compute function of LSH model is still missing. In the Scala counterpart, the compute function finds the similar items to the input entry.

thanks, Richard

magsol commented 9 years ago

You are correct--I was pulled onto other projects and have not had time to complete this one. It is definitely still something I want to finish. Feel free to submit a pull request if you're interested in doing it :) Otherwise I'll get to it soon.

aman010 commented 9 years ago

sorry i meant "len(x[1]) > c" should "len(x[1]) >= c"

thanks

magsol commented 9 years ago

I'm still not sure what file / line number you're referring to.

magsol commented 8 years ago

Closing this issue as the question was never clarified.