dkoslicki / CMash

Fast and accurate set similarity estimation via containment min hash
BSD 3-Clause "New" or "Revised" License
42 stars 9 forks source link

Make sure jaccard isn't counting blanks/Inf as matches #10

Closed dkoslicki closed 4 years ago

dkoslicki commented 5 years ago

See https://github.com/dkoslicki/CMash/blob/master/CMash/MinHash.py#L662-L671

dkoslicki commented 4 years ago

Closing as is addressed by 18da15444b (inf->p for initialization, then is already checked for in jaccard, common, common_count` etc.)