mattilyra / LSH

Locality Sensitive Hashing using MinHash in Python/Cython to detect near duplicate text documents
MIT License
278 stars 78 forks source link