chrismattmann / tika-similarity

Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.
Apache License 2.0
106 stars 59 forks source link

Added command line options for clustering based on edit or cosine similarity #69

Closed AravindRam closed 8 years ago

AravindRam commented 8 years ago

Professor @chrismattmann ,

Modified edit-cosine-circle-packing.py and edit-cosine-cluster.py to cluster based on the command line option passed by the user depending on the necessary use case. Please review and let me know if this is fine.

chrismattmann commented 8 years ago

awesome thank you @AravindRam !