Tika-Similarity uses the Tika-Python package (Python port of Apache Tika) to compute file similarity based on Metadata features.
Apache License 2.0
107
stars
59
forks
source link
-Added command line options for stylictic feature similarity. Added more usage description for metalevenshtein and bell curve intersection -Moved imports to top #75