PaulLereverend / NextcloudDuplicateFinder

Save some space by finding your duplicate files
GNU Affero General Public License v3.0
78 stars 16 forks source link

How is duplicates defined? #47

Open carcinocron opened 3 years ago

carcinocron commented 3 years ago

Are duplicates done by md5/sha hash? perceptual hash? Does it only detect literal identical files or does it help find files that are the same photo but one is a smaller resolution or cropped?

chrros95 commented 3 years ago

The duplicatefinder app considers a file as a duplicate if they are binary equal. This equality is proven by comparing their SHA-256 hashes.

For a semantical comparison of images, there already exists a feature request #10

linuzer commented 2 years ago

Thanks a lot for this clarification! I think it would be very helpful to include this in the Readme, pointing out, that duplicates are not found by filename, path, creation-timestamp or anything else "superficial". Maybe even a link to Wikipedia -> SHA256 would be helpful, pointing out the de facto impossibility of wrong matches.