dogsheep / dogsheep-photos

Upload your photos to S3 and import metadata about them into a SQLite database
Apache License 2.0
170 stars 15 forks source link

Integrate image content hashing #7

Open simonw opened 4 years ago

simonw commented 4 years ago

To spot duplicate images (where the file content differs such that the sha256 is no longer a match) it would be useful to calculate and store perceptual hashes of some sort.

simonw commented 4 years ago

https://pypi.org/project/ImageHash/ Is one option.

dkam commented 2 years ago

Perceptual hashes might be what you're after : http://phash.org