thomasheckmann / image-indexer

This utility provides phash calculation on images. It also provides a standalone service built on the spark framework.
5 stars 5 forks source link

Compare pHashes of images #1

Open martinzavodny opened 7 years ago

martinzavodny commented 7 years ago

Hey Thomas,

I tried your implementation of phash and it works well but I am curious whether there is solution to compare those hashes and evaluate whether two pictures are the same or not. For example the project of phash.org has this method and I dont know exactly how that works so I am little but stuck in the moment.

Thank you for answer in advance and have a nice day.

thomasheckmann commented 7 years ago

In ImageHashedType there are a couple of distance method to calculate the distance(degree of similarity) between to hash value, sound like what you are looking for. /thomas

On 25 May 2017, at 16.34, martinzavodny notifications@github.com wrote:

Hey Thomas,

I tried your implementation of phash and it works well but I am curious whether there is solution to compare those hashes and evaluate whether two pictures are the same or not. For example the project of phash.org has this method and I dont know exactly how that works so I am little but stuck in the moment.

Thank you for answer in advance and have a nice day.

— You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub, or mute the thread.

martinzavodny commented 7 years ago

So you are basically using plain hamming distance, nothing more special. Ok, as I mentioned I have been using library of phash.org until recently and the comparison of two almost similar pictures (actually it is one pic, just the copy of it has higher contrast ) returns hashes where their hamming distance is quite big, but still their method returns that the pics are similar.

After using your hash method I got even greater hamming distance of those pictures I mentioned, so I am curious how to evaluate those results to be able to conclude that the pictures are similar.

thomasheckmann commented 7 years ago

Yes, phash is all about hamming distance, phash.org does the same. Take a look at the test program, it computes the distance from images. You can just replace the test images with your own

/thomas

On 26 May 2017, at 08.04, martinzavodny notifications@github.com wrote:

So you are basically using plain hamming distance, nothing more special. Ok, as I mentioned I have been using library of phash.org until recently and the comparison of two almost similar pictures (actually it is one pic, just the copy of it has higher contrast ) returns hashes where their hamming distance is quite big, but still their method returns that the pics are similar.

After using your hash method I got even greater hamming distance of those pictures I mentioned, so I am curious how to evaluate those results to be able to conclude that the pictures are similar.

— You are receiving this because you commented. Reply to this email directly, view it on GitHub, or mute the thread.