elisemercury / Duplicate-Image-Finder

difPy - Python package for finding duplicate and similar images
https://difpy.readthedocs.io
MIT License
466 stars 67 forks source link

Incorrect results and a few further observations #79

Closed audiomuze closed 1 month ago

audiomuze commented 1 year ago

@elisemercury , I've just pulled and tested your latest commit and have encountered what I assume are bugs:

running python /home/x/git/Duplicate-Image-Finder/difPy/dif.py --directory /mnt/sdc/2tag/ --output_directory /tmp --recursive True --limit_extensions True --show_progress True:

Edited extract from /tmp/difPy_20230927222221_lower_quality.json:

{"lower_quality": ["/pathtofile/xfolder.jpg"]}
audiomuze commented 1 year ago

I've just run it against another group of files and on this occasion difpy reported no lower quality images whereas in reality there were many instances of a smaller image and a larger, higher resolution image.

Perhaps the easiest way to illustrate would be for me to send you the image files to run against and compare results locally?

audiomuze commented 1 year ago

@elisemercury,just flagging in case you missed this?

elisemercury commented 11 months ago

Hi @audiomuze

Thanks so much for flagging these issues! They will be investigated and considered with the next difPy release.

Thanks again! Best Elise

elisemercury commented 9 months ago

Hi @audiomuze,

difPy v4.1.0 has been release and I would recommend testing it on your dataset to see if you can see some improvements. The new version comes with an improved comparison algorithm.

Feel free to reach out if the issue should still persist.

Thanks, Elise