elisemercury / Duplicate-Image-Finder

difPy - Python package for finding duplicate or similar images within folders
https://difpy.readthedocs.io
MIT License
420 stars 65 forks source link

Incorrect results and a few further observations #79

Open audiomuze opened 9 months ago

audiomuze commented 9 months ago

@elisemercury , I've just pulled and tested your latest commit and have encountered what I assume are bugs:

running python /home/x/git/Duplicate-Image-Finder/difPy/dif.py --directory /mnt/sdc/2tag/ --output_directory /tmp --recursive True --limit_extensions True --show_progress True:

Edited extract from /tmp/difPy_20230927222221_lower_quality.json:

{"lower_quality": ["/pathtofile/xfolder.jpg"]}
audiomuze commented 9 months ago

I've just run it against another group of files and on this occasion difpy reported no lower quality images whereas in reality there were many instances of a smaller image and a larger, higher resolution image.

Perhaps the easiest way to illustrate would be for me to send you the image files to run against and compare results locally?

audiomuze commented 8 months ago

@elisemercury,just flagging in case you missed this?

elisemercury commented 7 months ago

Hi @audiomuze

Thanks so much for flagging these issues! They will be investigated and considered with the next difPy release.

Thanks again! Best Elise

elisemercury commented 4 months ago

Hi @audiomuze,

difPy v4.1.0 has been release and I would recommend testing it on your dataset to see if you can see some improvements. The new version comes with an improved comparison algorithm.

Feel free to reach out if the issue should still persist.

Thanks, Elise