elisemercury / Duplicate-Image-Finder

difPy - Python package for finding duplicate or similar images within folders
https://difpy.readthedocs.io
MIT License
448 stars 67 forks source link

Suggestion: scan folder and compare against an archived folder #95

Open phpology opened 5 months ago

phpology commented 5 months ago

Hi Elise,

Hope you are well.

I wanted to run a scenario past you and was wondering if this is possible, please? So we deal with timesheets and daily we do get duplicates. DifPy (during POC was amazing) when testing against a single folder and moving out the duplicates to another folder. Then my situation changed and was asked if it could search against an archived folder just incase we get backdated timesheets. So it goes like this: I have a working folder where the timesheets get dropped into. Images in here will change throughout the day. Any images dropped into the working will automatically copy into the archived folder where we will store up to a month.

If an image arrives 2 weeks later in the working folder, it would look into the archived folder and say 'hey, you look similar to what is in the archived folder' - and move that image from the working folder into the duplicates folder. The idea is that the archived folder images is not moved but is used as a 'database' to refer to, which is just sitting there storing images (until we do our monthly housekeeping).

I am not an hardcore Python dev (just in case someone says read the code) but just throwing out an idea if that is possible.

Thanks

Raj