qarmin / czkawka

Multi functional app to find duplicates, empty folders, similar images etc.
Other
19.62k stars 641 forks source link

Similar Images: Order files by directory? #208

Open alexislefebvre opened 3 years ago

alexislefebvre commented 3 years ago

Are results from Similar Images ordered or appear in random order?

I have similar images in 2 or more folders and it's not obvious from this view that these directories have similar content.

Could it be possible to order them?

I suggest:

  1. take image with greater dimensions ¹ from each group
  2. use this image to order groups of images

¹: there are not always the best image to keep but I think it's a good estimate

blsz commented 3 years ago

This would be a very useful feature. Ordering files by directory would generate very long lists that would have to be compared manually. One often needs to find identical or similar directories. Disk space is often wasted by copies of large projects that differ by one or two files.

Would it be possible to generate a listing of sets of directories ordered by similarity from most to least similar where similarity decreases with the number of differing files or paths in each tree up to n recursive differences where n is a threshold set by the user? I think you would have to set a low threshold or the comparison could take too long. It would be useful to see the number of identical and different files under each set of paths.

An alternative would be a view that lists all paths that contain identical files without repeating identical paths. If files were excluded from this list and the list could be sorted then this view might help identify recursively similar paths.

Update: I just found https://gitlab.com/alvarezp2000/poda which tries to compare similarity of 2 paths but does not find similarities across an arbitrary number of paths.