qarmin / czkawka

Multi functional app to find duplicates, empty folders, similar images etc.
Other
18.38k stars 606 forks source link

Duplicate finding by name and size (With a change) #1162

Open Ret2lib opened 6 months ago

Ret2lib commented 6 months ago

Feature Description ... The current name and size is a really cool addition but it can be improved if we add a checkmark that checks names that are subset.. For example somerandomimage.jpg and somerandomimage(1).jpg having the same size and name.. Although its not exactly the same name, maybe we can check if size is indeed the same, than check if the name of one is a subset of the other.

See example of a situation I currently have:

image

In pseudocode something like If the file1.size==file2.size (//somerandomimage.jpg == somerandomimage01.jpg if file1.has an extension //ie somerandomimage.jpg tempFile1Name= file1name.name(no extension) //somerandomimage check if tempFile1Name in file2.name

If the routine runs for every single file than eventually file2 will remove its extension and check if its a subset of file1 as well.

I am not really sure why the hash for these files are different (I manually checked and they indeed were different), they really should be the same. The filesize is eactly the same, image is the same. Maybe different metadata?

Ret2lib commented 6 months ago

https://github.com/qarmin/czkawka/issues/539 This issue is somewhat similar but for names only.