arsenetar / dupeguru

Find duplicate files
https://dupeguru.voltaicideas.net
GNU General Public License v3.0
5.37k stars 415 forks source link

DupeGuru completely misses multiple exact duplicate iphone .mov video files #1078

Open sunlit777 opened 1 year ago

sunlit777 commented 1 year ago

Put a bunch of duplicate iphone video files (in my case iphone 13 Pro Max) in a number of folders. Files are exactly same, and filename and extension also exactly identical, just the names of folders they are put in differs. DupeGuru completely misses them and reports "no duplicates found" - I tested both on Mac and in WIndows 10 too! I attach debug log. Please put some attention to this - its a major bug! dupeGuru.log

mtaalas commented 1 year ago

This happens on my mp4 files that were shot with Android cellphone... this is a major bug! Winmerge tells me the files are identical, but dupeguru doesn't recognize them as identical since they're in two different folders, even if both are added to the dupeguru folder list.

Only after completely clearing cache and CLOSING THE DUPEGURU it will get proper results. There's a bug somewhere that causes false negatives in certain situations. It might be related if the program was open when computer went to sleep, but I'm going to investigate further.

If scan is not active, even if computer sleeps, it shouldn't cause issues with dupeguru after wake up...

arsenetar commented 1 year ago

I have not been able to reproduce the behavior here, I'm going to need some additional information? What scan type was used for these, contents? What options were set in the preferences for the scan? Which version of dupeGuru is being used?

@sunlit777 From your log I can see you ran both content and picture mode scans (picture mode would not pick up .mp4), it seems the content mode found 147 files across some copied directories. Can you provide any additional details on these files, such as naming of these files that might help pinpoint the issue?

@mtaalas clearing the cache takes affect immediately and will affect the next scan, I have not seen an instance of a cache not clearing out. I don't think dupeGuru being open when the computer goes to sleep should impact the next scan. If you can provide additional details as well on your scan preferences and additional details on the folders and files you saw this issue with that would be helpful.

sunlit777 commented 1 year ago

thank you for replying!I have since found out the reason why not only yours but all other dupe finding apps fail to find my duplicate videos. the culprit is Final Cut Pro (Apple’s in-house video editor). It alters the files on import by adding some metadata INSIDE the file itself thus changing its MD5 checksum. The video stream inside remains 100% identical, but since all apps compare checksums those files are not detected. if you could upgrade your app to detect identical video stream  and ignore md5 you”ll be my heros!thank youAlexSent from my iPhoneOn 10 Jan 2023, at 10:16, Andrew Senetar @.***> wrote: I have not been able to reproduce the behavior here, I'm going to need some additional information? What scan type was used for these, contents? What options were set in the preferences for the scan? Which version of dupeGuru is being used? @sunlit777 From your log I can see you ran both content and picture mode scans (picture mode would not pick up .mp4), it seems the content mode found 147 files across some copied directories. Can you provide any additional details on these files, such as naming of these files that might help pinpoint the issue? @mtaalas clearing the cache takes affect immediately and will affect the next scan, I have not seen an instance of a cache not clearing out. I don't think dupeGuru being open when the computer goes to sleep should impact the next scan. If you can provide additional details as well on your scan preferences and additional details on the folders and files you saw this issue with that would be helpful.

—Reply to this email directly, view it on GitHub, or unsubscribe.You are receiving this because you were mentioned.Message ID: @.***>