I had 17000 duplicates in my library. I was already antipating the deduplication functionality.
Many video-files (which i copied from VHS-tape) start with a few seconds of all-black screen. The deduplication-algorithm sees them all as duplicates, which i don't think it should. As this makes it very easy to delete files of which only the first frames match with each other.
I don't think the algorithm should only take the first frame into account. The algorithm is accurate for photos, but very inaccurate for video-files. It's too easy for people to delete their video files by mistake.
I also noticed that the algorithm marks images (black.gif) and movie files (with first black frame) as duplicates. I don't think this should be the case.
The bug
I had 17000 duplicates in my library. I was already antipating the deduplication functionality. Many video-files (which i copied from VHS-tape) start with a few seconds of all-black screen. The deduplication-algorithm sees them all as duplicates, which i don't think it should. As this makes it very easy to delete files of which only the first frames match with each other. I don't think the algorithm should only take the first frame into account. The algorithm is accurate for photos, but very inaccurate for video-files. It's too easy for people to delete their video files by mistake.
I also noticed that the algorithm marks images (black.gif) and movie files (with first black frame) as duplicates. I don't think this should be the case.
The OS that Immich Server is running on
Debian 12
Version of Immich Server
v1.105.1
Version of Immich Mobile App
NA
Platform with the issue
Your docker-compose.yml content
Your .env content
Reproduction steps
Relevant log output
No response
Additional information
No response