markfasheh / duperemove

Tools for deduping file systems
GNU General Public License v2.0
794 stars 78 forks source link

optimization of extents already shared? #290

Closed brainchild0 closed 1 year ago

brainchild0 commented 1 year ago

I have noticed through casual use that if files in the input set already share extents, then each file is still processed separately for checksum computation. Is it possible to avoid computation of checksums for extents previously determined in the same operation, because they were referenced by files earlier processed?

I am using version 0.11.2 under Linux Mint 21 with kernel 6.0.0.

JackSlateur commented 1 year ago

Hello,

Unchanged files are no longer checksumed nor deduplicated, unless some extents have to be shared with a new file Could you check the latest code ?