pauldreik / rdfind

find duplicate files utility
Other
979 stars 79 forks source link

Faster hashes #141

Open ido opened 1 year ago

ido commented 1 year ago

If you check the entire file's contents after comparing hashes, then it is probably worthwhile to explore faster hashes such as xxhash or to use hardware-accelerated checksums such as CRC32 on x86_64 if it's available, as a first pass before more expensive hashes or block by block file comparisons.

Here's a benchmark of fast hashes: https://github.com/rurban/smhasher#summary

Here's the crc32c Intel intrinsics: https://www.intel.com/content/www/us/en/docs/intrinsics-guide/index.html#ig_expand=1494,1494&text=crc32