pkolaczk / fclones

Efficient Duplicate File Finder
MIT License
1.87k stars 70 forks source link

Persistent caching of file hashes #111

Closed pkolaczk closed 2 years ago

pkolaczk commented 2 years ago

This commit implements a new feature, enabled by --cache flag.

Caching can dramatically improve grouping speed on subsequent runs of fclones at the expense of some additional storage space needed for the cache. Caching also allows for resuming work quickly after interruption, so it is recommended if you plan to run fclones on huge data sets.

The cache works as follows: