pauldreik / rdfind

find duplicate files utility
Other
986 stars 81 forks source link

Support dry run results as input for execution #100

Open chicanery86 opened 2 years ago

chicanery86 commented 2 years ago

Provide means to use dry run output as input thus avoiding need to duplicate work where workflow is as follows: step 1 dry run performed, Step 2 volume of matches and specific samples verified from dry run file Step 3 rerun specifying the dry run reports as input to skip to execution.

PoC-dev commented 2 years ago

On a side note to this request, it could be highly beneficial to (optionally) cache calculated checksums in a separate file with a time stamp, also on dry run. Flat file? Sqlite? Doesn't really matter. But this might save a lot of time for repeated runs: Only recreate the checksum of files with a newer mtime than saved in the cache file.