markfasheh / duperemove

Tools for deduping file systems
GNU General Public License v2.0
816 stars 81 forks source link

Lack of documented functions #113

Closed Wintch closed 8 years ago

Wintch commented 8 years ago

Hi,

In the performance test on this page: https://github.com/markfasheh/duperemove/wiki/Performance-Numbers an example of using this switches is given:

--read-hashes --write-hashes

I am using duperemove v0.10 and would really appreciate those "hidden" switches to be explained in "--help" output. There is an option to "update_hashes" in duperemove.c, does it work?

Thanks for the tool!

lpirl commented 8 years ago

Did you see the documentation about --*-hashes in the man page?

Wintch commented 8 years ago

Sorry, i missed that, update_hashes is not in man, so my grep failed :) This is not something stable, right? I would like to reuse the hash file and not generate a new one each time for dedup. Thanks!

lpirl commented 8 years ago
It is more or less deprecated but might still be used for development. It uses plain text files to store hashes and some magic to detect dupes and so on. There is the newer `--hashfile=` that creates (and for subsequent runs: _re-uses_) a SQLite database.
Wintch commented 8 years ago

Thanks!