Open scruffynerf opened 2 years ago
https://github.com/r-salas/oshash
This is being used more and more, it's very fast, and resistant. It's a hash of filesize, and the beginning and end of the file.
In pseudo-code, the hash is computed in the following way:
file_buffer = open("/path/to/file/") head_checksum = checksum(file_buffer.head(64 * 1024)) # 64KB tail_checksum = checksum(file_buffer.tail(64 * 1024)) # 64KB file_hash = file_buffer.size + head_checksum + tail_checksum
More code (including C code) at https://trac.opensubtitles.org/projects/opensubtitles/wiki/HashSourceCodes
@rhash similar algorithm for reference.
Another vote for both oshash and imohash.
https://github.com/r-salas/oshash
This is being used more and more, it's very fast, and resistant. It's a hash of filesize, and the beginning and end of the file.
In pseudo-code, the hash is computed in the following way:
More code (including C code) at https://trac.opensubtitles.org/projects/opensubtitles/wiki/HashSourceCodes