Feature Request: Sync based on hash missmatch

etag: "md5sum"

ETag header values of S3-compatible object stores only directly correspond to the file's MD5 hash if the file has not been created via multipart upload.

For multipart uploads, the following applies:

The ETag of each individual part is the MD5 hash of the contents of the part. The ETag of the completed multipart object is the hash of the MD5 sums of each of the constituent parts concatenated together followed by a hyphen and the number of parts uploaded.

As igungor pointed out here:

Since we use multipart upload, object ETag changes if user changes part-size of a file.

The above does not speak against the usefulness of ETag-based sync in general, it's just important to keep in mind since not all object storage providers with an "S3-compatible API" implement this in the way the original AWS S3 does. Not least because AWS doesn't publicly document how they implement such important details (though that particular linked statement is clearly outdated as the multipart-ETag calculation is publicly known by now).

peak / s5cmd

Feature Request: Sync based on hash missmatch #561