Closed JustinKyleJames closed 3 years ago
i think the convention in the filesystem library / checksum library was to use the latest good replica (so... max()). @korydraughn is that correct?
That is true @trel. It uses the checksum of the latest good replica
Looks like it returns an empty string if there is no checksum. I believe that is fine.
min()
gives us the 'oldest' good replica... do we want the 'latest' (max()
) good replica instead?
min()
gives us the 'oldest' good replica... do we want the 'latest' (max()
) good replica instead?
I put the rationale in the commit comments.
"Used min under the assumption that if multiple replicas are good, a checksum calculated against the oldest replica would be valid for all replicas."
Basically, isn't it true that if we have N good replicas, they all should have the same data? So min should suffice and make us recalculate less often. Example, if we replicated to another resource, there should be no reason to recalculate the checksum.
yes, it will be valid, but that's not the convention used elsewhere (aka filesystem)... if they're equivalent, then i'd rather we keep consistent.
if there's a legitimate reason (aka, the min() will change less often)... then that's okay, but we should say that's the reason in the commit message rather than just 'it would be valid'.
yes, it will be valid, but that's not the convention used elsewhere (aka filesystem)... if they're equivalent, then i'd rather we keep consistent.
if there's a legitimate reason (aka, the min() will change less often)... then that's okay, but we should say that's the reason in the commit message rather than just 'it would be valid'.
My thinking was that if a file is replicated from one resource to another, that would trigger a recalculation of the checksums. Do we want that? Added this to the commit message.
That makes sense to me. Let's # these and get them in. Thanks.
That makes sense to me. Let's # these and get them in. Thanks.
Done
Please point to this particular repo in the commit message - it is trying to link to an unrelated issue in the 'upstream' of the fork-tree.
This is the first time i've seen this happen... not sure what we can do about it outside of removing our fork from the tree itself.
Please point to this particular repo in the commit message - it is trying to link to an unrelated issue in the 'upstream' of the fork-tree.
This is the first time i've seen this happen... not sure what we can do about it outside of removing our fork from the tree itself.
Done
Note: Used min under the assumption that if multiple replicas are good, a checksum calculated against the oldest replica would be valid for all replicas.