We can prevent duplicate file ingestions by calculating a signature for each file. The signature would be based off of the entire file bytes. We can store this signature in the submission table. When a ingestion request is being processed the first thing we should do it calculate the signature and determine whether those bytes have been processed before.
We can prevent duplicate file ingestions by calculating a signature for each file. The signature would be based off of the entire file bytes. We can store this signature in the
submission
table. When a ingestion request is being processed the first thing we should do it calculate the signature and determine whether those bytes have been processed before.