Open emilioalvap opened 3 weeks ago
cc @bturquet
From my understanding from this PR, if ESF receives a document that was already sent to ES, then it will just skip it and continue the execution. In opposition to stopping and returning an error, like it is now. Can you confirm this? This seems ok with me.
Additionally, this PR needs to increase the ESF version and add an entry to the CHANGELOG.
From implementation side all are ok. The key point is the comment:
events that were so close to each other that they were given the same timestamp
If we can verify that this does not happen and we can guarantee the uniqueness of the timestamp creation here then I think we are ok.
What is the timestamp's precision? I mean we include ms right as in here?
If the timestamp is non-unique, then we would need to update how the _id
field is computed. An option for that would be to hash the entire document, using something fast and non-cryptographic, like xxhash or murmur3.
What does this PR do?
Fixes #677.
Check status codes for
_bulk
requests responses to detect_id
collisions and prevent them from going into the replay queue.Why is it important?
Checklist
CHANGELOG.md