Open przemekwitek opened 6 months ago
Pinging @elastic/ml-core (Team:ML)
If there currently any workarounds or setting that could be adjusted? In our use case records in between checkpoint / executions must never be skipped.
Any news or version ETA on this issue?
Elasticsearch Version
8.13
Installed Plugins
No response
Java Version
bundled
OS Version
MacOS
Problem Description
Latest transform was reported to skip some source documents.
I identified 2 potential issues:
@timestamp
value, thelatest
transform only picks one of them.sync.time.delay
field does not seem to influence the filterrange
queries issued by thelatest
transform.Ad 1.: This is how we build the range query in the code:
So I think it can be that because of this
lt
the documents that have the same timestamp as the document that was already involved in the checkpoint will not get processed. This should be taken care of by thetime.sync.delay
but apparently it doesn't work in this case (Ad 2.)Steps to Reproduce
This has been reproduced by the Kibana team (https://github.com/elastic/security-team/issues/8893). Now I'm working on reproducing it locally.
Logs (if relevant)
No response