Open bryjen opened 1 week ago
In addition to the calculation of the predicted acquisition and publish dates, we also return a confidence
value, which is a value bounded between [0, 1]. It represents how confident we are in our predicted date based on variance between previous dates (how consistent the intervals are between dates).
We calculate and normalize the timespan intervals, then clamp it to some maximum allowed variance. We then calculate the confidence value using the formula $1-(\dfrac{variance}{0.65 * maxVariance})^5$. The graph looks like the one below:
>90%
is considered pretty accurate - the margin of error of which is still unknown, but looks like a pretty accurate calculation just by looking at it.
Problem
We need to verify that the given date time prediction for the next acquisition time is accurate to some margin of error (ex.
5%
or10%
).Approaches
For example, for the path
p
and rowr
, if the have the acquisition times [tn, tn-1, ..., t3, t2, t1], where t1 represents the most recent acquisition time. We'll see if we can accurately predict t1 given t2 to tn. We assert this for all possible combinations ofp
andr
.An approach we could take for this is creating a worker service calling the prediction logic from the backend and having the worker service assert that the new acquisition image is taken within some margin of error of the predicted date. If a path and row combination has
n
correctly predicted values, then we can say that the predictions for that combination is accurate enough. We try to assert this for all possible combinations of paths and rows.