Open chuwy opened 7 years ago
SUCCEEDED
not COMPLETED
for clarity...
I actually think instead of State
key it should be separate Started
key with date and Suceeded
with date/null
. This way we can get additional information on when folders were processed.
Edit: Suceeded
StartedAt, SucceededAt - I think that works...
@alexanderdean do you think this feature can go outside Analytics SDK? Like maintaining state LoadedAt
from RDB Loader or AddedAt
from EmrEtlRunner (or whatever we'll be responsible in that moment).
I am not sure if this answers your question @chuwy, but I 100% envisage the Scala Analytics SDK being used in e.g. the RDB Loader, and thus yes using this state tracking...
Currently we have only
RunId
key in run manifest table -enriched-archive/run=2017-05-01-12-00-00
. It would be useful to addState
withSTARTED
andCOMPLETED
to see what folders were failed.