Is your feature request related to a problem? Please describe.
Ability to quickly detect an issue in the CQLReplicator, e.g., connection issue, OOMs, and etc.
Describe the solution you'd like
Persist the error from discovery and/or replication Glue jobs to Amazon S3 bucket s3://bucket/ks/tbl/discovery/errors/epoch.err,
s3://bucket/ks/tbl/discovery/errors/epoch.err during an Exception event, e.g. {"process_type": "replication", "tile": 0, "error_message": "message", "original_exception": "Scala exception"}
Add a new state get-error-cause to retrieve the latest error from Amazon S3 based on epoch time, check the discovery first and after the replicators.
Add a new state cleanup-errors to remove all *.err files from the S3 bucket
Is your feature request related to a problem? Please describe. Ability to quickly detect an issue in the
CQLReplicator
, e.g., connection issue, OOMs, and etc.Describe the solution you'd like
get-error-cause
to retrieve the latest error from Amazon S3 based on epoch time, check the discovery first and after the replicators.cleanup-errors
to remove all *.err files from the S3 bucket