aws-samples / cql-replicator

CQLReplicator is a migration tool that helps you to replicate data from Cassandra to AWS Services
Apache License 2.0
15 stars 8 forks source link

Offload discovery and replication errors/exception to Amazon S3 #132

Open nwheeler81 opened 6 months ago

nwheeler81 commented 6 months ago

Is your feature request related to a problem? Please describe. Ability to quickly detect an issue in the CQLReplicator, e.g., connection issue, OOMs, and etc.

Describe the solution you'd like

  1. Persist the error from discovery and/or replication Glue jobs to Amazon S3 bucket s3://bucket/ks/tbl/discovery/errors/epoch.err, s3://bucket/ks/tbl/discovery/errors/epoch.err during an Exception event, e.g. {"process_type": "replication", "tile": 0, "error_message": "message", "original_exception": "Scala exception"}
  2. Add a new state get-error-cause to retrieve the latest error from Amazon S3 based on epoch time, check the discovery first and after the replicators.
  3. Add a new state cleanup-errors to remove all *.err files from the S3 bucket