aws-samples / cql-replicator

CQLReplicator is a migration tool that helps you to replicate data from Cassandra to AWS Services
Apache License 2.0
16 stars 8 forks source link

[CQLReplicator on Glue] Add support DQ/DC #52

Open nwheeler81 opened 11 months ago

nwheeler81 commented 11 months ago

Data quality and completeness. Sample and validate data between C* and Keyspaces.

timota commented 9 months ago

As initial (historical) replication it will be good to have a simple counter how many rows have been synced or, if its possible to count rows in target similar to this one https://github.com/aws-samples/amazon-keyspaces-examples/blob/main/scala/datastax-v4/aws-glue/count-table-rows/count-example.scala where you can get a number of rows in target table.

nwheeler81 commented 8 months ago

[] Add count the primary keys in the source or the target [] Add sample the primary keys in the source and check in the target. Compare a hash of the all columns [] Add sample the primary keys in the target and check in the source. Compare a hash of the all columns [] Add sample the row size in the source, returns the stats