awslabs / deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Apache License 2.0
3.32k stars 539 forks source link

Referential/2.0.0 spark 3.1 #436

Closed dariobig closed 1 year ago

dariobig commented 2 years ago

Issue #, if available:

Description of changes:

New Referential integrity and Data synchronization checks

mentekid commented 1 year ago

A different implementation for this was merged in https://github.com/awslabs/deequ/pull/449