awslabs / deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Apache License 2.0
3.18k stars 519 forks source link

Configurable RetainCompletenessRule #564

Closed zeotuan closed 2 months ago

zeotuan commented 2 months ago

Close #340 By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

zeotuan commented 2 months ago

@rdsharma26 Hi, Please help review this PR.