awslabs / deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.
Apache License 2.0
3.27k stars 536 forks source link

support col match and change to DatasetMatch #529

Closed VenkataKarthikP closed 7 months ago

VenkataKarthikP commented 8 months ago

*Issue, if available: https://github.com/awslabs/deequ/issues/501

Description of changes:

address review comments from here for DatasetMatch.

cc: @rdsharma26 By submitting this pull request, I confirm that my contribution is made under the terms of the Apache 2.0 license.

VenkataKarthikP commented 8 months ago

@rdsharma26 could you please review this, thanks.

VenkataKarthikP commented 7 months ago

2 minor comments, rest looks good. Thanks again for the PR!

thanks for review @rdsharma26 ... updated doc strings, could you please re-approve and merge PR. thanks