FRosner / drunken-data-quality

Spark package for checking data quality
Apache License 2.0
222 stars 69 forks source link

Exact DataFrame Equality #114

Closed FRosner closed 7 years ago

FRosner commented 7 years ago

Problem

Is it possible to check the equality of two dataframes in an exact way (without collecting if possible)?

Documentation

https://github.com/FRosner/drunken-data-quality/wiki/Drunken-Data-Quality-4.0.0#equality

FRosner commented 7 years ago

@Gerrrr can you provide the python API eventually?

FRosner commented 7 years ago

Please find the changes in branch issue/114. Feel free to ask me if you need anything for the Python API @Gerrrr. There is already a pending merge request so we can review there.