FINRAOS / MegaSparkDiff

A Spark-based data comparison tool at scale which facilitates software development engineers to compare a plethora of pair combinations of possible data sources. Multiple execution modes in multiple environments enable the user to generate a diff report as a Java/Scala-friendly DataFrame or as a file for future use. Comes with out of the box SparkFactory and SparkCompare tools.
https://finraos.github.io/MegaSparkDiff/
Apache License 2.0
49 stars 26 forks source link

Return a boolean indicating the comparison result along with saving file and exclude some columns in the saved file. #51

Closed Yuming-Wei closed 2 years ago

Yuming-Wei commented 5 years ago

Feature implemented. Created a pull request https://github.com/FINRAOS/MegaSparkDiff/pull/53 Please review and make comments.