FRosner / drunken-data-quality

Spark package for checking data quality
Apache License 2.0
222 stars 69 forks source link

Dataset[T] instead of DataFrame #132

Open FRosner opened 7 years ago

FRosner commented 7 years ago

Problem

Following the discussion of https://github.com/FRosner/drunken-data-quality/pull/113#issuecomment-277662486 we should consider changing the API to be more flexible and not only work for DataFrames. We need to see if this is possible or if we have any implementation issues.

Links