awslabs / python-deequ

Python API for Deequ
Apache License 2.0
713 stars 134 forks source link

Support Deequ 2.0.0 (Spark 3.1) and 2.0.1 (Spark 3.2) #100

Closed lecardozo closed 2 years ago

lecardozo commented 2 years ago

Related issues

Related PRs

Changes

In order to support latest deequ releases, a few adjustments were necessary:

Additionally, a few technical debts were addressed

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.

lecardozo commented 2 years ago

Hey, @gucciwang!

Wanted to hear your thoughts on this PR. Let me know if there's anything else we could do to enable this support for Deequ>=2.x 😃

Thanks!

RaptorRob commented 2 years ago

Can't wait for the fix of #70

albjoaov commented 2 years ago

🥺

chenliu0831 commented 2 years ago

Is there a test report (if not from CI) that we can review? Just to make sure everything passes.

lecardozo commented 2 years ago

@chenliu0831 You can check the reports of tests here while I'm addressing your suggestions. As soon as we merge it, I'd expect to see them running on awslabs/pydeequ. :)

chenliu0831 commented 2 years ago

@chenliu0831 You can check the reports of tests here while I'm addressing your suggestions. As soon as we merge it, I'd expect to see them running on awslabs/pydeequ. :)

Thanks, please just post a final link again when you are ready, I will review & approve

lecardozo commented 2 years ago

Done! Here is the last run of the CI workflow for c410f72

mycaule commented 2 years ago

Hi are you going to release the python package soon ?

matbmeijer commented 1 year ago

Hi, I would love to use this library. Does this PR support for spark 3.2.1, databricks runtime 10.4 LTS?