awslabs / python-deequ

Python API for Deequ
Apache License 2.0
713 stars 134 forks source link

Added a dockerfile for building and testing the package #195

Closed rdsharma26 closed 5 months ago

rdsharma26 commented 5 months ago

Description of changes:

This dockerfile can be used to setup and run the tests in the Python Deequ package. This way, we do not need to install any dependencies in our local workspaces. Right now, it only builds against Spark version 3.3. Will be adding other versions in a future PR.

Verified that the docker run output is the same as that of the PR workflow.

===== 103 passed, 1 skipped, 63 xfailed, 172 warnings in 276.10s (0:04:36) =====

By submitting this pull request, I confirm that you can use, modify, copy, and redistribute this contribution, under the terms of your choice.