awslabs / python-deequ

Python API for Deequ
Apache License 2.0
676 stars 131 forks source link

Issue #1 : Improve documentation for Spark session creation (Databricks etc.) #85

Open rwitzel opened 2 years ago

rwitzel commented 2 years ago

Issue #1

Why?

The many comments in #1 suggest that users overlook the required installation of JARs when installing Deequ.

What?

Now the documentation clarifies the importance of installing the JARs.

arpheno commented 2 years ago

Looks good to me

bakintunde commented 2 years ago

Perhaps include more description about the type of error obtained which requires this installation of the jar file. Also, that both pydeequ package from pypi and the jar file from maven central need to be installed.

Thanks.