awslabs / python-deequ

Python API for Deequ
Apache License 2.0
669 stars 131 forks source link

spark_version not capturing from OS environ in Azure databricks cluster #143

Open sujitgh opened 11 months ago

sujitgh commented 11 months ago

Describe the bug spark_version not capturing from OS environ in Azure databricks cluster. To Reproduce Steps to reproduce the behavior:

  1. Go to 'Azure databricks workspace'
  2. Click on 'notebook'
  3. Scroll down to 'try to run the pydeequ as a job'
  4. See error

image

Expected behavior We can use spark session to retrieve spark version. image

Screenshots

Desktop (please complete the following information):

Smartphone (please complete the following information):

Additional context Add any other context about the problem here.

mykola-yesypchuk-inflection commented 11 months ago

AWS Glue is also failed with the same error.

chenliu0831 commented 10 months ago

There's a workaround and we are tracking the improvement here https://github.com/awslabs/python-deequ/issues/140.