-
**Describe the bug**
When running Pydeequ 1.0.1 the test generated by ConstraintSuggestionRunner include tests using the isContainedIn() function that fail during execution.
The cause is that the …
-
Пытаюсь запустить pydeequ на кластере databricks на azure.
Пробую данный код
```
from pydeequ.checks import *
from pydeequ.verification import *
# примерные данные
df = spark.sparkContext.par…
-
I saw the big needs for deequ to be used in the python friendly data pipelines. And I have tried this one on Databricks env. I saw a big potential for this project to be very successful. What do you g…
-
Describe the bug
Link to PyDeequ Slack no longer active.
To Reproduce
Steps to reproduce the behavior:
Go to repo Readme
Click on PyDeequ Slack
Expected behavior
Slack opening
…
-
*This is not a bug or feature report.*
I am going to use pydeequ on Glue Notebook instance.
* What is the latest version of pydeequ? 0.1.8? or 1.0.1?
* I can see 1.0.1 on pip repo. https://p…
-
**Describe the bug**
The PyDeequ slack link on this Github page as well as PyPi seems to be outdated and no longer active to join the slack.
**To Reproduce**
Steps to reproduce the behavior:
1. …
-
from pydeequ.checks import *
from pydeequ.verification import *
check = Check(spark, CheckLevel.Warning, "Review Check")
checkResult = VerificationSuite(spark) \
.onData(df) \
.addChe…
-
**Describe the bug**
We are trying to run pydeeque on an already running EMR cluster using ' spark-submit --deploy-mode cluster --py-files s3://path/pydeeque.zip --jars s3://path/deequ-2.0.1-spark-3.…
-
Code:
from pydeequ.profiles import *
result = ColumnProfilerRunner(spark) \
.onData(df) \
.run()
for col, profile in result.profiles.items():
print(profile)
Error:
KeyError: …
-
**Describe the bug**
How did you handle this issue if you run many spark jobs which is running pydeequ checks at the same time?
in my case only one job is running rest of the other jobs are failin…