Open ammar-nizami opened 2 years ago
Same as issue #64
I was having the same issue, please make sure you shutdown the spark app before spawning another one. This solved it for me
spark.sparkContext._gateway.shutdown_callback_server()
Then
spark.stop()
@chenliu0831 do you know what will be the solution for this?
Hi @chenliu0831 @ammar-nizami @oscarcampos-c How did you handle this issue if you run many spark jobs which is running pydeequ checks at the same time?
in my case only one job is running rest of the other jobs are failing with same issue.
Traceback (most recent call last): File "/usr/lib/spark/python/lib/py4j-0.10.9-src.zip/py4j/java_gateway.py", line 2207, in start OSError: [Errno 98] Address already in use
During handling of the above exception, another exception occurred: Any solutions or suggestions
Traceback (most recent call last):
File "/opt/ammar/pydeequ_poc_pyspark.py", line 26, in
Describe the bug When creating a check which accepts an assertion as a parameter, I get an error "OSError: [Errno 98] Address already in use"
To Reproduce
Expected behavior Check is created, and results are generated.
Log
Additional context I am submitting the spark job using LIVY from an airflow task. Spark has default configs.