-
I connect my IDE to Databricks cluster via databricks-connect and I get this error when I try to execute the following code:
code:
```
from pyspark.sql import SparkSession
spark = SparkSess…
-
As discussed with @mnrozhkov and @tapadipti, it would be great if MLEM would somehow simplify model application within Airflow DAGs. Two questions to start with:
1. We can create something like MLEMO…
-
### Search before asking
- [X] I had searched in the [issues](https://github.com/apache/incubator-xtable/issues?q=is%3Aissue) and found no similar issues.
### Please describe the bug 🐞
I wr…
-
**Describe the bug**
spark-rapids-ml RandomForestClassifier/Regressor which is built on cuml throws an exception when setting n_streams=2 (or any value > 1) on the node with 2 processes running, each…
-
I've been working on a new Pyspark JupyterLab build and was testing extension support one at a time using the Docker container `quay.io/jupyter/pyspark-notebook:x86_64-spark-3.5.0`. I installed the `s…
-
When i try to create new jupyter notebook, I get below options
Python 2
Python 3
Python 3.5
Python 3.6
xpython
xpython is not providing me option to choose python version.
When I create xp…
-
### SynapseML version
com.microsoft.azure:synapseml_2.12:0.10.2
### System information
- **Language version** : python 2.7.18
- **Spark Version** : 3.3.1
- **Spark Platform** :Pyspark 3.8.10
-**…
-
I am using the extension to connect to Azure Synapse Interactive PySpark session. The session expires every 1-1.5 hour with the following error message:
`An error was encountered:
Invalid status c…
-
> I have uses XLX file and 3 parqute files as source and performed some teansformation. the code ran good and i could able to see the linegae in spline. but i could able see only 3 parqute files as s…
wajda updated
3 months ago
-
https://github.com/AlexIoannides/pyspark-example-project/blob/13d6fb2f5fb45135499dbd1bc3f1bdac5b8451db/tests/test_etl_job.py#L64
You should use `data_transformed `not `expected_data` for actual tra…