-
i had write some code to consume kafka stream data, but got error
following is my raydp code
```python
import ray
import raydp
ray.init()
packages = "org.apache.spark:spark-sql-kafka-0-10_2.12…
-
### SynapseML version
0.10.2
### System information
- **Language version** (e.g. python 3.8, scala 2.12): Python 3.7.4
- **Spark Version** (e.g. 3.2.3): 3.0.0
- **Spark Platform** (e.g. Syn…
-
Problem:TimeDelta cannot be cast to java.time.Duration error
catboost version: ai.catboost:catboost-spark_3.1_2.12:1.0.4
Operating System: DataBricks 10.4 LTS ML (includes Apache Spark 3.1.2, Scala …
-
Hey folks, I have a Spark Application that reads from a source bucket and writes into a target bucket. I'm experiencing some issues when setting the keyfile for the second operation, as a Hadoop confi…
-
**Describe the bug**
spark-rapids-ml RandomForestClassifier/Regressor which is built on cuml throws an exception when setting n_streams=2 (or any value > 1) on the node with 2 processes running, each…
-
Hello, I followed all the installation steps in the [documentation](https://nlu.johnsnowlabs.com/docs/en/install), but it was not enough to get the library working.
Then I had to install the JDK, s…
-
Ibis is a popular project that provides Pandas-like API for backend compute engines and databases. People from the Ibis community is interested in adding backend support for Apache Sedona: https://git…
-
**Hi! :) I changed some of the code to use with Python 3,**, however, I have some issues.
I cannot find a library with the FP-growth algorithm that works. I tried the pyspark one and the FP-growth. I…
-
https://docs.aws.amazon.com/glue/latest/dg/aws-glue-programming-etl-libraries.html#develop-local-docker-image
I followed the instructions above to setup its docker image container from DockerHub [l…
-
https://hsh0107.github.io/2019/12/19/pyspark-%E6%B7%BB%E5%8A%A0checkpoint.html
sthoo的博客