-
### Code of Conduct
- [X] I agree to follow this project's [Code of Conduct](https://www.apache.org/foundation/policies/conduct)
### Search before asking
- [X] I have searched in the [issue…
-
When using char-dist-features + header features for the domain "dbpedia", we get many features (400+). The training of RandomForestClassifier with Spark fails with the error:
Cause: org.codehaus.jani…
-
I am trying to run mnist.py with standalone cluster mode. for this I set `master = "local[*] to "spark://MY_MASTER_IP:7077" ` and I submit my task by following command
spark-submit --master sp…
-
### Search before asking
- [X] I had searched in the [issues](https://github.com/datasophon/datasophon/issues?q=is%3Aissue) and found no similar issues.
### What happened
正确配置hive on spark后,通过Data…
-
I'm running spark locally. Spark v3.0.1 with Scala 2.12.10. I've also made sure that my local version of scala is 2.12.
Does the RestDataSource package need to be recompiled for Scala 2.12?
I ran…
-
### Current Behaviour
OS:Mac
Python:3.11
Interface: Jupyter Lab
pip: 22.3.1
[dataset](https://github.com/plotly/datasets/blob/master/2015_flights.parquet)
|DEPARTURE_DELAY|ARRIVAL_DELA…
-
I am using Spark `ML_pipelines` to easily deploy operations that I have developed in `Sparklyr` in a production environment using `SCALA`. It is working pretty well, except for one part: it seems that…
-
For this test case, https://github.com/apache/incubator-iceberg/blob/6f28abfa62838d531be4faa93273965665af933d/spark/src/test/java/org/apache/iceberg/spark/source/TestPartitionValues.java
if I repla…
-
Hi guys,
I'm using kafka, spark streaming 1.62 and spark-redshift. it works well at first, and then suddenly get these error message below:
> ========= 2016-09-28 08:47:00 =========
> 16/09/28 08:47…
-
total data : 60Million
use spark to read hive data, then save the data to Doris use **spark-doris-connector-2.3_2.11**, but when load about 20million, the spark job is dead with the log
```
2022-04…