-
**Is your feature request related to a problem? Please describe.**
I'm currently facing issues with the PyDeequ support to Apache Spark version 3.4.0, since it is impacting several projects in my org…
-
### Search before asking
- [X] I had searched in the [issues](https://github.com/apache/dolphinscheduler/issues?q=is%3Aissue) and found no similar issues.
### What happened
创建工作流,选择数据质量组件,自定义数…
-
**Describe the problem you faced**
We are creating empty hudi tables from java as follows
```
Dataset emptyDF = spark.createDataFrame(new ArrayList(), schemaStruct);
emptyDF.wr…
-
### Current Behaviour
I'm making a very simple Spark dataframe with only one column. Apparently, ProfileReport does not generate the report when I am using Databricks notebook.:
Below is the code th…
-
ConfigMaps don't get cleaned up when sparkapplications are deleted. I think it might be good to include owner references for the configmaps that are created so cascading deletes can happen. I had ~160…
-
### Is your feature request related to a problem? Please describe
## 1. Current status
Currently, users can use the Spark Dataset API to directly read and write OpenSearch indices. The OpenSearch …
-
Hello,
I am trying to update my application to spark 3.5.1 (Scala version 2.12.18, OpenJDK 64-Bit Server VM, 17.0.10) but the scala 2.12 connector keeps throwing this error.
While trying to str…
-
### Backend
VL (Velox)
### Bug description
We’re getting into a weird situation with a method (`org.apache.spark.shuffle.IndexShuffleBlockResolver.writeMetadataFileAndCommit`) that appears to be mi…
-
### Search before asking
- [X] I had searched in the [issues](https://github.com/apache/seatunnel/issues?q=is%3Aissue+label%3A%22bug%22) and found no similar issues.
### What happened
We ar…
-
I am using the example provided in the Java docs and running this on a local spark cluster.
```
public void run() throws Exception {
SparkSession sparkSession = SparkSession.builder()
…