-
### Feature & Improvement
1. Compile script refactoring optimization
2. Increase the import of csv format
3. Support pushdown of doris.filter.query on sparkSQL
4. Support doris datev2/datetimev2/d…
-
Hey folks, I have a Spark Application that reads from a source bucket and writes into a target bucket. I'm experiencing some issues when setting the keyfile for the second operation, as a Hadoop confi…
-
I want to use flink and spark to write to the mor table, and use bucket CONSISTENT_HASHING for the index, but I find that spark is very fast to write the full amount and flink is very slow(flink write…
-
**Describe the bug**
A user may see an error like below when using the python package , sometimes due to limited network reachability. It is hard to tell which connection (host:port) did we error on.…
-
## Bug
#### Which Delta project/connector is this regarding?
- [x] Spark
- [ ] Standalone
- [ ] Flink
- [ ] Kernel
- [ ] Other (fill in here)
### Describe the problem
I'm using a Jup…
stvno updated
5 months ago
-
Hi,
I believe there is an issue specific to macOS users, as I encountered problems getting the workspace example to work correctly on my Mac. I found a similar issue reported by another macOS user…
-
## Expected Behavior
This library works the same with Spark Connect.
## Current Behavior
This library uses `sparkSession.sparkContext` which doesn't work with Spark Connect, here is an exampl…
-
I'm getting below error while using parameterized data types in BigQuery [https://cloud.google.com/bigquery/docs/reference/standard-sql/data-types#parameterized_data_types](url) in a spark java projec…
-
My team wants to use py-spark in Kubeflow pipeline nodes.
This py-spark pipeline node is communicating with a completely independent MinIO instance and runs ANSI SQL commands to it.
When we create…
-
## Bug
#### Which Delta project/connector is this regarding?
- [ ] Spark
- [ ] Standalone
- [X] Flink
- [ ] Kernel
- [ ] Other (fill in here)
### Describe the problem
We have a Flink a…