-
When I use the current version of the package with spark 3.5 I get the following error:
```
23/10/27 16:06:36 WARN TaskSetManager: Lost task 2.0 in stage 3.0 (TID 4) (192.168.178.63 executor 0): jav…
-
**Describe the problem you faced**
Running an ETL on a table "Aggregate X Details". When running this on a large dataset i.e ~30M rows of data we partition our dataset into 50 partitions using RDD.co…
-
## Bug
#### Which Delta project/connector is this regarding?
- [x] Spark
- [ ] Standalone
- [ ] Flink
- [ ] Kernel
- [ ] Other (fill in here)
### Describe the problem
We write data to …
-
**Is your feature request related to a problem? Please describe.**
Provides users with the ability to connect and set up `Spark SQL` as a data source, enabling integration with distributed data proce…
-
**Describe the bug**
跑求交的时候发现三个spark-worker容器没有均分任务,一共三个线程,有一个容器占了两,另一个一个线程,还有个是空的,三个容器内存使用量区别很大
**To Reproduce**
Steps to reproduce the behavior:
1. Go to '...'
2. Click on '....'
3. Scroll d…
-
### Describe the bug
I'm having issues reading data from CH in my AWS Glue job. There are no connection issues between them, but reading returns no rows.
### Steps to reproduce
1. Set up a CH Clo…
-
I've tried using sail for local development of spark jobs. But running simple query on dataset that has size of few GBs makes sail slower than spark.
When join is not there then query runs within 10…
-
I'm assuming that since the app will still work, that this will still work for the functions that it previously did, correct?
-
Hello everyone. I am using Jupyter Enterprise Gateway with PySpark sessions on Kubernetes. The elyra/kernel-spark-py:3.2.3 image works as expected.
I modified the image and rebuilt it to upgrade t…
-
### Backend
CH (ClickHouse)
### Bug description
sql query
```
select to_date('2025-07-22 10:00:00', 'yyyy-MM-dd')
```
in valina spark, when set `spark.sql.legacy.timeParserPolicy` value as `…