spark Search Results - Githubissues

1000+ results
for spark

Best match

Best match Most commented Newest Recently updated Least commented Oldest Least recently updated

simplexspatial/osm4scala #113

Spark 3.5

When I use the current version of the package with spark 3.5 I get the following error: ``` 23/10/27 16:06:36 WARN TaskSetManager: Lost task 2.0 in stage 3.0 (TID 4) (192.168.178.63 executor 0): jav…

BjoernWaechter updated 2 weeks ago
2
apache/hudi #12186

[SUPPORT] HUDI upsert fails at distinct at MapPartitionRDD

**Describe the problem you faced** Running an ETL on a table "Aggregate X Details". When running this on a large dataset i.e ~30M rows of data we partition our dataset into 50 partitions using RDD.co…

IamJusKyrat updated 2 weeks ago
3
delta-io/delta #3892

Optimize on delta table not encoding correct URL for Dayhour…

## Bug #### Which Delta project/connector is this regarding? - [x] Spark - [ ] Standalone - [ ] Flink - [ ] Kernel - [ ] Other (fill in here) ### Describe the problem We write data to …

gprashmi updated 6 days ago
2
Canner/WrenAI #708

Support Spark SQL data source

**Is your feature request related to a problem? Please describe.** Provides users with the ability to connect and set up `Spark SQL` as a data source, enabling integration with distributed data proce…

andreashimin updated 3 weeks ago
8
FederatedAI/FATE #5734

spark-worker资源使用不均

**Describe the bug** 跑求交的时候发现三个spark-worker容器没有均分任务，一共三个线程，有一个容器占了两，另一个一个线程，还有个是空的，三个容器内存使用量区别很大 **To Reproduce** Steps to reproduce the behavior: 1. Go to '...' 2. Click on '....' 3. Scroll d…

kikyoulg updated 1 week ago
1
ClickHouse/clickhouse-java #1951

Clickhouse JDBC connection with AWS Glue succeeds but all re…

### Describe the bug I'm having issues reading data from CH in my AWS Glue job. There are no connection issues between them, but reading returns no rows. ### Steps to reproduce 1. Set up a CH Clo…

BeautyFades updated 12 hours ago
3
lakehq/sail #301

Slow wide table left outer join on local machine

I've tried using sail for local development of spark jobs. But running simple query on dataset that has size of few GBs makes sail slower than spark. When join is not there then query runs within 10…

eredzik updated 4 days ago
4
happyhappysundays/SparkBox #53

Question: Spark 2

I'm assuming that since the app will still work, that this will still work for the functions that it previously did, correct?

boctok updated 2 months ago
2
jupyter-server/enterprise_gateway #1392

Custom Spark 3.5.3 Kernel

Hello everyone. I am using Jupyter Enterprise Gateway with PySpark sessions on Kubernetes. The elyra/kernel-spark-py:3.2.3 image works as expected. I modified the image and rebuilt it to upgrade t…

fatihmete updated 3 weeks ago
1
apache/incubator-gluten #7896

[CH] To_date diff while set `spark.sql.legacy.timeParserPoli…

### Backend CH (ClickHouse) ### Bug description sql query ``` select to_date('2025-07-22 10:00:00', 'yyyy-MM-dd') ``` in valina spark, when set `spark.sql.legacy.timeParserPolicy` value as `…

KevinyhZou updated 1 week ago
1

上一页 1...6 7 8 9 10 11 12...100 下一页

1000+ results for spark

1000+ results
for spark