-
@onurbaran is helping us when creating data pipelines, the draft schema exists below.
![image](https://user-images.githubusercontent.com/9213230/103807123-03c49300-5067-11eb-959b-3acae20666a9.png)
…
-
The more I work with Spark, the more I find the ecosystem promising. Architecturally, it is very flexible even though most of the APIs are still immature. But the fact the computation can be separated…
-
### Search before asking
- [X] I had searched in the [issues](https://github.com/apache/incubator-seatunnel/issues?q=is%3Aissue+label%3A%22bug%22) and found no similar issues.
### What happened
Th…
-
### What happened?
I tried running a few different pipelines using ios based on filebasedsink. They all failed on Dataflow and/or locally with various errors when the `--streaming` flag is used. Addi…
-
Hey man- checking in with your project after a long time. It looks like you've been busy making updates, but the releases are either impossible to download, horribly out of date, or both.
-
replaces #745
Bulk rejections are a constant pain point when running very large Hadoop/Spark jobs. Each outgoing bulk request is split on the Elasticsearch network thread and sent to each node's b…
-
-
**Describe the bug**
Precommit build fails with the following error for PRs targeted to 24.12
```
/home/runner/work/spark-rapids/spark-rapids/sql-plugin/src/main/spark332db/scala/org/apache/spark/sq…
-
Hi, I am not shure what is this issue related to, so I try my luck here. I am trying to get records from my stream with qubole kinesis spark library:
val kinesis = sparkContextService.SQLC.sparkSes…
-
### What happened?
I have deployed a Spark v3.1.2 cluster on kubernetes. My beam job server and beam sdk container are running on 2 separate linux virtual machines. The following keeps executing and …