-
**Describe the problem you faced**
When writing files into S3 after batch streaming from Kafka, it takes around 2 hours to finish the step "Tagging" while the EMR Cluster looks like being almost i…
-
**Description:**
$subject in jdbc and mysql modules. Basically the requirement is to send out large data set in the http response without loading them in the memory at once.
Earlier this was su…
-
Hi,
I could not find a way to start the streaming job from where it left off previously in the kinesis stream even with using Spark streaming's "checkpointLocation" option.
TRIM_HORIZON starts a…
-
Hi Jdbc2S team,
Thank you so much for this repo.
I'm working on a personal project where I need to fetch real-time data from the Database using Spark Structured Streaming API.
I tried 2 approach…
-
This tutorial https://getindata.com/blog/dbt-run-real-time-analytics-on-apache-flink-announcing-the-dbt-flink-adapter/
...after downloading the Kafka Table Connector `curl -LO https://repo.maven.…
-
### Is your feature request related to a problem? Please describe.
Often I find myself needing to dump some content quickly into sql server, and map the columns sequentially. While I can create a `Da…
-
### What kind an issue is this?
- [x] Bug report. If you’ve found a bug, please provide a code snippet or test to reproduce it below.
The easier it is to track down the bug, the faster it i…
-
Some users complains that they do not want some rows' expression error prevent the whole query's processing and result.
### `TRY()` function
presto and trino do this
https://trino.io/docs/current/f…
-
Hi,SHC Team
I got some trouble when i writing my structured streaming application with shc.
I got the new data from kafka source ,and I want to check whether the new data is different from the old o…
-
**Describe the bug**
Execution fails with the following error message on UI
```
Buffer overflow. Available: 0, required: 80893
Serialization trace:
underlying (org.apache.spark.util.BoundedPr…