-
**Why would this plugin be helpful to the Flyte community**
Often times users want to process data using Spark, but data is passed to a Tensorflow training process. Parquet or other columnar structur…
-
Hello I am implementing a consumer application with the kinesis-sql library on spark 3.0.and running into an the following issue:
- Start my consumer and there is no data available.
- Start the prod…
-
Hi,
I could not find a way to start the streaming job from where it left off previously in the kinesis stream even with using Spark streaming's "checkpointLocation" option.
TRIM_HORIZON starts a…
-
Hi, I am having `java.lang.IllegalStateException: Error while creating OptimisticTransactionDb instance lock : file:/home/centos/rocksdb/rdb/db/state_-1619083917/0/0/LOCK: No locks available. `Error w…
-
Loaded kafka-connect distributed on 2 machines (using confluent kafka connect docker image). Configured it to work with streamx, created a job with tasks.max=1 while using s3a (configured in the hdfs-…
-
Someday we want DivConq to offer connections to or embed some sort of high performance distributed processing like Hadoop or [HPPC](http://hpccsystems.com/Why-HPCC)
-
Does sparklens supported with spark 3.x version
-
Hi,
I am doing a POC with kinesis and am using this connector and am hoping to use this in production. When "TRIM_HORIZON" is used in a newly created stream, things work fine but when trying this …
-
### Body
Some of Hooks provide connection ability, however quite a few of them do not provide any documentation and/or connection type (missing in the UI). It would be nice if we add missing parts …
-
Add support for Google Cloud Storage.