Currently, the SparkIngestor doesn't accept any configuration. This will be needed for long-running jobs to choose either StreamingQuery.awaitTermination or StreamingQuery.processAllAvailable.
Additionally, it's currently not possible to pass configuration options to the sparksession via the config file or command line
Tasks
Make SparkIngestor a class that accepts a SparkSession and Configuration which is instantiated by a companion object, much like the other components e.g. KafkaStreamReader etc.
Pass all options with prefix ingestor.spark.options. on to the SparkSession
Currently, the
SparkIngestor
doesn't accept any configuration. This will be needed for long-running jobs to choose eitherStreamingQuery.awaitTermination
orStreamingQuery.processAllAvailable
.Additionally, it's currently not possible to pass configuration options to the sparksession via the config file or command line
Tasks
SparkIngestor
a class that accepts aSparkSession
andConfiguration
which is instantiated by a companion object, much like the other components e.g.KafkaStreamReader
etc.ingestor.spark.options.
on to the SparkSession