StarRocks / starrocks

The world's fastest open query engine for sub-second analytics both on and off the data lakehouse. With the flexibility to support nearly any scenario, StarRocks provides best-in-class performance for multi-dimensional analytics, real-time analytics, and ad-hoc queries. A Linux Foundation project.
https://starrocks.io
Apache License 2.0
9.03k stars 1.82k forks source link

Kafka Resource for Simplified Routine Load Broker Management #53113

Open jselvareef opened 5 hours ago

jselvareef commented 5 hours ago

Feature request

Is your feature request related to a problem? Please describe. Currently, when creating a routine load in StarRocks to consume data from a Kafka topic, we must specify the broker_list property directly in the routine load. This creates significant overhead when managing multiple routine loads.

For instance, with 20 or more routine loads, the same broker_list must be repeated in all of them. If maintenance occurs in Kafka, such as adding or removing brokers, we must update the broker_list in each of these routine loads individually. This is inefficient and prone to error.

Describe the solution you'd like Introduce a new type of "RESOURCE" where type=kafka. This resource would include properties like broker_list. When creating a routine load, instead of specifying the broker_list, users would reference this Kafka resource.

This approach centralizes the management of the broker_list in the Kafka resource. If updates are needed, they can be made in a single place without requiring changes to every routine load that relies on it.

Describe alternatives you've considered

Additional context This feature would significantly improve the user experience for managing Kafka-based routine loads in environments with high routine load counts. It would also reduce the risk of misconfiguration during Kafka maintenance activities.