StarRocks / starrocks

StarRocks, a Linux Foundation project, is a next-generation sub-second MPP OLAP database for full analytics scenarios, including multi-dimensional analytics, real-time analytics, and ad-hoc queries.
https://starrocks.io
Apache License 2.0
8.75k stars 1.76k forks source link

Scheduled Recurring Execution of SUBMIT TASK for INSERT...SELECT... Statements #36739

Closed AsaiWiz closed 3 weeks ago

AsaiWiz commented 10 months ago

Enhancement

Current Challenge:

In the current StarRocks system, there is a lack of functionality for scheduling recurring SUBMIT TASK operations for INSERT...SELECT... statements. This limitation requires manual intervention or external scheduling systems to regularly execute these tasks for loading data from one StarRocks table to another. This setup can be cumbersome and prone to errors, especially in cases requiring frequent and consistent data transfers.

Proposed Enhancement:

Automated Scheduling for SUBMIT TASK of INSERT...SELECT... Statements:

Function: Introduce a feature that allows users to schedule recurring SUBMIT TASK operations for INSERT...SELECT... statements within StarRocks. This feature would enable automatic execution of these tasks at predefined intervals (e.g., hourly, daily, weekly). Pros: Automation and Efficiency: Automating these tasks reduces manual effort and the risk of human error, ensuring consistent data loading. Time-Saving: Saves significant time for database administrators and developers by eliminating the need for manual scheduling. Improved Data Consistency and Availability: Regular and timely updates of data ensure better consistency and availability for downstream applications and analyses. Integration and Simplicity: Offers a more integrated solution within StarRocks, simplifying the overall data management process. Cons: Resource Management: Automated tasks may lead to resource contention if not managed properly, especially in systems with limited resources or high workloads. Complexity in Scheduling: Implementing a scheduling feature could introduce complexity in terms of managing overlapping tasks or handling failures. Potential for Increased System Load: Frequent automated tasks could increase the overall load on the system, impacting performance. Conclusion:

Introducing a scheduled recurring execution feature for SUBMIT TASK of INSERT...SELECT... statements in StarRocks would significantly enhance the platform's capabilities in terms of automation, efficiency, and data management. This feature would be a valuable addition for users who require regular data loading and transformation, providing a more streamlined and error-resistant approach to database operations.

murphyatwork commented 7 months ago

41827

github-actions[bot] commented 1 month ago

We have marked this issue as stale because it has been inactive for 6 months. If this issue is still relevant, removing the stale label or adding a comment will keep it active. Otherwise, we'll close it in 10 days to keep the issue queue tidy. Thank you for your contribution to StarRocks!