Open Xuzhengz opened 2 weeks ago
help!!!
come here
I am not get your question. The lifecycle is source -> transform -> sink
Only source can generate data, transform only receive the data from source.
How does Transform know if the source has been executed?
When transfrom recived data, it means sourcce has been executed.
Do you want to ask how to judge source job total complete, has read all data?
it seem can't solve it with flatmap. It seems to be multi-line data processing. @corgy-w cc
I am not get your question. The lifecycle is source -> transform -> sink
Only source can generate data, transform only receive the data from source.
How does Transform know if the source has been executed?
When transfrom recived data, it means sourcce has been executed. My requirement is to customize a tansform component for grouping aggregation. Aggregate operations will be carried out after each row of data comes in, and the aggregate results will be output to sink after the source is completed
Recently I saw that tansform supports multiple lines of output, so I'm trying to see if I can implement an aggregate transform
Is there a better solution to implement aggregation operations on seatunnel, even if the parallelism is set to 1,Because I am currently facing many requirements that require aggregation operations to be implemented on the portal instead of the database side @liunaijie cc
@corgy-w is implementing multi-line transform and aggregation , You can communicate the progress
Is there a better solution to implement aggregation operations on seatunnel, even if the parallelism is set to 1,Because I am currently facing many requirements that require aggregation operations to be implemented on the portal instead of the database side @liunaijie cc
No, SeaTunnel is focus on Data integration, the flatMap
function recently support is use to explode data from one row to multiple rows.
From your description, you want the aggregation function, it's compute function, I think we are not going to support it. cc @Hisoka-X
Is there a better solution to implement aggregation operations on seatunnel, even if the parallelism is set to 1,Because I am currently facing many requirements that require aggregation operations to be implemented on the portal instead of the database side @liunaijie cc
No, SeaTunnel is focus on Data integration, the
flatMap
function recently support is use to explode data from one row to multiple rows.From your description, you want the aggregation function, it's compute function, I think we are not going to support it. cc @Hisoka-X
Yes, there is no way to solve data sharing between multiple degrees of parallelism, possibly on different nodes.
Search before asking
Description
How does Transform know if the source has been executed? Because I want to use the newly added FlatMap multi line output function of Transform to achieve aggregation effect, that is, after the source is completed, the result of Transform and aggregation is output line by line to the destination
Usage Scenario
No response
Related issues
No response
Are you willing to submit a PR?
Code of Conduct