Open lmatz opened 1 year ago
q15, q16, q17 have similar query patterns, and they may suffer from the same problem.
Q6 requires AVG over
. Flink would output error when running this query.
Q11 requires session_start
.
Q12 requires proc_time
.
Q13 requires proc_time
.
Q14 requires count_char
UDF.
By 4/4, Q19 is supported but not enabled on the performance dashboard.
Validation:
create sink XXX with ( connector = 'blackhole' )
instead ofcreate materialized view
as the other system doesn't have a materialized view. Although some plans shown in the sub-issues are still in the form ofmaterialized view
.Optimization Tasks
Watermark
We notice that at https://github.com/nexmark/nexmark/blob/master/nexmark-flink/src/main/resources/queries/ddl_gen.sql#L37, Nexmark's source table has specified watermark as
FOR dateTime AS dateTime - INTERVAL '4' SECOND
. As watermark helps clean the state table and thus potentially improve the performance of state access, we also track the progress of watermark here:The queries
blackhole
sink.Group By
. After this, all aggregation withGroup By
(non-simple) will no longer choose 2-phase aggregation.Queries made up by @BugenZhao to cover stream operators that are not covered by the standard Nexmark: