Open skyzh opened 1 month ago
so probably we should set it to derived
How did you mean? It looks like it's just storing the group by column. I am not sure i'm following where distribution of a column is stored here
I think it's probably better to store it as Distinct(v1) in column ref logical property so that the cost model can take such information into account
currently, aggregation group-by's logical property is like:
but actually, group by could change the distribution of the column, so probably we should set it to derived, or find a way to represent it? if a later join refers to this column, we should treat it differently.