Light Aggregator (not keeping the whole field group)

miguelantonio commented 9 years ago

Right now the aggregators keep the field group and do the math on the fly, I've seen the following approach in Tibco Streambase to process aggregation: 1- the tuple (as group of attributes/columns, basically a map) gets into the aggregator and only the aggregator input fields are considered and the rest is forgotten (the rest of the tuple) 2- the value gets processed into simple math: add, subtract. This is in a deconstructed manner, ej. avg is just one Long/BigInteger count and one Long/BigInteger sum. The actual field gets forgotten at this time. 3- when the aggregation window closes (emits, etc.) the 'heavy' math is done: multiplication, division, etc.

I've seen millions of tuples get processed into this type of aggregator with very low CPU and memory consumption and attack ships on fire off the shoulder of Orion

cjnolet commented 9 years ago

I think this ticket may be invalid after some discussions we've had. Can it be closed?

miguelantonio commented 9 years ago

It´s super invalid, I was seeing things that day, like the Ice King, I´ll close it

calrissian / flowmix

Light Aggregator (not keeping the whole field group) #45