Closed luckyasser closed 7 years ago
From @sekruse on August 30, 2016 8:51
When SampleOperator
s lack the dataset size, they materialize the dataset in order to count it. While this is logically sound, it tricks the statistics collection. Thus, it would be better if the SampleOperator
s exposed that behavior so that Rheem can better keep track of what is happening.
From @sekruse on August 30, 2016 13:42
The handling of eager execution and channel evaluation will be postponed.
From @sekruse on August 29, 2016 11:1
We have the
SampleOperator
in the basic plugin, but it's not reflected in the API. It should be added. Also, it would be nice if there was no need to make specifying the sampling method optional.Copied from original issue: daqcri/rheem#20