Account for platform start-up costs - Githubissues

rheem-ecosystem / rheem

Rheem - a cross-platform data processing system

https://rheem-ecosystem.github.io

5 stars 0 forks source link

Account for platform start-up costs #13

Closed luckyasser closed 7 years ago

luckyasser commented 7 years ago

From @sekruse on July 10, 2016 11:20

In general, platforms might incur some overhead for initialization (e.g., Spark does so). This overhead cannot be pinned to single Operators and can therefore not be expressed in our current cost model. However, it is an important criterion to determine whether to use only Java (no overhead) or some "heavy-weight" framework. Thus, we should model this overhead explicitly:

[ ] model the overhead as part of PlanImplementations' TimeEstimates
[ ] in LatentOperatorPruningStrategy, treat the usage of a Platform as an interesting property (the initial overhead for single Operators might be redeemed over the complete PlanImplementation)

Copied from original issue: daqcri/rheem#4