Open betatim opened 9 years ago
Contribution: "Low latency, scalable concurrency control in Gaudi Hive"
The move from sequential to concurrent (multithreaded, task-based) Gaudi complicates the problem of decision making in the space of control flow and data flow constraints of algorithm execution precedence and requires a dedicated concurrency control system for it. Concurrent environment imposes the requirements of at least 2D scalability and soft real-time reasoning on this system. In this contribution I will describe a new, low latency and scalable, concurrency control system (CCS), deployed by me to the Gaudi Hive prototype. I will present its new algorithmic approach, performance measurements, as well as will point out its advantages over the previously used CCS.
Contribution: "Predictive scheduling for peak throughput in Gaudi Hive"
The Gaudi Hive prototype introduces two dimensions of concurrency: at intra- and inter-event levels. Previous generation of the prototype was based on "myopic" concurrency control with only reactive decision making effectively possible at the intra-event level. Due to significant heterogeneity/asymmetry of typical LHCb/ATLAS algorithms' precedence constraints and some degree of precedence freedom for a given data processing workflow, this may lead to frequent degradation of intra-event disclosure dynamics. In such situations, the only strategy to saturate the framework's occupancy is to go for higher degrees of inter-event concurrency, which, in turn, leads to increase of framework's overall overhead and, subsequently, to degradation of its peak throughput. In this contribution I will present an alternative scheduling strategy, which utilizes the ability of the new generation concurrency control for efficient proactive decision making to generically maximize the intra-event concurrency disclosure dynamics by predicting optimal intra-event execution plans. This allows to minimize the need in excessive degrees of inter-event concurrency, and thus to maximize the peak throughput of the framework.
We could have two talks of about 15 each on Wednesday morning about this.
Some thoughts:
Given that the subject of the contributions just cannot be any closer to the topic of the session and is just it, may I have 20m/talk? I could well fit in 15m, but I prefer some spare time not to rush. More over, the sessions on Wed and Thu seem to be completely empty anyway for the moment..
Correspondingly to bullets:
This issue is to organise the schedule of the Scheduling/Framework sessions.
Please post your contributions/ideas below and we will update the table