Is scheduling taken as a Markovian Process which allows Reinforcement Learning to be used?

hongzimao / decima-sim

Learning Scheduling Algorithms for Data Processing Clusters

286 stars 90 forks source link

Thanks for your interest! Yes we formulate the scheduling problem as a Markov decision process (MDP). Section 5.2 has more details of how we design the scheduling event and scheduling action to construct an easy MDP for the learning agent to train.

The neural network (NN) design is for processing the state information. Section 5.1 talks about how we use graph neural network to process job information embedded in computation graphs with arbitrary shape and size. NN and MDP are two different things — you can think of NN as information processing tool and MDP as problem formulation.

Hope this helps.

hongzimao / decima-sim

Is scheduling taken as a Markovian Process which allows Reinforcement Learning to be used? #9