ray-project / ray_beam_runner

Ray-based Apache Beam runner
Apache License 2.0
41 stars 12 forks source link

[batch] Create Work Item Scheduler #3

Open pdames opened 2 years ago

pdames commented 2 years ago

The Ray Work Item Scheduler takes work items from the batch FnApiRunner's topological scheduler as input, and submits them for execution as Ray worker tasks. Its design should be based on the FnApiRunner's WorkerHandlerManager.

Ray worker tasks should be based on Beam's EmbeddedPythonWorker and other worker handlers. Each Ray worker task takes a ProcessBundleRequest proto to execute, and stores/retrieves its state as object references in the central Pipeline State Manager.

pdames commented 2 years ago

This work is required as part of https://github.com/ray-project/ray_beam_runner/issues/2