Parallel map-style actions

elijahbenizzy commented 4 months ago

Overview

We should be able to launch a whole bunch of actions in parallel. Walking the state machine in parallel is tricky, so this proposes the following:

A MultiAction that performs a map operation over some state field
This then spawns and delegates subactions
These sub actions each have access to a portion of that state field + everything else they need in state
These then write back to state
The user specifies how to reduce the result
This is done via a configurable executor -- asyncio, threading, etc... With default of multithreading (for sync) and asyncio (for async)

Use-cases

PR-bot -- want to determine how to respond for each comment, also for each file
Firing off requests to multiple LLMs at once to determine the best/quickest
Optimizing over multiple ML algorithms at once.
Web scraper -- send a bunch of requests out

Requirements:

Recursive, all the way down. Should use the same executor for sub-actions, sub-sub-actions, etc...
Each individual update (from an action completing) is treated as a state update
Idempotency is encouraged, if not built in (for instance, it'll decide the tasks on the input fields minus the output fields)
Configurable parallelism (max rate, etc...)
Hooks, everything respected as expected
Quick-exit mode (get the first then cancel the others), join-all mode (join them all)

Design Qs:

Should the actions that can be launched from an action all be the same? Or should there be a possibility to launch two of them?
How to handle action failures? Should we allow some? Do retries? End up having the actions themselves handle failure cases and not deal with them at all?
API -- class-based? Function-based?

elijahbenizzy commented 4 months ago

This is very likely dependent on #33 -- we'll need to layer/apply updates to state in a nice way. We can probably get around that but it is worth considering.

skrawcz commented 4 months ago

What I would do today is just do it internal to the action:

@action(...)
def my_parallel_action(state: State, .. ) -> tuple(dict, state):
      # do the parallelization myself with threading/asyncio/etc.
      futures = [ ... ]
      # wait for first, or wait for all
     result = wait_for_futures(futures)
     # do state update here / handle any failures etc
     state = update_state(state, result)
     return result, state

So the question if we provide framework support for something like this is "what are we removing/simplifying/enhancing'"?

elijahbenizzy commented 4 months ago

What I would do today is just do it internal to the action:
@action(...)

def my_parallel_action(state: State, .. ) -> tuple(dict, state):

      # do the parallelization myself with threading/asyncio/etc.

      futures = [ ... ]

      # wait for first, or wait for all

     result = wait_for_futures(futures)
     # do state update here / handle any failures etc
     state = update_state(state, result)
     return result, state
So the question if we provide framework support for something like this is "what are we removing/simplifying/enhancing'"?

Not a requirement (yet). But the enhancement is visibility/telemetry/checkpointing for long-running jobs.

Furthermore, when we combine this with the ability to build actions recursively then it really simplifies things.

Say you're building a PR bot such as ellipsis. You'll want to respond to each comment in parallel, otherwise it'll take too long. Each comment could be a multi-step process. This + composable actions make that feasible. Again, not necessary (there are multiple ways to model it).

elijahbenizzy commented 1 month ago

Plan of action:

Use-case (sample, for scrapegraph):

[ ] List a bunch of URLs
[ ] Run a single step that spawns multiple graphs
[ ] Join the results together
(1) Tracking/pointer storage

One subgraph points to the other. This is a spawner/subgraph relationship (not to be confused with parent
[ ] Pass a common parameter (pointer/container) into the app so we can store it with the tracker
[ ] Don’t have Burr automatically know about it beforehand (yet)
[ ] MVP — view pointer links in the UI — click back and forth, maybe show substeps in graph
(2) Store statically — we can store the subgraph relationships statically so we can display easily:
[ ] Have a node be a subgraph type and register it
[ ] Have it still responsible for running itself (or running many in parallel)
[ ] We can view these as clusters/subgraphs in the UI, although it’ll be a little tricky to display current
(3) Burr handles execution — we can wrap the prior one in this so we automatically launch a ton of them
[ ] More complex — state is either multiple at once, queue-based, or something clever
[ ] Scrapegraph doesn’t need it as it can handle parallelism

elijahbenizzy commented 1 month ago

Design for phase (1):

@action(...)
def spawning_action(..., __context: AppContext):
    app = (
        ApplicationBuilder()...
        .with_spawning_parent(
            __context.app_id, 
            __context.sequence_id)
        .with_tracker(__context.tracker)
        .build()
     )
    ... = app.run(...)
    return ...

DAGWorks-Inc / burr