Using Apache Beam with Dask has a scaling limit of 100-200 GPUs

This is a high-level tracker issue for issues/PRs in both Dask and Beam.

### Issues
- [ ] https://github.com/apache/beam/issues/26669
- [ ] https://github.com/dask/dask/issues/10291

### PRs
- [ ] https://github.com/dask/dask/pull/10294

Summary

I'm exploring some unexpected behaviour which is creating a limitation on how far Apache Beam workloads can scale with a Dask runner. This was originally discovered on NGC Base Command Platform, although the bug seems to be general to all Beam on Dask deployments.

Beam uses dask.bag to store collections of items (which Beam calls a PCollection). Calling beam.Create(iterable) is translated to dask.bag.from_sequence(iterable). It then translates calls to functions like beam.ParDo to dask.bag.map.

The problem we are investigating was discovered when a Beam workload wouldn't scale beyond ~200 workers. Additional workers added to the Dask cluster would sit idle. After some investigation, it seems the root cause is Dask Bag's default partitioning scheme which limits you to 100 tasks beyond a certain number of items in the bag.

This means it is never possible to full utilize clusters with more than 200 GPUs.

rapidsai / deployment

Using Apache Beam with Dask has a scaling limit of 100-200 GPUs #235

Summary