ApolloResearch / rib

Library for methods related to the Local Interaction Basis (LIB)
MIT License
2 stars 0 forks source link

Allow distributed edge splitting over out_dim #319

Closed danbraunai-apollo closed 8 months ago

danbraunai-apollo commented 8 months ago

Allow distributed edge splitting over out_dim

Description

Motivation and Context

Splitting over the dataset is not possible for large runs that only use a few n_ctx length samples, because there aren't enough samples to split over efficiently. By instead splitting over out_dim, we can distribute the computation much more effectively.

How Has This Been Tested?

Added the following tests for modadd with dist_split_over="out_dim", which mirror the structure of the existing modadd tests which split over the dataset:

NOTE: Multiple distributed tests cannot be run in the test suite without breaking things. Tests with mpi must be run in separate process. I've thus added a tests/run_distributed_tests.sh script for running each mpi test in a separate process, and added a --runmpi flag which, unless given, will not run mpi tests.

Does this PR introduce a breaking change?

No