The model device in the default settings may not be as expected.

Thanks for the feedback, @LOVEANDNINENINE.

You're correct that supporting multi-accelerator configurations is not a design goal for this codebase, which is focused on reproducibility of the associated research paper. Instead, we assume that users will be limited to whatever is supported by Colab, which today is a single GPU per runtime.

Given this assumption, setting device_map="auto" will infer the correct device map and fill the available accelerator (i.e., cuda:0). We find to be sufficiently user-friendly for our goals, and definitely more maintainable than defining custom device_maps using the torch.device identifier from DEVICE for each model/accelerator configuration. If users find that their model is not fitting on the accelerators available to them, they will need to consider purchasing a subscription.

We strongly encourage that users interested in multi-accelerator configurations first consider using the production-grade SynthID Text implementation available in Hugging Face Transformers.

Researchers looking to directly reproduce this work on multi-accelerator setups (e.g., VMs or enterprise resources) will need to fork this repo and update the code accoridngly.

google-deepmind / synthid-text

The model device in the default settings may not be as expected. #14