How to speedup the running?

Hi, this experiment is indeed expected to take a long time, because the hyper-parameters were optimized for pure accuracy regardless of a potential computational trade-off.

With that said, your running times do seem a few times longer than expected. Some suggestions:

We ran our experiment on different hardware, so it may be that a larger batch_size.per_device_per_step can fit in memory on your GPU, which could increase the number of samples processed per second -- overall it's probably worth trying to tune that number to optimize the throughput on your specific device.
Check that the GPU utilization is high during training, to ensure in particular that the pipeline is not bottlenecked by data loading.
If the experiment remains too long, it is possible to trade-off some compute cost for accuracy, by e.g. reducing the number of augmentations set by augmult: in this fine-tuning setting, it is quite likely that the returns are somewhat small. This would not perfectly reproduce our results, but the resulting accuracy should remain somewhat close.

google-deepmind / jax_privacy

How to speedup the running? #13