chufanchen / read-paper-and-code

0 stars 0 forks source link

DiLoCo: Distributed Low-Communication Training of Language Models #82

Open chufanchen opened 7 months ago

chufanchen commented 7 months ago

https://arxiv.org/abs/2311.08105