issues
search
chufanchen
/
read-paper-and-code
0
stars
0
forks
source link
DiLoCo: Distributed Low-Communication Training of Language Models
#82
Open
chufanchen
opened
7 months ago
chufanchen
commented
7 months ago
https://arxiv.org/abs/2311.08105
https://arxiv.org/abs/2311.08105