pentium3 / sys_reading

system paper reading notes
235 stars 12 forks source link

Scaling Large Language Model Training to More Than 10,000 GPUs #330

Open pentium3 opened 9 months ago