chufanchen / read-paper-and-code

0 stars 0 forks source link

NeurIPS 2023 | Depthwise Hyperparameter Transfer in Residual Networks: Dynamics and Scaling Limit #106

Open chufanchen opened 7 months ago

chufanchen commented 7 months ago

https://arxiv.org/abs/2309.16620