Open lepodl opened 1 year ago
I have some questions about the implementation of muP in the rescale and transfer hyperparameter. Specifically, in
I would greatly appreciate it if you could take the time to answer my question!
I have some questions about the implementation of muP in the rescale and transfer hyperparameter. Specifically, in
I would greatly appreciate it if you could take the time to answer my question!