Open sshleifer opened 2 years ago
I would love to test your method out on language modeling tasks in fairseq.
Do you have the code to make table 2 (or just the GradInit rows in Table 2) handy?
GradInit
Same, waiting for the code on IWSLT. @zhuchen03
Maybe it is too late, but the code is now available!
I would love to test your method out on language modeling tasks in fairseq.
Do you have the code to make table 2 (or just the
GradInit
rows in Table 2) handy?