Closed dohlee closed 1 year ago
@dohlee oh yup, i later updated it here
probably should have just gone with your solution 😅
Oh sorry I didn't notice that commit. No problem at all :)
I'll keep on reviewing the code. Really informative!
@dohlee ok, do send me an email if you get any great results with fine tuning!
Thank you for the quick fix for my recent issue (https://github.com/lucidrains/enformer-pytorch/issues/21)!
nn.init.dirac_
is a great solution.However there's one more thing: we have to initialize the weight with a Identity matrix multiplied by 2!
Of note, the official deepmind Sonnet implementation of enformer uses
snt.initializers.Identity
withgain=2
. (Please refer code lines here and here in deepmind implementation)Thanks, Dohoon