Open DavieHR opened 5 years ago
hi. I check the original code , which supplied by the paper author, that weight_scale was not like yours. the weight will multiply a constant from he's initializer . I just want to know why you set the weight not like that. THX
hi. I check the original code , which supplied by the paper author, that weight_scale was not like yours. the weight will multiply a constant from he's initializer . I just want to know why you set the weight not like that. THX