Closed anonymousA123 closed 5 months ago
it seems self.gamma and self.beta all need squeeze, i.e. self.gamma.squeeze()
Thank you so much for your reply. May I ask if you have encountered the following error message?? Because when I add squeeze to self.gamma and self.beta, it reports an error: Additionally, does training with the latest version of the code still generate gibberish messages like in issue #23?
Stale issue message
While running the updated code, I encountered an issue as follows:
I would greatly appreciate any guidance or assistance you can provide to help resolve this issue.
Upvote & Fund