Open wulaoshi opened 1 year ago
This is a great job, but I got an error when I ran, and I guess torch.nn.GELU() should be used here?
Sorry for late reply. I suspect this is caused by difference in torch version?
This is a great job, but I got an error when I ran, and I guess torch.nn.GELU() should be used here?