Closed Sillobear closed 1 year ago
An implementation for the Swish activation function, with learnable scale and offset parameter, as described in: https://arxiv.org/abs/2004.09576
An implementation for the Swish activation function, with learnable scale and offset parameter, as described in: https://arxiv.org/abs/2004.09576