Closed WangDeyu closed 4 years ago
After excitation layer, there is "clamp operation" but not "sigmoid", so why does it have this modification?
because clamp is faster than sigmoid.
Well, does this modification affect performance?
The performance has no significant difference.
OK, thanks!
After excitation layer, there is "clamp operation" but not "sigmoid", so why does it have this modification?