torch.nn.functional.gelu¶
- torch.nn.functional.gelu(input, approximate='none') Tensor ¶
When the approximate argument is ‘none’, it applies element-wise the function
where is the Cumulative Distribution Function for Gaussian Distribution.
- When the approximate argument is ‘tanh’, Gelu is estimated with:
- math::
text{GELU}(x) = 0.5 * x * (1 + text{Tanh}(sqrt(2 / pi) * (x + 0.044715 * x^3)))