深度学习：激活函数曲线总结

编程入门行业动态更新时间:2024-10-23 16:14:23

深度学习：激活<a href=https://www.elefans.com/category/jswz/34/1771370.html style= 函数曲线总结"/>

在深度学习中有很多时候需要利用激活函数进行非线性处理，在搭建网路的时候也是非常重要的，为了更好的理解不同的激活函数的区别和差异，在这里做一个简单的总结，在pytorch中常用的激活函数的数学表达形式，同时为了更直观的感受，给出不同激活函数的曲线形式，方便查询。

import torch
import torch.nn as nn
x = torch.linspace(-4, 4, 400) # 在-4和4之间画400个点。

给负值一个斜率，不全为零。

数学公式：
LeakyReLU ( x ) = { x , if x ≥ 0 negative_slope × x , otherwise \text{LeakyReLU}(x) = \begin{cases} x, & \text{ if } x \geq 0 \\ \text{negative\_slope} \times x, & \text{ otherwise } \end{cases} LeakyReLU(x)={x,negative_slope×x, if x≥0 otherwise

leakyrelu = nn.LeakyReLU(negative_slope=0.01)

negative_slope 是一个小于1的值，通常设置为0.01，用于控制在输入小于0时的输出斜率。这意味着在nn.LeakyReLU中，负数输入会乘以negative_slope，而正数输入保持不变。

ReLU是一个常用的激活函数，它将负数值设为0，保持正数值不变。

数学公式：
ReLU ( x ) = ( x ) + = max ⁡ ( 0 , x ) \text{ReLU}(x) = (x)^+ = \max(0, x) ReLU(x)=(x)+=max(0,x)

relu = nn.ReLU()

Tanh函数将输入映射到-1和1之间

数学公式
Tanh ( x ) = tanh ⁡ ( x ) = exp ⁡ ( x ) − exp ⁡ ( − x ) exp ⁡ ( x ) + exp ⁡ ( − x ) \text{Tanh}(x) = \tanh(x) = \frac{\exp(x) - \exp(-x)} {\exp(x) + \exp(-x)} Tanh(x)=tanh(x)=exp(x)+exp(−x)exp(x)−exp(−x)

Tanh = nn.Tanh()

PReLU是具有可学习参数的激活函数，用于克服ReLU的一些问题

数学公式
RReLU ( x ) = { x if x ≥ 0 a x otherwise \text{RReLU}(x) = \begin{cases} x & \text{if } x \geq 0 \\ ax & \text{ otherwise } \end{cases} RReLU(x)={xaxif x≥0 otherwise

PReLU = nn.PReLU(num_parameters=1)

所有点上都是连续的和可微的，训练快

数学公式：
ELU ( x ) = { x , if x > 0 α ∗ ( exp ⁡ ( x ) − 1 ) , if x ≤ 0 \text{ELU}(x) = \begin{cases} x, & \text{ if } x > 0\\ \alpha * (\exp(x) - 1), & \text{ if } x \leq 0 \end{cases} ELU(x)={x,α∗(exp(x)−1), if x>0 if x≤0

ELU = nn.ELU()

数学公式：
SELU ( x ) = scale ∗ ( max ⁡ ( 0 , x ) + min ⁡ ( 0 , α ∗ ( exp ⁡ ( x ) − 1 ) ) ) \text{SELU}(x) = \text{scale} * (\max(0,x) + \min(0, \alpha * (\exp(x) - 1))) SELU(x)=scale∗(max(0,x)+min(0,α∗(exp(x)−1)))