activations Published at 2025-07-10 Licensed under CC BY-NC-SA 4.0 langspythonpytorch nn.Mish() 非单调,0 附近下沉,比 SiLU 更低. nn.SiLU() 非单调,0 附近下沉