Skip to content
Technical GlossaryDeep Learning

Swish Activation

A modern activation function that multiplies the input by a sigmoid to create a smooth nonlinear transformation.

Swish is one of the modern activation functions that has attracted attention for its smoothness and expressive behavior. Because it does not shut off completely in the negative region and provides smooth transitions, it can outperform ReLU in some settings. It is often considered a strong alternative in studies examining deep optimization dynamics.