Can you explain the behavior of the Swish activation function?

The Swish activation function is a relatively new activation function that has gained popularity in deep learning neural networks. It was introduced by researchers at Google in 2017.

The Swish activation function is defined as follows:

f(x) = x * sigmoid(beta * x)