ReLU Activation Function

ReLU stands for Rectified Linear Unit. ReLU activation function is one of the most used activation functions in the deep learning models. ReLU function is used in almost all the convolutional neural networks or deep learning models.

The ReLU (Rectified Linear Unit) function is an activation function that is currently more popular as compared with the sigmoid function and the tanh function.

Advantages of tanh function

When the input is OK, no gradient saturation problem.

The calculation speed is very quickly. The ReLU function has only a direct relationship. Even so forward or backward, much faster than tanh and sigmoid.(tanh and Sigmoid you need to calculate the object, which will move slowly.)

Disadvantages of tanh function

When the input is negative, ReLU is not fully functional, which means when it comes to the wrong number installed, ReLU will die. This problem is also known as the Dead Neurons problem. While you are forward propagation process, not a problem. Some areas are sensitive while others are present unsympathetic. But in the back propagation process, if you enter something negative number, the gradient will be completely zero, with the same problem as sigmoid function and tanh function.

We find that the result of ReLU function can be 0 or positive number, which means that ReLU activity is not 0-centric activity.

ReLU function can only be used within Hidden layers of a Neural Network Model.

To overcome the Dead Neurons problem of ReLU function another modification was introduced which is called Leaky ReLU. It introduces a small slope to keep the updates alive and overcome the dead neurons problem of ReLU.

Another variant was made from both ReLu and Leaky ReLu called which is known as Maxout function which we will be discussing in details in other articles.

Search This Blog

Ultimate Artificial Intelligence and Data Science

ReLU Activation Function

Comments

Post a Comment