Feed Forward Neural: Learning Parameters Intuition

How can we go about learning the parameters of a feedforward neural network? Remember that the gradient descent algorithm for a simple neural network was as follows:

Now, instead of $W$ and $b$ , we have $[W_{1}, W_{2}, ..., W_{L}]$ and $[b_{1}, b_{2}, ..., b_{L}]$ . We can put both of these into one vector called $θ$ , modifying the algorithm to:

Where:

θ = [W_{1}, W_{2}, ..., W_{L}, b_{1}, b_{2}, ..., b_{L}]

\nabla θ_{t} = [\frac{\partial L ( θ )}{\partial W _{1, t}}, ..., \frac{\partial L ( θ )}{\partial W _{L, t}}, \frac{\partial L ( θ )}{\partial b _{1, t}}, ..., \frac{\partial L ( θ )}{\partial b _{L, t}}]

$\nabla θ$ is composed of the gradients of the weight and bias of each layer in the network. So, how can we calculate the loss function and how can we calculate the gradient?

IITM-BS Notes

Feed Forward Neural: Learning Parameters Intuition