The Taylor Series is a way of approximating continuously differentiable functions within a particular window using polynomials of degree . The higher the degree, the better the approximation. For example, if we have the function , its linear approximation () would look like this:
Notice that outside the interval marked by the black points, linear approximation does not provide an accurate picture of the original function.Hence, we’d like as small an interval as possible. A quadratic approximation of would look like:
This is a more accurate approximation than the linear one.
If we know the value of a function at a particular point, we can approximate the value at a different point near it based on our chosen degree of approximation. If we’d like to approximate our loss function and we know the value of the function at , we can approximate the function’s value at a point as follows:
- Linear approximation:
- Quadratic:
and refer to the first and second order derivatives of the function.