Contents Linearly Inseparable Functions Sigmoid Neurons Typical Supervised ML setup Infeasible Method to Learn Parameters Taylor Series Gradient Descent Introduction Gradient Descent - Weight Update Rule Representation Power of Sigmoid Neurons