L35: Nesterov accelarated gradient descent

Welcome to Lecture 35 of the course "Deep Learning" by Prof. Mitesh M.Khapra Full Course: https://study.iitm.ac.in/ds/course_pa... Video Overview Nesterov Accelerated Gradient Descent (NAG) is an optimization technique that enhances the standard momentum method by incorporating a lookahead mechanism. Introduced by Yurii Nesterov in 1983, NAG anticipates the future gradient direction, allowing the algorithm to make more informed updates and accelerate convergence. This approach reduces oscillations and overshooting, leading to more stable and efficient training in machine learning models. In NAG, the update rule involves computing the gradient at the estimated future position, which combines the current position and the momentum term. This foresight enables the algorithm to adjust its trajectory more effectively, especially in regions with steep gradients. As a result, NAG often outperforms standard gradient descent and momentum methods, particularly in convex optimization problems." About IIT Madras' online Bachelor of Science programme IIT Madras offers four-year BS programmes that aim to provide quality education to all, irrespective of age, educational background, or location. The BS programme has multiple levels, which provide flexibility to students to exit at any of these levels. Depending on the courses completed and credits earned, the learner can receive a Foundation Certificate from IITM CODE (Centre for Outreach and Digital Education), Diploma(s) from IIT Madras, or BSc/BS Degrees from IIT Madras. For more details, Visit: https://www.iitm.ac.in/academics/stud... #NesterovAcceleratedGradient #GradientDescent #Optimization #MachineLearning #DeepLearning #ConvexOptimization #MomentumMethod #AIAlgorithms #NeuralNetworks