Momentum vs Normalization in Deep learning -Part 2 – Day 34
Comparing Momentum and Normalization in Deep Learning: A Mathematical Perspective Momentum and normalization are two pivotal techniques in deep learning that enhance the efficiency and stability of training. This article explores the mathematics behind these methods, provides examples with and without these techniques, and demonstrates why they are beneficial for deep learning models. Comparing Momentum and Normalization Momentum: Smoothing and Accelerating Convergence Momentum is an optimization technique that modifies the standard gradient descent by adding a velocity term to the update rule. This velocity term is a running average of past gradients, which helps the optimizer to continue moving in...