Skip to content
Information Sciences
Machine Learning
1847/modern
Intermediate

Gradient Descent Update Rule

θt+1=θtηθL(θt)\theta_{t+1} = \theta_t - \eta \nabla_\theta L(\theta_t)

Update parameters by stepping opposite to the gradient of the loss—learning by hill descent.

By Augustin-Louis Cauchy, Various

Information Sciences
Gradient Descent Update Rule
1847/modern · Augustin-Louis Cauchy
Source Verified
89%

Rabbit Hole Mode

Five doors into the universe behind this equation. Choose your path.

Launch collection — full story, visual, sound & share card
Why it matters: The engine behind virtually all deep learning training.

Discoverers: Augustin-Louis Cauchy, Various (1847/modern)

What does it mean?

Update parameters by stepping opposite to the gradient of the loss—learning by hill descent.

Why should I care?

The engine behind virtually all deep learning training.

Equation Compass

Variables & Units

SymbolNameUnitMeaning
θθParametersModel weights
ηηLearning rateStep size
LLLossObjective function
GradientDirection of steepest ascent

Worked Example

η too large → divergence; too small → slow convergence.

AI Guide (Pro)

Ask questions about equations and get answers grounded in the Equation Universe catalog.

Share this equation

Equation Universe

Gradient Descent Update Rule

θt+1=θtηθL(θt)\theta_{t+1} = \theta_t - \eta \nabla_\theta L(\theta_t)

Real-world impact

Quantum technology

Wave mechanics enables next-generation devices.

Photo: Unsplash — quantum hardware

Update parameters by stepping opposite to the gradient of the loss—learning by hill descent.

equation-universe.vercel.app

Post