Learn Before
Formula

Batch Gradient Descent Update Formula

Assuming that the error function is J(w)J(w) with one parameter ww, to minimize the error, we can update the weight ww as follows:

w=wαdJ(w)dww = w - \alpha \cdot \frac{dJ(w)}{dw}

where α\alpha is the learning rate, and dJ(w)dw\frac{dJ(w)}{dw} is the derivative of J(w)J(w) with respect to ww. If the error function has two or more parameters, for example, a weight ww and a bias bb, we can update them one by one:

w=wαJ(w,b)ww = w - \alpha \cdot \frac{\partial J(w,b)}{\partial w}

b=bαJ(w,b)bb = b - \alpha \cdot \frac{\partial J(w,b)}{\partial b}

where \partial denotes the partial derivative.

Image 0

0

2

Updated 2026-06-07

Tags

Data Science