5.C Gradient Search

Gradient Search Example

Let’s use gradient search to find the maximum of the function \[ f(x,y) = - x^4 - x^3 + 10 x y + 2y - 8 y^2 \] whose partial derivatives are \[ \frac{\partial f}{\partial x} = -4 x^3 -3 x^2 + 10 y \quad \mbox{and} \quad \frac{\partial f}{\partial x} = 10 x + 2 - 16 y \]

First, you define the partial derivatives and then choose your starting point (newx, newy). In this case, we start at (1,1).

partialx = makeFun( -4*x^3 -3*x^2 + 10*y ~ x&y)
partialy = makeFun(10*x + 2 - 16*y ~ x&y)
newx = 1
newy = 1

Next, you repeatedly run the following code block, which updates the current point by moving 0.1 times the gradient vector. This takes a small step in the uphill direction.

slopex=partialx(newx, newy)
slopey=partialy(newx, newy)
newx = newx + 0.05*slopex
newy = newy + 0.05*slopey
# print new partial derivatives
c(partialx(newx, newy), partialy(newx, newy))
# print new point
c(newx, newy)

Repeatedly run this code block until the partial derivatives are essentially zero (at least two zeros after the decimal point). Congrats! You have found your local maximum.

You will end at the point (1.04,0.77). But note that starting at another initial point might take you to a different local maximum!

You can check that this critical point is a local maximum (and not a saddle point) by using the same method that we used for 2D Optimizatiion

f = makeFun( -x^4 - x^3 + 10*x*y + 2* y - 8* y^2 ~ x&y)
a=1.0405
b=0.7753
r = 0.1
theta = seq(0,2*pi,pi/10)
f(a,b) - f(a+r*cos(theta), b+r*sin(theta))

##  [1] 0.10152236 0.06983861 0.04586756 0.03913009 0.05231624
##  [6] 0.07998000 0.11073639 0.13203698 0.13536588 0.11957064
## [11] 0.09102447 0.06080448 0.04028374 0.03696369 0.05199590
## [16] 0.08002000 0.11113281 0.13426810 0.14099672 0.12862949
## [21] 0.10152236

Activities

Finding a Local Minimum

How should I change to the code if I want to find a local minimum instead of a local maximum? (Hint: we want to take a small step downhill.)
Use gradient search to find a local minimum of \[f(x,y) = x^2 + 2 x y + 3 x + 4 y + 5 y^2.\]

Solutions

Finding a Local Minimum

In order to find a local minimum, we need to move in the opposite direction of the gradient. This is the “downhill direction.” All we need to do in the code is replace the two + 0.1 with - 0.1.
We want to find the local minimum of \[f(x,y) = x^2 + 2 x y + 3 x + 4 y + 5 y^2.\] We have \[ f_x(x,y) = 2x +2y + 3, \qquad f_y(x,y) = 2x + 4 + 10y. \] Here is our updated code to walk downhill. We set up our partials and pick our starting point.

partialx = makeFun(2*x +2*y + 3 ~ x&y)
partialy = makeFun(2*x + 4 + 10*y ~ x&y)
newx = 1
newy = 1

Now we repeatedly run this “move downhill” block until the partial derivatives are (basically) zero.

slopex=partialx(newx, newy)
slopey=partialy(newx, newy)
newx = newx - 0.05*slopex
newy = newy - 0.05*slopey
# print new partial derivatives
c(partialx(newx, newy), partialy(newx, newy))

## [1] 4.7 7.3

# print new point
c(newx, newy)

## [1] 0.65 0.20

If we run this 20 or 30 times, the partial derivatives become zero, and we estimate our local minimum to be the point \((-1.37, -0.13)\).