FE501 – 1214 Nonlinearp Programming Week 3

FE501 – 1214 Nonlinearp Programming Week 3Calculation of the Hessian MatrixUnconstrained Nonlinear Optimization ProblemsGradient and Stationary PointMore about stationary point (Optional Information)Different types of stationary pointsTurning pointsHow to find Stationary point ?Stationary point and Local Minimum or Local Maximum (1)Stationary point and Local Minimum or Local Maximum (2)Unconstrained Theorems TheoremsFor single variable function (single dimension)For multi-dimensional function (multivariable function)ExtraQuadratic ProblemsReferences

Calculation of the Hessian Matrix

$f(x_1,x_2,..x_n)$ is a convex, concave or neither.

Expl $f(x,y)=x^2 - y^2$

CleanShot 2022-12-21 at 10.22.41

Note: First order PMs are on the diagnoze. Second is the determinate of the matrix.

The function is Neither Convex Nor Concave (NCNC)

Expl $f(x,y)=x^2 + y^2$

CleanShot 2022-12-21 at 10.22.21

Actually we do not need to calculate this, because:

Sum of CVX is CVX.
Sum of CCV is CCV
$f(\vec x)$ $-f(\vec x)$ is CCV

Expl $f(x,y)=-x^2 - y^2$

This is CCV (concave) cause sum of CCV is CCV.

CleanShot 2022-12-21 at 10.22.59

Expl. $f(x,y)=x^2 + y^2 + xy$

CleanShot 2022-12-21 at 10.21.42

Expl. $f(x,y)=-x^2 - y^2 - xy$

NOTICE: Wrong result is addressed on this problem during the class!

CleanShot 2022-12-21 at 10.23.16

Unconstrained Nonlinear Optimization Problems

B.N.11.6-655

We now discuss how to find an optimal solution (if it exists) or a local extremum (local optimum point) for the following unconstrained NLP:

\begin{matrix} (1) & \begin{matrix} max (o r min) & f (x_{1}, x_{2}, . . x_{n}) \\ s . t . & (x_{1}, x_{2}, . . ., x_{n}) \in R^{n} \end{matrix} \end{matrix}

We assume that for the given function the first and second partial derivatives are all exists.

Gradient and Stationary Point

$\frac{\part f(\vec x)}{\part x_i}$ =

\begin{matrix} (2) & \begin{matrix} \nabla f (x) = [\begin{matrix} \frac{\partial f}{\partial x_{1}} \\ \frac{\partial f}{\partial x_{2}} \\ \frac{\partial f}{\partial x_{3}} \\ \dots \\ \frac{\partial f}{\partial x_{n}} \end{matrix}] \end{matrix} \end{matrix}

in formula (2) is called the gradient of the function.

$\vec x = (x_1,x_2,..x_n)$ to be a local extremum for NLP is:

\begin{matrix} (3) & \begin{matrix} if such point is extremum point(max or min) for function, then: \\ \nabla f (\vec{x}) = 0 (for given point) \end{matrix} \end{matrix}

$f'(x)$ either being zero or not at the point.

Expl. $f(x)=x^2$

CleanShot 2022-12-21 at 10.54.07

We have learned how to determind the domain being either convex or not, and we also have learned determining the objective function being either convex or not, and now we will start learning about the local maximum or local minimum points and their properties.

$\grad f(\vec x)=0$ are candidate points for being local optimum solution.
or in other way:
$\vec x$ $\grad f(\vec x)=0$ stationary point $f$ .

More about stationary point (Optional Information¹)

NOTE: you can skip this.

A stationary point, or critical point, is a point at which the curve's gradient equals to zero. Consequently if a curve has equation y=f(x) then at a stationary point we'll always have:

\begin{matrix} (4) & f' (x) = 0 \end{matrix}

which can also be written:

\begin{matrix} (5) & \frac{d y}{d x} = 0 \end{matrix}

In other words the derivative function equals to zero at a stationary point*.

Different types of stationary points

There are three types of stationary points:

local (or global) maximum points
local (or global) minimum points
horizontal (increasing or decreasing) points of inflexion.

It is worth pointing out that maximum and minimum points are often called turning points.

Turning points

A turning point is a stationary point, which is can be:

a local (or global) minimum
a local (or global) maximum

each of which are illustared in the graphs shown here, where the horizontal tangent is shown in orange:

stationary-points-different-types-of-turning-points-illustration

How to find Stationary point ?

Given a function f(x) and its curve y=f(x), to find any stationary point(s) we follow three steps:

Step 1: find f′(x)
Step 2: solve the equation f′(x)=0, this will give us the x-coordinate(s) of any stationary point(s).
Step 3 (if needed/asked): calculate the y-coordinate(s) of the stationary point(s) by plugging the x values found in step 2 into f(x).

Stationary point and Local Minimum or Local Maximum (1)

(Choose either (1) or (2) methods, both has the same meaning but slightly different approach for finding the local minimum or maximum)

As we know that:

$\vec x$ $\grad f(\vec x)=0$ stationary point $f$ .

$f(\vec x)$ we found the graph like this:

CleanShot 2022-12-21 at 11.12.54

(Apparently this is a CCV function).

$\grad$ $\vec x$ $\frac{\part f(\vec x)}{\part x_i}=0, i=1,2,...n$ ,

Let’s say when we increase the value of x slightly, without changing the values of y or z on the graph, we got the blue point. the function value for blue point holds:

\begin{matrix} (6) & f (b l u e) \leq f (r e d) \end{matrix}

which also indicates that in it’s neibourhood of point(red), it is a local maximum point by definition.

To observe this more formally, we use Hessian matrix.

$x^s$ ,
$H_k(x^s) > 0, k=1,2,...n$ $x^s$ is a local minimum point for NLP.
$k=1,2,...n, H_k(x^s)$ $(-1)^k$ $x^*$ is a local maximum for NLP.
$H_n(x^s) \ne 0$ $x^s$ $x^s$ is not a local extremum, then it is called a saddle point.
$H_n(x^s)=0$ for a stationary point, then the stationary point may be a local maximum, a local minimum or a saddle point, and the preceding tests are inconclusive.

Stationary point and Local Minimum or Local Maximum (2)

This definition and process is given by Hocam and you might find this more useful.

$\grad f (\vec x) = 0$ are the candidate points for being a local optimum solution.

Expl $f(x) =x^2$

CleanShot 2022-12-21 at 11.43.21

\begin{matrix} (7) & \begin{matrix} f^{'} (x) = 2 x \\ l e t 2 x = 0 \Rightarrow x = 0 \end{matrix} \end{matrix}

so x=0 is a local optimum point (for this problem it is min).

There are three posibilities:

local minimum
local maximum
inflection point

Expl. $f(x)=-x^2$

\begin{matrix} (8) & \begin{matrix} f^{'} (x) = - 2 x \\ l e t - 2 x = 0 \Rightarrow x = 0 (s t a t i o n a r y p o i n t) \\ f^{″} (x) = - 2 \\ f^{″} (x = 0) = - 2 \end{matrix} \end{matrix}

So for this particular problem, x=0 is a local maximum point.

Expl. $f(x) = x^3$

\begin{matrix} (9) & \begin{matrix} f^{'} (x) = 3 x^{2} \\ l e t 3 x^{2} = 0 \Rightarrow x = 0 \end{matrix} \end{matrix}

But x=0 is not a local maximum nor a local minimum point, it is a inflaction point.

So someother steps are needed in order to determine the type of the point.

Expl. $f(x)=-2x^2$

\begin{matrix} (10) & \begin{matrix} 1. find the stationary point: \\ f^{'} (x) = - 2 x \\ l e t - 2 x = 0 \Rightarrow x = 0 \\ check the second order derivative at the stationary point: \\ f^{″} (x) = - 2 < 0 \\ (second order derivative at given stationary point is negative) \\ ∴ l o c a l m a x i m u m \end{matrix} \end{matrix}

Expl. $f(x)=x^3$

\begin{matrix} (11) & \begin{matrix} f^{'} (x) = 3 x^{2} \\ l e t 3 x^{2} = 0 \Rightarrow x = 0 \\ f^{″} (x) = 6 x \\ f^{″} (x = 0) = 0 \\ ∴ non conclusive \end{matrix} \end{matrix}

Actually, it is a inflaction point.

Expl. $f(x)=x^4$

Obviously, x=0 is local minimum.

Unconstrained Theorems Theorems

If first non-zero derivative occurs at an odd ordered derivative, then x is an inflaction point.

If first non-zero derivative occurs at an even ordered derivative, and:


2
1
- The value is **positive**, stationary point x is **local minimum**.
2
- The value is **negative**, stationary point x is **local maximum**.

For a convex function, the local minimum is a gloabl minimum.
For a concave function, the local maximum is a global maximum.
In a case that we can not determine the function being convex or concave, then we need to:
check all the local minimum points and local maximum points
chec the boundary points of the function domains on the feasible region.

CleanShot 2022-12-21 at 12.16.45