A New Smoothing Nonlinear Penalty Function for Constrained Optimization

Yang, Touna; Binh, Nguyen Thanh; Thang, Tran Manh; Hoa, Duong Thi

doi:10.3390/mca22020031

Open AccessArticle

A New Smoothing Nonlinear Penalty Function for Constrained Optimization

by

Touna Yang

¹,

Nguyen Thanh Binh

^2,3,*,

Tran Manh Thang

⁴ and

Duong Thi Hoa

³

¹

School of Mathematical Sciences, Dalian University of Technology, Dalian 116024, China

²

Department of Mathematics, Shanghai University, Shanghai 200444, China

³

Yen Bai Teacher’s Training College, Yen Bai City 320000, Vietnam

⁴

Department of Education and Training Yen Bai, Yen Bai City 320000, Vietnam

^*

Author to whom correspondence should be addressed.

Math. Comput. Appl. 2017, 22(2), 31; https://doi.org/10.3390/mca22020031

Submission received: 5 November 2016 / Revised: 29 March 2017 / Accepted: 4 April 2017 / Published: 12 April 2017

Download

Browse Figure

Versions Notes

Abstract

:

In this study, a new smoothing nonlinear penalty function for constrained optimization problems is presented. It is proved that the optimal solution of the smoothed penalty problem is an approximate optimal solution of the original problem. Based on the smoothed penalty function, we develop an algorithm for finding an optimal solution of the optimization problems with inequality constraints. We further discuss the convergence of this algorithm and test this algorithm with three numerical examples. The numerical examples show that the proposed algorithm is feasible and effective for solving some nonlinear constrained optimization problems.

Keywords:

penalty function; smoothing method; constrained optimization

1. Introduction

Consider the following constrained optimization problem:

\begin{matrix} (P) min & f (x) \\ s . t . & g_{i} (x) \leq 0, i = 1, 2, \dots, m, \\ x \in R^{n}, \end{matrix}

where the functions

f, g_{i} : R^{n} \to R, i \in I = {1, 2, \dots, m}

, are continuously differentiable functions.

Let

X_{0} = {x \in R^{n} | g_{i} (x) \leq 0, i \in I}

be the feasible solution set and we assume that

X_{0}

is not empty.

For a general constrained optimization problem, the penalty function method has attracted many researchers in both theoretical and practical aspects. However, to obtain an optimal solution for the original problem, the conventional quadratic penalty function method usually requires that the penalty parameter tends to infinity, which is undesirable in practical computation. In order to overcome the drawbacks of the quadratic penalty function method, exact penalty functions were proposed to solve problem

(P)

. Zangwill [1] first proposed the

l_{1}

exact penalty function

F^{1} (x, ρ) = f (x) + ρ \sum_{i = 1}^{m} g_{i}^{+} (x),

(1)

where

ρ > 0

is a penalty parameter, and

g_{i}^{+} (x) = max {0, g_{i} (x)}, i = 1, 2, \dots, m

. It was proved that there exists a fixed constant

ρ_{0} > 0

, for any

ρ > ρ_{0}

, and any global solution of the exact penalty problem is also a global solution of the original problem. Therefore, the exact penalty function methods have been widely used for solving constrained optimization problems (see, e.g., [2,3,4,5,6,7,8,9]).

Recently, the nonlinear penalty function of the following form has been investigated in [10,11,12,13]:

F^{k} (x, ρ) = {[f {(x)}^{k} + ρ \sum_{i = 1}^{m} {(g_{i}^{+} (x))}^{k}]}^{\frac{1}{k}},

(2)

where

f (x)

is assumed to be positive and

k \in (0, + \infty)

. It is called the k-th power penalty function in [14,15]. Obviously, if

k = 1

, the nonlinear penalty function

F^{k} (x, ρ)

is reduced to the

l_{1}

exact penalty function. In [12], it was shown that the exact penalty parameter corresponding to

k \in (0, 1]

is substantially smaller than that of the

l_{1}

exact penalty function. Rubinov and Yang [13] also studied a penalty function as follows:

ψ_{ρ}^{k} (x, c) = {[f (x) - c]}^{k} + ρ \sum_{i = 1}^{m} g_{i}^{+} (x),

(3)

where

c \in R

such that

f (x) - c > 0

for any

x \in R^{n}

, and

k \in (0, + \infty)

. The corresponding penalty problem of

(P)

is defined as

\begin{matrix} (P_{ρ}) min ψ_{ρ}^{k} (x, c) s . t . x \in R^{n} . \end{matrix}

In fact, the original problem

(P)

is equivalent to the problem as follows:

\begin{matrix} (P^{'}) min & {[f (x) - c]}^{k} \\ s . t . & g_{i} (x) \leq 0, i = 1, 2, \dots, m, \\ x \in R^{n} . \end{matrix}

Obviously, the penalty problem

(P_{ρ})

is the

l_{1}

exact penalty problem of problem

(P^{'})

defined as (1).

It is noted that these penalty functions

F^{1} (x, ρ), F^{k} (x, ρ) (0 < k \leq 1)

and

ψ_{ρ}^{k} (x, c)

are not differentiable at x such that

g_{i} (x) = 0

for some

i \in I

, which prevents the use of gradient-based methods and causes some numerical instability problems in its implementation, when the value of the penalty parameter becomes large [3,5,6,8]. In order to use existing gradient-based algorithms, such as a Newton method, it is necessary to smooth the exact penalty function. Thus, the smoothing of the exact penalty function attracts much attention [16,17,18,19,20,21,22,23,24]. Pinar and Zenios [21] and Wu et al. [22] discussed a quadratic smoothing approximation to nondifferentiable exact penalty functions for constrained optimization. Binh [17] and Xu et al. [23] proposed a second-order differentiability technique to the

l_{1}

exact penalty function. It is shown that the optimal solution of the smoothed penalty problem is an approximate optimal solution of the original optimization problem. Zenios et al. [24] discussed an algorithm for the solution of large-scale optimization problems.

In this study, we aim to develop the smoothing technique for the nonlinear penalty function (3). First, we define the following smoothing function

q_{ϵ, ρ}^{k} (t)

by

\begin{matrix} q_{ϵ, ρ}^{k} (t) = \{\begin{matrix} 0 & if t \leq 0, \\ \frac{2 m^{2} ρ^{2}}{9 ϵ^{2}} t^{3 k} & if 0 \leq t \leq {(\frac{ϵ}{m ρ})}^{\frac{1}{k}}, \\ t^{k} + \frac{ϵ}{3 m ρ} e^{- \frac{m ρ}{ϵ} t^{k} + 1} - \frac{10 ϵ}{9 m ρ} & if t \geq {(\frac{ϵ}{m ρ})}^{\frac{1}{k}}, \end{matrix} \end{matrix}

where

0 < k < + \infty, ρ > 0

and

ϵ > 0

. By considering this smoothing function, a new smoothing nonlinear penalty function is obtained. We use this smoothing nonlinear penalty function that is able to convert a constrained optimization problem into minimizations of a sequence of continuously differentiable functions and propose a corresponding algorithm for solving constrained optimization problems.

The rest of this paper is organized as follows. In Section 2, we propose a new smoothing penalty function for inequality constrained optimization problems, and some fundamental properties of its are proved. In Section 3, an algorithm based on the smoothed penalty function is presented and its global convergence is proved. In Section 4, we report results on application of this algorithm to three test problems and compare the results obtained with other similar algorithms. Finally, conclusions are discussed in Section 5.

2. Smoothing Nonlinear Penalty Functions

In this section, we first construct a new smoothing function. Then, we introduce our smoothing nonlinear penalty function and discuss its properties.

Let

q^{k} (t) : R \to R

be as follows:

\begin{matrix} q^{k} (t) = \{\begin{matrix} 0 & if t \leq 0, \\ t^{k} & if t \geq 0, \end{matrix} \end{matrix}

where

0 < k < + \infty

. Obviously, the function

q^{k} (t)

is

C^{1}

on

R

for

k > 1

, but it is not

C^{1}

for

0 < k \leq 1

. It is useful in defining exact penalty functions for constrained optimization problems (see, e.g., [14,15,21]). Consider the nonlinear penalty function

F_{ρ}^{k} (x, c) = {[f (x) - c]}^{k} + ρ \sum_{i = 1}^{m} q^{k} (g_{i} (x)),

(4)

and the corresponding penalty problem

\begin{matrix} (N P_{ρ}) min F_{ρ}^{k} (x, c) s . t . x \in R^{n} . \end{matrix}

As previously mentioned, for any

ϵ > 0

and

ρ > 0

, the function

q_{ϵ, ρ}^{k} (t)

is defined as:

\begin{matrix} q_{ϵ, ρ}^{k} (t) = \{\begin{matrix} 0 & if t \leq 0, \\ \frac{2 m^{2} ρ^{2}}{9 ϵ^{2}} t^{3 k} & if 0 \leq t \leq {(\frac{ϵ}{m ρ})}^{\frac{1}{k}}, \\ t^{k} + \frac{ϵ}{3 m ρ} e^{- \frac{m ρ}{ϵ} t^{k} + 1} - \frac{10 ϵ}{9 m ρ} & if t \geq {(\frac{ϵ}{m ρ})}^{\frac{1}{k}}, \end{matrix} \end{matrix}

where

0 < k < + \infty

.

Figure 1 shows the behavior of

q^{k} (t)

and

q_{ϵ, ρ}^{k} (t)

.

In the following, we discuss the properties of

q_{ϵ, ρ}^{k} (t)

.

Lemma 1.

For

0 < k < + \infty

and any

ϵ > 0

, we have

(i): $q_{ϵ, ρ}^{k} (t)$ is continuously differentiable for $k > \frac{1}{3}$ on $R$ , where

${[q_{ϵ, ρ}^{k} (t)]}^{'} = \{\begin{matrix} 0 & i f t \leq 0, \\ \frac{2 k m^{2} ρ^{2}}{3 ϵ^{2}} t^{3 k - 1} & i f 0 \leq t \leq {(\frac{ϵ}{m ρ})}^{\frac{1}{k}}, \\ k t^{k - 1} - \frac{1}{3} k t^{k - 1} e^{- \frac{m ρ}{ϵ} t^{k} + 1} & i f t \geq {(\frac{ϵ}{m ρ})}^{\frac{1}{k}} . \end{matrix}$
(ii): $lim_{ϵ \to 0} q_{ϵ, ρ}^{k} (t) = q^{k} (t)$ .
(iii): $q^{k} (t) \geq q_{ϵ, ρ}^{k} (t), \forall t \in R$ .

Proof.

(i) First, we prove that

q_{ϵ, ρ}^{k} (t)

is continuous. Obviously, the function

q_{ϵ, ρ}^{k} (t)

is continuous at any

t \in R \ \{0, {(\frac{ϵ}{m ρ})}^{\frac{1}{k}}\}

. We only need to prove that

q_{ϵ, ρ}^{k} (t)

continuous at the separating points: 0 and

{(\frac{ϵ}{m ρ})}^{\frac{1}{k}}

.

(1) For

t = 0

, we have

lim_{t \to 0^{-}} q_{ϵ, ρ}^{k} (t) = lim_{t \to 0^{-}} 0 = 0, lim_{t \to 0^{+}} q_{ϵ, ρ}^{k} (t) = lim_{t \to 0^{+}} \frac{2 m^{2} ρ^{2}}{9 ϵ^{2}} t^{3 k} = 0,

which implies

lim_{t \to 0^{-}} q_{ϵ, ρ}^{k} (t) = lim_{t \to 0^{+}} q_{ϵ, ρ}^{k} (t) = 0 = q_{ϵ, ρ}^{k} (0) .

Thus,

q_{ϵ, ρ}^{k} (t)

is continuous at

t = 0

.

(2) For

t = {(\frac{ϵ}{m ρ})}^{\frac{1}{k}}

, we have

\begin{matrix} lim_{t \to {[{(\frac{ϵ}{m ρ})}^{\frac{1}{k}}]}^{-}} q_{ϵ, ρ}^{k} (t) & = lim_{t \to {[{(\frac{ϵ}{m ρ})}^{\frac{1}{k}}]}^{-}} \frac{2 m^{2} ρ^{2}}{9 ϵ^{2}} t^{3 k} = \frac{2 ϵ}{9 m ρ}, \\ lim_{t \to {[{(\frac{ϵ}{m ρ})}^{\frac{1}{k}}]}^{+}} q_{ϵ, ρ}^{k} (t) & = lim_{t \to {[{(\frac{ϵ}{m ρ})}^{\frac{1}{k}}]}^{+}} [t^{k} + \frac{ϵ}{3 m ρ} e^{- \frac{m ρ}{ϵ} t^{k} + 1} - \frac{10 ϵ}{9 m ρ}] = \frac{2 ϵ}{9 m ρ}, \end{matrix}

which implies

lim_{t \to {[{(\frac{ϵ}{m ρ})}^{\frac{1}{k}}]}^{-}} q_{ϵ, ρ}^{k} (t) = lim_{t \to {[{(\frac{ϵ}{m ρ})}^{\frac{1}{k}}]}^{+}} q_{ϵ, ρ}^{k} (t) = \frac{2 ϵ}{9 m ρ} = q_{ϵ, ρ}^{k} ({(\frac{ϵ}{m ρ})}^{\frac{1}{k}}) .

Thus,

q_{ϵ, ρ}^{k} (t)

is continuous at

t = {(\frac{ϵ}{m ρ})}^{\frac{1}{k}}

.

Next, we will show that

q_{ϵ, ρ}^{k} (t)

is continuously differentiable, i.e.,

{[q_{ϵ, ρ}^{k} (t)]}^{'}

is continuous. Actually, we only need to prove that

{[q_{ϵ, ρ}^{k} (t)]}^{'}

is continuous at the separating points: 0 and

{(\frac{ϵ}{m ρ})}^{\frac{1}{k}}

.

(1) For

t = 0

, we have

lim_{t \to 0^{-}} {[q_{ϵ, ρ}^{k} (t)]}^{'} = lim_{t \to 0^{-}} 0 = 0, lim_{t \to 0^{+}} {[q_{ϵ, ρ}^{k} (t)]}^{'} = lim_{t \to 0^{+}} \frac{2 k m^{2} ρ^{2}}{3 ϵ^{2}} t^{3 k - 1} = 0,

which implies

lim_{t \to 0^{-}} {[q_{ϵ, ρ}^{k} (t)]}^{'} = lim_{t \to 0^{+}} {[q_{ϵ, ρ}^{k} (t)]}^{'} = 0 = {[q_{ϵ, ρ}^{k} (0)]}^{'} .

Thus,

{[q_{ϵ, ρ}^{k} (t)]}^{'}

is continuous at

t = 0

.

(2) For

t = {(\frac{ϵ}{m ρ})}^{\frac{1}{k}}

, we have

\begin{matrix} lim_{t \to {[{(\frac{ϵ}{m ρ})}^{\frac{1}{k}}]}^{-}} {[q_{ϵ, ρ}^{k} (t)]}^{'} & = lim_{t \to {[{(\frac{ϵ}{m ρ})}^{\frac{1}{k}}]}^{-}} \frac{2 k m^{2} ρ^{2}}{3 ϵ^{2}} t^{3 k - 1} = \frac{2 k}{3} {(\frac{ϵ}{m ρ})}^{\frac{k - 1}{k}}, \\ lim_{t \to {[{(\frac{ϵ}{m ρ})}^{\frac{1}{k}}]}^{+}} {[q_{ϵ, ρ}^{k} (t)]}^{'} & = lim_{t \to {[{(\frac{ϵ}{m ρ})}^{\frac{1}{k}}]}^{+}} (k t^{k - 1} - \frac{1}{3} k t^{k - 1} e^{- \frac{m ρ}{ϵ} t^{k} + 1}) = \frac{2 k}{3} {(\frac{ϵ}{m ρ})}^{\frac{k - 1}{k}}, \end{matrix}

which implies

lim_{t \to {[{(\frac{ϵ}{m ρ})}^{\frac{1}{k}}]}^{-}} {[q_{ϵ, ρ}^{k} (t)]}^{'} = lim_{t \to {[{(\frac{ϵ}{m ρ})}^{\frac{1}{k}}]}^{+}} {[q_{ϵ, ρ}^{k} (t)]}^{'} = \frac{2 k}{3} {(\frac{ϵ}{m ρ})}^{\frac{k - 1}{k}} = {[q_{ϵ, ρ}^{k} ({(\frac{ϵ}{m ρ})}^{\frac{1}{k}})]}^{'} .

Thus,

{[q_{ϵ, ρ}^{k} (t)]}^{'}

is continuous at

t = {(\frac{ϵ}{m ρ})}^{\frac{1}{k}}

.

(ii) For

\forall t \in R

, by the definition of

q^{k} (t)

and

q_{ϵ, ρ}^{k} (t),

we have

q^{k} (t) - q_{ϵ, ρ}^{k} (t) = \{\begin{matrix} 0 & if t \leq 0, \\ t^{k} - \frac{2 m^{2} ρ^{2}}{9 ϵ^{2}} t^{3 k} & if 0 \leq t \leq {(\frac{ϵ}{m ρ})}^{\frac{1}{k}}, \\ \frac{10 ϵ}{9 m ρ} - \frac{ϵ}{3 m ρ} e^{- \frac{m ρ}{ϵ} t^{k} + 1} & if t \geq {(\frac{ϵ}{m ρ})}^{\frac{1}{k}} . \end{matrix}

When

0 \leq t \leq {(\frac{ϵ}{m ρ})}^{\frac{1}{k}}

, let

u = t^{k}

. Then, we have

0 \leq u \leq \frac{ϵ}{m ρ}

. Consider the function:

G (u) = u - \frac{2 m^{2} ρ^{2}}{9 ϵ^{2}} u^{3}, 0 \leq u \leq \frac{ϵ}{m ρ},

and we have

G^{'} (u) = 1 - \frac{2 m^{2} ρ^{2}}{3 ϵ^{2}} u^{2}, 0 \leq u \leq \frac{ϵ}{m ρ} .

Obviously,

G^{'} (u) > 0

for

0 \leq u \leq \frac{ϵ}{m ρ}

. Moreover,

G (0) = 0

and

G (\frac{ϵ}{m ρ}) = \frac{7 ϵ}{9 m ρ}

. Hence, we have

0 \leq q^{k} (t) - q_{ϵ, ρ}^{k} (t) \leq \frac{7 ϵ}{9 m ρ} .

When

t \geq {(\frac{ϵ}{m ρ})}^{\frac{1}{k}}

, we have

\begin{matrix} 0 < q^{k} (t) - q_{ϵ, ρ}^{k} (t) = \frac{10 ϵ}{9 m ρ} - \frac{ϵ}{3 m ρ} e^{- \frac{m ρ}{ϵ} t^{k} + 1} \leq \frac{10 ϵ}{9 m ρ} . \end{matrix}

Thus, we have

0 \leq q^{k} (t) - q_{ϵ, ρ}^{k} (t) \leq \frac{10 ϵ}{9 m ρ} .

That is,

lim_{ϵ \to 0} q_{ϵ, ρ}^{k} (t) = q^{k} (t) .

(iii) For

\forall t \in R

, from (ii), we have

\begin{matrix} q^{k} (t) - q_{ϵ, ρ}^{k} (t) \geq 0, \end{matrix}

which is

q^{k} (t) \geq q_{ϵ, ρ}^{k} (t)

.

This completes the proof. ☐

In this study, we always assume that

c < 0

and

| c |

is large enough, such that

f (x) - c > 0

for all

x \in R^{n}

. Let

F_{ϵ, ρ}^{k} (x, c) = {[f (x) - c]}^{k} + ρ \sum_{i = 1}^{m} q_{ϵ, ρ}^{k} (g_{i} (x)), 0 < k < + \infty .

(5)

Then,

F_{ϵ, ρ}^{k} (x, c)

is continuously differentiable at any

x \in R^{n}

and is a smooth approximation of

F_{ρ}^{k} (x, c)

. We have the following smoothed penalty problem:

\begin{matrix} (N P_{ϵ, ρ}) min F_{ϵ, ρ}^{k} (x, c) s . t . x \in R^{n} . \end{matrix}

Lemma 2.

We have that

0 \leq F_{ρ}^{k} (x, c) - F_{ϵ, ρ}^{k} (x, c) \leq \frac{10 ϵ}{9}, 0 < k < + \infty

(6)

for any

x \in R^{n}, ϵ > 0

and

ρ > 0

.

Proof.

For any

x \in R^{n}, ϵ > 0, ρ > 0

, we have

F_{ρ}^{k} (x, c) - F_{ϵ, ρ}^{k} (x, c) = ρ \sum_{i = 1}^{m} (q^{k} (g_{i} (x)) - q_{ϵ, ρ}^{k} (g_{i} (x))) .

Note that

q^{k} (g_{i} (x)) - q_{ϵ, ρ}^{k} (g_{i} (x)) = \{\begin{matrix} 0 & if g_{i} (x) \leq 0, \\ {[g_{i} (x)]}^{k} - \frac{2 m^{2} ρ^{2}}{9 ϵ^{2}} {[g_{i} (x)]}^{3 k} & if 0 \leq g_{i} (x) \leq {(\frac{ϵ}{m ρ})}^{\frac{1}{k}}, \\ \frac{10 ϵ}{9 m ρ} - \frac{ϵ}{3 m ρ} e^{- \frac{m ρ}{ϵ} {[g_{i} (x)]}^{k} + 1} & if g_{i} (x) \geq {(\frac{ϵ}{m ρ})}^{\frac{1}{k}}, \end{matrix}

for any

i \in I

.

By Lemma 1, we have

0 \leq \sum_{i = 1}^{m} (q^{k} (g_{i} (x)) - q_{ϵ, ρ}^{k} (g_{i} (x))) \leq \frac{10 ϵ}{9 ρ},

which implies

0 \leq ρ \sum_{i = 1}^{m} (q^{k} (g_{i} (x)) - q_{ϵ, ρ}^{k} (g_{i} (x))) \leq \frac{10 ϵ}{9} .

Hence,

0 \leq F_{ρ}^{k} (x, c) - F_{ϵ, ρ}^{k} (x, c) \leq \frac{10 ϵ}{9} .

This completes the proof. ☐

Lemma 3.

Let

x^{*}

and

x_{ρ}^{*}

be optimal solutions of problem

(P)

and problem

(N P_{ρ}),

respectively. If

x_{ρ}^{*}

is a feasible solution to problem

(P)

, then

x_{ρ}^{*}

is an optimal solution for problem

(P)

.

Proof.

Under the given conditions, we have that

0 < {[f (x_{ρ}^{*}) - c]}^{k} = F_{ρ}^{k} (x_{ρ}^{*}, c) \leq F_{ρ}^{k} (x^{*}, c) = {[f (x^{*}) - c]}^{k}, 0 < k < + \infty .

Therefore,

f (x_{ρ}^{*}) - c \leq f (x^{*}) - c

, which is

f (x_{ρ}^{*}) \leq f (x^{*})

.

Since

x^{*}

is an optimal solution and

x_{ρ}^{*}

is feasible to problem

(P)

, which is

f (x_{ρ}^{*}) \geq f (x^{*}) .

Therefore,

x_{ρ}^{*}

is an optimal solution for problem

(P)

.

This completes the proof. ☐

Theorem 1.

Let

x_{ρ}^{*}

and

x_{ϵ, ρ}^{*}

be the optimal solutions of problem

(N P_{ρ})

and problem

(N P_{ϵ, ρ}),

respectively, for some

ρ > 0

and

ϵ > 0

. Then, we have that

0 \leq F_{ρ}^{k} (x_{ρ}^{*}, c) - F_{ϵ, ρ}^{k} (x_{ϵ, ρ}^{*}, c) \leq \frac{10 ϵ}{9}, 0 < k < + \infty .

(7)

Furthermore, if

x_{ρ}^{*}

satisfies the conditions of Lemma 3 and

x_{ϵ, ρ}^{*}

is feasible to problem

(P)

, then

x_{ϵ, ρ}^{*}

is an optimal solution for problem

(P)

.

Proof.

By Lemma 2, for

ρ > 0

and

ϵ > 0

, we obtain

\begin{matrix} 0 & \leq F_{ρ}^{k} (x_{ρ}^{*}, c) - F_{ϵ, ρ}^{k} (x_{ρ}^{*}, c) \\ \leq F_{ρ}^{k} (x_{ρ}^{*}, c) - F_{ϵ, ρ}^{k} (x_{ϵ, ρ}^{*}, c) \\ \leq F_{ρ}^{k} (x_{ϵ, ρ}^{*}, c) - F_{ϵ, ρ}^{k} (x_{ϵ, ρ}^{*}, c) \\ \leq \frac{10 ϵ}{9} . \end{matrix}

That is,

0 \leq F_{ρ}^{k} (x_{ρ}^{*}, c) - F_{ϵ, ρ}^{k} (x_{ϵ, ρ}^{*}, c) \leq \frac{10 ϵ}{9},

and

\begin{matrix} 0 & \leq \{{[f (x_{ρ}^{*}) - c]}^{k} + ρ \sum_{i = 1}^{m} q^{k} (g_{i} (x_{ρ}^{*}))\} - \{{[f (x_{ϵ, ρ}^{*}) - c]}^{k} + ρ \sum_{i = 1}^{m} q_{ϵ, ρ}^{k} (g_{i} (x_{ϵ, ρ}^{*}))\} \\ \leq \frac{10 ϵ}{9} . \end{matrix}

(8)

From the definition of

q^{k} (t), q_{ϵ, ρ}^{k} (t)

and the fact that

x_{ρ}^{*}, x_{ϵ, ρ}^{*}

are feasible for problem

(P)

, we have

\sum_{i = 1}^{m} q^{k} (g_{i} (x_{ρ}^{*})) = \sum_{i = 1}^{m} q_{ϵ, ρ}^{k} (g_{i} (x_{ϵ, ρ}^{*})) = 0 .

Note that

f (x_{ϵ, ρ}^{*}) - c > 0

, and from (8), we have

0 < {[f (x_{ϵ, ρ}^{*}) - c]}^{k} \leq {[f (x_{ρ}^{*}) - c]}^{k} .

Therefore,

f (x_{ϵ, ρ}^{*}) - c \leq f (x_{ρ}^{*}) - c

, which is

f (x_{ϵ, ρ}^{*}) \leq f (x_{ρ}^{*})

.

As

x_{ϵ, ρ}^{*}

is feasible to

(P)

and by Lemma 3,

x_{ρ}^{*}

is an optimal solution to

(P)

, we have

f (x_{ϵ, ρ}^{*}) \geq f (x_{ρ}^{*}) .

Thus,

x_{ϵ, ρ}^{*}

is an optimal solution for problem

(P)

.

This completes the proof. ☐

Definition 1.

A feasible solution

x^{*}

of problem

(P)

is called a KKT point, if there exists a

μ^{*} \in R^{m}

such that the solution pair

(x^{*}, μ^{*})

satisfies the following conditions:

\begin{matrix} \nabla f (x^{*}) + \sum_{i = 1}^{m} μ_{i}^{*} \nabla g_{i} (x^{*}) & = 0, \end{matrix}

(9)

\begin{matrix} μ_{i}^{*} g_{i} (x^{*}) = 0, g_{i} (x^{*}) \leq 0, μ_{i}^{*} & \geq 0, i \in I . \end{matrix}

(10)

Theorem 2.

Suppose the functions

f, g_{i} (i \in I)

in problem (P) are convex. Let

x^{*}

and

x_{ϵ, ρ}^{*}

be the optimal solutions of problem

(P)

and problem

(N P_{ϵ, ρ}),

respectively. If

x_{ϵ, ρ}^{*}

is feasible to problem

(P)

, and there exists a

μ^{*} \in R^{m}

such that the pair

(x_{ϵ, ρ}^{*}, μ^{*})

satisfies the conditions in Equations (9) and (10), then we have that

0 \leq {[f (x_{ϵ, ρ}^{*}) - c]}^{k} - {[f (x^{*}) - c]}^{k} \leq \frac{10 ϵ}{9}, 0 < k < + \infty .

(11)

Proof.

Since the functions

f, g_{i} (i \in I)

are continuously differentiable and convex, we see that

\begin{matrix} f (x^{*}) & \geq f (x_{ϵ, ρ}^{*}) + \nabla f {(x_{ϵ, ρ}^{*})}^{T} (x^{*} - x_{ϵ, ρ}^{*}), \end{matrix}

(12)

\begin{matrix} g_{i} (x^{*}) & \geq g_{i} (x_{ϵ, ρ}^{*}) + \nabla g_{i} {(x_{ϵ, ρ}^{*})}^{T} (x^{*} - x_{ϵ, ρ}^{*}), i = 1, 2, \dots, m . \end{matrix}

(13)

After applying the conditions given in Equations (9), (10), (12) and (13), we see that

\begin{matrix} f (x^{*}) & \geq f (x_{ϵ, ρ}^{*}) + \nabla f {(x_{ϵ, ρ}^{*})}^{T} (x^{*} - x_{ϵ, ρ}^{*}) \\ = f (x_{ϵ, ρ}^{*}) - \sum_{i = 1}^{m} μ_{i}^{*} \nabla g_{i} {(x_{ϵ, ρ}^{*})}^{T} (x^{*} - x_{ϵ, ρ}^{*}) \\ \geq f (x_{ϵ, ρ}^{*}) - \sum_{i = 1}^{m} μ_{i}^{*} [g_{i} (x^{*}) - g_{i} (x_{ϵ, ρ}^{*})] \\ = f (x_{ϵ, ρ}^{*}) - \sum_{i = 1}^{m} μ_{i}^{*} g_{i} (x^{*}) \\ \geq f (x_{ϵ, ρ}^{*}) . \end{matrix}

Therefore,

f (x^{*}) - c \geq f (x_{ϵ, ρ}^{*}) - c > 0

. Thus,

\begin{matrix} {[f (x_{ϵ, ρ}^{*}) - c]}^{k} \leq {[f (x^{*}) - c]}^{k} \leq {[f (x^{*}) - c]}^{k} + ρ \sum_{i = 1}^{m} q^{k} (g_{i} (x^{*})) = F_{ρ}^{k} (x^{*}, c) . \end{matrix}

By Lemma 2, we have

F_{ρ}^{k} (x^{*}, c) \leq F_{ϵ, ρ}^{k} (x^{*}, c) + \frac{10 ϵ}{9} .

It follows that

\begin{matrix} {[f (x_{ϵ, ρ}^{*}) - c]}^{k} & \leq F_{ϵ, ρ}^{k} (x^{*}, c) + \frac{10 ϵ}{9} \\ = {[f (x^{*}) - c]}^{k} + ρ \sum_{i = 1}^{m} q_{ϵ, ρ}^{k} (g_{i} (x^{*})) + \frac{10 ϵ}{9} \\ = {[f (x^{*}) - c]}^{k} + \frac{10 ϵ}{9} . \end{matrix}

(14)

Since

x_{ϵ, ρ}^{*}

is feasible to

(P)

, which is

f (x^{*}) \leq f (x_{ϵ, ρ}^{*}),

then

f (x^{*}) - c \leq f (x_{ϵ, ρ}^{*}) - c,

and, by

f (x^{*}) - c > 0

, we have

0 < {[f (x^{*}) - c]}^{k} \leq {[f (x_{ϵ, ρ}^{*}) - c]}^{k} .

(15)

Combining Equations (14) and (15), we have that

{[f (x^{*}) - c]}^{k} \leq {[f (x_{ϵ, ρ}^{*}) - c]}^{k} \leq {[f (x^{*}) - c]}^{k} + \frac{10 ϵ}{9},

which is

0 \leq {[f (x_{ϵ, ρ}^{*}) - c]}^{k} - {[f (x^{*}) - c]}^{k} \leq \frac{10 ϵ}{9}, 0 < k < + \infty .

This completes the proof. ☐

3. Algorithm

In this section, by considering the above smoothed penalty function, we propose an algorithm to find an optimal solution of problem

(P)

, defined as Algorithm 1.

Definition 2.

For

ϵ > 0

, a point

x_{ϵ}^{*} \in X_{0}

is called an ϵ-feasible solution to

(P)

, if it satisfies

g_{i} (x_{ϵ}^{*}) \leq ϵ, \forall i \in I

.

Algorithm 1: Algorithm for solving problem

(P)

Step 1: Let the initial point

x_{1}^{0}

. Let

ϵ_{1} > 0, ρ_{1} > 0, 0 < γ < 1, β > 1

and choose a constant

c < 0

such that

f (x) - c > 0, \forall x \in X_{0}

, let

j = 1

and go to Step 2.
Step 2: Use

x_{j}^{0}

as the starting point to solve the following problem:

(N P_{ϵ_{j}, ρ_{j}}) min_{x \in R^{n}} F_{ϵ_{j}, ρ_{j}}^{k} (x, c) = {[f (x) - c]}^{k} + ρ_{j} \sum_{i = 1}^{m} q_{ϵ_{j}, ρ_{j}}^{k} (g_{i} (x)) .

Let

x_{ϵ_{j}, ρ_{j}}^{*}

be an optimal solution of

(N P_{ϵ_{j}, ρ_{j}})

(the solution of

(N P_{ϵ_{j}, ρ_{j}})

we obtained by the BFGS method given in [25]).
Step 3: If

x_{ϵ_{j}, ρ_{j}}^{*}

is

ϵ

-feasible for problem

(P)

, then the algorithm stops and

x_{ϵ_{j}, ρ_{j}}^{*}

is an approximate optimal solution of problem

(P)

. Otherwise, let

ρ_{j + 1} = β ρ_{j}, ϵ_{j + 1} = γ ϵ_{j}, x_{j + 1}^{0} = x_{ϵ_{j}, ρ_{j}}^{*}

and

j = j + 1

. Then, go to Step 2.

Remark 1.

From

0 < γ < 1, β > 1,

we can easily see that as

j \to + \infty

, the sequence

{ϵ_{j}} \to 0

and the sequence

{ρ_{j}} \to + \infty

.

Theorem 3.

For

k > \frac{1}{3}

, suppose that for

ϵ \in (0, ϵ_{1}]

and

ρ \in [ρ_{1}, + \infty),

the set

arg min_{x \in R^{n}} F_{ϵ, ρ}^{k} (x, c) \neq \emptyset .

(16)

Let

{x_{ϵ_{j}, ρ_{j}}^{*}}

be the sequence generated by Algorithm 1. If

lim_{∥ x ∥ \to + \infty} f (x) = + \infty

and the sequence

{F_{ϵ_{j}, ρ_{j}}^{k} (x_{ϵ_{j}, ρ_{j}}^{*}, c)}

is bounded, then

{x_{ϵ_{j}, ρ_{j}}^{*}}

is bounded and the limit point of

{x_{ϵ_{j}, ρ_{j}}^{*}}

is the solution of

(P)

.

Proof.

First, we prove that

{x_{ϵ_{j}, ρ_{j}}^{*}}

is bounded. Note that

F_{ϵ_{j}, ρ_{j}}^{k} (x_{ϵ_{j}, ρ_{j}}^{*}, c) = {[f (x_{ϵ_{j}, ρ_{j}}^{*}) - c]}^{k} + ρ_{j} \sum_{i = 1}^{m} q_{ϵ_{j}, ρ_{j}}^{k} (g_{i} (x_{ϵ_{j}, ρ_{j}}^{*})), j = 0, 1, \dots .

(17)

From the definition of

q_{ϵ, ρ}^{k} (t)

, we have

ρ_{j} \sum_{i = 1}^{m} q_{ϵ_{j}, ρ_{j}}^{k} (g_{i} (x_{ϵ_{j}, ρ_{j}}^{*})) \geq 0 .

(18)

Suppose, on the contrary, that the sequence

{x_{ϵ_{j}, ρ_{j}}^{*}}

is unbounded and without loss of generality

∥ x_{ϵ_{j}, ρ_{j}}^{*} ∥ \to + \infty

as

j \to + \infty

, and

f (x_{ϵ_{j}, ρ_{j}}^{*}) - c > 0

. Then,

lim_{j \to + \infty} {[f (x_{ϵ_{j}, ρ_{j}}^{*}) - c]}^{k} = + \infty

, and from Equations (17) and (18), we have

F_{ϵ_{j}, ρ_{j}}^{k} (x_{ϵ_{j}, ρ_{j}}^{*}, c) \geq {[f (x_{ϵ_{j}, ρ_{j}}^{*}) - c]}^{k} \to + \infty, j = 0, 1, \dots,

which contradicts with the sequence

{F_{ϵ_{j}, ρ_{j}}^{k} (x_{ϵ_{j}, ρ_{j}}^{*}, c)}

being bounded. Thus,

{x_{ϵ_{j}, ρ_{j}}^{*}}

is bounded.

Next, we prove that the limit point of

{x_{ϵ_{j}, ρ_{j}}^{*}}

is the solution of problem

(P)

. Let

x^{*}

be a limit point of

{x_{ϵ_{j}, ρ_{j}}^{*}}

. Then, there exists the subset

J \subset N

such that

x_{ϵ_{j}, ρ_{j}}^{*} \to x^{*}

for

j \in J

, where

N

is the set of natural numbers. We have to show that

x^{*}

is an optimal solution of problem

(P)

. Thus, it is sufficient to show

(i) x^{*} \in X_{0}

and

(ii) f (x^{*}) \leq {inf}_{x \in X_{0}} f (x)

.

(i) Suppose

x^{*} \notin X_{0}

. Then, there exists

θ_{0} > 0

and the subset

J^{'} \subset J

, such that

g_{i^{'}} (x_{ϵ_{j}, ρ_{j}}^{*}) \geq θ_{0} > 0

for any

j \in J^{'}

and some

i^{'} \in I

.

If

θ_{0} \leq g_{i^{'}} (x_{ϵ_{j}, ρ_{j}}^{*}) < {(\frac{ϵ_{j}}{m ρ_{j}})}^{\frac{1}{k}}

, from the definition of

q_{ϵ, ρ}^{k} (t)

and

x_{ϵ_{j}, ρ_{j}}^{*}

is the optimal solution according j-th values of the parameters

ϵ_{j}, ρ_{j}

for any

x \in X_{0}

, we have

\begin{matrix} {[f (x_{ϵ_{j}, ρ_{j}}^{*}) - c]}^{k} + \frac{2 m^{2} ρ_{j}^{3} θ_{0}^{3 k}}{9 ϵ_{j}^{2}} & \leq F_{ϵ_{j}, ρ_{j}}^{k} (x_{ϵ_{j}, ρ_{j}}^{*}, c) \\ \leq F_{ϵ_{j}, ρ_{j}}^{k} (x, c) = {[f (x) - c]}^{k}, \end{matrix}

which contradicts with

ρ_{j} \to + \infty

and

ϵ_{j} \to 0

.

If

g_{i^{'}} (x_{ϵ_{j}, ρ_{j}}^{*}) \geq θ_{0} \geq {(\frac{ϵ_{j}}{m ρ_{j}})}^{\frac{1}{k}}

or

g_{i^{'}} (x_{ϵ_{j}, ρ_{j}}^{*}) \geq {(\frac{ϵ_{j}}{m ρ_{j}})}^{\frac{1}{k}} \geq θ_{0}

, from the definition of

q_{ϵ, ρ}^{k} (t)

and

x_{ϵ_{j}, ρ_{j}}^{*}

is the optimal solution according j-th values of the parameters

ϵ_{j}, ρ_{j}

for any

x \in X_{0}

, we have

\begin{matrix} {[f (x_{ϵ_{j}, ρ_{j}}^{*}) - c]}^{k} + ρ_{j} θ_{0}^{k} + \frac{ϵ_{j}}{3 m} e^{- \frac{m ρ_{j}}{ϵ_{j}} θ_{0}^{k} + 1} - \frac{10 ϵ_{j}}{9 m} & \leq F_{ϵ_{j}, ρ_{j}}^{k} (x_{ϵ_{j}, ρ_{j}}^{*}, c) \\ \leq F_{ϵ_{j}, ρ_{j}}^{k} (x, c) = {[f (x) - c]}^{k}, \end{matrix}

which contradicts with

ρ_{j} \to + \infty

and

ϵ_{j} \to 0

.

Thus,

x^{*} \in X_{0}

.

(ii) For any

x \in X_{0}

, we have

{[f (x_{ϵ_{j}, ρ_{j}}^{*}) - c]}^{k} \leq F_{ϵ_{j}, ρ_{j}}^{k} (x_{ϵ_{j}, ρ_{j}}^{*}, c) \leq F_{ϵ_{j}, ρ_{j}}^{k} (x, c) = {[f (x) - c]}^{k} .

We know that

f (x_{ϵ_{j}, ρ_{j}}^{*}) - c > 0

, so

f (x_{ϵ_{j}, ρ_{j}}^{*}) - c \leq f (x) - c

. Therefore,

f (x^{*}) \leq {inf}_{x \in X_{0}} f (x)

holds.

This completes the proof. ☐

4. Numerical Examples

In this section, we apply the Algorithm 1 to three test problems. The proposed algorithm is implemented in Matlab (R2011A, The MathWorks Inc., Natick, MA, USA).

In each example, we take

ϵ = 10^{- 6}

. Then, it is expected to get an

ϵ

-solution to problem

(P)

with Algorithm 1, and the numerical results are presented in the following tables.

Example 1.

Consider the following problem ([20], Example 4.1)

\begin{matrix} min & f (x) = x_{1}^{2} + x_{2}^{2} + 2 x_{3}^{2} + x_{4}^{2} - 5 x_{1} - 5 x_{2} - 21 x_{3} + 7 x_{4} \\ s . t . & g_{1} (x) = 2 x_{1}^{2} + x_{2}^{2} + x_{3}^{2} + 2 x_{1} + x_{2} + x_{4} - 5 \leq 0, \\ g_{2} (x) = x_{1}^{2} + x_{2}^{2} + x_{3}^{2} + x_{4}^{2} + x_{1} - x_{2} + x_{3} - x_{4} - 8 \leq 0, \\ g_{3} (x) = x_{1}^{2} + 2 x_{2}^{2} + x_{3}^{2} + 2 x_{4}^{2} - x_{1} - x_{4} - 10 \leq 0 . \end{matrix}

For

k = \frac{2}{3}

, let

x_{1}^{0} = (0, 0, 0, 0), ρ_{1} = 6, β = 10, ϵ_{1} = 0.01, γ = 0.01

and choose

c = - 100

. The results are shown in Table 1.

For

k = 1

, let

x_{1}^{0} = (5, 5, 5, 5), ρ_{1} = 10, β = 4, ϵ_{1} = 0.01, γ = 0.1

and choose

c = - 100

. The results are shown in Table 2.

The results in Table 1 and Table 2 show that the convergence of Algorithm 1 and the objective function values are almost the same. By Table 1, we obtain that an approximate optimal solution

x^{*} = (0.166332, 0.828748, 2.013798, - 0.959021)

after two iterations with function value

f (x^{*}) = - 44.233325

. In [20], the obtained approximate optimal solution is

x^{*} = (0.170056, 0.841066, 2.004907, - 0.968785)

with function value

f (x^{*}) = - 44.225989

. Numerical results obtained by our algorithm are slightly better than the results in [20].

Example 2.

Consider the following problem ([22], Example 3.2)

\begin{matrix} min & f (x) = - x_{1} - x_{2} \\ s . t . & g_{1} (x) = - 2 x_{1}^{4} + 8 x_{1}^{3} - 8 x_{1}^{2} + x_{1} - 2 \leq 0, \\ g_{2} (x) = - 4 x_{1}^{4} + 32 x_{1}^{3} - 88 x_{1}^{2} + 96 x_{1} + x_{2} - 36 \leq 0, \\ 0 \leq x_{1} \leq 3, \\ 0 \leq x_{2} \leq 4 . \end{matrix}

For

k = \frac{3}{4}

, let

x_{1}^{0} = (3, 1), ρ_{1} = 5, β = 10, ϵ_{1} = 0.1, γ = 0.1

, and choose

c = - 10

. The results are shown in Table 3.

For

k = 1

, let

x_{1}^{0} = (0, 1), ρ_{1} = 6, β = 10, ϵ_{1} = 0.02, γ = 0.01

and choose

c = - 10

. The results are shown in Table 4.

The results in Table 3 and Table 4 show that the convergence of Algorithm 1 and the objective function values are almost the same. By Table 3, we obtain an approximate optimal solution is

x^{*} = (2.112103, 3.900086)

after 2 iterations with function value

f (x^{*}) = - 6.012190

. In [22], the obtained global solution is

x^{*} = (2.3295, 3.1784)

with function value

f (x^{*}) = - 5.5080

. Numerical results obtained by our algorithm are much better than the results in [22].

Example 3.

Consider the following problem ([26], Example 4.1)

\begin{matrix} min & f (x) = x_{1}^{2} + x_{2}^{2} - cos (17 x_{1}) - cos (17 x_{2}) + 3 \\ s . t . & g_{1} (x) = {(x_{1} - 2)}^{2} + x_{2}^{2} - 1 . 6^{2} \leq 0, \\ g_{2} (x) = x_{1}^{2} + {(x_{2} - 3)}^{2} - 2 . 7^{2} \leq 0, \\ 0 \leq x_{1} \leq 2, \\ 0 \leq x_{2} \leq 2 . \end{matrix}

For

k = \frac{2}{3}

, let

x_{1}^{0} = (0, 1), ρ_{1} = 1, β = 3, ϵ_{1} = 0.01, γ = 0.01

and choose

c = - 2

. The results are shown in Table 5.

For

k = \frac{3}{4}

, let

x_{1}^{0} = (0, 0), ρ_{1} = 1, β = 9, ϵ_{1} = 0.01, γ = 0.01

, and choose

c = - 2

. The results are shown in Table 6.

The results in Table 5 and Table 6 show that the convergence of Algorithm 1 and the objective function values are almost the same. By Table 5, we obtain that an approximate optimal solution is

x^{*} = (0.725355, 0.399258)

after two iterations with function value

f (x^{*}) = 1.837548

. In [26], the obtained approximate optimal solution is

x^{*} = (0.7255, 0.3993)

with function value

f (x^{*}) = 1.8376

. Numerical results obtained by our algorithm are slightly better than the results in [26].

5. Conclusions

In this study, we have proposed a new smoothing approach to the nonsmooth penalty function and developed a corresponding algorithm to solve constrained optimization with inequality constraints. It is shown that any optimal solution of the smoothed penalty problem is shown to be an approximate optimal solution or a global solution of the original optimization problem. Furthermore, the numerical results given in Section 4 show that the Algorithm 1 has a good convergence for an approximate optimal solution.

Acknowledgments

The authors would like to express their gratitude to anonymous referees’ detailed comments and remarks that help us improve our presentation of this paper considerably. This work is supported by grants from the National Natural Science Foundation of China (No. 11371242 and 61572099).

Author Contributions

All authors contributed equally to this work.

Conflicts of Interest

The authors declare no competing interests.

References

Zangwill, W.I. Nonlinear programming via penalty function. Manag. Sci. 1967, 13, 334–358. [Google Scholar] [CrossRef]
Bazaraa, M.S.; Goode, J.J. Sufficient conditions for a globally exact penalty function without convexity. Math. Program. Study 1982, 19, 1–15. [Google Scholar]
Di Pillo, G.; Grippo, L. An exact penalty function method with global conergence properties for nonlinear programming problems. Math. Program. 1986, 36, 1–18. [Google Scholar] [CrossRef]
Di Pillo, G.; Grippo, L. Exact penalty functions in constrained optimization. SIAM J. Control. Optim. 1989, 27, 1333–1360. [Google Scholar] [CrossRef]
Han, S.P.; Mangasrian, O.L. Exact penalty function in nonlinear programming. Math. Program. 1979, 17, 251–269. [Google Scholar] [CrossRef]
Lasserre, J.B. A globally convergent algorithm for exact penalty functions. Eur. J. Oper. Res. 1981, 7, 389–395. [Google Scholar] [CrossRef]
Mangasarian, O.L. Sufficiency of exact penalty minimization. SIAM J. Control. Optim. 1985, 23, 30–37. [Google Scholar] [CrossRef]
Rosenberg, E. Exact penalty functions and stability in locally Lipschitz programming. Math. Program. 1984, 30, 340–356. [Google Scholar] [CrossRef]
Yu, C.J.; Teo, K.L.; Zhang, L.S.; Bai, Y.Q. A new exact penalty function method for continuous inequality constrained optimization problems. J. Indus. Mgmt. Optimiz. 2010, 6, 895–910. [Google Scholar] [CrossRef]
Huang, X.X.; Yang, X.Q. Convergence analysis of a class of nonlinear penalization methods for constrained optimization via first-order necessary optimality conditions. J. Optim. Theory Appl. 2003, 116, 311–332. [Google Scholar] [CrossRef]
Rubinov, A.M.; Glover, B.M.; Yang, X.Q. Extended Lagrange and penalty functions in continuous optimization. Optimization 1999, 46, 327–351. [Google Scholar] [CrossRef]
Rubinov, A.M.; Yang, X.Q.; Bagirov, A.M. Penalty functions with a small penalty parameter. Optim. Methods Softw. 2002, 17, 931–964. [Google Scholar] [CrossRef]
Rubinov, A.M.; Yang, X.Q. Lagrange-Type Functions in Constrained Non-Convex Optimization; Kluwer Academic Publishers: Dordrecht, Netherlands, 2003. [Google Scholar]
Binh, N.T.; Yan, W.L. Smoothing approximation to the k-th power nonlinear penalty function for constrained optimization problems. J. Appl. Math. Bioinform. 2015, 5, 1–19. [Google Scholar]
Yang, X.Q.; Meng, Z.Q.; Huang, X.X.; Pong, G.T.Y. Smoothing nonlinear penalty functions for constrained optimization. Numer. Funct. Anal. Optimiz. 2003, 24, 351–364. [Google Scholar] [CrossRef]
Binh, N.T. Smoothing approximation to l₁ exact penalty function for constrained optimization problems. J. Appl. Math. Inform. 2015, 33, 387–399. [Google Scholar] [CrossRef]
Binh, N.T. Second-order smoothing approximation to l₁ exact penalty function for nonlinear constrained optimization problems. Theor. Math. Appl. 2015, 5, 1–17. [Google Scholar]
Binh, N.T. Smoothed lower order penalty function for constrained optimization problems. IAENG Int. J. Appl. Math. 2016, 46, 76–81. [Google Scholar]
Chen, C.H.; Mangasarian, O.L. Smoothing methods for convex inequalities and linear complementarity problems. Math. Program. 1995, 71, 51–69. [Google Scholar] [CrossRef]
Meng, Z.Q.; Dang, C.Y.; Jiang, M.; Shen, R. A smoothing objective penalty function algorithm for inequality constrained optimization problems. Numer. Funct. Anal. Optimiz. 2011, 32, 806–820. [Google Scholar] [CrossRef]
Pinar, M.C.; Zenios, S.A. On smoothing exact penalty function for convex constrained optimization. SIAM J. Optim. 1994, 4, 486–511. [Google Scholar] [CrossRef]
Wu, Z.Y.; Lee, H.W.J.; Bai, F.S.; Zhang, L.S. Quadratic smoothing approximation to l₁ exact penalty function in global optimization. J. Ind. Manag. Optim. 2005, 1, 533–547. [Google Scholar]
Xu, X.S.; Meng, Z.Q.; Sun, J.W.; Huang, L.G.; Shen, R. A second-order smooth penalty function algorithm for constrained optimization problems. Comput. Optim. Appl. 2013, 55, 155–172. [Google Scholar] [CrossRef]
Zenios, S.A.; Pinar, M.C.; Dembo, R.S. A smooth penalty function algorithm for network-structured problems. Eur. J. Oper. Res. 1995, 83, 220–236. [Google Scholar] [CrossRef]
Nocedal, J.; Wright, S.T. Numerical Optimization; Springer: New York, NY, USA, 1999. [Google Scholar]
Sun, X.L.; Li, D. Value-estimation function method for constrained global optimization. J. Optim. Theory Appl. 1999, 102, 385–409. [Google Scholar] [CrossRef]

Figure 1. The behavior of

q^{k} (t)

and

q_{ϵ, ρ}^{k} (t) .

Figure 1. The behavior of

q^{k} (t)

and

q_{ϵ, ρ}^{k} (t) .

Table 1. Results of Algorithm 1 with

k = \frac{2}{3}, x_{1}^{0} = (0, 0, 0, 0)

for Example 1.

Table 1. Results of Algorithm 1 with

k = \frac{2}{3}, x_{1}^{0} = (0, 0, 0, 0)

for Example 1.

j	$ρ_{j}$	$ϵ_{j}$	$x_{ϵ_{j}, ρ_{j}}^{*}$	$f (x_{ϵ_{j}, ρ_{j}}^{*})$	$g_{1} (x_{ϵ_{j}, ρ_{j}}^{*})$	$g_{2} (x_{ϵ_{j}, ρ_{j}}^{*})$	$g_{3} (x_{ϵ_{j}, ρ_{j}}^{*})$
1	6	0.01	(0.214257, 0.952510, 1.934008, −1.051001)	−44.266455	0.069500	0.044928	−1.353205
2	60	0.0001	(0.166332, 0.828748, 2.013798, −0.959021)	−44.233325	−0.000071	−0.000006	−1.911178

Table 2. Results of Algorithm 1 with

k = 1, x_{1}^{0} = (5, 5, 5, 5)

for Example 1.

Table 2. Results of Algorithm 1 with

k = 1, x_{1}^{0} = (5, 5, 5, 5)

for Example 1.

j	$ρ_{j}$	$ϵ_{j}$	$x_{ϵ_{j}, ρ_{j}}^{*}$	$f (x_{ϵ_{j}, ρ_{j}}^{*})$	$g_{1} (x_{ϵ_{j}, ρ_{j}}^{*})$	$g_{2} (x_{ϵ_{j}, ρ_{j}}^{*})$	$g_{3} (x_{ϵ_{j}, ρ_{j}}^{*})$
1	10	0.01	(0.049915, 1.064341, 1.936277, −1.083612)	−44.010514	−0.032469	0.064160	−0.600569
2	40	0.001	(0.161485, 0.837434, 2.012060, −0.962236)	−44.233462	0.000003	0.000003	−1.870398
3	160	0.0001	(0.169837, 0.833494, 2.009512, −0.963710)	−44.233813	0.000000	0.000000	−1.892242

Table 3. Results of Algorithm 1 with

k = \frac{3}{4}, x_{1}^{0} = (3, 1)

for Example 2.

Table 3. Results of Algorithm 1 with

k = \frac{3}{4}, x_{1}^{0} = (3, 1)

for Example 2.

j	$ρ_{j}$	$ϵ_{j}$	$x_{ϵ_{j}, ρ_{j}}^{*}$	$f (x_{ϵ_{j}, ρ_{j}}^{*})$	$g_{1} (x_{ϵ_{j}, ρ_{j}}^{*})$	$g_{2} (x_{ϵ_{j}, ρ_{j}}^{*})$
1	5	0.1	(2.112050, 3.900242)	−6.012292	0.000039	0.000052
2	50	0.01	(2.112103, 3.900086)	−6.012190	−0.000020	−0.000008

Table 4. Results of Algorithm 1 with

k = 1, x_{1}^{0} = (0, 1)

for Example 2.

Table 4. Results of Algorithm 1 with

k = 1, x_{1}^{0} = (0, 1)

for Example 2.

j	$ρ_{j}$	$ϵ_{j}$	$x_{ϵ_{j}, ρ_{j}}^{*}$	$f (x_{ϵ_{j}, ρ_{j}}^{*})$	$g_{1} (x_{ϵ_{j}, ρ_{j}}^{*})$	$g_{2} (x_{ϵ_{j}, ρ_{j}}^{*})$
1	6	0.02	(2.111875, 3.900776)	−6.012651	0.000232	0.000278
2	60	0.0002	(2.112763, 3.898814)	−6.011577	−0.000755	−0.000109

Table 5. Results of Algorithm 1 with

k = \frac{2}{3}, x_{1}^{0} = (0, 1)

for Example 3.

Table 5. Results of Algorithm 1 with

k = \frac{2}{3}, x_{1}^{0} = (0, 1)

for Example 3.

j	$ρ_{j}$	$ϵ_{j}$	$x_{ϵ_{j}, ρ_{j}}^{*}$	$f (x_{ϵ_{j}, ρ_{j}}^{*})$	$g_{1} (x_{ϵ_{j}, ρ_{j}}^{*})$	$g_{2} (x_{ϵ_{j}, ρ_{j}}^{*})$
1	1	0.01	(0.016363, 1.092060)	2.271514	2.567409	−3.649496
2	3	0.0001	(0.725355, 0.399258)	1.837548	−0.775873	−0.000000

Table 6. Results of Algorithm 1 with

k = \frac{3}{4}, x_{1}^{0} = (0, 0)

for Example 3.

Table 6. Results of Algorithm 1 with

k = \frac{3}{4}, x_{1}^{0} = (0, 0)

for Example 3.

j	$ρ_{j}$	$ϵ_{j}$	$x_{ϵ_{j}, ρ_{j}}^{*}$	$f (x_{ϵ_{j}, ρ_{j}}^{*})$	$g_{1} (x_{ϵ_{j}, ρ_{j}}^{*})$	$g_{2} (x_{ϵ_{j}, ρ_{j}}^{*})$
1	1	0.01	(0.387905, 1.086812)	2.448684	1.220019	−3.479255
2	9	0.0001	(0.725356, 0.399258)	1.837548	−0.775876	−0.000000

© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yang, T.; Binh, N.T.; Thang, T.M.; Hoa, D.T. A New Smoothing Nonlinear Penalty Function for Constrained Optimization. Math. Comput. Appl. 2017, 22, 31. https://doi.org/10.3390/mca22020031

AMA Style

Yang T, Binh NT, Thang TM, Hoa DT. A New Smoothing Nonlinear Penalty Function for Constrained Optimization. Mathematical and Computational Applications. 2017; 22(2):31. https://doi.org/10.3390/mca22020031

Chicago/Turabian Style

Yang, Touna, Nguyen Thanh Binh, Tran Manh Thang, and Duong Thi Hoa. 2017. "A New Smoothing Nonlinear Penalty Function for Constrained Optimization" Mathematical and Computational Applications 22, no. 2: 31. https://doi.org/10.3390/mca22020031

APA Style

Yang, T., Binh, N. T., Thang, T. M., & Hoa, D. T. (2017). A New Smoothing Nonlinear Penalty Function for Constrained Optimization. Mathematical and Computational Applications, 22(2), 31. https://doi.org/10.3390/mca22020031

Article Menu

A New Smoothing Nonlinear Penalty Function for Constrained Optimization

Abstract

1. Introduction

2. Smoothing Nonlinear Penalty Functions

3. Algorithm

4. Numerical Examples

5. Conclusions

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI