An Efficient Subspace Minimization Conjugate Gradient Method for Solving Nonlinear Monotone Equations with Convex Constraints

Taiyong Song; Zexian Liu

doi:10.3390/axioms13030170

and

School of Mathematics and Statistics, Guizhou University, Guiyang 550025, China

^*

Author to whom correspondence should be addressed.

Axioms2024, 13(3), 170;https://doi.org/10.3390/axioms13030170

This article belongs to the Special Issue Numerical Analysis and Optimization

Version Notes

Order Reprints

Abstract

The subspace minimization conjugate gradient (SMCG) methods proposed by Yuan and Store are efficient iterative methods for unconstrained optimization, where the search directions are generated by minimizing the quadratic approximate models of the objective function at the current iterative point. Although the SMCG methods have illustrated excellent numerical performance, they are only used to solve unconstrained optimization problems at present. In this paper, we extend the SMCG methods and present an efficient SMCG method for solving nonlinear monotone equations with convex constraints by combining it with the projection technique, where the search direction is sufficiently descent.Under mild conditions, we establish the global convergence and R-linear convergence rate of the proposed method. The numerical experiment indicates that the proposed method is very promising.

Keywords:

nonlinear monotone equations; subspace minimization conjugate gradient method; convex constraints; global convergence; R-linear convergence rate

MSC:

90C06; 65K

1. Introduction

We consider the following nonlinear equations with convex constraints:

F (x) = 0, x \in Ω,

(1)

where

Ω \subset R^{n}

is a non-empty closed convex set and F:

R^{n} \to R^{n}

is a continuous mapping that satisfies the monotonicity condition

⟨ F (x) - F (y), x - y ⟩ \geq 0,

(2)

for all

x, y \in R^{n} .

It is easy to verify that the solution set of problem (1) is convex under condition (2).

Nonlinear equations have numerous practical applications, e.g., machinery manufacturing problems [], neural networks [], economic equilibrium problems [], image recovery problems [], and so on. In the context of many practical applications, problem (1) has attracted a substantial number of scholars to put forward more effective iterative methods to find solutions, such as Newton’s method, quasi-Newton methods, trust region methods, Levenberg–Marquardt methods, or their variants ([,,,,]). Although these methods are very popular and have fast convergence at an adequately good initial point, they are not suitable for solving large-scale nonlinear equations due to the calculation and storage of the Jacobian matrix or its approximation.

Due to its simple form and low memory requirement, conjugate gradient (CG) methods are used to solve problem (1) by combining them with projection technology proposed by Solodov and Svaiter [] (see [,]). Xiao and Zhu [] extended the famous CG_DESCENT method [] for solving nonlinear monotone equations with convex constraints due to its effectiveness. Liu and Li [] presented an efficient projection method for solving convex constrained monotone nonlinear equations, which can be viewed as another extension of the CG_DESCENT method [] and was used to solve the sparse signal reconstruction in compressive sensing. Based on the Dai–Yuan (DY) method [], Liu and Feng [] presented an efficient derivative-free iterative method and established its Q-linear convergence rate of the proposed method under the local error bound condition. By minimizing the distance between relative matrix and the self-scaling memoryless BFGS method in the Frobenius norm, Gao et al. [] proposed an adaptive projection method for solving nonlinear equations and applied it to recover a sparse signal from incomplete and contaminated sampling measurements. Based on [], Li and Zheng [] proposed two effective derivative-free methods for solving large-scale nonsmooth monotone nonlinear equations. Waziri et al. [] proposed two DY-type iterative methods for solving (1). By using the projection method [], Abdulkarim et al. [] introduced two classes of three-term methods for solving (1) and established the global convergence under a weaker monotonicity condition.

The subspace minimization conjugate gradient (SMCG) methods proposed by Yuan and Stoer [] are generalizations of the traditional CG methods and are a class of iterative methods for unconstrained optimization. The SMCG methods have illustrated excellent numerical performance and have also received much attention recently. However, the SMCG methods are only used to solve unconstrained optimization at present. Therefore, it is very interesting to study the SMCG methods for solving nonlinear equations with convex constraints. In this paper, we propose an efficient SMCG method for solving nonlinear monotone equations with convex constraints by combining it with the projection technology, where the search direction is in a sufficient descent. Under suitable conditions, the global convergence and the convergence rate of the proposed method are established. The numerical experiment is conducted, which indicates that the proposed method is superior to some efficient conjugate gradient methods.

The remainder of this paper is organized as follows. In Section 2, an efficient SMCG method for solving nonlinear monotone equations with convex constraints is presented. We prove the global convergence and the convergence rate of the proposed method in Section 3. In Section 4, the conducted numerical experiment is discussed to verify the effectiveness of the proposed method. The conclusion is presented in Section 5.

2. The SMCG Method for Solving Nonlinear Monotone Equations with Convex Constraints

In this section, we first review the SMCG methods for unconstrained optimization, and then propose an efficient SMCG method for solving (1) by combining it with the projection technique and exploit some of its important properties.

2.1. The SMCG Method for Unconstrained Optimization

We review the SMCG methods here.

The SMCG methods were proposed by Yuan and Stoer [] to solve the unconstrained optimization problem

min_{x \in R^{n}} f (x),

where

f : R^{n} \to R

is continuously differentiable. The SMCG methods are of the form

x_{k + 1} = x_{k} + α_{k} {\hat{d}}_{k}

, where

α_{k}

is the stepsize and

{\hat{d}}_{k}

is the search direction, which are generated by minimizing the quadratic approximate models of the objective function f at the current iterative point

x_{k}

in the subspace

Ω_{k} = Span {{\hat{d}}_{k - 1}, g_{k}},

namely

min_{\hat{d} \in Ω_{k}} m_{k} (\hat{d}) = g_{k}^{T} \hat{d} + \frac{1}{2} {\hat{d}}^{T} B_{k} \hat{d},

(3)

where

B_{k}

is an approximation to the Hessian matrix and is required to satisfy the quasi-Newton equation

B_{k} {\hat{s}}_{k - 1} = {\hat{y}}_{k - 1}

,

{\hat{s}}_{k - 1} = x_{k + 1} - x_{k} = α_{k} {\hat{d}}_{k}

,

g_{k} = \nabla f (x_{k})

.

In the following, we consider the case that

{\hat{d}}_{k - 1}

and

g_{k}

are not collinear. Since the vector

{\hat{d}}_{k}

in

Ω_{k} = Span {{\hat{d}}_{k - 1}, g_{k}}

can be expressed as

{\hat{d}}_{k} = μ_{k} g_{k} + ν_{k} {\hat{s}}_{k - 1},

(4)

where

μ_{k}, ν_{k} \in R

, by substituting (4) into (3), we obtain

min_{(μ_{k}, ν_{k}) \in R^{2}} Φ (μ_{k}, ν_{k}) = {(\begin{matrix} | | g_{k} | |^{2} \\ g_{k}^{T} {\hat{s}}_{k - 1} \end{matrix})}^{T} (\begin{matrix} μ_{k} \\ ν_{k} \end{matrix}) + \frac{1}{2} {(\begin{matrix} μ_{k} \\ ν_{k} \end{matrix})}^{T} (\begin{matrix} g_{k}^{T} B_{k} g_{k} \\ {\hat{s}}_{k - 1}^{T} B_{k} g_{k} \end{matrix} \begin{matrix} g_{k}^{T} B_{k} {\hat{s}}_{k - 1} \\ {\hat{s}}_{k - 1}^{T} B_{k} {\hat{s}}_{k - 1} \end{matrix}) (\begin{matrix} μ_{k} \\ ν_{k} \end{matrix}) .

(5)

When

B_{k}

is positive definite, by imposing

\nabla Φ (μ_{k}, ν_{k}) = (0, 0)

, we obtain the optimal solution of subproblem (5) (for more details, please see []):

(\begin{matrix} μ_{k}^{*} \\ ν_{k}^{*} \end{matrix}) = \frac{1}{Δ_{k}} (\begin{matrix} g_{k}^{T} {\hat{y}}_{k - 1} g_{k}^{T} {\hat{s}}_{k - 1} - {\hat{s}}_{k - 1}^{T} {\hat{y}}_{k - 1} | | g_{k} | |^{2} \\ g_{k}^{T} {\hat{y}}_{k - 1} | | g_{k} | |^{2} - ρ_{k} g_{k}^{T} {\hat{s}}_{k - 1} \end{matrix}),

(6)

where

Δ_{k} = ρ_{k} {\hat{s}}_{k - 1}^{T} {\hat{y}}_{k - 1} - {(g_{k}^{T} {\hat{y}}_{k - 1})}^{2}

,

ρ_{k} = g_{k}^{T} B_{k} g_{k}

,

{\hat{y}}_{k - 1} = g_{k + 1} - g_{k}

.

An important property about the SMCG methods was given by Dai and Kou [] in 2016. They established the two-dimensional finite termination property of the SMCG methods and presented some Barzilai–Borwein conjugate gradient (BBCG) methods with different

ρ_{k}

values, and the most efficient one is

ρ_{k}^{B B C G 3} = \frac{3 | | g_{k} | |^{2} | | {\hat{y}}_{k - 1} | |^{2}}{2 {\hat{s}}_{k - 1}^{T} {\hat{y}}_{k - 1}} .

(7)

Motivated by the SMCG methods [] and

ρ_{k}^{B B C G 3}

, Liu and Liu [] extended the BBCG3 method to general unconstrained optimization and presented an efficient subspace minimization conjugate gradient method (SMCG_BB). Since then, a lot of SMCG methods [,,] have been proposed for unconstrained optimization. The SMCG methods are very efficient and have received much attention.

2.2. The SMCG Method for Solving (1) and Its Some Important Properties

We will extend the SMCG methods for unconstrained optimization for solving (1) by combining it with the projection technique and exploit some important properties of the search direction in the subsection. The motivation behind we extend the SMCG methods for unconstrained optimization to solve (1) is that the SMCG methods have the following characteristics: (i) The search directions of the SMCG methods are parallel to those of the traditional CG methods when the exact line search is performed, and thus reduce to the traditional CG methods when the exact line search is performed. It implies that the SMCG methods can inherit the finite termination property of the traditional CG methods for convex quadratic minimization. (ii) The search directions of the SMCG methods are generated by solving (3) over the whole two-dimensional subspace

Ω_{k} = Span {{\hat{d}}_{k - 1}, g_{k}},

while those of the traditional CG methods are

{\hat{d}}_{k} = - g_{k} + β_{k} {\hat{d}}_{k - 1}

, where

β_{k}

is called the conjugate parameter. Obviously, the search directions of the traditional CG methods are derived in the special subset of

Ω_{k}

to make them possess the conjugate property. As a result, the SMCG methods have more choices and thus have more potential in theoretical properties and numerical performance. In theory, the SMCG methods without the exact line search can possess the finite termination property when solving two-dimensional strictly convex quadratic minimization problems [], while this is impossible for the traditional CG methods when the line search is not exact. In numerical performance, the numerical results in [,,,] indicated that the SMCG methods are very efficient.

For simplicity, we abbreviate

F (x_{k})

as

F_{k}

in the following. We are particularly interested in the SMCG methods proposed by Yuan and store [], where the search directions are given by

{\hat{d}}_{k} = μ_{k}^{*} g_{k} + ν_{k}^{*} {\hat{s}}_{k - 1},

(8)

where

μ_{k}^{*}

and

ν_{k}^{*}

are determined by (6). For the choice of

ρ_{k}

in (6), we take the form (7) due to its effectiveness []. Therefore, based on (8) and (7), the search direction of the SMCG method for solving problem (1) can be arranged as

{\bar{d}}_{k} = \frac{1}{Δ_{k}} [(F_{k}^{T} y_{k - 1} F_{k}^{T} s_{k - 1} - s_{k - 1}^{T} y_{k - 1} {∥F_{k}∥}^{2}) F_{k} + (F_{k}^{T} y_{k - 1} {∥F_{k}∥}^{2} - ρ_{k} F_{k}^{T} s_{k - 1}) s_{k - 1}],

(9)

where

y_{k - 1} = {\bar{y}}_{k - 1} + r s_{k - 1}

,

{\bar{y}}_{k - 1} = F (x_{k}) - F_{k - 1}

,

s_{k - 1} = x_{k} - x_{k - 1}

,

Δ_{k} = ρ_{k} s_{k - 1}^{T} y_{k - 1} - {(F_{k}^{T} y_{k - 1})}^{2}, ρ_{k} = \frac{3 | | F_{k} | |^{2} | | y_{k - 1} | |^{2}}{2 s_{k - 1}^{T} y_{k - 1}}

.

In order to analyze some properties of the search direction, the search direction will be reset as

- F_{k}

when

s_{k - 1}^{T} y_{k - 1} < ξ_{1} | | y_{k - 1} | |^{2}

, where

ξ_{1} > 0

. Therefore, the search direction is truncated as

\begin{matrix} d_{k} = \{\begin{matrix} {\bar{d}}_{k}, if s_{k - 1}^{T} y_{k - 1} \geq ξ_{1} | | y_{k - 1} | |^{2}, \\ - F_{k}, otherwise, \end{matrix} \end{matrix}

(10)

where

{\bar{d}}_{k}

is given by (9).

The projection technique, which will be used in the proposed method, is described as follows.

By setting

z_{k} = x_{k} + α_{k} d_{k}

as a trial point, we define a hyperplane

H_{k} = {x \in R^{n} | ⟨ F (z_{k}), x - z_{k} ⟩ = 0},

which strictly separates

x_{k}

from the zero points of

F (x)

in (1). The projection operator is a mapping from

R^{n}

to the non-empty closed subset

Ω

:

P_{Ω} [x] = arg min {| | x - y | | | y \in Ω},

which enjoys the non-expansive property

| | P_{Ω} [x] - y | | \leq | | x - y | |, \forall y \in Ω .

Solodov and Svaiter [] showed that the next iterative point

x_{k + 1}

is the projection of

x_{k}

onto

H_{k}

, namely

x_{k + 1} = x_{k} - \frac{⟨ F (z_{k}), x_{k} - z_{k} ⟩}{| | F (z_{k}) | |^{2}} F (z_{k}) .

By combining (10) with the projection technique, we present an SMCG method for solving (1), which is described in detail as follows.

The following lemma indicates that the search direction

d_{k}

satisfies the sufficient descent property.

Lemma 1.

The search direction

\{d_{k}\}

generated by Algorithm 1 always satisfies the sufficient descent condition

d_{k}^{T} F_{k} \leq - C | | F_{k} | |^{2},

(11)

for all

k \geq 0

.

Proof.

According to (10), we know that (11) holds with

C = 1

if

s_{k - 1}^{T} y_{k - 1} < ξ_{1} | | y_{k - 1} | |^{2}

. We next consider the opposite situation. It follows that

\begin{matrix} d_{k}^{T} F_{k} & = \frac{1}{Δ_{k}} {[(F_{k}^{T} y_{k - 1} F_{k}^{T} s_{k - 1} - s_{k - 1}^{T} y_{k - 1} | | F_{k} | |^{2}) F_{k} + (F_{k}^{T} y_{k - 1} | | F_{k} | |^{2} - ρ_{k} F_{k}^{T} s_{k - 1}) s_{k - 1}]}^{T} F_{k} \\ = - \frac{| | F_{k} | |^{4}}{Δ_{k}} [s_{k - 1}^{T} y_{k - 1} - 2 F_{k}^{T} y_{k - 1} \frac{F_{k}^{T} s_{k - 1}}{| | F_{k} | |^{2}} + ρ_{k} {(\frac{F_{k}^{T} s_{k - 1}}{| | F_{k} | |^{2}})}^{2}] \\ \overset{Δ}{=} - \frac{| | F_{k} | |^{4}}{Δ_{k}} \cdot η_{k} \leq - \frac{| | F_{k} | |^{4}}{Δ_{k}} \cdot \frac{Δ_{k}}{ρ_{k}} = - \frac{{∥F_{k}∥}^{4}}{ρ_{k}}, \end{matrix}

(12)

where the inequality comes from the fact that treating

η_{k}

as a one variable function of

\frac{F_{k}^{T} s_{k - 1}}{| | F_{k} | |^{2}}

and minimizing it can yield

η_{k} \geq \frac{Δ_{k}}{ρ_{k}} .

Consequently, by the choice of

ρ_{k}

and (10), it holds that

d_{k}^{T} F_{k} \leq - \frac{| | F_{k} | |^{4}}{ρ_{k}} \leq - \frac{2 s_{k - 1}^{T} y_{k - 1}}{3 | | y_{k - 1} | |^{2}} | | F_{k} | |^{2} \leq - \frac{2 ξ_{1}}{3} | | F_{k} | |^{2} .

In sum, (11) holds with

C = min \{1, \frac{2 ξ_{1}}{3}\}

. The proof is completed. □

Algorithm 1 Subspace Minimization Conjugate Gradient Method for Solving (1)

Step 0. Initialization. Select $x_{0} \in R^{n}$ , $ε > 0$ , $0 < σ < 1$ , $ξ \in (0, 1)$ , $ρ \in (0, 1)$ , $κ \in (0, 2)$ . Set $k = 0 .$
Step 1. If $∥ F_{k} ∥ \leq ε$ , stop. Otherwise, compute search direction $d_{k}$ by (10).
Step 2. Let $z_{k} = x_{k} + α_{k} d_{k},$ where $α_{k} = max \{ξ ρ^{i} | i = 0, 1, 2, 3, \dots\}$ is determined by

$- ⟨F (x_{k} + α_{k} d_{k}), d_{k}⟩ \geq σ α_{k} | | F (x_{k} + α_{k} d_{k}) | | | | d_{k} | |^{2},$

(13)
Step 3. If $z_{k} \in Ω$ and $∥F (z_{k})∥ \leq ε$ , $x_{k + 1} = z_{k}$ , then stop. Otherwise, we determine $x_{k + 1}$ by

$x_{k + 1} = P_{Ω} [x_{k} - κ λ_{k} F (z_{k})],$

where

$λ_{k} = \frac{⟨ F (z_{k}), x_{k} - z_{k} ⟩}{{∥ F (z_{k}) ∥}^{2}} .$
Step 4. Set $k = k + 1$ and go to Step 1.

Lemma 2.

Let the sequences

\{d_{k}\}

and

\{x_{k}\}

be generated by Algorithm 1, then there always exists a stepsize

α_{k}

satisfying the line search (13).

Proof.

We prove it by contradiction. Suppose that inequality (13) does not hold for any positive integer i at the k-th iteration, we can determine that

- ⟨F (x_{k} + β ρ^{i} d_{k}), d_{k}⟩ < σ β ρ^{i} ∥F (x_{k} + β ρ^{i} d_{k})∥ {∥d_{k}∥}^{2} .

(14)

By taking

i \to \infty

, it follows from the continuity of F and

ρ \in (0, 1)

that

- F {(x_{k})}^{T} d_{k} \leq 0,

(15)

which contradicts (11). The proof is completed. □

3. Convergence Analysis

In this section, we will establish the global convergence and the convergence rate of Algorithm 1.

3.1. Global Convergence

We first perform the following assumptions.

Assumption 1.

There is a solution

x^{*} \in Ω^{*}

such that

F (x^{*}) = 0 .

Assumption 2.

The mapping F is continuous and monotone.

By utilizing (2), we can obtain

s_{k - 1}^{T} y_{k - 1} = s_{k - 1}^{T} ({\bar{y}}_{k - 1} + r s_{k - 1}) = s_{k - 1}^{T} (F (x_{k}) - F (x_{k - 1})) + r s_{k - 1}^{T} s_{k - 1} \geq r {∥s_{k - 1}∥}^{2} > 0 .

(16)

The next lemma indicates that sequence

\{| | x_{k} - x^{*} | |\}

generated by Algorithm 1 is Fejèr monotone with respect to

Ω

.

Lemma 3.

Suppose that Assumptions 1 and 2 hold, and

\{x_{k}\}

and

\{z_{k}\}

are generated by Algorithm 1. Then, it holds that

| | x_{k + 1} - x^{*} | |^{2} \leq | | x_{k} - x^{*} | |^{2} - κ (2 - κ) σ^{2} | | x_{k} - z_{k} | |^{4}, \forall x^{*} \in Ω^{*} .

(17)

Moreover, the sequence

\{x_{k}\}

is bounded and

\sum_{k = 0}^{\infty} | | x_{k} - z_{k} | |^{4} < + \infty .

(18)

Proof.

From (2), the following holds:

⟨F (z_{k}), x_{k} - x^{*}⟩ \geq ⟨F (z_{k}), x_{k} - z_{k}⟩, \forall x^{*} \in Ω^{*},

which, together with (13), implies that

⟨F (z_{k}), x_{k} - z_{k}⟩ \geq σ α_{k}^{2} | | F (z_{k}) | | | | d_{k} | |^{2} \geq 0 .

As a result, we have

\begin{matrix} | | x_{k + 1} - x^{*} | |^{2} & = P_{Ω} [x_{k} - κ λ_{k} F (z_{k}) - x^{*}] \leq | | x_{k} - κ λ_{k} F (z_{k}) - x^{*} | |^{2} \\ = | | x_{k} - x^{*} | |^{2} - 2 κ λ_{k} ⟨F (z_{k}), x_{k} - x^{*}⟩ + | | κ λ_{k} F (z_{k}) | |^{2} \\ \leq | | x_{k} - x^{*} | |^{2} - 2 κ λ_{k} ⟨F (z_{k}), x_{k} - z_{k}⟩ + | | κ λ_{k} F (z_{k}) | |^{2} \\ = | | x_{k} - x^{*} | |^{2} - κ (2 - κ) \frac{{⟨F (z_{k}), x_{k} - z_{k}⟩}^{2}}{| | F (z_{k}) | |^{2}} \\ \leq | | x_{k} - x^{*} | |^{2} - κ (2 - κ) σ^{2} | | x_{k} - z_{k} | |^{4} . \end{matrix}

□

It follows that the sequence

| | x_{k} - x^{*} | |

is non-increasing and thus, the sequence

\{x_{k}\}

is bounded. We also have

κ (2 - κ) σ^{2} \sum_{k = 0}^{\infty} | | x_{k} - z_{k} | |^{4} < | | x_{0} - x^{*} | |^{2} < + \infty .

By the definition of

\{z_{k}\}

, we can determine that

lim_{k \to \infty} ∥x_{k} - z_{k}∥ = lim_{k \to \infty} α_{k} | | d_{k} | | = 0 .

(19)

The following lemma is proved only based on the continuity assumption on F.

Lemma 4.

Suppose that

\{d_{k}\}

is generated by Algorithm 1. Then, for all

k \geq 0

, we have

C | | F_{k} | | \leq | | d_{k} | | \leq C_{3} | | F_{k} | |,

(20)

where

0 < C < C_{3}

.

Proof.

From (11) and by utilizing the Cauchy–Schwartz inequality, it follows that

| | d_{k} | | \geq C | | F_{k} | | .

(21)

In the following, we consider two cases: when

(i)

s_{k - 1}^{T} y_{k - 1} \geq ξ_{1} | | y_{k - 1} | |^{2}

holds, we have

|\frac{1}{Δ_{k}}| = \frac{1}{|ρ_{k} s_{k - 1}^{T} y_{k - 1} - {(F_{k}^{T} y_{k - 1})}^{2}|} = \frac{1}{|\frac{3}{2} | | y_{k - 1} | |^{2} | | F_{k} | |^{2} - {(F_{k}^{T} y_{k - 1})}^{2}|} \leq \frac{2}{| | y_{k - 1} | |^{2} | | F_{k} | |^{2}} .

(22)

Therefore, by (10), (16) and (22), as well as the Cauchy–Schwarz inequality, we obtain

\begin{matrix} | | d_{k} | | & = |\frac{1}{Δ_{k}}| \cdot ∥[(F_{k}^{T} y_{k - 1} F_{k}^{T} s_{k - 1} - s_{k - 1}^{T} y_{k - 1} | | F_{k} | |^{2}) F_{k} + (F_{k}^{T} y_{k - 1} | | F_{k} | |^{2} - \frac{3 | | y_{k - 1} | |^{2} | | F_{k} | |^{2}}{2 s_{k - 1}^{T} y_{k - 1}} F_{k}^{T} s_{k - 1}) s_{k - 1}]∥ \\ \leq |\frac{1}{Δ_{k}}| \cdot [∥F_{k}^{T} y_{k - 1} F_{k}^{T} s_{k - 1}∥ ∥F_{k}∥ + ∥s_{k - 1}^{T} y_{k - 1}∥ | | F_{k} | |^{3} \\ + ∥F_{k}^{T} y_{k - 1}∥ | | F_{k} | |^{2} ∥s_{k - 1}∥ + \frac{3 | | y_{k - 1} | |^{2} | | F_{k} | |^{2}}{2 s_{k - 1}^{T} y_{k - 1}} ∥F_{k}^{T} s_{k - 1}∥ ∥s_{k - 1}∥] \\ \leq \frac{2}{| | y_{k - 1} | |^{2} | | F_{k} | |^{2}} \cdot [3 ∥s_{k - 1}∥ ∥y_{k - 1}∥ {∥F_{k}∥}^{3} + \frac{3}{2 s_{k - 1}^{T} y_{k - 1}} | | y_{k - 1} | |^{2} {∥s_{k - 1}∥}^{2} {∥F_{k}∥}^{3}] \\ = (\frac{6 ∥s_{k - 1}∥}{∥y_{k - 1}∥} + \frac{3}{s_{k - 1}^{T} y_{k - 1}} {∥s_{k - 1}∥}^{2}) ∥F_{k}∥ \\ = (\frac{6 {∥s_{k - 1}∥}^{2}}{∥s_{k - 1}∥ ∥y_{k - 1}∥} + \frac{3}{s_{k - 1}^{T} y_{k - 1}} {∥s_{k - 1}∥}^{2}) ∥F_{k}∥ \\ \leq (\frac{6 {∥s_{k - 1}∥}^{2}}{s_{k - 1}^{T} y_{k - 1}} + \frac{3}{s_{k - 1}^{T} y_{k - 1}} {∥s_{k - 1}∥}^{2}) ∥F_{k}∥ \\ \leq \frac{9 {∥s_{k - 1}∥}^{2}}{s_{k - 1}^{T} y_{k - 1}} ∥F_{k}∥ \\ \leq \frac{9}{r} ∥F_{k}∥ . \end{matrix}

(23)

(ii)

s_{k - 1}^{T} y_{k - 1} < ξ_{1} {∥y_{k - 1}∥}^{2}

or

k = 0

,

∥d_{k}∥ = ∥F_{k}∥

. In sum, (20) holds for all

k \geq 0

with

C_{3} = max \{\frac{9}{r}, 1\}

and C in (11). The proof is completed. □

In the following theorem, we establish the global convergence of Algorithm 1.

Theorem 1.

Suppose that Assumption 2 holds, and the sequences

\{x_{k}\}

and

\{z_{k}\}

are generated by Algorithm 1. Then, the following holds:

lim_{k \to \infty} inf | | F_{k} | | = 0 .

(24)

Proof.

We prove it by contradiction. Suppose that (24) does not hold, i.e., there exists a constant

r > 0

such that

| | F_{k} | | \geq r

,

\forall k \geq 0

. Together with (21), it implies that

| | d_{k} | | \geq C r, \forall k \geq 0 .

(25)

By utilizing (19) and (25), we can determine that

lim_{k \to \infty} α_{k} = 0

. By

α_{k} = β ρ^{i_{k}}

and the line search (13), for a large enough k, we can determine that

- ⟨F (x_{k} + β ρ^{i_{k - 1}} d_{k}), d_{k}⟩ < σ β ρ^{i_{k - 1}} ∥F (x_{k} + β ρ^{i_{k - 1}} d_{k})∥ {∥d_{k}∥}^{2} .

(26)

It follows from (20) and

| | F_{k} | | \geq r

that the sequence

\{d_{k}\}

is bounded. Together with the boundedness of

\{x_{k}\}

, we know that there exist convergent subsequences for both

\{x_{k}\}

and

\{d_{k}\}

. Without the loss of generality, we assume that the two sequences

\{x_{k}\}

and

\{d_{k}\}

are convergent. Hence, taking limits on (26) yields

- ⟨F (\bar{x}), \bar{d}⟩ < 0,

(27)

where

\bar{x}

and

\bar{d}

are the corresponding limit points. By taking limits on both sides of (11), we obtain

- ⟨F (\bar{x}), \bar{d}⟩ \geq C {∥F (\bar{x})∥}^{2} .

(28)

It follows from (27) and (28) that

∥F (\bar{x})∥ = 0

, which contradicts

| | F_{k} | | \geq r

. Therefore, we obtain (24). The proof is completed. □

3.2. R-Linear Convergence Rate

We begin to analyze the Q-linear convergence and R-linear convergence of Algorithm 1. We say that a method enjoys Q-linear convergence to mean that its iterative sequence

\{x_{k}\}

satisfies

\underset{n \to \infty}{lim sup} \frac{∥x_{n + 1} - x^{*}∥}{∥x_{n} - x^{*}∥} \leq ϕ,

where

ϕ \in (0, 1)

; we say that a method enjoys R-linear convergence to mean that for its iterative sequence

\{x_{k}\}

, there exists two positive constants

m \in (0, \infty), q \in (0, 1)

such that

∥x_{n} - x^{*}∥ \leq m q^{k}

holds (See []).

Assumption 3.

For any

x^{*} \in Ω^{*}

, there exist constant

ω \in (0, 1)

and

δ > 0

such that

ω d i s t (x, Ω^{*}) \leq {∥F (x)∥}^{2}, \forall x \in N (x^{*}, δ),

(29)

where

d i s t (x, Ω^{*})

denotes the distance from x to the solution set

Ω^{*}

, and

N (x^{*}, δ) = {x \in Ω | ∥x - x^{*}∥

\leq δ}

.

Theorem 2.

Suppose that Assumptions 2 and 3 hold, and let the sequence

\{x_{k}\}

be generated by Algorithm 1. Then, the sequence dist

\{x_{k}, Ω^{*}\}

is Q-linearly convergent to 0 and the sequence

\{x_{k}\}

is R-linearly convergent to

\bar{x} \in Ω^{*}

.

Proof.

By setting

u_{k} : = arg min \{∥x_{k} - u∥ |u \in Ω^{*}\},

we know that

u_{k}

is the nearest solution from

x_{k}

, i.e.,

∥x_{k} - u_{k}∥ = d i s t (x_{k}, Ω^{*}) .

From (17), (21) and (29), for

u_{k} \in Ω^{*},

we have

\begin{matrix} d i s t {(x_{k + 1}, Ω^{*})}^{2} & = {∥x_{k + 1} - u_{k}∥}^{2} \\ \leq d i s t {(x_{k}, Ω^{*})}^{2} - σ^{2} {∥α_{k} d_{k}∥}^{4} \\ \leq d i s t {(x_{k}, Ω^{*})}^{2} - σ^{2} α_{k}^{4} C^{4} {∥F_{k}∥}^{4} \\ \leq d i s t {(x_{k}, Ω^{*})}^{2} - σ^{2} ω^{2} α_{k}^{4} C^{4} d i s t {(x_{k}, Ω^{*})}^{2} \\ = (1 - σ^{2} ω^{2} α_{k}^{4} C^{4}) d i s t {(x_{k}, Ω^{*})}^{2}, \end{matrix}

which, together with

σ \in (0, 1)

,

ω \in (0, 1)

,

α_{k} \in [0, 1]

, and

C \in [0, 1]

implies that the sequence

d i s t (x_{k}, Ω^{*})

is Q-linearly convergent to 0. If

d i s t (x_{k}, Ω^{*})

has this property, then the sequence

\{x_{k}\}

is R-linearly convergent to

\bar{x} \in Ω^{*}

. The proof is completed. □

4. Numerical Experiments

In this section, the numerical experiment is conducted to compare the performance of Algorithm 1 with that of the HTTCGP method [], the PDY method [], the MPRPA method [], and the PCG method [], which are very effective types of projection algorithm for solving (1). All codes of the test methods were implemented in MATLAB R2019a and were run on an HP personal desktop computer with Intel(R) Core(TM) i5-10500 CPU 3.10 GHz, 8.00 GB RAM, and Windows 10 operation system.

In Algorithm 1, we choose the following the parameter values:

ρ = 0.53, σ = 0.0001, ξ = 0.55, ξ_{1} = 10^{- 7}, ξ = 0.55, κ = 1.9, r = 0.1 .

The parameters of the other four test algorithms use the default values from [,,,], respectively. In the numerical experiment, all test methods are terminated if the iteration exceeds 10,000, or if the function value of the current iterations satisfies the condition

∥F (x_{k})∥ \leq 10^{- 5}

.

Denote

F (x) = {(F_{1} (x), F_{2} (x), \dots, F_{n} (x))}^{T} .

The test problems are given as follows.

Problem 1.

This problem is a logarithmic function with

Ω = \{x \in R^{n} | x_{i} > - 1\}

[], i.e.,

F_{i} (x) = l n (x_{i} + 1) - \frac{x_{i}}{n}, i = 1, 2, 3, \dots, n .

Problem 2.

This problem is a discrete boundary value problem with

Ω = R_{+}^{n}

[], i.e.,

\begin{matrix} F_{1} (x) = 2 x_{1} + 0.5 h^{2} {(x_{1} + h)}^{3} - x_{2}, \\ F_{i} (x) = 2 x_{i} + 0.5 h^{2} {(x_{i} + i h)}^{3} - x_{i - 1} + x_{i + 1}, \\ F_{n} (x) = 2 x_{n} + 0.5 h^{2} {(x_{n} + n h)}^{3} - x_{n - 1}, \end{matrix}

where

h = \frac{1}{n + 1}

,

i = 2, 3, \dots, n - 1 .

Problem 3.

This problem is a trigexp funtion with

Ω = R_{+}^{n}

[], i.e.,

\begin{matrix} F_{1} (x) = 3 x_{1}^{3} + 2 x_{2} - 5 + sin (x_{1} - x_{2}) sin (x_{1} + x_{2}), \\ F_{i} (x) = - x_{i - 1} e^{(x_{i - 1} - x_{i})} + x_{i} (4 + 3 x_{i}^{2}) + 2 x_{i + 1} + sin (x_{i - 1} - x_{i}) sin (x_{i - 1} + x_{i}) - 8, \\ F_{n} (x) = - x_{n - 1} e^{(x_{n - 1} - x_{n})} + 4 x_{n} - 3, \end{matrix}

where

i = 2, 3, \dots, n - 1 .

Problem 4.

This problem is a tridiagonal exponential problem with

Ω = R_{+}^{n}

[], i.e.,

F_{i} (x) = e^{x_{i}} - 1,

where

i = 1, 2, \dots, n .

Problem 5.

This problem is problem 4.6 in [] with

Ω = R_{+}^{n}

, i.e.,

F_{i} (x) = x_{i} - 2 sin | x_{i} - 1 |,

where

i = 1, 2, \dots, n .

Problem 6.

This problem is problem 4.7 in [], i.e.,

\begin{matrix} F_{1} (x) = 2.5 x_{1} + x_{2} - 1, \\ F_{i} (x) = x_{i - 1} + 2.5 x_{i} + x_{i + 1} - 1, \\ F_{n} (x) = x_{n - 1} + 2.5 x_{n} - 1, \end{matrix}

where

Ω = \{x \in R^{n} | x \geq - 3\}

,

i = 2, 3, \dots, n - 1 .

Problem 7.

This problem is problem 4.8 in [], i.e.,

F_{i} (x) = 2 x_{i} - sin (x_{i}),

where

Ω = \{x \in R^{n} | x \geq - 2\}

,

i = 1, 2, \dots, n .

Problem 8.

This problem is problem 3 in [], i.e.,

\begin{matrix} F_{1} (x) = x_{1} - e^{cos (\frac{x_{1} + x_{2}}{n + 1})}, \\ F_{i} (x) = x_{i} - e^{cos (\frac{x_{i - 1} + x_{i} + x_{i + 1}}{n + 1})}, \\ F_{n} (x) = x_{n} - e^{cos (\frac{x_{n - 1} + x_{n}}{n + 1})}, \end{matrix}

where

Ω = R_{+}^{n}

,

i = 2, \dots, n - 1 .

Problem 9.

This problem is problem 4.3 in [], i.e.,

F_{i} (x) = \frac{i}{n} e^{x_{i}} - 1,

where

Ω = R_{+}^{n}

,

i = 1, 2, \dots, n .

Problem 10.

This problem is problem 4.8 in [], i.e.,

F_{i} (x) = {(e^{x_{i}})}^{2} + 3 sin x_{i} cos x_{i} - 1,

where

Ω = R_{+}^{n}

,

i = 1, 2, \dots, n .

Problem 11.

This problem is problem 4.5 in [], i.e.,

\begin{matrix} F_{1} (x) = x_{1} - e^{cos (\frac{x_{1} + x_{2}}{2})}, \\ F_{i} (x) = x_{i} - e^{cos (\frac{x_{i - 1} + x_{i} + x_{i + 1}}{i})}, \\ F_{n} (x) = x_{n} - e^{cos (\frac{x_{n - 1} + x_{n}}{n})}, \end{matrix}

where

Ω = R_{+}^{n}

,

i = 2, \dots, n - 1 .

Problem 12.

This problem is problem 5 in [], i.e.,

\begin{matrix} F_{1} (x) = e^{x_{1}} - 1, \\ F_{i} (x) = e^{x_{i}} + x_{i - 1} - 1, \end{matrix}

where

Ω = R_{+}^{n}

,

i = 2, \dots, n - 1 .

Problem 13.

This problem is problem 6 in [], i.e.,

\begin{matrix} F_{1} (x) = 2 x_{1} - x_{2} + e^{x_{1}} - 1, \\ F_{i} (x) = - x_{i - 1} + 2 x_{i} - x_{i + 1} + e^{x_{i}} - 1, \\ F_{n} (x) = - x_{n - 1} + 2 x_{n} + e^{x_{n}} - 1, \end{matrix}

where

Ω = R_{+}^{n}

,

i = 2, \dots, n - 1 .

Problem 14.

This problem is problem 4.3 in [], i.e.,

\begin{matrix} F_{1} (x) = x_{1} (2 x_{1}^{2} + 2 x_{2}^{2}) - 1, \\ F_{i} (x) = x_{i} (2 x_{i - 1}^{2} + 2 x_{i}^{2} + 2 x_{i + 1}^{2}) - 1, \\ F_{n} (x) = x_{n} (2 x_{n - 1}^{2} + 2 x_{n}^{2}) - 1, \end{matrix}

where

Ω = R_{+}^{n}

,

i = 2, \dots, n - 1 .

Problem 15.

This problem is a complementarity problem in [], i.e.,

F_{i} (x) = {(x_{i} - 1)}^{2} - 1.01,

where

Ω = R_{+}^{n}

,

i = 1, 2, \dots, n .

The above 15 problems with different dimensions (

n = 1000, 5000, 10,000, and 50,000)

are used to test the five test methods, as well as different initial points

x_{0} = a_{1} * o n e s (n, 1)

, where

a_{1} = 0.1, 0.2, 0.5, 0.12, 0.15, 2.0

, and

m = o n e s (n, 1)

. Some of the numerical results are listed in Table 1, where “Al” represents Algorithm 1, “Pi”

(i = 1, 2, \dots, 15)

stands for the i-th test problem listed above, and “Ni” and “NF” denote the number of iterations and the number of function calculations, respectively. Other numerical results are available at https://www.cnblogs.com/888-0516-2333/p/18026523 (accessed on 6 January 2024).

Table 1. The numerical results (n = 10,000).

The performance profiles proposed by Dolan and Moré [] are used to compare the numerical performance of the test methods in terms of Ni, NF, and T, respectively. We explain the performance profile by taking the number of iterations as an example. Denote the test set and the set of algorithms by P and A, respectively. We assume that we have

n_{a}

algorithms and

n_{p}

problems. For each problem with

p \in P

and algorithm

a \in A

,

t_{p, a}

represents the number of iterations required to solve problem p by algorithm a. We use the performance ratio

r_{p, a} = \frac{t_{p, a}}{min \{t_{p, a} | a \in A\}}

to compare the performance on problem p by solver a with the best performance by any algorithm on this problem. To obtain an overall assessment of the performance of the algorithm, we define

ρ_{a} (τ) = \frac{1}{n_{p}} s i z e \{p \in P | r_{p, a} \leq τ\},

which is the probability for algorithm

a \in A

that a performance ratio

r_{p, a}

is within a factor

τ_{a} \in R

of the best possible ratio and reflects the numerical performance of algorithm a relative to the other test algorithms in A. Obviously, algorithms with large probability

p_{a} (τ)

are to be preferred. Therefore, in the figure plotted with these

ρ_{a} (τ)

of the test methods, the higher the curve is, the better the corresponding algorithm a performs.

As shown in Figure 1, we observe that, in terms of the number of iterations, Algorithm 1 is the best, followed by the HTTCGP, MPRPA, and PCG methods, and the PDY method is the worst. Figure 2 indicates that Algorithm 1 has significant improvement over the other four test methods in terms of the number of function calculations, since it successfully solves about 78% of test problems with the least number of function calculations, while the percentages of the other four methods are all less than 10%. As for the reason for the significant improvement in terms of NF, it is due to the fact that the search direction of Algorithm 1 is generated by minimizing the quadratic approximate model in the two-dimensional subspace

Ω_{k} = Span {{\hat{d}}_{k - 1}, g_{k}},

which implies that the search direction has new parameters corresponding to

F_{k}

and thus results in that it requires less function calculations in Step 2. This is also the advantage of the SMCG methods compared with other CG methods. We can see from Figure 3 that Algorithm 1 is much faster than the other four test methods.

Figure 1. Performance profilesof the five algorithms with respect to number of iterations (Ni).

Figure 2. Performance profiles of the five algorithms with respect to number of function evaluations (NF).

Figure 3. Performance profiles of the five algorithms with respect to CPU time (T).

The numerical experiment indicates Algorithm 1 is superior to the the other four test methods.

5. Conclusions

In this paper, an efficient SMCG method is presented for solving nonlinear monotone equations with convex constraints. The sufficient descent property of the search direction is analyzed, and the global convergence and convergence rate of the proposed algorithm are established under suitable assumptions. The numerical results confirm the effectiveness of the proposed method.

The SMCG method has illustrated a good numerical performance for solving nonlinear monotone equations with convex constraints. There is a wide research gap with regard to studying the SMCG methods for solving nonlinear monotone equations with convex constraints, including exploiting suitable quadratic or non-quadratic approximate models to derive new search directions. This is also our future research focus.

Author Contributions

Conceptualization, T.S. and Z.L.; methodology, T.S. and Z.L.; software, T.S. and Z.L.; validation, T.S. and Z.L.; formal analysis, T.S. and Z.L.; investigation, T.S. and Z.L.; resources, T.S. and Z.L.; data curation, T.S. and Z.L.; writing—original draft preparation, T.S.; writing—review and editing, Z.L.; visualization, T.S. and Z.L.; supervision, T.S. and Z.L.; project administration, T.S. and Z.L.; funding acquisition, Z.L. All authors have read and agreed to the published version of the manuscript.

Funding

This research was supported by National Science Foundation of China (No. 12261019), Guizhou Science Foundation (No. QHKJC-ZK[2022]YB084).

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article and corresponding link.

Acknowledgments

We would like to thank the Associate Editor and the anonymous referees for their valuable comments and suggestions.

Conflicts of Interest

The authors declare no competing interests.

References

Guo, D.S.; Nie, Z.Y.; Yan, L.C. The application of noise-tolerant ZD design formula to robots’ kinematic control via time-varying nonlinear equations solving. IEEE Trans. Syst. Man Cybern. Syst. 2017, 48, 2188–2197. [Google Scholar] [CrossRef]
Shi, Y.; Zhang, Y. New discrete-time models of zeroing neural network solving systems of time-variant linear and nonlinear inequalities. IEEE Trans. Syst. Man Cybern. Syst. 2017, 50, 565–576. [Google Scholar] [CrossRef]
Dirkse, S.P.; Ferris, M.C. MCPLIB: A collection of nonlinear mixed complementarity problems. Optim. Methods Softw. 1995, 5, 319–345. [Google Scholar] [CrossRef]
Xiao, Y.H.; Wang, Q.Y.; Hu, Q.J. Non-smooth equations based methods for l1-norm problems with applications to compressed sensing. Nonlinear Anal. 2011, 74, 3570–3577. [Google Scholar] [CrossRef]
Yuan, Y.X. Subspace methods for large scale nonlinear equations and nonlinear least squares. Optim. Eng. 2009, 10, 207–218. [Google Scholar] [CrossRef]
Ahmad, F.; Tohidi, E.; Carrasco, J.A. A parameterized multi-step Newton method for solving systems of nonlinear equations. Numer. Algorithms 2016, 71, 631–653. [Google Scholar] [CrossRef]
Lukšan, L.; Vlček, J. New quasi-Newton method for solving systems of nonlinear equations. Appl. Math. 2017, 62, 121–134. [Google Scholar] [CrossRef]
Yu, Z. On the global convergence of a Levenberg-Marquardt method for constrained nonlinear equations. JAMC 2004, 16, 183–194. [Google Scholar] [CrossRef]
Zhang, J.L.; Wang, Y. A new trust region method for nonlinear equations. Math. Methods Oper. Res. 2003, 58, 283–298. [Google Scholar] [CrossRef]
Solodov, M.V.; Svaiter, B.F. A globally convergent inexact Newton method for systems of monotone equations. In Reformulation: Nonsmooth, Piecewise Smooth, Semismooth and Smoothing Methods; Fukushima, M., Qi, L., Eds.; Kluwer Academic: Boston, MA, USA, 1998; pp. 355–369. [Google Scholar]
Zheng, Y.; Zheng, B. Two new Dai–Liao-type conjugate gradient methods for unconstrained optimization problems. J. Optim. Theory Appl. 2017, 175, 502–509. [Google Scholar] [CrossRef]
Li, M.; Liu, H.W.; Liu, Z.X. A new family of conjugate gradient methods for unconstrained optimization. J. Appl. Math. Comput. 2018, 58, 219–234. [Google Scholar] [CrossRef]
Xiao, Y.H.; Zhu, H. A conjugate gradient method to solve convex constrained monotone equations with applications in compressive sensing. J. Math. Anal. Appl. 2013, 405, 310–319. [Google Scholar] [CrossRef]
Hager, H.H.; Zhang, H. A new conjugate gradient method with guaranteed descent and an efficient line search. SIAM J. Optim. 2005, 16, 170–192. [Google Scholar] [CrossRef]
Liu, J.K.; Li, S.J. A projection method for convex constrained monotone nonlinear equations with applications. Comput. Math. Appl. 2015, 70, 2442–2453. [Google Scholar] [CrossRef]
Dai, Y.H.; Yuan, Y.X. A nonlinear conjugate gradient with a strong global convergence property. SIAM J. Optim. 1999, 10, 177–182. [Google Scholar] [CrossRef]
Liu, J.K.; Feng, Y.M. A derivative-free iterative method for nonlinear monotone equations with convex constraints. Numer. Algorithms 2019, 82, 245–262. [Google Scholar] [CrossRef]
Gao, P.T.; He, C.J.; Liu, Y. An adaptive family of projection methods for constrained monotone nonlinear equations with applications. Appl. Math. Comput. 2019, 359, 1–16. [Google Scholar] [CrossRef]
Bojari, S.; Eslahchi, M.R. Two families of scaled three-term conjugate gradient methods with sufficient descent property for nonconvex optimization. Numer. Algorithms 2020, 83, 901–933. [Google Scholar] [CrossRef]
Li, Q.; Zheng, B. Scaled three-term derivative-free methods for solving large-scale nonlinear monotone equations. Numer. Algorithms 2021, 87, 1343–1367. [Google Scholar] [CrossRef]
Waziri, M.Y.; Ahmed, K. Two Descent Dai-Yuan Conjugate Gradient Methods for Systems of Monotone Nonlinear Equations. J. Sci. Comput. 2022, 90, 36. [Google Scholar] [CrossRef]
Ibrahim, A.H.; Alshahrani, M.; Al-Homidan, S. Two classes of spectral three-term derivative-free method for solving nonlinear equations with application. Numer. Algorithms 2023. [Google Scholar] [CrossRef]
Yuan, Y.X.; Stoer, J. A subspace study on conjugate gradient algorithms. Z. Angew. Math. Mech. 1995, 75, 69–77. [Google Scholar] [CrossRef]
Dai, Y.H.; Kou, C.X. A Barzilai-Borwein conjugate gradient method. Sci. China Math. 2016, 59, 1511–1524. [Google Scholar] [CrossRef]
Liu, H.W.; Liu, Z.X. An efficient Barzilai–Borwein conjugate gradient method for unconstrained optimization. J. Optim. Theory Appl. 2019, 180, 879–906. [Google Scholar] [CrossRef]
Li, Y.F.; Liu, Z.X.; Liu, H.W. A subspace minimization conjugate gradient method based on conic model for unconstrained optimization. Comput. Appl. Math. 2019, 38, 16. [Google Scholar] [CrossRef]
Zhao, T.; Liu, H.W.; Liu, Z.X. New subspace minimization conjugate gradient methods based on regularization model for unconstrained optimization. Numer. Algorithms 2021, 87, 1501–1534. [Google Scholar] [CrossRef]
Wang, T.; Liu, Z.; Liu, H. A new subspace minimization conjugate gradient method based on tensor model for unconstrained optimization. Int. J. Comput. Math. 2019, 96, 1924–1942. [Google Scholar] [CrossRef]
Ortega, J.M.; Rheinboldt, W.C. Iterative Solution of Nonlinear Equation in Several Variables; Academic Press: New York, NY, USA; London, UK, 1970. [Google Scholar]
Yin, J.H.; Jian, J.B.; Jiang, X.Z.; Liu, M. X; Wang, L.Z. A hybrid three-term conjugate gradient projection method for constrained nonlinear monotone equations with applications. Numer. Algorithms 2021, 88, 389–418. [Google Scholar] [CrossRef]
Ou, Y.G.; Li, J.Y. A new derivative-free SCG-type projection method for nonlinear monotone equations with convex constraints. J. Appl. Math. Comput. 2018, 56, 195–216. [Google Scholar] [CrossRef]
Ma, G.D.; Jin, J.C.; Jian, J.B.; Yin, J.H.; Han, D.L. A modified inertial three-term conjugate gradient projection method for constrained nonlinear equations with applications in compressed sensing. Numer. Algorithms 2023, 92, 1621–1653. [Google Scholar] [CrossRef]
Dolan, E.D.; More, J.J. Benchmarking optimization software with performance profiles. Math. Program 2002, 91, 201–213. [Google Scholar] [CrossRef]

Figure 1. Performance profilesof the five algorithms with respect to number of iterations (Ni).

Figure 2. Performance profiles of the five algorithms with respect to number of function evaluations (NF).

Figure 3. Performance profiles of the five algorithms with respect to CPU time (T).

Table 1. The numerical results (n = 10,000).

P	$x_{0}$	Al	PDY	HTTCGP	MPRPA	PCG	$x_{0}$	Al	HTTCGP	PDY	MPRPA	PCG
P	$x_{0}$	Ni/NF	Ni/NF	Ni/NF	Ni/NF	Ni/NF	$x_{0}$	Ni/NF	Ni/NF	Ni/NF	Ni/NF	Ni/NF
P1	0.1 m	4\9	14\31	3\7	28\57	9\23	1.2 m	8\17	14\31	5\11	34\69	11\27
	0.2 m	5\11	14\31	3\7	29\59	8\20	1.5 m	8\17	20\42	5\11	35\71	9\22
	0.5 m	6\13	18\39	4\9	31\63	10\25	2.0 m	8\17	15\33	6\14	36\73	11\27
P2	0.1 m	4\10	7\23	6\19	12\25	10\36	1.2 m	5\11	8\25	7\22	13\27	9\32
	0.2 m	4\10	7\23	6\19	12\25	10\36	1.5 m	6\13	8\25	7\22	13\27	9\32
	0.5 m	3\8	6\20	5\16	9\19	8\29	2.0 m	6\13	8\25	8\25	14\29	10\35
P3	0.1 m	19\39	16\42	26\72	42\85	16\39	1.2 m	17\35	27\81	49\126	37\75	18\52
	0.2 m	19\39	19\49	31\83	42\85	16\39	1.5 m	19\40	20\58	41\109	33\67	17\46
	0.5 m	19\39	22\57	28\80	41\83	18\48	2.0 m	18\38	32\148	46\121	34\70	19\59
P4	0.1 m	22\46	24\97	50\172	77\170	19\77	1.2 m	29\60	32\129	53\185	92\205	22\89
	0.2 m	24\50	26\105	52\179	83\184	20\81	1.5 m	30\62	33\133	53\185	92\205	22\89
	0.5 m	26\54	30\121	59\202	86\191	21\85	2.0 m	31\64	34\137	54\189	95\212	23\93
P5	0.1 m	1\3	1\4	20\61	27\55	9\24	1.2 m	1\4	1\5	23\71	30\61	11\31
	0.2 m	1\3	1\4	21\64	29\59	9\24	1.5 m	1\4	1\5	23\71	29\59	11\31
	0.5 m	1\3	1\4	22\67	30\61	10\27	2.0 m	1\4	1\6	24\76	31\64	10\28
P6	0.1 m	5\11	6\20	7\22	12\25	9\32	1.2 m	6\13	7\22	7\22	13\27	9\32
	0.2 m	5\11	6\20	6\19	12\25	8\29	1.5 m	5\11	6\20	7\22	13\27	10\35
	0.5 m	3\8	6\20	5\16	9\19	8\29	2.0 m	6\14	7\22	8\25	14\29	10\34
P7	0.1 m	8\18	11\36	16\57	17\35	17\65	1.2 m	6\13	10\31	18\70	12\25	11\42
	0.2 m	8\20	10\33	15\54	16\33	20\78	1.5 m	6\13	10\31	20\79	11\23	15\59
	0.5 m	7\15	11\33	18\68	14\29	18\69	2.0 m	3\8	7\23	17\69	8\17	15\61
P8	0.1 m	5\11	7\19	20\61	28\57	10\26	1.2 m	7\15	13\31	24\73	32\65	10\26
	0.2 m	5\11	7\19	21\64	29\59	10\26	1.5 m	7\15	14\33	24\74	32\65	10\26
	0.5 m	6\13	6\16	23\70	31\63	10\26	2.0 m	8\17	14\33	25\77	32\65	11\29
P9	0.1 m	6\13	9\24	26\80	34\69	12\31	1.2 m	6\13	9\24	24\73	33\67	11\29
	0.2 m	6\13	9\24	26\80	34\69	12\31	1.5 m	6\13	9\24	24\73	32\65	11\29
	0.5 m	6\13	9\24	25\77	34\69	12\31	2.0 m	6\13	9\24	23\70	31\63	10\26
P10	0.1 m	51\105	40\192	197\816	13\40	68\311	1.2 m	27\57	26\143	147\617	14\43	52\241
	0.2 m	51\105	31\159	187\774	13\40	71\324	1.5 m	31\65	36\180	121\512	14\43	43\201
	0.5 m	29\61	26\131	181\752	14\43	62\285	2.0 m	34\71	35\174	132\557	16\49	43\201
P11	0.1 m	1\5	1\7	16\81	23\93	10\51	1.2 m	1\4	1\4	18\92	24\97	1\5
	0.2 m	1\5	1\7	17\86	24\97	10\51	1.5 m	1\7	1\3	18\93	1\4	1\3
	0.5 m	1\5	1\3	17\86	25\101	1\3	2.0 m	1\6	1\8	19\100	1\4	2\11
P12	0.1 m	20\41	18\75	95\306	25\51	44\152	1.2 m	21\43	15\60	73\233	28\57	43\149
	0.2 m	20\41	15\59	95\306	24\49	44\152	1.5 m	21\43	15\61	65\210	26\53	24\85
	0.5 m	20\41	17\67	98\315	25\51	44\152	2.0 m	18\37	20\76	92\293	23\47	38\131
P13	0.1 m	30\62	24\119	182\741	83\246	99\442	1.2 m	37\76	27\136	216\880	104\333	127\641
	0.2 m	34\70	25\124	188\761	88\261	98\440	1.5 m	37\77	28\157	226\963	85\284	79\389
	0.5 m	34\71	26\133	191\786	96\288	101\458	2.0 m	3\80	25\127	210\872	120\454	3\48
P14	0.1 m	54\111	31\163	101\430	129\416	14\70	1.2 m	47\99	28\152	61\269	167\533	18\94
	0.2 m	40\82	42\260	125\530	184\582	17\85	1.5 m	55\120	27\158	91\393	157\505	18\96
	0.5 m	49\101	31\159	123\522	177\563	17\86	2.0 m	52\115	34\177	138\588	147\473	22\119
P15	0.1 m	34\72	22\177	39\245	16\80	20\140	1.2 m	21\47	20\165	48\306	15\76	16\113
	0.2 m	31\66	23\185	35\218	16\80	18\126	1.5 m	31\67	22\184	48\311	18\91	43\291
	0.5 m	28\60	23\162	42\262	17\86	18\127	2.0 m	24\54	19\155	58\361	16\82	17\121

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

An Efficient Subspace Minimization Conjugate Gradient Method for Solving Nonlinear Monotone Equations with Convex Constraints

Abstract

1. Introduction

2. The SMCG Method for Solving Nonlinear Monotone Equations with Convex Constraints

2.1. The SMCG Method for Unconstrained Optimization

2.2. The SMCG Method for Solving (1) and Its Some Important Properties

3. Convergence Analysis

3.1. Global Convergence

3.2. R-Linear Convergence Rate

4. Numerical Experiments

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics