Extended Convergence Analysis of the Newton–Hermitian and Skew–Hermitian Splitting Method

Argyros, Ioannis K; George, Santhosh; Godavarma, Chandhini; Magreñán, Alberto A

doi:10.3390/sym11080981

Open AccessArticle

Extended Convergence Analysis of the Newton–Hermitian and Skew–Hermitian Splitting Method

by

Ioannis K Argyros

^1,*,

Santhosh George

²

,

Chandhini Godavarma

² and

Alberto A Magreñán

³

¹

Department of Mathematical Sciences, Cameron University, Lawton, OK 73505, USA

²

Department of Mathematical and Computational Sciences, National Institute of Technology, Karnataka 575 025, India

³

Departamento de Matemáticas y Computación, Universidad de la Rioja, 26006 Logroño, Spain

^*

Author to whom correspondence should be addressed.

Symmetry 2019, 11(8), 981; https://doi.org/10.3390/sym11080981

Submission received: 24 June 2019 / Revised: 17 July 2019 / Accepted: 25 July 2019 / Published: 2 August 2019

(This article belongs to the Special Issue Symmetry with Operator Theory and Equations)

Download

Browse Figures

Versions Notes

Abstract

:

Many problems in diverse disciplines such as applied mathematics, mathematical biology, chemistry, economics, and engineering, to mention a few, reduce to solving a nonlinear equation or a system of nonlinear equations. Then various iterative methods are considered to generate a sequence of approximations converging to a solution of such problems. The goal of this article is two-fold: On the one hand, we present a correct convergence criterion for Newton–Hermitian splitting (NHSS) method under the Kantorovich theory, since the criterion given in Numer. Linear Algebra Appl., 2011, 18, 299–315 is not correct. Indeed, the radius of convergence cannot be defined under the given criterion, since the discriminant of the quadratic polynomial from which this radius is derived is negative (See Remark 1 and the conclusions of the present article for more details). On the other hand, we have extended the corrected convergence criterion using our idea of recurrent functions. Numerical examples involving convection–diffusion equations further validate the theoretical results.

Keywords:

Newton–HSS method; systems of nonlinear equations; semi-local convergence

1. Introduction

Numerous problems in computational disciplines can be reduced to solving a system of nonlinear equations with n equations in n variables like

F (x) = 0

(1)

using Mathematical Modelling [1,2,3,4,5,6,7,8,9,10,11]. Here, F is a continuously differentiable nonlinear mapping defined on a convex subset

Ω

of the

n -

dimensional complex linear space

C^{n}

into

C^{n} .

In general, the corresponding Jacobian matrix

F^{'} (x)

is sparse, non-symmetric and positive definite. The solution methods for the nonlinear problem

F (x) = 0

are iterative in nature, since an exact solution

x^{*}

could be obtained only for a few special cases. In the rest of the article, some of the well established and standard results and notations are used to establish our results (See [3,4,5,6,10,11,12,13,14] and the references there in). Undoubtedly, some of the well known methods for generating a sequence to approximate

x^{*}

are the inexact Newton (IN) methods [1,2,3,5,6,7,8,9,10,11,12,13,14]. The IN algorithm involves the steps as given in the following:

Algorithm IN [6]

Step 1: Choose initial guess $x_{0}$ , tolerance value $t o l$ ; Set $k = 0$
Step 2: While $F (x_{k}) > t o l \times F (x_{0})$ , Do
- Choose $η_{k} \in [0, 1)$ . Find $d_{k}$ so that $∥ F (x_{k}) + F^{'} (x_{k}) d_{k} ∥ \leq η_{k} ∥ F (x_{k}) ∥$ .
- Set $x_{k + 1} = x_{k} + d_{k}$ ; $k = k + 1$

Furthermore, if A is sparse, non-Hermitian and positive definite, the Hermitian and skew-Hermitian splitting (HSS) algorithm [4] for solving the linear system

A x = b

is given by,

Algorithm HSS [4]

Step 1: Choose initial guess $x_{0}$ , tolerance value $t o l$ and $α > 0$ ; Set $l = 0$
Step 2: Set $H = \frac{1}{2} (A + A^{*}) and S = \frac{1}{2} (A - A^{*})$ , where H is Hermitian and S is skew-Hermitian parts of A.
Step 3: While $∥ b - A x_{ł} ∥ > t o l \times ∥ b - A x_{0} ∥$ , Do
- Solve $(α I + H) x_{l + 1 / 2} = (α I - S) x_{l} + b$
- Solve $(α I + S) x_{l} = (α I - H) x_{ł + 1 / 2} + b$
- Set $l = l + 1$

Newton–HSS [5] algorithm combines appropriately both IN and HSS methods for the solution of the large nonlinear system of equations with positive definite Jacobian matrix. The algorithm is as follows:

Algorithm NHSS (The Newton–HSS method [5])

Step 1: Choose initial guess $x_{0}$ , positive constants $α$ and $t o l$ ; Set $k = 0$
Step 2: While $∥ F (x_{k}) ∥ > t o l \times ∥ F (x_{0}) ∥$
-
Compute Jacobian $J_{k} = F^{'} (x_{k})$
-
Set

$H_{k} (x_{k}) = \frac{1}{2} (J_{k} + J_{k}^{*}) and S_{k} (x_{k}) = \frac{1}{2} (J_{k} - J_{k}^{*}),$

(2)

where $H_{k}$ is Hermitian and $S_{k}$ is skew-Hermitian parts of $J_{k}$ .
-
Set $d_{k, 0} = 0; l = 0$
-
While

$∥ F (x_{k}) + J_{k} d_{k, ł} ∥ \geq η_{k} \times ∥ F (x_{k}) ∥ (η_{k} \in [0, 1))$

(3)

Do
{
Solve sequentially:

$\begin{matrix} (α I + H_{k}) d_{k, l + 1 / 2} = (α I - S_{k}) d_{k, l} + b \end{matrix}$

(4)

$\begin{matrix} (α I + S_{k}) d_{k, l} = (α I - H_{k}) d_{k, l + 1 / 2} + b \end{matrix}$

(5)

Set $l = l + 1$
}
-
Set

$x_{k + 1} = x_{k} + d_{k, l}; k = k + 1$

(6)

-
Compute $J_{k}$ , $H_{k}$ and $S_{k}$ for new $x_{k}$

Please note that

η_{k}

is varying in each iterative step, unlike a fixed positive constant value in used in [5]. Further observe that if

d_{k, ℓ_{k}}

in (6) is given in terms of

d_{k, 0}

, we get

d_{k, ℓ_{k}} = (I - T_{k}^{ℓ}) {(I - T_{k})}^{- 1} B_{k}^{- 1} F (x_{k})

(7)

where

T_{k} : = T (α, k), B_{k} : = B (α, k)

and

\begin{matrix} T (α, x) & = & B {(α, x)}^{- 1} C (α, x) \\ B (α, x) & = & \frac{1}{2 α} (α I + H (x)) (α I + S (x)) \\ C (α, x) & = & \frac{1}{2 α} (α I - H (x)) (α I - S (x)) . \end{matrix}

(8)

Using the above expressions for

T_{k}

and

d_{k, ℓ_{k}}

, we can write the Newton–HSS in (6) as

x_{k + 1} = x_{k} - {(I - T_{k}^{ℓ})}^{- 1} F {(x_{k})}^{- 1} F (x_{k}) .

(9)

A Kantorovich-type semi-local convergence analysis was presented in [7] for NHSS. However, there are shortcomings:

(i): The semi-local sufficient convergence criterion provided in (15) of [7] is false. The details are given in Remark 1. Accordingly, Theorem 3.2 in [7] as well as all the followings results based on (15) in [7] are inaccurate. Further, the upper bound function $g_{3}$ (to be defined later) on the norm of the initial point is not the best that can be used under the conditions given in [7].
(ii): The convergence domain of NHSS is small in general, even if we use the corrected sufficient convergence criterion (12). That is why, using our technique of recurrent functions, we present a new semi-local convergence criterion for NHSS, which improves the corrected convergence criterion (12) (see also Section 3 and Section 4, Example 4.4).
(iii): Example 4.5 taken from [7] is provided to show as in [7] that convergence can be attained even if these criteria are not checked or not satisfied, since these criteria are not sufficient too. The convergence criteria presented here are only sufficient.
Moreover, we refer the reader to [3,4,5,6,7,8,9,10,11,13,14] and the references therein to avoid repetitions for the importance of these methods for solving large systems of equations.

The rest of the note is organized as follows. Section 2 contains the semi-local convergence analysis of NHSS under the Kantorovich theory. In Section 3, we present the semi-local convergence analysis using our idea of recurrent functions. Numerical examples are discussed in Section 4. The article ends with a few concluding remarks.

2. Semi-Local Convergence Analysis

To make the paper as self-contained as possible we present some results from [3] (see also [7]). The semi-local convergence of NHSS is based on the conditions (

A

). Let

x_{0} \in C^{n}

and

F : Ω \subset C^{n} ⟶ C^{n}

be

G -

differentiable on an open neighborhood

Ω_{0} \subset Ω

on which

F^{'} (x)

is continuous and positive definite. Suppose

F^{'} (x) = H (x) + S (x)

where

H (x)

and

S (x)

are as in (2) with

x_{k} = x .

$(𝒜$ ₁): There exist positive constants $β, γ$ and $δ$ such that

$max {∥ H (x_{0}) ∥, ∥ S (x_{0}) ∥} \leq β, ∥ F^{'} {(x_{0})}^{- 1} ∥ \leq γ, ∥ F (x_{0}) ∥ \leq δ,$

(10)
$(𝒜$ ₂): There exist nonnegative constants $L_{h}$ and $L_{s}$ such that for all $x, y \in U (x_{0}, r) \subset Ω_{0},$

$\begin{matrix} ∥ H (x) - H (y) ∥ & \leq & L_{h} ∥ x - y ∥ \\ ∥ S (x) - S (y) ∥ & \leq & L_{s} ∥ x - y ∥ . \end{matrix}$

(11)

Next, we present the corrected version of Theorem 3.2 in [7].

Theorem 1.

Assume that conditions (

A

) hold with the constants satisfying

δ γ^{2} L \leq {\bar{g}}_{3} (η)

(12)

where

{\bar{g}}_{3} (t) : = \frac{{(1 - t)}^{2}}{2 (2 + t + 2 t^{2} - t^{3})},

η = max {η_{k}} < 1, r = max {r_{1}, r_{2}}

with

\begin{matrix} r_{1} & = & \frac{α + β}{L} (\sqrt{1 + \frac{2 α τ θ}{(2 γ + γ τ θ) {(α + β)}^{2}}} - 1) \\ r_{2} & = & \frac{b - \sqrt{b^{2} - 2 a c}}{a} \\ a & = & \frac{γ L (1 + η)}{1 + 2 γ^{2} δ L η}, b = 1 - η, c = 2 γ δ, \end{matrix}

(13)

and with

ℓ_{*} = lim {inf}_{k ⟶ \infty} ℓ_{k}

satisfying

ℓ_{*} > ⌊ \frac{ln η}{ln ((τ + 1) θ} ⌋,

(Here

⌊ . ⌋

represents the largest integer less than or equal to the corresponding real number)

τ \in (0, \frac{1 - θ}{θ})

and

θ \equiv θ (α, x_{0}) = ∥ T (α, x_{0}) ∥ < 1 .

(14)

Then, the iteration sequence

{x_{k}}_{k = 0}^{\infty}

generated by Algorithm NHSS is well defined and converges to

x_{*},

so that

F (x_{*}) = 0 .

Proof.

We simply follow the proof of Theorem 3.2 in [7] but use the correct function

{\bar{g}}_{3}

instead of the incorrect function

g_{3}

defined in the following remark. □

Remark 1.

The corresponding result in [7] used the function bound

g_{3} (t) = \frac{1 - t}{2 (1 + t^{2})}

(15)

instead of

{\bar{g}}_{3}

in (12) (simply looking at the bottom of first page of the proof in Theorem 3.2 in [7]), i.e., the inequality they have considered is,

δ γ^{2} L \leq g_{3} (η) .

(16)

However, condition (16) does not necessarily imply

b^{2} - 4 a c \geq 0,

which means that

r_{2}

does not necessarily exist (see (13) where

b^{2} - 2 a c \geq 0

is needed) and the proof of Theorem 3.2 in [7] breaks down. As an example, choose

η = \frac{1}{2},

then

g_{3} (\frac{1}{2}) = \frac{1}{5}, {\bar{g}}_{3} (\frac{1}{2}) = \frac{1}{23}

and for

{\bar{g}}_{3} (\frac{1}{2}) = δ γ^{2} L < g_{3} (\frac{1}{2}),

we have

b^{2} - 4 a c < 0 .

Notice that our condition (12) is equivalent to

b^{2} - 4 a c \geq 0 .

Hence, our version of Theorem 3.2 is correct. Notice also that

{\bar{g}}_{3} (t) < g_{3} (t) for each t \geq 0,

(17)

so (12) implies (16) but not necessarily vice versa.

3. Semi-Local Convergence Analysis II

We need to define some parameters and a sequence needed for the semi-local convergence of NHSS using recurrent functions.

Let

β, γ, δ, L_{0}, L

be positive constants and

η \in [0, 1) .

Then, there exists

μ \geq 0

such that

L = μ L_{0} .

Set

c = 2 γ δ .

Define parameters

p, q, η_{0}

and

δ_{0}

by

p = \frac{(1 + η) μ γ L_{0}}{2}, q = \frac{- p + \sqrt{p^{2} + 4 γ L_{0} p}}{2 γ L_{0}},

(18)

η_{0} = \sqrt{\frac{μ}{μ + 2}}

(19)

and

ξ = \frac{μ}{2} min {\frac{2 (q - η)}{(1 + η) μ + 2 q}, \frac{(1 + η) q - η - q^{2}}{(1 + η) q - η}} .

(20)

Moreover, define scalar sequence

{s_{k}}

by

\begin{matrix} s_{0} & = & 0, s_{1} = c = 2 γ δ and for each k = 1, 2, \dots \\ s_{k + 1} & = & s_{k} + \frac{1}{1 - γ L_{0} s_{k}} [p (s_{k} - s_{k - 1}) + η (1 - γ L_{0} s_{k - 1})] (s_{k} - s_{k - 1}) . \end{matrix}

(21)

We need to show the following auxiliary result of majorizing sequences for NHSS using the aforementioned notation.

Lemma 1.

Let

β, γ, δ, L_{0}, L

be positive constants and

η \in [0, 1) .

Suppose that

γ^{2} L δ \leq ξ

(22)

and

η \leq η_{0},

(23)

where

η_{0}, ξ

are given by (19) and (20), respectively. Then, sequence

{s_{k}}

defined in (21) is nondecreasing, bounded from above by

s^{* *} = \frac{c}{1 - q}

(24)

and converges to its unique least upper bounds

s^{*}

which satisfies

c \leq s^{*} \leq s^{* *} .

(25)

Proof.

Notice that by (18)–(23)

q \in (0, 1), q > η, η_{0} \in [\frac{\sqrt{3}}{3}, 1), c > 0,

(1 + η) q - η > 0,

(1 + η) q - η - q^{2} > 0

and

ξ > 0 .

We shall show using induction on k that

0 < s_{k + 1} - s_{k} \leq q (s_{k} - s_{k - 1})

(26)

or equivalently by (21)

0 \leq \frac{1}{1 - γ L_{0} s_{k}} [p (s_{k} - s_{k - 1}) + η (1 - γ L_{0} s_{k - 1})] \leq q .

(27)

Estimate (27) holds true for

k = 1

by the initial data and since it reduces to showing

δ \leq \frac{η}{γ^{2} L} \frac{q - η}{(1 + η) μ + 2 q},

which is true by (20). Then, by (21) and (27), we have

0 < s_{2} - s_{1} \leq q (s_{1} - s_{0}), γ L_{0} s_{1} < 1

and

s_{2} \leq s_{1} + q (s_{1} - s_{0}) = \frac{1 - q^{2}}{1 - q} (s_{1} - s_{0}) < \frac{s_{1} - s_{0}}{1 - q} = s^{* *} .

Suppose that (26),

γ L_{0} s_{k} < 1

(28)

and

s_{k + 1} \leq \frac{1 - q^{k + 1}}{1 - q} (s_{1} - s_{0}) < s^{* *}

(29)

hold true. Next, we shall show that they are true for k replaced by

k + 1 .

It suffices to show that

0 \leq \frac{1}{1 - γ L_{0} s_{k + 1}} (p (s_{k + 1} - s_{k}) + η (1 - γ L_{0} s_{k})) \leq q

or

p (s_{k + 1} - s_{k}) + η (1 - γ L_{0} s_{k}) \leq q (1 - γ L_{0} s_{k + 1})

or

p (s_{k + 1} - s_{k}) + η (1 - γ L_{0} s_{k}) - q (1 - γ L_{0} s_{k + 1}) \leq 0

or

p (s_{k + 1} - s_{k}) + η (1 - γ L_{0} s_{1}) + γ q L_{0} s_{k + 1}) - q \leq 0

(since

s_{1} \leq s_{k}

) or

2 γ δ p q^{k} + 2 γ^{2} q L_{0} δ (1 + q + \dots + q^{k}) + η (1 - 2 γ^{2} L_{0} δ) - q \leq 0 .

(30)

Estimate (30) motivates us to introduce recurrent functions

f_{k}

defined on the interval

[0, 1)

by

f_{k} (t) = 2 γ δ p t^{k} + 2 γ^{2} L_{0} δ (1 + t + \dots + t^{k}) t - t + η (1 - 2 γ^{2} L_{0} δ) .

(31)

Then, we must show instead of (30) that

f_{k} (q) \leq 0 .

(32)

We need a relationship between two consecutive functions

f_{k} :

\begin{matrix} f_{k + 1} (t) & = & f_{k + 1} (t) - f_{k} (t) + f_{k} (t) \\ = & 2 γ δ p t^{k + 1} + 2 γ^{2} L_{0} δ (1 + t + \dots t^{k + 1}) t - t \\ + η (1 - 2 γ^{2} L_{0} δ) - 2 γ δ p t^{k} - 2 γ^{2} L_{0} δ (1 + t + \dots + t^{k}) t \\ + t - η (1 - 2 γ^{2} L_{0} δ) + f_{k} (t) \\ = & f_{k} (t) + 2 γ δ g (t) t^{k}, \end{matrix}

(33)

where

g (t) = γ L_{0} t^{2} + p t - p .

(34)

Notice that

g (q) = 0 .

It follows from (32) and (34) that

f_{k + 1} (q) = f_{k} (q) for each k .

(35)

Then, since

f_{\infty} (q) = lim_{k ⟶ \infty} f_{k} (q),

(36)

it suffices to show

f_{\infty} (q) \leq 0

(37)

instead of (32). We get by (31) that

f_{\infty} (q) = \frac{2 γ^{2} L_{0} δ q}{1 - q} - q + η (1 - 2 γ^{2} L_{0} δ)

(38)

so, we must show that

\frac{2 γ^{2} L_{0} δ q}{1 - q} - q + η (1 - 2 γ^{2} L_{0} δ) \leq 0,

(39)

which reduces to showing that

δ \leq \frac{μ}{2 γ^{2} L} \frac{(1 + η) q - η - q^{2}}{(1 + η) q - η},

(40)

which is true by (22). Hence, the induction for (26), (28) and (29) is completed. It follows that sequence

{s_{k}}

is nondecreasing, bounded above by

s^{* *}

and as such it converges to its unique least upper bound

s^{*}

which satisfies (25). □

We need the following result.

Lemma 2

([14]). Suppose that conditions (

A

) hold. Then, the following assertions also hold:

(i): $∥ F^{'} (x) - F^{'} (y) ∥ \leq L ∥ x - y ∥$
(ii): $∥ F^{'} (x) ∥ \leq L ∥ x - y ∥ + 2 β$
(iii): If $r < \frac{1}{γ L},$ then $F^{'} (x)$ is nonsingular and satisfies

$∥ F^{'} {(x)}^{- 1} ∥ \leq \frac{γ}{1 - γ L ∥ x - x_{0} ∥},$

(41)

where $L = L_{h} + L_{s} .$

Next, we show how to improve Lemma 2 and the rest of the results in [3,7]. Notice that it follows from (i) in Lemma 2 that there exists

L_{0} > 0

such that

∥ F^{'} (x) - F^{'} (x_{0}) ∥ \leq L_{0} ∥ x - x_{0} ∥ for each x \in Ω .

(42)

We have that

L_{0} \leq L

(43)

holds true and

\frac{L}{L_{0}}

can be arbitrarily large [2,12]. Then, we have the following improvement of Lemma 2.

Lemma 3.

Suppose that conditions (

A

) hold. Then, the following assertions also hold:

(i): $∥ F^{'} (x) - F^{'} (y) ∥ \leq L ∥ x - y ∥$
(ii): $∥ F^{'} (x) ∥ \leq L_{0} ∥ x - y ∥ + 2 β$
(iii): If $r < \frac{1}{γ L_{0}},$ then $F^{'} (x)$ is nonsingular and satisfies

$∥ F^{'} {(x)}^{- 1} ∥ \leq \frac{γ}{1 - γ L_{0} ∥ x - x_{0} ∥} .$

(44)

Proof.

(ii) We have

\begin{matrix} ∥ F^{'} (x) ∥ & = & ∥ F^{'} (x) - F^{'} (x_{0}) + F^{'} (x_{0}) ∥ \\ \leq & ∥ F^{'} (x) - F^{'} (x_{0}) ∥ + ∥ F^{'} (x_{0}) ∥ \\ \leq & L_{0} ∥ x - x_{0} ∥ + ∥ F^{'} (x_{0}) ∥ \leq L_{0} ∥ x - x_{0} ∥ + 2 β . \end{matrix}

(iii)

γ ∥ F^{'} (x) - F^{'} (x_{0}) ∥ \leq γ L_{0} ∥ x - x_{0} ∥ < 1 .

(45)

It follows from the Banach lemma on invertible operators [1] that

F^{'} (x)

is nonsingular, so that (44) holds. □

Remark 2.

The new estimates (ii) and (iii) are more precise than the corresponding ones in Lemma 2, if

L_{0} < L .

Next, we present the semi-local convergence of NHSS using the majorizing sequence

{s_{n}}

introduced in Lemma 1.

Theorem 2.

Assume that conditions (

A

), (22) and (23) hold. Let

η = max {η_{k}} < 1, r = max {r_{1}, t^{*}}

with

\begin{matrix} r_{1} & = & \frac{α + β}{L} (\sqrt{1 + \frac{2 α τ θ}{(2 γ + γ τ θ) {(α + β)}^{2}}} - 1) \end{matrix}

and

s^{*}

is as in Lemma 1 and with

ℓ_{*} = lim {inf}_{k ⟶ \infty} ℓ_{k}

satisfying

ℓ_{*} > ⌊ \frac{ln η}{ln ((τ + 1) θ} ⌋,

(Here

⌊ . ⌋

represents the largest integer less than or equal to the corresponding real number)

τ \in (0, \frac{1 - θ}{θ})

and

θ \equiv θ (α, x_{0}) = ∥ T (α, x_{0}) ∥ < 1 .

(46)

Then, the sequence

{x_{k}}_{k = 0}^{\infty}

generated by Algorithm NHSS is well defined and converges to

x_{*},

so that

F (x_{*}) = 0 .

Proof.

If we follow the proof of Theorem 3.2 in [3,7] but use (44) instead of (41) for the upper bound on the norms

∥ F^{'} {(x_{k})}^{- 1} ∥

we arrive at

∥ x_{k + 1} - x_{k} ∥ \leq \frac{(1 + η) γ}{1 - γ L_{0} s_{k}} ∥ F (x_{k}) ∥,

(47)

where

∥ F (x_{k}) ∥ \leq \frac{L}{2} {(s_{k} - s_{k - 1})}^{2} + η \frac{1 - γ L_{0} s_{k - 1}}{γ (1 + η)} (s_{k} - s_{k - 1}),

(48)

so by (21)

∥ x_{k + 1} - x_{k} ∥ \leq (1 + η) \frac{γ}{1 - γ L_{0} s_{k}} [\frac{L}{2} (s_{k} - s_{k - 1}) + η \frac{1 - γ L_{0} s_{k - 1}}{γ (1 + η)}] (s_{k} - s_{k - 1}) = s_{k + 1} - s_{k} .

(49)

We also have that

∥ x_{k + 1} - x_{0} ∥ \leq ∥ x_{k + 1} - x_{k} ∥ + ∥ x_{k} - x_{k - 1} ∥ + \dots + ∥ x_{1} - x_{0} ∥ \leq s_{k + 1} - s_{k} + s_{k} - s_{k - 1} + \dots + s_{1} - s_{0} = s_{k + 1} - s_{0} < s^{*} .

It follows from Lemma 1 and (49) that sequence

{x_{k}}

is complete in a Banach space

R^{n}

and as such it converges to some

x_{*} \in \bar{U} (x_{0}, r)

(since

\bar{U} (x_{0}, r)

is a closed set).

However,

∥ T (α; x_{*}) ∥ < 1

[4] and NHSS, we deduce that

F (x_{*}) = 0 .

□

Remark 3.

(a): The point $s^{*}$ can be replaced by $s^{* *}$ (given in closed form by (24)) in Theorem 2.
(b): Suppose there exist nonnegative constants $L_{h}^{0}, L_{s}^{0}$ such that for all $x \in U (x_{0}, r) \subset Ω_{0}$

$∥ H (x) - H (x_{0}) ∥ \leq L_{h}^{0} ∥ x - x_{0} ∥$

and

$∥ S (x) - S (x_{0}) ∥ \leq L_{s}^{0} ∥ x - x_{0} ∥ .$

Set $L_{0} = L_{h}^{0} + L_{s}^{0} .$ Define $Ω_{0}^{1} = Ω_{0} \cap U (x_{0}, \frac{1}{γ L_{0}}) .$ Replace condition ( $A_{2}$ ) by
( $A_{2}^{'}$ ) There exist nonnegative constants $L_{h}^{'}$ and $L_{s}^{'}$ such that for all $x, y \in U (x_{0}, r) \subset Ω_{0}^{1}$

$∥ H (x) - H (y) ∥ \leq L_{h}^{'} ∥ x - y ∥$

$∥ S (x) - S (y) ∥ \leq L_{s}^{'} ∥ x - y ∥ .$

Set $L^{'} = L_{h}^{'} + L_{s}^{'} .$ Notice that

$L_{h}^{'} \leq L_{h}, L_{s}^{'} \leq L s and L^{'} \leq L,$

(50)

since $Ω_{0}^{1} \subseteq Ω_{0} .$ Denote the conditions ( $A_{1}$ ) and ( $A_{2}^{'}$ ) by ( $A^{'}$ ). Then, clearly the results of Theorem 2 hold with conditions ( $A^{'}$ ), $Ω_{0}^{1}, L^{'}$ replacing conditions ( $A$ ), $Ω_{0}$ and $L,$ respectively (since the iterates ${x_{k}}$ remain in $Ω_{0}^{1}$ which is a more precise location than $Ω_{0}$ ). Moreover, the results can be improved even further, if we use the more accurate set $Ω_{0}^{2}$ containing iterates ${x_{k}}$ defined by $Ω_{0}^{2} : = Ω \cap U (x_{1}, \frac{1}{γ L_{0}} - γ δ) .$ Denote corresponding to $L^{'}$ constant by $L^{''}$ and corresponding conditions to ( $A^{'}$ ) by ( $A^{''}$ ). Notice that (see also the numerical examples) $Ω_{0}^{2} \subseteq Ω_{0}^{1} \subseteq Ω_{0} .$ In view of (50), the results of Theorem 2 are improved and under the same computational cost.
(c): The same improvements as in (b) can be made in the case of Theorem 1.

The majorizing sequence

{t_{n}}

in [3,7] is defined by

\begin{matrix} t_{0} & = & 0, t_{1} = c = 2 γ δ \\ t_{k + 1} & = & t_{k} + \frac{1}{1 - γ L t_{k}} [p (t_{k} - t_{k - 1}) + η (1 - γ L t_{k - 1})] (t_{k} - t_{k - 1}) . \end{matrix}

(51)

Next, we show that our sequence

{s_{n}}

is tighter than

{t_{n}} .

Proposition 1.

Under the conditions of Theorems 1 and 2, the following items hold

(i): $s_{n} \leq t_{n}$
(ii): $s_{n + 1} - s_{n} \leq t_{n + 1} - t_{n}$ and
(iii): $s^{*} \leq t^{*} = {lim}_{k ⟶ \infty} t_{k} \leq r_{2} .$

Proof.

We use a simple inductive argument, (21), (51) and (43). □

Remark 4.

Majorizing sequences using

L^{'}

or

L^{''}

are even tighter than sequence

{s_{n}}

.

4. Special Cases and Numerical Examples

Example 1.

The semi-local convergence of inexact Newton methods was presented in [14] under the conditions

\begin{matrix} ∥ F^{'} {(x_{0})}^{- 1} F (x_{0}) ∥ & \leq & β, \\ ∥ F^{'} {(x_{0})}^{- 1} (F^{'} (x) - F^{'} (y)) ∥ & \leq & γ ∥ x - y ∥, \\ \frac{∥ F^{'} {(x_{0})}^{- 1} s_{n} ∥}{∥ F^{'} {(x_{0})}^{- 1} F (x_{n}) ∥} & \leq & η_{n} \end{matrix}

and

β γ \leq g_{1} (η),

where

g_{1} (η) = \frac{\sqrt{{(4 η + 5)}^{3}} - (2 η^{3} + 14 η + 11)}{(1 + η) {(1 - η)}^{2}} .

More recently, Shen and Li [11] substituted

g_{1} (η)

with

g_{2} (η),

where

g_{2} (η) = \frac{{(1 - η)}^{2}}{(1 + η) (2 (1 + η) - η {(1 - η)}^{2})} .

Estimate (22) can be replaced by a stronger one but directly comparable to (20). Indeed, let us define a scalar sequence

{u_{n}}

(less tight than

{s_{n}}

) by

\begin{matrix} u_{0} = 0, u_{1} & = & 2 γ δ, \\ u_{k + 1} & = & u_{k} + \frac{(\frac{1}{2} ρ (u_{k} - u_{k - 1}) + η)}{1 - ρ u_{k}} (u_{k} - u_{k - 1}), \end{matrix}

(52)

where

ρ = γ L_{0} (1 + η) μ .

Moreover, define recurrent functions

f_{k}

on the interval

[0, 1)

by

f_{k} (t) = \frac{1}{2} ρ c t^{k - 1} + ρ c (1 + t + \dots + t^{k - 1}) t + η - t

and function

g (t) = t^{2} + \frac{t}{2} - \frac{1}{2} .

Set

q = \frac{1}{2} .

Moreover, define function

g_{4}

on the interval

[0, \frac{1}{2})

by

g_{4} (η) = \frac{1 - 2 η}{4 (1 + η)} .

(53)

Then, following the proof of Lemma 1, we obtain:

Lemma 4.

Let

β, γ, δ, L_{0}, L

be positive constants and

η \in [0, \frac{1}{2}) .

Suppose that

γ^{2} L δ \leq g_{4} (η)

(54)

Then, sequence

{u_{k}}

defined by (52) is nondecreasing, bounded from above by

u^{* *} = \frac{c}{1 - q}

and converges to its unique least upper bound

u^{*}

which satisfies

c \leq u^{*} \leq u^{* *} .

Proposition 2.

Suppose that conditions (

A

) and (54) hold with

r = min {r_{1}, u^{*}} .

Then, sequence

{x_{n}}

generated by algorithm NHSS is well defined and converges to

x_{*}

which satisfies

F (x_{*}) = 0 .

These bound functions are used to obtain semi-local convergence results for the Newton–HSS method as a subclass of these techniques. In Figure 1 and Figure 2, we can see the graphs of the four bound functions

g_{1}, g_{2}, {\bar{g}}_{3}

and

g_{4} .

Clearly our bound function

{\bar{g}}_{3}

improves all the earlier results. Moreover, as noted before, function

g_{3}

cannot be used, since it is an incorrect bound function.

In the second example we compare the convergence criteria (22) and (12).

Example 2.

Let

η = 1, Ω_{0} = Ω = U (x_{0}, 1 - λ), x_{0} = 1, λ \in [0, 1) .

Define function F on Ω by

F (x) = x^{3} - λ .

(55)

Then, using (55) and the condition (

A

), we get

γ = \frac{1}{3}, δ = 1 - λ, L = 6 (2 - λ),

L_{0} = 3 (3 - λ)

and

μ = \frac{2 (2 - λ)}{3 - λ} .

Choosing

λ = 0.8 .

, we get

L = 7.2, L_{0} = 6.6, δ = 0.2, μ = 1.0909091, η_{0} = 0.594088525, p = 1.392, q = 0.539681469,

γ^{2} L δ = 0.16

. Let

η = 0.16 < η_{0},

then,

{\bar{g}}_{3} (0.16) = 0.159847474,

ξ = min {0.176715533, 0.20456064} = 0.176715533

. Hence the old condition (12) is not satisfied, since

γ^{2} L δ > {\bar{g}}_{3} (0.16)

. However, the new condition (22) is satisfied, since

γ^{2} L δ < ξ .

Hence, the new results expand the applicability of NHSS method.

The next example is used for the reason already mentioned in (iii) of the introduction.

Example 3.

Consider the two-dimensional nonlinear convection–diffusion equation [7]

\begin{matrix} - (u_{x x} + u_{y y}) + q (u_{x} + u_{y}) & = & - e^{u}, (x, y) \in Ω \\ u (x, y) & = & 0 (x, y) \in \partial Ω \end{matrix}

(56)

where

Ω = (0, 1) \times (0, 1)

and

\partial Ω

is the boundary of

Ω .

Here

q > 0

is a constant to control the magnitude of the convection terms (see [7,15,16]). As in [7], we use classical five-point finite difference scheme with second order central difference for both convection and diffusion terms. If N defines number of interior nodes along one co-ordinate direction, then

h = \frac{1}{N + 1}

and

R e = \frac{q h}{2}

denotes the equidistant step-size and the mesh Reynolds number, respectively. Applying the above scheme to (56), we obtain the following system of nonlinear equations:

\begin{matrix} \bar{A} u + h^{2} e^{u} & = & 0 \\ u & = & {(u_{1}, u_{2}, \dots, u_{N})}^{T}, u_{i} = {(u_{i 1}, u_{i 2}, \dots, u_{i N})}^{T}, i = 1, 2, \dots, N, \end{matrix}

where the coefficient matrix

\bar{A}

is given by

\bar{A} = T_{x} \otimes I + I \otimes T_{y} .

Here, ⊗ is the Kronecker product,

T_{x}

and

T_{y}

are the tridiagonal matrices

T_{x} = t r i d i a g (- 1 - R e, 4, - 1 + R e), T_{y} = t r i d i a g (- 1 - R e, 0, - 1 + R e) .

In our computations, N is chosen as 99 so that the total number of nodes are

100 \times 100

. We use

α = \frac{q h}{2}

as in [7] and we consider two choices for

η_{k}

i.e.,

η_{k} = 0.1

and

η_{k} = 0.01

for all k.

The results obtained in our computation is given in Figure 3, Figure 4, Figure 5 and Figure 6. The total number of inner iterations is denoted by

I T

, the total number of outer iterations is denoted by

O T

and the total CPU time is denoted by t.

5. Conclusions

A major problem for iterative methods is the fact that the convergence domain is small in general, limiting the applicability of these methods. Therefore, the same is true, in particular for Newton–Hermitian, skew-Hermitian and their variants such as the NHSS and other related methods [4,5,6,11,13,14]. Motivated by the work in [7] (see also [4,5,6,11,13,14]) we:

(a): Extend the convergence domain of NHSS method without additional hypotheses. This is done in Section 3 using our new idea of recurrent functions. Examples, where the new sufficient convergence criteria hold (but not previous ones), are given in Section 4 (see also the remarks in Section 3).
(b): The sufficient convergence criterion (16) given in [7] is false. Therefore, the rest of the results based on (16) do not hold. We have revisited the proofs to rectify this problem. Fortunately, the results can hold if (16) is replaced with (12). This can easily be observed in the proof of Theorem 3.2 in [7]. Notice that the issue related to the criteria (16) is not shown in Example 4.5, where convergence is established due to the fact that the validity of (16) is not checked. The convergence criteria obtained here are not necessary too. Along the same lines, our technique in Section 3 can be used to extend the applicability of other iterative methods discussed in [1,2,3,4,5,6,8,9,12,13,14,15,16].

Author Contributions

Conceptualization: I.K.A., S.G.; Editing: S.G., C.G.; Data curation: C.G. and A.A.M.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

References

Argyros, I.K.; Szidarovszky, F. The Theory and Applications of Iteration Methods; CRC Press: Boca Raton, FL, USA, 1993. [Google Scholar]
Argyros, I.K.; Magréñan, A.A. A Contemporary Study of Iterative Methods; Elsevier (Academic Press): New York, NY, USA, 2018. [Google Scholar]
Argyros, I.K.; George, S. Local convergence for an almost sixth order method for solving equations under weak conditions. SeMA J. 2018, 75, 163–171. [Google Scholar] [CrossRef]
Bai, Z.Z.; Golub, G.H.; Ng, M.K. Hermitian and skew-Hermitian splitting methods for non-Hermitian positive definite linear systems. SIAM J. Matrix Anal. Appl. 2003, 24, 603–626. [Google Scholar] [CrossRef]
Bai, Z.Z.; Guo, X.P. The Newton-HSS methods for systems of nonlinear equations with positive-definite Jacobian matrices. J. Comput. Math. 2010, 28, 235–260. [Google Scholar]
Dembo, R.S.; Eisenstat, S.C.; Steihaug, T. Inexact Newton methods. SIAM J. Numer. Anal. 1982, 19, 400–408. [Google Scholar] [CrossRef]
Guo, X.P.; Duff, I.S. Semi-local and global convergence of the Newton-HSS method for systems of nonlinear equations. Numer. Linear Algebra Appl. 2011, 18, 299–315. [Google Scholar] [CrossRef]
Magreñán, A.A. Different anomalies in a Jarratt family of iterative root finding methods. Appl. Math. Comput. 2014, 233, 29–38. [Google Scholar]
Magreñán, A.A. A new tool to study real dynamics: The convergence plane. Appl. Math. Comput. 2014, 248, 29–38. [Google Scholar] [CrossRef]
Ortega, J.M.; Rheinboldt, W.C. Iterative Solution of Nonlinear Equations in Several Variables; Academic Press: New York, NY, USA, 1970. [Google Scholar]
Shen, W.P.; Li, C. Kantorovich-type convergence criterion for inexact Newton methods. Appl. Numer. Math. 2009, 59, 1599–1611. [Google Scholar] [CrossRef]
Argyros, I.K. Local convergence of inexact Newton-like-iterative methods and applications. Comput. Math. Appl. 2000, 39, 69–75. [Google Scholar] [CrossRef] [Green Version]
Eisenstat, S.C.; Walker, H.F. Choosing the forcing terms in an inexact Newton method. SIAM J. Sci. Comput. 1996, 17, 16–32. [Google Scholar] [CrossRef]
Guo, X.P. On semilocal convergence of inexact Newton methods. J. Comput. Math. 2007, 25, 231–242. [Google Scholar]
Axelsson, O.; Catey, G.F. On the numerical solution of two-point singularly perturbed value problems, Computer Methods in Applied Mechanics and Engineering. Comput. Methods Appl. Mech. Eng. 1985, 50, 217–229. [Google Scholar] [CrossRef]
Axelsson, O.; Nikolova, M. Avoiding slave points in an adaptive refinement procedure for convection-diffusion problems in 2D. Computing 1998, 61, 331–357. [Google Scholar] [CrossRef]

Figure 1. Graphs of

g_{1} (t)

(Violet),

g_{2} (t)

(Green),

{\bar{g}}_{3}

(Red).

Figure 1. Graphs of

g_{1} (t)

(Violet),

g_{2} (t)

(Green),

{\bar{g}}_{3}

(Red).

Figure 2. Graphs of

g_{1} (t)

(Violet),

g_{2} (t)

(Green),

{\bar{g}}_{3}

(Red) and

g_{4}

(Blue).

Figure 2. Graphs of

g_{1} (t)

(Violet),

g_{2} (t)

(Green),

{\bar{g}}_{3}

(Red) and

g_{4}

(Blue).

Figure 3. Plots of (a) inner iterations vs.

log (∥ F (x_{k}) ∥)

, (b) outer iterations vs.

log (∥ F (x_{k}) ∥)

, (c) CPU time vs.

log (∥ F (x_{k}) ∥)

for

q = 600

and

x_{0} = e

.

Figure 3. Plots of (a) inner iterations vs.

log (∥ F (x_{k}) ∥)

, (b) outer iterations vs.

log (∥ F (x_{k}) ∥)

, (c) CPU time vs.

log (∥ F (x_{k}) ∥)

for

q = 600

and

x_{0} = e

.

Figure 4. Plots of (a) inner iterations vs.

log (∥ F (x_{k}) ∥)

, (b) outer iterations vs.

log (∥ F (x_{k}) ∥)

, (c) CPU time vs.

log (∥ F (x_{k}) ∥)

for

q = 2000

and

x_{0} = e

.

Figure 4. Plots of (a) inner iterations vs.

log (∥ F (x_{k}) ∥)

, (b) outer iterations vs.

log (∥ F (x_{k}) ∥)

, (c) CPU time vs.

log (∥ F (x_{k}) ∥)

for

q = 2000

and

x_{0} = e

.

Figure 5. Plots of (a) inner iterations vs.

log (∥ F (x_{k}) ∥)

, (b) outer iterations vs.

log (∥ F (x_{k}) ∥)

, (c) CPU time vs.

log (∥ F (x_{k}) ∥)

for

q = 600

and

x_{0} = 6 e

.

Figure 5. Plots of (a) inner iterations vs.

log (∥ F (x_{k}) ∥)

, (b) outer iterations vs.

log (∥ F (x_{k}) ∥)

, (c) CPU time vs.

log (∥ F (x_{k}) ∥)

for

q = 600

and

x_{0} = 6 e

.

Figure 6. Plots of (a) inner iterations vs.

log (∥ F (x_{k}) ∥)

, (b) outer iterations vs.

log (∥ F (x_{k}) ∥)

, (c) CPU time vs.

log (∥ F (x_{k}) ∥)

for

q = 2000

and

x_{0} = 6 e

.

Figure 6. Plots of (a) inner iterations vs.

log (∥ F (x_{k}) ∥)

, (b) outer iterations vs.

log (∥ F (x_{k}) ∥)

, (c) CPU time vs.

log (∥ F (x_{k}) ∥)

for

q = 2000

and

x_{0} = 6 e

.

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Argyros, I.K.; George, S.; Godavarma, C.; Magreñán, A.A. Extended Convergence Analysis of the Newton–Hermitian and Skew–Hermitian Splitting Method. Symmetry 2019, 11, 981. https://doi.org/10.3390/sym11080981

AMA Style

Argyros IK, George S, Godavarma C, Magreñán AA. Extended Convergence Analysis of the Newton–Hermitian and Skew–Hermitian Splitting Method. Symmetry. 2019; 11(8):981. https://doi.org/10.3390/sym11080981

Chicago/Turabian Style

Argyros, Ioannis K, Santhosh George, Chandhini Godavarma, and Alberto A Magreñán. 2019. "Extended Convergence Analysis of the Newton–Hermitian and Skew–Hermitian Splitting Method" Symmetry 11, no. 8: 981. https://doi.org/10.3390/sym11080981

APA Style

Argyros, I. K., George, S., Godavarma, C., & Magreñán, A. A. (2019). Extended Convergence Analysis of the Newton–Hermitian and Skew–Hermitian Splitting Method. Symmetry, 11(8), 981. https://doi.org/10.3390/sym11080981

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Extended Convergence Analysis of the Newton–Hermitian and Skew–Hermitian Splitting Method

Abstract

1. Introduction

2. Semi-Local Convergence Analysis

3. Semi-Local Convergence Analysis II

4. Special Cases and Numerical Examples

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI