Newton-Type Methods for Solving Equations in Banach Spaces: A Unified Approach

Argyros, Ioannis K.; Shakhno, Stepan; Regmi, Samundra; Yarmola, Halyna

doi:10.3390/sym15010015

Open AccessArticle

Newton-Type Methods for Solving Equations in Banach Spaces: A Unified Approach

¹

Department of Computing and Mathematical Sciences, Cameron University, Lawton, OK 73505, USA

²

Department of Theory of Optimal Processes, Ivan Franko National University of Lviv, Universytetska Str. 1, 79000 Lviv, Ukraine

³

Department of Mathematics, University of Houston, Houston, TX 77204, USA

⁴

Department of Computational Mathematics, Ivan Franko National University of Lviv, Universytetska Str. 1, 79000 Lviv, Ukraine

^*

Authors to whom correspondence should be addressed.

Symmetry 2023, 15(1), 15; https://doi.org/10.3390/sym15010015

Submission received: 4 November 2022 / Revised: 15 December 2022 / Accepted: 17 December 2022 / Published: 21 December 2022

(This article belongs to the Section Mathematics)

Download

Browse Figures

Versions Notes

Abstract

:

A plethora of quantum physics problems are related to symmetry principles. Moreover, by using symmetry theory and mathematical modeling, these problems reduce to solving iteratively finite differences and systems of nonlinear equations. In particular, Newton-type methods are introduced to generate sequences approximating simple solutions of nonlinear equations in the setting of Banach spaces. Specializations of these methods include the modified Newton method, Newton’s method, and other single-step methods. The convergence of these methods is established with similar conditions. However, the convergence region is not large in general. That is why a unified semilocal convergence analysis is developed that can be used to handle these methods under even weaker conditions that are not previously considered. The approach leads to the extension of the applicability of these methods in cases not covered before but without new conditions. The idea is to replace the Lipschitz parameters or other parameters used by smaller ones to force convergence in cases not possible before. It turns out that the error analysis is also extended. Moreover, the new idea does not depend on the method. That is why it can also be applied to other methods to also extend their applicability. Numerical applications illustrate and test the convergence conditions.

Keywords:

convergence; Banach spaces; Fréchet derivative; iterative method

MSC:

65G99; 47H99; 65H10; 49M15

1. Introduction

Let X and Y stand for Banach spaces and let

Ω

be a convex and nonempty subset of X. A plethora of applications from diverse disciplines may be solved, if reduced to a nonlinear equation of the form

F (x) = 0 .

(1)

This reduction takes place by using mathematical modeling [1,2,3,4,5,6]. Then, a solution denoted by

x^{*} \in Ω

is to be found that answers the application. The solution may be a number or a vector or a matrix or in general an operator. This task is very challenging in general. Obviously, the solution

x^{*}

is desired in an analytical form. However, in practice, this is achievable only in rare cases. That is why researchers mostly develop iterative methods convergent to

x^{*}

under some conditions on the initial information.

Popular methods are the modified Newton’s method (MNM) and Newton’s method (NM) defined, respectively, for starting point

x_{0} \in Ω

and all

n = 0, 1, 2, \dots

by

x_{n + 1} = x_{n} - F^{'} {(x_{0})}^{- 1} F (x_{n})

(2)

and

x_{n + 1} = x_{n} - F^{'} {(x_{n})}^{- 1} F (x_{n}) .

(3)

Here,

F^{'}

is the notation for the Fréchet derivative of the operator F [7].

Numerous articles have been written on the convergence of these two methods [7,8,9,10,11]. The convergence conditions are mostly sufficient and in rare cases necessary. This observation indicates that there is a possibility to weaken the conditions, especially because these methods may converge even if these conditions are not fulfilled.

That is why in this article the objective is to consider alternatives.

Let us look at the main convergence conditions for these methods.

Remark 1.

Suppose that there exist parameters

L_{0} > 0

and

L_{2} > 0

, such that

∥ F^{'} {(x_{0})}^{- 1} (F^{'} (v) - F^{'} (x_{0})) ∥ \leq L_{0} ∥ v - x_{0} ∥

(4)

and

∥ F^{'} {(x_{0})}^{- 1} (F^{'} (w) - F^{'} (v)) ∥ \leq L_{2} ∥ w - v ∥

(5)

for all

v, w \in Ω

. Moreover, consider a parameter

η \geq 0

, such that

∥ F^{'} {(x_{0})}^{- 1} F (x_{0}) ∥ \leq η .

(6)

Then, the corresponding sufficient convergence conditions for MNM and NM are, respectively [9,12,13]

L_{0} η \leq \frac{1}{2}

(7)

and

L_{2} η \leq \frac{1}{2} .

(8)

The conditions (7) and (8) are due to Kantorovich [11]. Clearly, it follows that

L_{0} \leq L_{2} .

(9)

Thus, we deduce that

L_{2} η \leq \frac{1}{2} \Rightarrow L_{0} η \leq \frac{1}{2} .

(10)

That is, the condition (7) is weaker than (8). However, the convergence of MNM is only linear, whereas that of NM is quadratic [11]. Moreover, one can construct even scalar equations, where both conditions (7) and (8) are not fulfilled. That is, convergence is not assured by either convergence result although these methods may converge. Let us look at an elementary but motivational example.

Example 1.

Let us consider the domain

Ω = [λ, 2 - λ]

for

λ \in (0, \frac{1}{2})

and the starting point

x_{0} = 1 .

Moreover, we define the function

φ : Ω \to R

by

φ (t) = t^{3} - λ .

(11)

The conditions (4)–(6) are fulfilled if

L_{0} = 3 - λ < L_{2} = 2 (2 - λ)

and

η = \frac{1}{3} (1 - λ)

. By plugging these values on the conditions (7) and (8) and solving for the parameter λ, we deduce that (8) does not hold for any

λ \in (0, \frac{1}{2})

, whereas (7) does hold provided that

λ \in (\frac{4 - \sqrt{1} 0}{2}, \frac{1}{2})

. However, the convergence is only linear in the MNM case.

Remark 2.

In view of the Example 1, the question arises: Can we weaken the condition (8) but maintain the quadratic convergence of NM?

The answer was given in [1,9,12,13] and it is positive. In those articles we looked at condition (8) and realized that a weakening can take place if in condition (8) we replace:

Case 1. Parameter

L_{2}

by a smaller one;

Case 2. Parameter η by a smaller one; and

Case 3. Parameters

L_{2}

and η by smaller ones.

Positive results for Case 1 are reported in [1,11].

Additional benefits include more precise error estimates on the distances

∥ x_{n + 1} - x_{n} ∥

,

∥ x^{*} - x_{n} ∥

and an at least as extended uniqueness ball. The novelty is that all these benefits are achieved without additional conditions. Relevant research can be found in [14,15,16,17].

In this article, we present similar contributions for cases 2 and 3.

The idea is to replace NM with a Newton-type method (NTM) for

x_{0} \in Ω

,

M_{0} = x_{0}

and all

n = 0, 1, 2, \dots

defined by

x_{n + 1} = x_{n} - F^{'} {(M_{n})}^{- 1} b_{n} F (H_{n}),

(12)

where

M, H : Ω \to Ω

are continuous operators and

{b_{n}}

is a bounded sequence of nonzero parameters. The notation

M_{n}, H_{n}

stands for

M_{n} = M (x_{n})

and

H_{n} = H (x_{n})

with

M_{0} = x_{0}

.

Next, we present a general auxiliary result for the convergence of iterative methods.

Lemma 1.

Let

X_{0}

and

Y_{0}

be normed spaces and

F : D \subset X_{0} \to Y_{0}

be a continuous operator, where the set D is open. Define the NTM

y_{0} \in D, y_{n + 1} = y_{n} - E_{n} F (H (y_{n})),

(13)

where

{E_{n}}

is a sequence of linear operators.

Suppose

(i) The sequence

{E_{n}^{- 1}}

exists and is bounded

and

(ii) the sequence

{y_{n}}

is Cauchy.

Then, there exists a parameter

C > 0

such that for all

n = 0, 1, 2, \dots

,

∥ E_{n}^{- 1} ∥ \leq C, y^{*} : = lim_{n \to + \infty} y_{n} e x i s t s a n d F H ((y^{*})) = 0 .

(14)

Proof.

The sequence

{E_{n}^{- 1}}

exists and is bounded by the hypothesis. Thus, (14) holds. Then, it follows by method (13) for

n \to + \infty

that

∥ F (y_{n}) ∥ \leq ∥ E_{n}^{- 1} ∥ ∥ y_{n + 1} - y_{n} ∥ \to 0,

since the sequence

{y_{n}}

is Cauchy.

Thus, we deduce

0 = lim_{n \to \infty} ∥ F (y_{n}) ∥ \Rightarrow F (lim_{n \to \infty} y_{n}) = 0 .

□

Remark 3.

(a): The space $X_{0}$ does not have to be complete for the sequence ${y_{n}}$ to converge. In the case of method (12), set $L_{n} = F^{'} {(M_{n})}^{- 1} b_{n}$ . Then, $H (y^{*})$ solves the equation $F (x) = 0$ .
(b): Possible choices for H are $H (x) = x - F^{'} {(x_{0})}^{- 1} F (x)$ or $H (x) = x - F^{'} {(x)}^{- 1} F (x)$ .
(c): Special cases of NTM are MNM (if $M_{n} = x_{0}, H_{n} = I$ and $b_{n} = 1$ for all $n = 0, 1, 2, \dots$ ), NM (if $M_{n} = H_{n} = I$ and $b_{n} = 1$ for all $n = 0, 1, 2, \dots$ ).

Other choices lead to Newton-like methods [1]. That is why by studying the convergence of method NTM, we also unify the convergence of its specializations. Moreover, we may weaken the convergence criteria and improve the error bounds or the information on the uniqueness of the solution

x^{*}

at least in some cases (see Section 4). Notice that the smallness of

η

is determined by

∥ F^{'} {(M_{0})}^{- 1} b_{0} F (H_{0}) ∥ \leq μ

for some

μ \geq 0

. Then, in some cases, this parameter is such that

μ < η .

(15)

Hence, this case is favorable to our expectations.

The semilocal convergence for method (12) is given in Section 2, followed by the local convergence in Section 3. The examples and the concluding remarks appear in Section 4 and Section 5, respectively.

2. The Semi-Local Convergence of the Method NTM

We introduce certain parameters and real sequences that are important in the convergence of the method (12). Let

η, d_{j}, L_{1} \geq 0

for

j = 0, 1, 2, \dots, 7

be given parameters. Define the parameters

δ_{0} = d_{4} d_{7}, δ_{1} = \frac{L_{1} d_{1} d_{6}}{2}, δ_{2} = d_{6} (L_{1} d_{2} + d_{4} d_{5} d_{7}), δ_{3} = d_{6} (L_{1} d_{3} + d_{5}) .

Some of these parameters can be zero (see also the Example 2). Moreover, define the sequences for all

n = 0, 1, 2, \dots

\begin{matrix} t_{0} = 0, t_{1} = η, \\ a_{n + 1} = δ_{1} (t_{n + 1} - t_{n}) + δ_{2} t_{n} + δ_{3} \\ and \\ t_{n + 2} = t_{n + 1} + \frac{a_{n + 1} (t_{n + 1} - t_{n})}{1 - δ_{0} t_{n + 1}} . \end{matrix}

(16)

This sequence appears often in the study of Newton-like methods [1].

The first general convergence result for the sequence

{t_{n}}

follows.

Lemma 2.

Suppose that for

δ_{0} \neq 0

and all

n = 0, 1, 2, \dots

t_{n + 1} < \frac{1}{δ_{0}} .

(17)

Then, the sequence

{t_{n}}

generated by the Formula (16) is convergent and satisfying

0 \leq t_{n} \leq t_{n + 1} \leq lim_{n \to \infty} t_{n} = t^{*} \in [0, \frac{1}{δ_{0}}]

.

Proof.

It follows by (16) and (17) that the sequence

{t_{n}}

is nondecreasing and also bounded from above by

\frac{1}{δ_{0}}

and as such is convergent to

t^{*}

. □

Remark 4.

We can provide some stronger alternatives to the verification of (17). It is convenient to introduce some polynomials and functions defined on the interval

[0, 1)

in order to show the second convergence result for the sequence

{t_{n}}

as follows:

p_{2} (t) = δ_{0} t^{3} + (δ_{1} + δ_{2}) t - δ_{1},

g_{n + 1} (t) = δ_{1} η t^{n} + δ_{2} (1 + t + \dots + t^{n}) η + δ_{0} t (1 + t + \dots + t^{n + 1}) η - t + δ_{3} .

Thus, we get by these definitions that

g_{n + 1} (t) - g_{n} (t) = p_{2} (t) t^{n} η .

Define the function

g_{\infty}

on the interval

[0, 1)

by

g_{\infty} (t) = lim_{n \to \infty} g_{n} (t) .

Then, we get

g_{\infty} (t) = \frac{δ_{2} η}{1 - t} + \frac{δ_{0} t η}{1 - t} + δ_{3} - t .

Set

δ_{4} = 1 + δ_{3} - δ_{0} η, δ_{5} = δ_{3} + δ_{2} η, Δ = δ_{4}^{2} - 4 δ_{5}, a n d f (t) = t^{2} - δ_{4} t + δ_{5} .

Then, the function

g_{\infty}

can be rewritten as

g_{\infty} (t) = \frac{f (t)}{1 - t} .

By these definitions

p_{2} (0) = - δ_{1} < 0

and

p_{2} (1) = δ_{0} + δ_{2} > 0

. It then follows by the intermediate theorem that the polynomial

p_{2}

has zeros in the interval

(0, 1)

. Notice that by Descartes’ rule of signs there is only one such zero, which we denote by γ.

Suppose

(I_{1})

δ_{4} > 0,

0 \leq \frac{δ_{1} η + δ_{3}}{1 - δ_{0} η} \leq γ,

f (γ) \leq 0;

or

(I_{2})

δ_{4} > 0,

0 \leq \frac{δ_{1} η + δ_{3}}{1 - δ_{0} η} \leq γ,

Δ \geq 0,

f (r_{1}) = 0,

where

r_{1}

is the smallest positive solution

f (t) = 0

assured to exist by

Δ \geq 0

;

or

(I_{3})

δ_{4} > 0,

0 \leq \frac{δ_{1} η + δ_{3}}{1 - δ_{0} η} \leq γ,

Δ \geq 0,

f (\frac{δ_{4}}{2}) \leq 0 .

Notice that the conditions

(I_{1})

is verified in Example 2 used in Theorem 1, which follows.

Lemma 3.

Suppose that any of the conditions

(I_{1})

–

(I_{3})

hold. Then, the sequence

{t_{n}}

generated by the formula (16) is convergent with

0 \leq t_{n} \leq t_{n + 1} \leq lim_{n \to \infty} t_{n} = t^{*} \in [0, \frac{1}{δ_{0}}]

and

0 \leq t_{n + 2} - t_{n + 1} \leq γ (t_{n + 1} - t_{n}) \leq γ^{n + 1} (t_{1} - t_{0}) .

(18)

Proof.

Mathematical induction is utilized to show for all

k = 0, 1, 2, \dots

the assertion

0 \leq \frac{δ_{1} (t_{k + 1} - t_{k}) + δ_{2} t_{k} + δ_{3}}{1 - δ_{0} t_{1}} \leq γ .

(19)

This assertion holds if

k = 0

, by (16) and the second condition in

(I_{1})

–

(I_{3})

. It follows that

0 \leq t_{2} - t_{1} \leq γ (t_{1} - t_{0}), t_{2} \leq t_{1} + γ (t_{1} - t_{0}) = \frac{1 - γ^{2}}{1 - γ} η < \frac{η}{1 - γ}

. Suppose that the assertion (19) holds for all integer values smaller than k. Then, we get

0 \leq t_{k + 1} - t_{k} \leq γ^{k} η

and

t^{k + 1} \leq \frac{1 - γ^{k + 1}}{1 - γ} η < \frac{η}{1 - γ} .

Then, evidently (19) holds if

δ_{1} γ^{k} η + δ_{2} \frac{1 - γ^{k}}{1 - γ} η + δ_{3} + δ_{0} γ \frac{1 - γ^{k + 2}}{1 - γ} η - γ \leq 0 .

(20)

Assertion (20) motivates the introduction of the recurrent polynomials

g_{k}

, and shows instead of (20) that

g_{k} (γ) \leq 0 .

(21)

However,

g_{k + 1} (γ) = g_{k} (γ)

, because

p_{2} (γ) = 0

. Thus, we have

g_{\infty} (γ) = g_{k} (γ)

.

Consequently, the estimate

g_{\infty} (γ) \leq 0

can be shown instead of (21), which is true by any of the last conditions in

(I_{1})

–

(I_{3})

.

The induction for the assertion is completed. Therefore, the sequence

{t_{n}}

is nondecreasing and bounded from above by

\frac{η}{1 - γ}

. Hence, it is convergent to some

t^{*} \in [0, \frac{η}{1 - γ}]

. □

The notation

U (x, a), U [x, a]

is used for the open and closed ball in X of center x and radius

a > 0

, respectively. We denote by

£ (Y, X)

the space of bounded linear operators from Y to X.

The following set of conditions are used in the semi-local convergence of NTM.

(A_{1})

There exist

x_{0} \in Ω

and

η \geq 0,

such that

F^{'} {(M_{0})}^{- 1} \in £ (Y, X) and ∥ F^{'} {(M_{0})}^{- 1} b_{0} F (H_{0}) ∥ \leq η .

(A_{2})

∥ F^{'} {(M_{0})}^{- 1} (F^{'} (M (x)) - F^{'} (M_{0})) ∥ \leq d_{7} ∥ M (x) - M (x_{0}) ∥

and

∥ M (x) - M (x_{0}) ∥ \leq d_{4} ∥ x - x_{0} ∥ for all x \in Ω for some d_{4} \in [0, 1) .

Define the region

Ω_{1}

as

Ω_{1} = \{\begin{matrix} Ω \cap U (M_{0}, \frac{1}{δ_{0}}), if δ_{0} \neq 0, \\ Ω, if δ_{0} = 0 . \end{matrix}

(A_{3})

∥ H (y) - H (x) ∥ \leq d_{1} ∥ y - x ∥,

∥ H (x) - M (x) - (H (x_{0}) - M (x_{0})) ∥ \leq d_{2} ∥ x - x_{0} ∥,

∥ H (x_{0}) - M (x_{0}) ∥ \leq d_{3}, ∥ H (x) - M (x_{0}) ∥ < \frac{1}{δ_{0}},

| I - b_{n} | \leq d_{5},

| b_{n} | \leq d_{6},

∥ F^{'} {(M_{0})}^{- 1} (F^{'} (y) - F^{'} (x)) ∥ \leq L_{1} ∥ y - x ∥ for all x, y \in Ω_{1} .

(A_{4})

The conditions of any of the last two lemmas are fulfilled

and

(A_{5})

U [M_{0}, t^{*}] \subseteq Ω .

Next, the semilocal convergence of the method NTM follows.

Theorem 1.

Suppose that the conditions

(A_{1})

–

(A_{5})

and any of the conditions

(I_{i})

,

i = 1, 2, 3

or (17) hold. Then, the sequence

{x_{n}}

generated by the NTM is well defined in

U (M_{0}, t^{*})

, remains in

U (M_{0}, t^{*})

and is convergent to some

H (x^{*}) \in U [M_{0}, t^{*}]

solving the equation

F (x) = 0

. Moreover, the following assertion holds:

∥ x^{*} - x_{n} ∥ \leq t^{*} - t_{n} .

(22)

Proof.

Mathematical induction is used to show

∥ x_{k + 1} - x_{k} ∥ \leq t_{k + 1} - t_{k}

(23)

for all

k = 0, 1, 2, . \dots

The definition of (16) and the condition

(A_{1})

imply

∥ x_{1} - M_{0} ∥ = ∥ x_{1} - x_{0} ∥ = ∥ F^{'} {(M_{0})}^{- 1} b_{0} F (H_{0}) ∥ \leq η = t_{1} - t_{0} = t_{1} < t^{*},

so the iterate

x_{1} \in U (M_{0}, t^{*})

and the assertion (23) hold for

k = 0

. Let

M (x_{k}) \in U (M_{0}, t^{*})

. It follows from the conditions

(A_{1})

–

(A_{3})

that

\begin{matrix} ∥ F^{'} {(M_{0})}^{- 1} (F^{'} (M (x_{k})) - F^{'} (M (x_{0}))) ∥ & \leq & d_{7} ∥ M (x_{k}) - M (x_{0}) ∥ \\ \leq & d_{7} d_{4} ∥ x_{k} - x_{0} ∥ \leq δ_{0} t^{*} < 1 . \end{matrix}

(24)

Then, the estimate (24) and the perturbation lemma by Banach on linear invertible operators [11,18,19] assert that

F^{'} {(M (x_{k}))}^{- 1} \in £ (Y, X)

and

∥ F^{'} {(M (x_{k}))}^{- 1} F^{'} (M_{0}) ∥ \leq \frac{1}{1 - δ_{0} ∥ x_{k} - x_{0} ∥} .

(25)

Moreover, the iterate

x_{k + 1}

is well defined.

The definition of NTM implies the identity

\begin{matrix} F (H_{k + 1}) & = & F (H_{k + 1}) - F (H_{k}) - F^{'} (H_{k}) (x_{k + 1} - x_{k}) \\ + (F^{'} (H_{k}) - F^{'} (M_{k}) b_{k}) (x_{k + 1} - x_{k}) \\ = & F (H_{k + 1}) - F (H_{k}) - F^{'} (H_{k}) (x_{k + 1} - x_{k}) \\ + (F^{'} (H_{k}) - F^{'} (M_{k})) (x_{k + 1} - x_{k}) + F^{'} (M_{k}) (I - b_{k}) (x_{k + 1} - x_{k}) . \end{matrix}

(26)

By applying the condition

(A_{3})

on the identity (26), we obtain in turn the estimates

\begin{matrix} ∥ H_{k + 1} - H_{k} ∥ & \leq & d_{1} ∥ x_{k + 1} - x_{k} ∥ \leq d_{1} (t_{k + 1} - t_{k}), \\ ∥ F^{'} {(M_{0})}^{- 1} (F (H_{k + 1}) - F (H_{k}) - F^{'} (H_{k}) (x_{k + 1} - x_{k})) ∥ \\ \leq & ∥ \int_{0}^{1} F^{'} {(M_{0})}^{- 1} (F^{'} (H_{k} + θ (H_{k + 1} - H_{k})) - F^{'} (H_{k})) d θ (x_{k + 1} - x_{k}) ∥ \\ \leq & \frac{L_{1} d_{1}}{2} {∥ x_{k + 1} - x_{k} ∥}^{2} \leq d_{9} {(t_{k + 1} - t_{k})}^{2}, \\ ∥ H_{k} - M_{k} ∥ & = & ∥ (H (x_{k}) - M (x_{k}) - (H (x_{0}) - M (x_{0}))) + (H (x_{0}) - M (x_{0})) ∥ \\ \leq & ∥ H (x_{k}) - M (x_{k}) - (H (x_{0}) - M (x_{0})) ∥ + ∥ H (x_{0}) - M (x_{0}) ∥ \\ \leq & d_{2} ∥ x_{k} - x_{0} ∥ + d_{3} \leq d_{2} t_{k} + d_{3} \end{matrix}

and

\begin{matrix} ∥ F^{'} {(M_{0})}^{- 1} F^{'} (M_{k}) (I - b_{k}) ∥ \\ \leq & ∥ F^{'} {(M_{0})}^{- 1} ((F^{'} (M_{k}) - F^{'} (M_{0})) + F^{'} (M_{0})) (I - b_{k}) ∥ \\ \leq & d_{7} d_{4} ∥ x_{k} - x_{0} ∥ + d_{5} . \end{matrix}

By summing up the preceding estimates, we get

| b_{k + 1} | ∥ F^{'} {(M_{0})}^{- 1} F (H_{k + 1}) ∥ \leq a_{k + 1} (t_{k + 1} - t_{k}),

(27)

and thus

\begin{matrix} ∥ x_{k + 2} - x_{k + 1} ∥ & \leq & ∥ F^{'} {(M_{k + 1})}^{- 1} F^{'} (M_{0}) ∥ ∥ F^{'} {(M_{0})}^{- 1} F (H_{k + 1}) ∥ ∥ b_{k + 1} ∥ \\ \leq & \frac{a_{k + 1} (t_{k + 1} - t_{k})}{1 - δ_{0} t_{k + 1}} = t_{k + 2} - t_{k + 1}, \end{matrix}

and

∥ x_{k + 2} - M_{0} ∥ \leq ∥ x_{k + 2} - x_{k + 1} ∥ + ∥ x_{k + 1} - M_{0} ∥ \leq t_{k + 2} - t_{k + 1} + t_{k + 1} - t_{0} = t_{k + 2} < t^{*} .

Hence, the iterate

x_{k + 2} \in U (M_{0}, t^{*})

and the estimate (23) holds for all k. Moreover, the sequence

{t_{n}}

is complete as convergent. Thus, the sequence

{x_{n}}

is also complete in Banach space X. Hence, it is also convergent to some

x^{*} \in U [x_{0}, t^{*}]

. Furthermore, the continuity of F, the Remark 3(a), and by letting

k \to + \infty

in the estimate (27) we obtain

F (H (x^{*})) = 0

. □

Next, a result is presented concerning the uniqueness of the solution for the equation

F (x) = 0

.

Proposition 1.

Suppose

(i)

there exists a solution

z \in U (M_{0}, ρ_{1})

of equation

F (x) = 0

for some

ρ_{1} > 0

;

(i i)

the condition

(A_{2})

holds on the ball

U (M_{0}, ρ_{1})

;

and

(i i i)

there exists

ρ_{2} \geq ρ_{1}

, such that

δ_{0} (ρ_{1} + ρ_{2}) < 2 .

(28)

Define the region

Ω_{2} = Ω \cap U [M_{0}, ρ_{2}]

. Then, the equation

F (x) = 0

is uniquely solvable by the element z in the region

Ω_{2}

.

Proof.

Let

w \in Ω_{2}

be a solution of the equation

F (x) = 0

. Then, we have

M (z) = z

. Define the linear operator

S = \int_{0}^{1} F^{'} (M (z + θ (w - z))) d θ

. By using the conditions

(A_{2})

and (28), we obtain in turn that

\begin{matrix} ∥ F^{'} {(M_{0})}^{- 1} (S - M_{0}) ∥ & \leq & d_{7} \int_{0}^{1} ((1 - θ) ∥ z - x_{0} ∥ + θ ∥ w - x_{0} ∥) d θ \\ \leq & \frac{d_{7} d_{4}}{2} (ρ_{1} + ρ_{2}) = \frac{δ_{0}}{2} (ρ_{1} + ρ_{2}) < 1, \end{matrix}

(29)

where we also used

∥ z - x_{0} ∥ = ∥ M (z) - M (x_{0}) ∥ \leq d_{4} ∥ z - x_{0} ∥ \leq d_{4} ρ_{2}

and

∥ w - x_{0} ∥ = ∥ M (w) - M (x_{0}) ∥ \leq d_{4} ∥ w - x_{0} ∥ \leq d_{4} ρ_{1} .

It follows by (29) that

S^{- 1} \in £ (Y, X) .

Consequently, we can have

w - z = S^{- 1} (F (w) - F (z)) = S^{- 1} (0 - 0) = 0 .

That is, we conclude that

w = z

. □

Remark 5.

(i)

The assumption

M_{0} = x_{0}

can be dropped if the second condition in

(A_{1})

is replaced by any of

∥ x_{0} - M_{0} ∥ < η, ∥ x_{0} - M_{0} ∥ < \frac{1}{δ_{0}}

or

∥ x_{0} - M_{0} ∥ < t^{*}

, and

∥ z_{1} - M_{0} ∥ < η

, where

x_{1} = x_{0} - F^{'} {(M_{0})}^{- 1} b_{0} F (N_{0})

. Then, the proof of Theorem 1 still goes through. The limit point

t^{*}

can be replaced by

\frac{1}{δ_{0}}

or

\frac{η}{1 - γ}

given in closed form in the condition

(A_{5})

.

(i i)

Notice that only the condition

(A_{2})

is used in Proposition 1. However, if all conditions are used then, we can set

z = x^{*}

.

(i i i)

An alternative to the majorizing sequence (16) and the convergence condition can be obtained as follows.

Let

a = 2 δ_{1}

,

σ = max {a, δ_{0} + δ_{2}}

,

f_{1} (t) = \frac{σ}{2} t^{2} (1 - δ_{3}) t + η

and

q (t) = 1 - δ_{0} t

.

Moreover, define the sequence

{u_{n}}

by

u_{0} = 0, u_{n + 1} = u_{n} + \frac{f_{1} (t_{n})}{q (t_{n})} .

(30)

Then, if

ν = σ η \leq {(1 - δ_{3})}^{2} and δ_{3} < 1,

(31)

it was shown in [8] that the sequence

{u_{n}}

is nondecreasing and convergent to

u_{1}^{*} = \frac{1 - δ_{3} - \sqrt{{(1 - δ_{3})}^{2} - 2 ν}}{σ} .

The parameter

t_{1}^{*}

is the smallest of the two roots of the quadratic polynomial

p_{1}

with the largest being given by

u_{2}^{*} = \frac{1 - δ_{3} - \sqrt{{(1 - δ_{3})}^{2} - 2 ν}}{σ} .

Moreover, simple induction shows

t_{n} \leq u_{n}

0 \leq t_{n + 1} - t_{n} \leq u_{n + 1} - u_{n}

, and

t^{*} \leq u_{1}^{*} .

Hence, the sequence

{u_{n}}

and the conditions (31) can replace

{t_{n}}

and conditions

(I_{i})

in Theorem 1, respectively.

Concerning the uniqueness of the solution for this case, we already have Proposition 1. However, the uniqueness of the solution (see [8]) can be established in the region

Ω_{3} = \{\begin{matrix} U [M_{0}, u_{1}^{*}] \cap Ω & i f ν = \frac{1}{2} {(1 - δ_{3})}^{2}, \\ U (x_{0}, u_{2}^{*}) \cap Ω & i f ν < \frac{1}{2} {(1 - δ_{3})}^{2} . \end{matrix}

In practice, we shall choose the largest region, the tighter sequence

{t_{n}}

and

t^{*}

provided that both the

(I_{i})

and the conditions (31) hold.

(i v)

The sequence of number

{b_{n}}

can be replaced by a sequence

{{\bar{b}}_{n}}

of continuous operators from Ω into X. In this case, the proof of Proposition 1 also goes through, provided that

F^{'} {(M (x))}^{- 1} b (x) = b (x) F^{'} {(M (x))}^{- 1} for all x \in Ω .

(32)

That is the operators

F^{'} {(M (x))}^{- 1}

and

b (x)

must be commutative.

(v)

A more general method than (12) is given by the Picard iteration

x_{n + 1} = P (T (x_{n})),

where

P, T : X \to X

are continuous operators and operator T has the same fixed points with P. Suppose that P satisfies

∥ P (y) - P (x) ∥ \leq α_{0} ∥ y - x ∥ for all x, y \in Ω

or not and

α_{0} \in (0, 1)

or not. However,

∥ T (y) - T (x) ∥ \leq α_{1} ∥ y - x ∥ ∥

and

∥ P (T (y)) - P (T (x)) ∥ \leq α_{1} α_{2} ∥ y - x ∥

and

α_{1} α_{2} \in (0, 1)

. Then, according to the contraction mapping principle [2], the operator P has a fixed point provided also that it maps a closed ball into itself.

A possible choice for P and T can be

T (x_{n}) = P (x_{n}) = x_{n} - F^{'} {(M (x_{n}))}^{- 1} b_{n} F (H (x_{n})) .

3. Local Convergence

Let

l_{0}, l, l_{1}, l_{2}, l_{3}, l_{4} \geq 0

be given parameters with

l_{1} \in [0, 1]

and

l_{3} l_{4} < 1

. Moreover, define the parameter r by

r = \frac{1 - l_{3} l_{4}}{l_{0} + l l_{2}},

(33)

provided that

l_{0} + l l_{2} \neq 0

. These parameters are connected with the operators appearing on the NTM with the conditions

(C)

.

Suppose

(C_{1})

there exists a solution

x^{*} \in Ω

of the equation

F (x) = 0

, such that

H (x^{*}) = M (x^{*}) = x^{*}

and

F^{'} {(M (x^{*}))}^{- 1} \in £ (Y, X)

;

(C_{2})

∥ F^{'} {(M (x^{*}))}^{- 1} (F^{'} (M (x)) - F^{'} (M (x^{*}))) ∥ \leq l_{0} ∥ x - x^{*} ∥

for all

x \in Ω

.

Define the region

Q = \{\begin{matrix} U (x^{*}, \frac{1}{l_{0}}) \cap Ω, if l_{0} \neq 0, \\ Ω, if l_{0} = 0 . \end{matrix}

(C_{3})

\begin{matrix} ∥ F^{'} {(M (x^{*}))}^{- 1} (F^{'} (y) - F^{'} (x)) ∥ & \leq & l ∥ y - x ∥, \\ ∥ H (x) - H (x^{*}) ∥ & \leq & l_{1} ∥ x - x^{*} ∥, for some l_{1} \in [0, 1] \\ \int_{0}^{1} ∥ x^{*} + θ (H (x) - x^{*}) - M (x) ∥ d θ & \leq & l_{2} ∥ x - x^{*} ∥, \\ | 1 - b_{n} | & \leq & l_{3} \end{matrix}

and

\begin{matrix} ∥ \int_{0}^{1} F^{'} {(M (x^{*}))}^{- 1} F^{'} (x^{*} + θ (H (x) - x^{*})) d θ ∥ & \leq & l_{4} \end{matrix}

for each

x, y \in Q

and

(C_{4})

U (x^{*}, r) \subset Ω

, where the parameter r is given by the formula (33).

Next, the local convergence analysis uses the parameters “l” and the conditions

(C)

.

Theorem 2.

Suppose that the conditions

(C_{1})

–

(C_{4})

hold and the starting point

x_{0} \in U (x^{*}, r) - {x^{*}}

. Then, the sequence

{x_{n}}

generated by NTM is such that

{x_{n}} \subset U (x^{*}, r)

,

lim_{n \to + \infty} x_{n} = x^{*}

and

∥ x_{n + 1} - x^{*} ∥ \leq β_{n} ∥ x_{n} - x^{*} ∥ < r,

(34)

where

β_{n} = \frac{l l_{2} ∥ x_{n} - x^{*} ∥ + l_{3} l_{4}}{1 - l_{0} ∥ x_{n} - x^{*} ∥} \in [0, 1) .

(35)

Proof.

It follows by the conditions

(C_{1})

,

(C_{2})

, the hypothesis

x_{0} \in U (x^{*}, r) - {x^{*}}

and the radius r that

∥ F^{'} {(M (x^{*}))}^{- 1} (F^{'} (M_{0}) - F^{'} (M (x^{*}))) ∥ \leq l_{0} ∥ x_{0} - x^{*} ∥ < 1 .

Thus, the operator

F^{'} {(M_{0})}^{- 1} \in £ (Y, X)

and

∥ F^{'} {(M_{0})}^{- 1} F^{'} (M (x^{*})) ∥ \leq \frac{1}{1 - l_{0} ∥ x_{0} - x^{*} ∥} .

(36)

Moreover, the iterate

x_{1}

is well defined by NTM for

n = 0

, and we can write in turn that

\begin{matrix} x_{1} - x^{*} & = & x_{0} - x^{*} - F^{'} {(M_{0})}^{- 1} b_{0} F (H_{0}) \\ = & F^{'} {(M_{0})}^{- 1} [F^{'} (M_{0}) - b_{0} \int_{0}^{1} F^{'} (x^{*} + θ (H_{0} - x^{*})) d θ] (x_{0} - x^{*}) \\ = & - F^{'} {(M_{0})}^{- 1} [\int_{0}^{1} F^{'} (x^{*} + θ (H_{0} - x^{*})) d θ - F^{'} (M_{0}) \\ - (1 - b_{0}) \int_{0}^{1} F^{'} (x^{*} + θ (H_{0} - x^{*})) d θ] (x_{0} - x^{*}) . \end{matrix}

(37)

By composing the expression in the bracket by

F^{'} {(M (x^{*}))}^{- 1}

and using the conditions

(C_{3})

, we see that it is bounded above by

\begin{matrix} l \int_{0}^{1} ∥ x^{*} + θ (H_{0} - x^{*}) - M_{0} ∥ d θ + | 1 - b_{0} | ∥\int_{0}^{1} F^{'} {(M (x^{*}))}^{- 1} F^{'} (x^{*} + θ (H_{0} - x^{*})) d θ∥ \\ \leq l l_{2} ∥ x - x_{0} ∥ + l_{3} l_{4}, \end{matrix}

leading together with (36) and (37) to the estimate (34) for

n = 0

, where we also used that

∥ H_{0} - x^{*} ∥ = ∥ H (x_{0}) - H (x^{*}) ∥ \leq l_{1} ∥ x_{0} - x^{*} ∥ < r

.

Hence, the iterate

x_{1} \in U (x^{*}, r) - {x^{*}}

.

Then, the induction for the estimate (34) is completed if we simply replace the iterates

x_{0}, x_{1}, β_{1}

by

x_{k}, x_{k + 1}, β_{k + 1}

in the preceding calculations.

Therefore, we have

∥ x_{k + 1} - x^{*} ∥ \leq β ∥ x_{k} - x^{*} ∥ < r,

where

β = \frac{l l_{2} r + l_{3} l_{4}}{1 - l_{0} r} \in [0, 1)

implies that the iterate

x_{k + 1} \in U (x^{*}, r) - {x^{*}}

and

lim_{k \to + \infty} x_{k} = x^{*}

. □

Remark 6.

The last condition in

(C_{3})

can be dropped in view of the alternative estimate

\begin{matrix} ∥\int_{0}^{1} F^{'} {(M (x^{*}))}^{- 1} F^{'} (x^{*} + θ (H_{0} - x^{*})) d θ∥ \\ \leq ∥\int_{0}^{1} F^{'} {(M (x^{*}))}^{- 1} (F^{'} (x^{*} + θ (H_{0} - x^{*})) - F^{'} (M (x^{*}))) d θ + F^{'} {(M (x^{*}))}^{- 1} F^{'} (M (x^{*}))∥ \\ \leq 1 + \frac{l_{0} l_{1}}{2} ∥ x_{0} - x^{*} ∥ . \end{matrix}

By replacing this estimate in the proof of Theorem 2, we see that the radius becomes

r_{1} = \frac{1 - l_{3}}{l l_{2} + \frac{l_{0} l_{1} l_{3}}{2} + l_{0}},

where we suppose

l_{3} \in [0, 1)

. Moreover, the new sequence

β_{n}^{1}

is defined by

β_{n}^{1} = \frac{l l_{2} ∥ x_{n} - x^{*} ∥ + l_{3} (1 + \frac{l_{0} l_{1}}{2} ∥ x_{n} - x^{*} ∥)}{1 - l_{0} ∥ x_{n} - x^{*} ∥} \in [0, 1) .

Even at this generality, the Theorem 2 improves earlier results in the interesting case of NM. Indeed, we have in this case

l_{1} = 1

,

l_{3} = 0

,

l_{2} = \frac{1}{2}

,

l_{4} \in [0, + \infty)

, implying that

r = r_{1} = \frac{2}{2 l_{0} + l} .

(38)

The corresponding radius given independently by Traub [4] and Rheinboldt [2] is

r_{0} = \frac{2}{3 l_{5}},

where

l_{5}

satisfies

∥ F^{'} {(x^{*})}^{- 1} (F^{'} (y) - F^{'} (x)) ∥ \leq l_{5} ∥ x - y ∥ f o r a l l x, y \in Ω .

However, then the estimate

r_{0} \leq r

holds, because

l_{0} \leq l_{5}

and

l_{1} \leq l_{5}

.

Let us look at the function F defined by

F (x) = e^{x} - 1

for all

x \in U (x^{*}, 1)

. Then, we have for

x^{*} = 0

, that

l_{0} = e - 1

,

l_{5} = e

,

Q = U (x^{*}, \frac{1}{l_{0}})

and

l_{1} = e^{\frac{1}{l_{0}}}

. Hence, we have

l_{0} < l < l_{5} .

Moreover,

r_{0} = 0.24 \dots < r = 0.32 . \dots

Furthermore, the new sequence

{β_{n}}

is tighter than

{β_{n}^{0}}

given in [2,4] and defined

β_{n}^{0} = \frac{l_{5} ∥ x_{n} - x^{*} ∥}{2 (1 - l_{5} ∥ x_{n} - x^{*} ∥)} .

Finally, notice that the second condition in

(C_{3})

can be replaced by

∥ H (x) - H^{*} ∥ \leq \frac{1}{l_{0}}

. Then, it follows again that

H (x) \in Q

.

Proposition 2.

Suppose there exists a solution

\bar{x} \in U (x^{*}, ρ_{5})

of the equation

F (x) = 0

with

M (\bar{x}) = {\bar{x}}_{ρ} .

The condition

(C_{2})

holds in the set

U (x^{*}, ρ_{5})

, and there exists

ρ_{6} \geq ρ_{5}

such that

l_{0} ρ_{6} < 2 .

(39)

Define the region

Q_{1} = U [x^{*}, ρ_{6}] \cap Ω

. Then, the only solution of the equation

F (x) = 0

in the region

Q_{1}

is

x^{*}

.

Proof.

Define the linear operator S by

S = F^{'} (x^{*} + θ (\bar{x} - x^{*})) d θ

for some

\bar{x} \in Q_{1}

with

F (\bar{x}) = 0

. It then follows by

(C_{2})

and (39) that

∥ F^{'} {(M (x^{*}))}^{- 1} (S - F^{'} (M (x^{*}))) ∥ \leq \frac{l_{0}}{2} ∥ \bar{x} - x^{*} ∥ \leq \frac{l_{0}}{2} r < 1 .

(40)

Then, by the continuity of F, the invertibility of S and the identity

\bar{x} - x^{*} = S^{- 1} (F (\bar{x}) - F (x^{*})) = S^{- 1} (0) = 0,

we conclude that

\bar{x} = x^{*}

. □

Notice that if all hypotheses of Theorem 2 hold, then we can choose

ρ_{5} = r

.

4. Special Cases and Numerical Problems

The operators appearing on the method (12) are specialized in some interesting cases. Then, a favorable comparison is given with existing methods.

Example 2.

Let us consider the case of NM. We shall verify the parameters in Theorem 1. It follows from (3) and (12) that the conditions (

A_{1}

) – (

A_{3}

) are verified provided that

d_{1} = d_{4} = d_{6} = 1

,

d_{2} = d_{3} = d_{5} = 0

,

d_{7}

and

L_{1}

to be determined if the operator is specified. The parameters delta are:

δ_{0} = d_{7}

,

δ_{1} = \frac{L_{1}}{2}

,

δ_{2} = δ_{3} = δ_{5} = 0

,

δ_{4} = 1 - d_{7} η

,

Δ = {(1 - d_{7} η)}^{2}

and

a_{n + 1} = \frac{L_{1}}{2} (t_{n + 1} - t_{n})

. Then, the conditions (

I_{1}

) reduce to

1 - d_{7} η > 0,

0 \leq \frac{\frac{L_{1}}{2} η}{1 - d_{7} η} \leq γ

and

\frac{d_{7} γ η}{1 - η} - γ \leq 0,

respectively. This system of inequalities can be written for

γ = \frac{2 L_{1}}{L_{1} + \sqrt{L_{1}^{2} + 8 d_{7} L_{1}}}

as

\bar{L} η \leq \frac{1}{2},

(41)

where

\bar{L} = \frac{1}{2} (4 d_{7} + L_{1} + \sqrt{L_{1}^{2} + 8 d_{7} L_{1}}) .

Notice that

d_{7} = L_{0}

,

L_{1} \leq L_{2}

and

L_{0} \leq L_{1}

. It follows that

L_{2} η \leq \frac{1}{2} \to \bar{L} η \leq \frac{1}{2},

(42)

but not vice versa unless

L_{0} = L_{1} = L_{2}

. Hence, we see that the general Theorem 1 if reduced provides a weaker convergence criterion than Kantorovich’s (8).

Let us return back to Example 1 given in the introduction. Then, we have

Ω_{1} = Ω \cap U (M_{0}, \frac{1}{δ_{0}}) = U (M_{0}, \frac{1}{δ_{0}})

, because

\frac{1}{δ_{0}} < 2 - λ

for each

λ \in (0, \frac{1}{2})

. It follows by last condition in (

A_{3}

) that

L_{1} = 2 (1 + \frac{1}{L_{0}}) < L_{2}

for each

λ \in (0, \frac{1}{2})

. The condition (41) is verified provided that

λ \in (0.4271907643, \frac{1}{2})

, which improves the convergence range for NM. Recall that the Kantorovich condition (8) does not hold for any

λ \in (0, \frac{1}{2})

.

Application 1.

Set

b_{n} = 1

and

H_{n} = x_{n}

. Then, NTM (12) reduces to

x_{n + 1} = x_{n} - F^{'} {(M_{n})}^{- 1} F (x_{n}) .

(43)

Further special cases of the method (43) are Newton’s method if

M_{n} = x_{n}

and the simplified Newton’s method provided that

M_{n} = x_{0}

. Other choices of the operators

M_{n}

are possible [20,21,22].

An interesting choice seems to be

M_{n} = x^{*} + μ_{n} (x_{n} - x^{*}) for some μ_{n} \in [0, 1] .

(44)

Next, some local and semilocal convergence results are presented under these choices.

Theorem 3.

Suppose

(i) the inverses

F^{'} {(M_{n})}^{- 1}

are well defined;

(ii) there exists a solution

x^{*} \in Ω

of the Equation (1);

(iii) there exists a parameter

K > 0

such that for each

u, v \in Ω

∥ F^{'} (u) - F^{'} (v) ∥ \leq K ∥ u - v ∥ .

(45)

Then, the following assertions hold:

∥ x_{n + 1} - x^{*} ∥ \leq \frac{1}{2} K ∥ F^{'} {(M_{n})}^{- 1} ∥ (∥ M_{n} - x_{n} ∥ + ∥ M_{n} - x^{*} ∥) ∥ x_{n} - x^{*} ∥

(46)

and

∥ x_{n + 1} - x^{*} ∥ \leq \frac{1}{2} K ∥ F^{'} {(M_{n})}^{- 1} ∥ ∥ x_{n} - x^{*} ∥^{2} .

(47)

Moreover, if the operators

F^{'} {(M_{n})}^{- 1}

exist and are uniformly bounded, then, the convergence order of the method (43) is two.

Proof.

In view of the choice (44), we only need to show (46), because then (47) follows from it.

We can write

\begin{matrix} F (x_{n}) - F (x^{*}) - F^{'} (M_{n}) (x_{n} - x^{*}) \\ = \int_{0}^{1} (F^{'} (τ x_{n} + (1 - τ) x^{*}) - F^{'} (τ M_{n} + (1 - τ) M_{n})) d τ (x_{n} - x^{*}) . \end{matrix}

(48)

By applying (44) and (45) on (48), we get in turn

∥ F (x_{n}) - F (x^{*}) - F^{'} (M_{n}) (x_{n} - x^{*}) ∥ \leq \frac{1}{2} K (∥ M_{n} - x_{n} ∥ + ∥ M_{n} - x^{*} ∥) ∥ x_{n} - x^{*} ∥

leading to (46) by (43). □

Remark 7.

Set

μ_{n} = 0

in (44), then

y_{n} = x^{*}

. Moreover, set

T = F^{'} {(x^{*})}^{- 1}

. Then, the method (43) further reduces to

x_{n + 1} = x_{n} - T F (x_{n}) .

(49)

Suppose

∥ T ∥ \leq K_{1} f o r s o m e p a r a m e t e r K_{1} > 0 .

(50)

Then, by Theorem 3, we deduce

∥ x_{n + 1} - x^{*} ∥ \leq \frac{1}{2} K K_{1} {∥ x_{n} - x^{*} ∥}^{2} .

(51)

Thus, the method (49) has convergence order two as Newton’s method but the ease of the simplified Newton’s method, as the operator is computed only once. Method (49) can be used provided that there exists an operator h, such that

F^{'} (x) = h (F (x)) .

(52)

Notice that

F^{'} (x^{*}) = h (0)

. However,

h (0)

is known, because h is given. Hence,

F^{'} (x^{*})

is determined. As an example, define the scalar function F to be

F (x) = e^{x} - α f o r s o m e α > 0 .

(53)

Then, we have

x^{*} = ln α

and

F^{'} (x) = F (x) + α

.

A second local convergence result follows under the condition (52).

Proposition 3.

Suppose

(i) the operator

T = h {(0)}^{- 1}

exists and

∥ T ∥ \leq K_{1}

.

(ii) there exists

x_{0} \in Ω

such that

ς \in (0, 1),

(54)

where

ς = \frac{1}{2} K K_{1} ∥ x_{0} - x^{*} ∥ .

(55)

Then, the following assertions hold:

∥ x_{n} - x^{*} ∥ \leq ς^{2^{n} - 1} ∥ x_{0} - x^{*} ∥

(56)

and

lim_{n \to \infty} x_{n} = x^{*} .

(57)

Proof.

Mathematical induction is given immediately by (51) and (54). □

Example 3.

Method (49) for F, given by (53), is defined by

x_{n + 1} = x_{n} - \frac{1}{α} (e^{x_{n}} - α)

and converges to

x^{*}

with the order two provided that

x_{0} \in U (x^{*}, K_{2})

for

K_{2} = \frac{2}{K K_{1}}

, because ς satisfies (54).

Proposition 4.

Suppose

(i) the operator T exists with

∥ T ∥ \leq K_{1}

;

(ii) the operator h is

δ -

Lipschitz continuous and

∥ F (x) ∥ \leq ξ

for some

ξ \geq 0

and

δ > 0

;

(iii)

ρ \in (0, 1)

, where

ρ = ξ δ K_{1}

.

Then, the following assertion holds:

Iteration (49) has a unique solution

x^{*}

,

∥ x_{n} - x^{*} ∥ \leq \frac{ρ^{n}}{1 - ρ} ∥ x_{1} - x_{0} ∥

and

lim_{n \to \infty} x_{n} = x^{*}

.

Proof.

Define the operator

G (x) = h (0) x - F (x)

. Then, the method (49) can be written as

x_{n + 1} = T G (x_{n})

. By using

G^{'} (x) = h (0) - F^{'} (x) = h (0) - h (F (x))

and (ii), we obtain

∥ G^{'} (x) ∥ = δ ∥ F (x) ∥ .

Then, the result follows the celebrated contraction mapping principle. □

So far, we presented local convergence results. Next, we provide a semilocal convergence result.

Notice that if h and F are

δ

and

ξ

(

ξ > 0

) Lipschitz continuous, then by (52) the operator

F^{'}

is Lipschitz continuous with parameter

K = δ ξ

. Moreover, we obtain

∥ F (x) ∥ \leq ∥ F (x_{0}) ∥ + ξ ∥ x - x_{0} ∥ .

Set

ε = ∥ x_{0} - x ∥

and define the parameter

P (ε) = δ K_{1} ∥ F (x_{0}) ∥ + K_{1} K ε .

Furthermore, define the parameters

Δ_{1} = (1 - δ K_{1} ∥ F (x_{0}) {∥)}^{2} - 4 K_{1} K ∥ x_{1} - x_{0} ∥, r_{1} = \frac{1 - δ K_{1} ∥ F (x_{0}) ∥ - \sqrt{Δ_{1}}}{2 K_{1} K}

and

r_{2} = \frac{1 - δ K_{1} ∥ F (x_{0}) ∥}{K_{1} K} .

Theorem 4.

Suppose

p (0) < 1 a n d Δ_{1} > 0 .

Then, the Equation (1) has a solution

x^{*} \in U [x_{0}, r_{1}]

, which is unique in

U (x_{0}, r_{1})

.

Proof.

This follows immediately by the contraction mapping principle. □

Remark 8.

The contraction mapping principle assures that

∥ x_{n} - x^{*} ∥ \leq p_{1}^{n} r_{1},

where

p_{1} = p (r_{1}) = \frac{1}{2} (1 + δ K_{1} ∥ F (x_{0}) ∥ - \sqrt{Δ_{1}}) .

However, by Theorem 3 the convergence order is two for

n = m, m + 1, \dots,

where m is the smallest integer satisfying

q = \frac{1}{2} K K_{1} {(p_{1})}^{m} r_{1} < 1 .

Hence, we have improved earlier works in this case.

Application 2.

Let

F (x) = x - F_{1} (x)

and consider the fixed-point problem

F_{1} (x) = x .

(58)

It is know that a fixed point

x^{*} \in Ω

satisfies

F_{1} (x^{*}) = x^{*}

, then, clearly the method [23,24]

x_{n + 1} = x_{n} - F^{'} {((1 - μ) x_{n} + μ F_{1} (x_{n}))}^{- 1} (x_{n} - F_{1} (x_{n})), 0 \leq μ \leq 1

(59)

is another specialization of the method (12). If

μ = 0

we get NM (3) and

x_{n + 1} = x_{n} - F^{'} {(\frac{x_{n} + F_{1} (x_{n})}{2})}^{- 1} (x_{n} - F_{1} (x_{n})),

(60)

x_{n + 1} = x_{n} - {[I - F_{1}^{'} (F_{1} (x_{n}))]}^{- 1} (x_{n} - F_{1} (x_{n}))

(61)

for

μ = 0.5

[23] and

μ = 1

, respectively.

Example 4.

Let us apply methods (3), (60) and (61) for solving the Equation (58) with function

F_{1} (x)

defined by

F_{1}^{(1)} (x) = \sqrt{\frac{8}{x}}

and

F_{1}^{(2)} (x) = x - \frac{e^{x} - 1}{15}

.

Figure 1 contains graphs of the nonlinear functions

F_{1}^{(1)} (x)

,

F_{1}^{(2)} (x)

. The intersection point of the graphs

y = x

and

y = F_{1} (x)

is the solution of the corresponding equation. From graphs (A) and (B) we see that

x_{1}^{*} = 2

and

x_{2}^{*} = 0

are solutions of the equations

F_{1}^{(1)} (x) = x

and

F_{1}^{(2)} (x) = x

, respectively. It is known that a condition

∥ F_{1}^{'} (x) ∥ < 1

is sufficient for the convergence of methods (60) and (61) [23,24]. It is possible to find intervals on which this condition will hold for both cases (see Figure 2).

Table 1 shows the number of iterations that are needed to obtain the approximate solutions. Results are obtained under condition

∥ F (x_{n}) ∥ \leq 10^{- 10}

. The initial approximations

x_{0}

were chosen from the intervals

[x_{1}^{*}, 5]

and

[x_{2}^{*}, 3.2]

. The approximations obtained at each iteration by the methods (60) and (61) are contained in specified intervals. Therefore, the condition

∥ F_{1}^{'} (x) ∥ < 1

was fulfilled. The obtained results show that the method (60) converges faster than Newton’s and Stirling’s methods.

Define the scalar function

F_{1} (x) = \{\begin{matrix} - \frac{3}{4} sin x, & i f x \leq π \\ φ (x), & i f x \geq π, \end{matrix}

(62)

and choose

x_{0} = π

. Then, the method (60) gives the exact solution after only one iteration. The method (61) converges after four iterations. But NM does not converge, provided that the function φ connects smoothly the other part of the function

F_{1}

with

| φ^{'} (x) | \leq \frac{3}{4}

. Note that in the neighborhood of the point

x = π

, NM converges more slowly than methods (60) and (61). More advantages of the method (60) and (61) over Newton’s and other methods along the same lines as Application 1. Some possible choices of the function φ are given by

φ (x) = 1 - e^{0.75 sin x}

and

φ (x) = \frac{3}{2 \sqrt{2}} cos (\frac{π}{4} + x) + \frac{3}{4}

for each

x \geq π

.

5. Conclusions

Newton-type methods are developed that specialize in many popular methods for solving nonlinear equations containing Banach space-type operators. The semilocal convergence analysis is based on Lipschitz conditions. The benefits of the new approach involve weaker sufficient convergence conditions, a piece of better information on the location of the solution, and a more provide error analysis. The new idea is general, and it does not depend on the specific method. Therefore, it can also be used on the method (12) and its specializations under Hölder or generalized Lipschitz conditions on the operator

F^{'}

as well as other single point or multipoint iterative methods such as Secant, Stirling’s, Newton-like, Traub, and other methods [14,15,16,17,18,19,20,21,22]. That is the focus of our future research.

Author Contributions

Conceptualization, I.K.A., S.S., S.R. and H.Y.; methodology, I.K.A., S.S., S.R. and H.Y.; software, I.K.A., S.S., S.R. and H.Y.; validation, I.K.A., S.S., S.R. and H.Y.; formal analysis, I.K.A., S.S., S.R. and H.Y.; investigation, I.K.A., S.S., S.R. and H.Y.; resources, I.K.A., S.S., S.R. and H.Y.; data curation, I.K.A., S.S., S.R. and H.Y.; writing—original draft preparation, I.K.A., S.S., S.R. and H.Y.; writing—review and editing, I.K.A., S.S., S.R. and H.Y.; visualization, I.K.A., S.S., S.R. and H.Y.; supervision, I.K.A., S.S., S.R. and H.Y.; project administration, I.K.A., S.S., S.R. and H.Y.; funding acquisition, I.K.A., S.S., S.R. and H.Y. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

Argyros, I.K. The Theory and Applications of Iterative Methods, 2nd ed.; Engineering Series; CRC-Taylor and Francis Publ. Group: Boca Raton, FL, USA, 2022. [Google Scholar]
Ortega, J.M.; Rheinboldt, W.C. Iterative Solution of Nonlinear Equations in Several Variables; Academic Press: New York, NY, USA, 1970. [Google Scholar]
Dennis, J.E., Jr.; Schnabel, R.B. Numerical Methods for Unconstrained Optimization and Nonlinear Equations; Prentice-Hall: Englewood Cliffs, NJ, USA, 1983. [Google Scholar]
Traub, J.F. Iterative Methods for the Solution of Equations; Prentice Hall: Hoboken, NJ, USA, 1964. [Google Scholar]
Ezquerro, J.A.; Hernández-Verón, M.A. Newton’s Method: An Updated Approach of Kantorovich’s Theory. Frontiers in Mathematics; Birkhäuser/Springer: Cham, Switzerland, 2017. [Google Scholar]
Verma, R. New Trends in Fractional Programming; Nova Science Publisher: New York, NY, USA, 2019. [Google Scholar]
Potra, F.A.; Pták, V. Nondiscrete induction and iterative processes. In Research Notes in Mathematics; Pitman (Advanced Publishing Program): Boston, MA, USA, 1984; Volume 103. [Google Scholar]
Yamamoto, T. Historical developments in convergence analysis for Newton’s and Newton-like methods. J. Comput. Appl. Math. 2000, 124, 1–23. [Google Scholar] [CrossRef] [Green Version]
Argyros, I.K. Unified Convergence Criteria for Iterative Banach Space Valued Methods with Applications. Mathematics 2021, 9, 1942. [Google Scholar] [CrossRef]
Proinov, P.D. New general convergence theory for iterative processes and its applications to Newton-Kantorovich type theorems. J. Complex. 2010, 26, 3–42. [Google Scholar] [CrossRef] [Green Version]
Kantorovich, L.V.; Akilov, G.P. Functional Analysis; Pergamon Press: Oxford, UK, 1982. [Google Scholar]
Argyros, I.K.; Hilout, S. On an improved convergence analysis of Newton’s scheme. Appl. Math. Comput. 2013, 225, 372–386. [Google Scholar]
Argyros, I.K.; Hilout, S. Weaker conditions for the convergence of Newton’s scheme. J. Complex. 2012, 28, 364–387. [Google Scholar] [CrossRef] [Green Version]
Zhanlav, T.; Chun, C.; Otgondorj, K.H.; Ulziibayar, V. High order iterations for systems of nonlinear equations. Int. J. Comput. Math. 2020, 97, 1704–1724. [Google Scholar] [CrossRef]
Sharma, J.R.; Guha, R.K. Simple yet efficient Newton-like method for systems of nonlinear equations. Calcolo 2016, 53, 451–473. [Google Scholar] [CrossRef]
Grau-Sanchez, M.; Grau, A.; Noguera, M. Ostrowski type methods for solving system of nonlinear equations. Appl. Math. Comput. 2011, 218, 2377–2385. [Google Scholar] [CrossRef]
Kou, J.; Wang, X.; Li, Y. Some eight order root finding three-step methods. Commun. Nonlinear Sci. Numer. Simul. 2010, 15, 536–544. [Google Scholar] [CrossRef]
Shakhno, S.M. Convergence of the two-step combined method and uniqueness of the solution of nonlinear operator equations. J. Comput. Appl. Math. 2014, 261, 378–386. [Google Scholar] [CrossRef]
Shakhno, S.M. On an iterative algorithm with superquadratic convergence for solving nonlinear operator equations. J. Comput. Appl. Math. 2009, 231, 222–235. [Google Scholar] [CrossRef]
Wang, X. An Ostrowski-type method with memory using a novel self-accelerating parameters. J. Comput. Appl. Math. 2018, 330, 710–720. [Google Scholar] [CrossRef]
Moccari, M.; Lofti, T. On a two-step optimal Steffensen-type method: Relaxed local and semi-local convergence analysis and dynamical stability. J. Math. Anal. Appl. 2018, 468, 240–269. [Google Scholar] [CrossRef]
Sharma, J.R.; Arora, H. Efficient derivative-free numerical methods for solving systems of nonlinear equations. Comput. Appl. Math. 2016, 35, 269–284. [Google Scholar] [CrossRef]
Bartish, M.Y.; Shakhno, S.M. On Newton’s method with accelerated convergence. Vest. Kiev Univ. Model. Complex Syst. 1987, 6, 62–66. (In Russian) [Google Scholar]
Werner, W. Newton-like methods for the Computation of Fixed Points. Comput. Math. 1984, 10, 77–86. [Google Scholar] [CrossRef]

Figure 1. Graphs of functions

F_{1}^{(1)} (x)

(A) and

F_{1}^{(2)} (x)

(B), and function

y = x

.

Figure 1. Graphs of functions

F_{1}^{(1)} (x)

(A) and

F_{1}^{(2)} (x)

(B), and function

y = x

.

Figure 2. Graphs of derivative of functions

F_{1}^{(1)} (x)

(A) and

F_{1}^{(2)} (x)

(B).

Figure 2. Graphs of derivative of functions

F_{1}^{(1)} (x)

(A) and

F_{1}^{(2)} (x)

(B).

Table 1. Number of iterations.

Function	$x_{0}$	Method (3)	Method (60)	Method (61)
$F_{1}^{(1)} (x)$	2.5	4	3	4
	4	4	4	5
	5	5	4	6
$F_{2}^{(1)} (x)$	0.5	4	4	4
	1.5	7	6	5
	3.2	8	6	7

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Argyros, I.K.; Shakhno, S.; Regmi, S.; Yarmola, H. Newton-Type Methods for Solving Equations in Banach Spaces: A Unified Approach. Symmetry 2023, 15, 15. https://doi.org/10.3390/sym15010015

AMA Style

Argyros IK, Shakhno S, Regmi S, Yarmola H. Newton-Type Methods for Solving Equations in Banach Spaces: A Unified Approach. Symmetry. 2023; 15(1):15. https://doi.org/10.3390/sym15010015

Chicago/Turabian Style

Argyros, Ioannis K., Stepan Shakhno, Samundra Regmi, and Halyna Yarmola. 2023. "Newton-Type Methods for Solving Equations in Banach Spaces: A Unified Approach" Symmetry 15, no. 1: 15. https://doi.org/10.3390/sym15010015

APA Style

Argyros, I. K., Shakhno, S., Regmi, S., & Yarmola, H. (2023). Newton-Type Methods for Solving Equations in Banach Spaces: A Unified Approach. Symmetry, 15(1), 15. https://doi.org/10.3390/sym15010015

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Newton-Type Methods for Solving Equations in Banach Spaces: A Unified Approach

Abstract

1. Introduction

2. The Semi-Local Convergence of the Method NTM

3. Local Convergence

4. Special Cases and Numerical Problems

5. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI