Hybrid Chebyshev-Type Methods for Solving Nonlinear Equations

Ioannis K. Argyros; Santhosh George

doi:10.3390/math13010074

and

¹

Department of Computing and Mathematical Sciences, Cameron University, Lawton, OK 73505, USA

²

Department of Mathematical & Computational Science, National Institute of Technology Karnataka, Surathkal, Mangaluru 575 025, India

^*

Author to whom correspondence should be addressed.

Mathematics2025, 13(1), 74;https://doi.org/10.3390/math13010074

This article belongs to the Special Issue New Trends and Developments in Numerical Analysis: 2nd Edition

Version Notes

Order Reprints

Abstract

Chebyshev-type methods have replaced the Chebyshev method in practice for solving nonlinear equations in abstract spaces. These methods are of the same R-order of three. However, they are easier to deal with, since the computationally expensive second derivative of the operator involved does not appear on these methods. However, the invertibility of the first derivative is still required at each step of the iteration. In this article, the inverse is replaced by a finite sum of linear operators. The convergence of the new Hybrid Chebyshev-Type Method (HCTM) is established under relaxed generalized continuity assumptions on the derivative and majorizing sequences. The iterates of the new methods converge to the original ones, but they are easier to find. Moreover, the numerical examples demonstrate that the new iterates converge essentially as fast to the solution. The methodology of this article can be used on other methods with inverses along the same lines due to its generality.

Keywords:

Chebyshev method; optimized and hybrid Chebyshev-type methods; Banach space; convergence; inverse of an operator

MSC:

65G99; 65 H10; 49H17; 49M15

1. Introduction

Let

X, Y

denote Banach spaces and

D \subset X

be a convex open set. A plethora of problems can be written using mathematical modeling like the following:

\begin{matrix} Λ (x) = 0, \end{matrix}

(1)

where

Λ : D \subset X ⟶ Y

is a Fréchet-differentiable operator [1,2,3,4,5,6,7,8]. A solution

x^{*} \in D

is needed in closed form. However, this is achievable only in special cases. That is why mostly iterative methods have been used to produce sequences converging to

x^{*} .

Newton’s is without a doubt the most popular among methods of convergence of order two. It is defined for

x_{0} \in D

and all

n = 0, 1, 2, \dots

by the following:

\begin{matrix} x_{n + 1} = x_{n} - Λ^{'} {(x_{n})}^{- 1} Λ (x_{n}) . \end{matrix}

(2)

The implementation of Newton’s method requires the inversion of the linear operator

Λ^{'} (x_{n})

at each step, which may not be possible or computationally expensive. That is why in [9], we suggested hybrid Newton and Newton-like methods. In the case of Newton’s method, the hybrid analog is defined for

M \in L (X, Y)

(the space of all linear operators that are bounded) being an invertible operator,

Δ = M^{- 1} (M - Λ^{'} (x))

and

B = B_{m} (x) = I + Δ + \dots + Δ^{k}, k

being a natural number by the following:

\begin{matrix} x_{n + 1} = x_{n} - B M^{- 1} Λ (x_{n}) . \end{matrix}

(3)

Note, that

{lim}_{k ⟶ + \infty} B_{m} M^{- 1} = Λ^{'} .

The inverse of the linear operator is computed only once. The convergence analysis of method (3) and the numerical examples demonstrate that the number of iterations needed to reach the same error tolerance for (2) and (3) is essentially the same. Motivated by these developments and in order to consider methods of an order higher than two, we look at the Chebyshev method [10,11,12,13]:

\begin{matrix} x_{n + 1} = x_{n} - Λ^{'} {(x_{n})}^{- 1} Λ (x_{n}) - \frac{1}{2} Λ^{'} {(x_{n})}^{- 1} Λ^{″} (x_{n}) Λ^{'} {(x_{n})}^{- 1} Λ (x_{n}), \end{matrix}

(4)

and an optimization of the Chebysehev method:

\begin{matrix} y_{n} & = & x_{n} - Λ^{'} {(x_{n})}^{- 1} Λ (x_{n}), \\ z_{n} & = & x_{n} + λ (y_{n} - x_{n}), λ \in (0, 1], \\ A_{n} & = & A (x_{n}, z_{n}) = \frac{1}{λ} Λ^{'} {(x_{n})}^{- 1} (Λ^{'} (x_{n}) - Λ^{'} (z_{n})), \\ x_{n + 1} & = & y_{n} + \frac{1}{2} A_{n} (y_{n} - x_{n}) . \end{matrix}

(5)

Method (5) is of R-order three but does not require the computation of

Λ^{″} (x_{n})

at each step as in (4), which is also of order three [10,11,12,13]. However, the implementation of (5) presents the same difficulties as (2). That is why we introduce the HCTM defined by the following:

\begin{matrix} y_{n} & = & x_{n} - B M^{- 1} Λ (x_{n}), \\ z_{n} & = & x_{n} + λ (y_{n} - x_{n}), λ \in (0, 1], \\ A^{1} (x_{n}, z_{n}) & = & \frac{1}{λ} B M^{- 1} (Λ^{'} (x_{n}) - Λ^{'} (z_{n})), \\ x_{n + 1} & = & y_{n} + \frac{1}{2} A^{1} (x_{n}, z_{n}) (y_{n} - x_{n}) . \end{matrix}

(6)

In this article, we study the local as well as the semi-local analysis of methods (5) and (6). The analysis of convergence relies on generalized continuity assumptions, which relax the usual Lipschitz or Hölder conditions on

Λ^{'} .

In particular, the semi-local analysis also uses majorizing sequences to control the sequence

{x_{n}} .

The conclusions are the same as the case of methods (2) and (3).

The following definitions are used in this paper.

Definition 1.

The computational order of convergence of a sequence

{x_{n}}_{n \geq 0}

is defined by the following:

{\bar{ρ}}_{n} = \frac{ln | e_{n + 1} / e_{n} |}{ln | e_{n} / e_{n - 1} |},

where

x_{n - 1}, x_{n}, x_{n + 1}

are three consecutive iterations near the root α and

e_{n} = x_{n} - α

[7,12,14].

Definition 2.

The approximated computational order of convergence of a sequence

{x_{n}}_{n \geq 0}

is defined by the following:

{\hat{ρ}}_{n} = \frac{ln | {\hat{e}}_{n + 1} / {\hat{e}}_{n} |}{ln | {\hat{e}}_{n} / {\hat{e}}_{n - 1} |},

where

{\hat{e}}_{n} = x_{n} - x_{n - 1} .

x_{n}, x_{n - 1}, x_{n - 2}

are three consecutive iterates [7,12,14].

The rest of the article is structured as follows. The local followed by semi-local analyses of method (5) are presented in Section 2 and Section 3, respectively. The same is carried out for method (6) in Section 4 and Section 5. The numerical examples can be found in Section 6 and the conclusions in Section 7.

2. Local Analysis of Method (5)

The analysis uses certain criteria. Let

T = [0, + \infty) .

It is also convenient to employ the abbreviations CNDF for a continuous and nondecreasing function and SPS for the smallest positive solution. Suppose the following:

(H1): There exists a CNDF $w_{0} : T ⟶ T$ such that the equation $w_{0} (t) - 1 = 0$ has an SPS. Denote such a solution by $ρ_{0} .$ Set $T_{0} = [0, ρ_{0}) .$
(H2): There exists a CNDF $w : T_{0} ⟶ T$ for function $h_{1} : T_{0} ⟶ T$ defined by the following:

$h_{1} (t) = \frac{\int_{0}^{1} w ((1 - θ) t) d θ}{1 - w_{0} (t)}$

such that the equation $h_{1} (t) - 1 = 0$ has an SPS in the interval $(0, ρ_{0}) .$ Denote such a solution by $r_{1} .$
(H3): For the function $h_{2} : T_{0} ⟶ T$ defined by $h_{2} (t) = [(1 - λ) + λ h_{1} (t)] t,$ the equation $h_{2} (t) - 1 = 0$ has an SPS in $(0, ρ_{0}$ ). Denote such a solution by $r_{2} .$
Denote the functions $\bar{w} : T_{0} ⟶ T, \tilde{w} : T_{0} ⟶ T$ and $h_{3} : T_{0} ⟶ T$ by the following:

$\bar{w} (t) = \{\begin{matrix} w ((1 + h_{2} (t)) t \\ o r \\ w_{0} (t) + w_{0} (h_{2} (t) t), \end{matrix}$

$\tilde{w} (t) = \frac{1}{2 λ} \frac{\bar{w} (t)}{1 - w_{0} (t)}$

and

$h_{3} (t) = (\tilde{w} (t) + (1 + \tilde{w} (t)) h_{1} (t)) t .$

Note, that in practice, we select the smallest of the two versions of the functions $\bar{w}$ and $\tilde{w} .$ The real functions $h_{1}, h_{2}$ and $h_{3}$ are used to majorize the error distances appearing in Theorem 1 that follows.
(H4): The equation $h_{3} (t) - 1 = 0$ has an SPS in $(0, ρ_{0}) .$ Denote such a solution by $r_{3} .$ Set

$r = min {r_{j}}, j = 1, 2, 3$

(7)

and $T - 1 = [0, r] .$ It follows by these definitions that for all $t \in T_{1}$ ,

$0 \leq w_{0} (t) < 1$

(8)

and

$0 \leq h_{j} (t) < 1 .$

(9)

There exists a relationship between the functions $w_{0}$ and w with the operators on method (5).
(H5): There exists a solution $x^{*} \in D$ of the equation $Λ (x) = 0$ and an invertible operator $M \in L (X, Y)$ such that for each $u \in D$

$∥ M^{- 1} (Λ^{'} (u) - M) ∥ \leq w_{0} (∥ u - x^{*} ∥) .$

Set $D_{0} = D \cap S (x^{*}, ρ_{0}) .$ The notation $S (x^{*}, ρ_{0})$ denotes the open ball with a center at $x^{*}$ and a radius $ρ_{0} .$ Moreover, $S [x^{*}, ρ_{0})]$ is its closure.
(H6): $∥ M^{- 1} (Λ^{'} (u_{2}) - Λ^{'} (u_{1})) ∥ \leq w (∥ u_{2} - u_{1} ∥)$ for each $u_{1}, u_{2} \in D_{0}$
and
(H7): $S [x^{*}, r] \subset D .$

Remark 1.

(i): The parameter r is shown to be a radius of convergence for the sequence ${x_{n}}$ given by formula (5) in Theorem 1.
(ii): Some choices for M can be $M = I,$ where I is the identity operator on X or $M = Λ^{'} (x^{*}) .$ In the latter case, $x^{*}$ is a simple solution. Note, that this is not assumed or necessarily implied by conditions (H1)–(H2). Consequently, method (5) can be used to find solutions $x^{*}$ of a multiplicity greater than one. Other choices for M are also possible, provided that criteria (H5) and (H6) hold such that $M = Λ^{'} (\bar{x}),$ where $\bar{x} \in D$ is an auxiliary point [11].
(iii): The smaller of the two versions of the function $\bar{w}$ is used. However, if these versions cross on the interval $T_{0},$ say, e.g., as

$w ((1 + h_{2} (t)) t) \leq w_{0} (t) + w_{0} (h_{2} (t) t)$

for $t \in [0, {\bar{ρ}}_{0}]$ and

$w_{0} (t) + w_{0} (h_{2} (t) t) \leq w ((1 + h_{2} (t)) t)$

for $t \in [{\bar{ρ}}_{0}, ρ_{0}],$ where ${\bar{ρ}}_{0} \in [0, ρ_{0}],$ then we choose

$\bar{w} (t) = \{\begin{matrix} w ((1 + h_{2} (t)) t) & f o r t \in [0, {\bar{ρ}}_{0}] \\ w_{0} (t) + w_{0} (h_{2} (t) t) & f o r t \in [{\bar{ρ}}_{0}, ρ_{0}] . \end{matrix}$

The local analysis of convergence uses conditions (H1)–(H7) and the preceding notation. Let

D_{1} = S (x^{*}, r) - {x^{*}} .

Theorem 1.

Suppose that criteria (H1)–(H7) hold and

x_{0} \in D_{1} .

Then, the following assertions hold for method (5) and all

n = 0, 1, 2, \dots

:

{x_{n}} \subset S (x^{*}, r),

(10)

∥ y_{n} - x^{*} ∥ \leq h_{1} (∥ x_{n} - x^{*} ∥) ∥ x_{n} - x^{*} ∥ \leq ∥ x_{n} - x^{*} ∥ < r,

(11)

∥ z_{n} - x^{*} ∥ \leq h_{2} (∥ x_{n} - x^{*} ∥) ∥ x_{n} - x^{*} ∥ \leq ∥ x_{n} - x^{*} ∥,

(12)

∥ x_{n + 1} - x^{*} ∥ \leq h_{3} (∥ x_{n} - x^{*} ∥) ∥ x_{n} - x^{*} ∥ \leq ∥ x_{n} - x^{*} ∥

(13)

and the sequence

{x_{n}}

converges to

x^{*} .

Proof.

Induction on n shall establish assertions (10)–(12). Clearly, Equation (10) holds if

n = 0 .

Let

u \in S (x^{*}, r) .

It follows by (7), (8), and (H5) that:

∥ M^{- 1} (Λ^{'} (u) - M) ∥ \leq w_{0} (∥ u - x^{*} ∥) \leq w_{0} (r) < 1 .

(14)

The estimate (14) and the Banach standard perturbation Lemma on linear operators [10,13,15,16,17] imply that

Λ^{'} {(u)}^{- 1} \in L (Y, X)

and

∥ Λ^{'} {(u)}^{- 1} M ∥ \leq \frac{1}{1 - w_{0} (∥ u - x^{*} ∥)} .

(15)

In particular, if

u = x_{0},

then

Λ^{'} {(x_{0})}^{- 1} \in L (Y, X),

the iterate

y_{0}

is well-defined, and we can write as follows:

\begin{matrix} y_{0} - x^{*} & = & x_{0} - x^{*} - Λ^{'} {(x_{0})}^{- 1} Λ (x_{0}) \\ = & [Λ^{'} {(x_{0})}^{- 1} Λ^{'} (x^{*})] \int_{0}^{1} Λ^{'} {(x^{*})}^{- 1} [Λ^{'} (x^{*} + θ (x_{0} - x^{*})) \\ - Λ^{'} (x_{0})] d θ (x_{0} - x^{*}) . \end{matrix}

(16)

By applying (7), (9) (for

j = 1

), Equation (15) (for

u = x_{0}

), and (H6), we obtain in turn by (16) the following:

\begin{matrix} ∥ y_{0} - x^{*} ∥ & \leq & \frac{\int_{0}^{1} w ((1 - θ) ∥ x_{0} - x^{*} ∥) d θ}{1 - w_{0} (∥ x_{0} - x^{*} ∥)} ∥ x_{0} - x^{*} ∥ \\ = & h_{1} (∥ x_{0} - x^{*} ∥) ∥ x_{0} - x^{*} ∥ \leq ∥ x_{0} - x^{*} ∥ < r . \end{matrix}

(17)

Hence, the iterate

y_{0} \in S (x^{*}, r)

and the assertion (11) hold if

n = 0 .

Then, from the second substep of method (5) for

n = 0,

we have in turn that:

\begin{matrix} z_{0} - x^{*} & = & x_{0} - x^{*} + λ (y_{0} - x^{*} + x^{*} - x_{0}) \\ = & (1 - λ) (x_{0} - x^{*}) + λ (y_{0} - x^{*}) . \end{matrix}

(18)

By (7), (9) (for

j = 2

), and (17) we obtain in turn that:

\begin{matrix} ∥ z_{0} - x^{*} ∥ & = & (1 - λ) ∥ x_{0} - x^{*} ∥ + λ ∥ y_{0} - x^{*} ∥ \\ \leq & h_{2} (∥ x_{0} - x^{*} ∥) ∥ x_{0} - x^{*} ∥ \leq ∥ x_{0} - x^{*} ∥ < r . \end{matrix}

(19)

Thus, the iterate

z_{0} \in S (x^{*}, r)

and the assertion (12) hold if

n = 0 .

Note, that if

λ = 1

, then

h_{1} = h_{2}

. We need some estimates as follows:

\begin{matrix} \frac{1}{2 λ} ∥ Λ^{'} {(x_{0})}^{- 1} M ∥ ∥ M^{- 1} (Λ . (x_{0}) - Λ^{'} (z_{0})) ∥ \\ \leq & \frac{1}{2 λ} \frac{w (∥ x_{0} - z_{0} ∥)}{1 - w_{0} (∥ x_{0} - x^{*} ∥)} \\ \leq & \frac{w (∥ x_{0} - x^{*} ∥ + ∥ z_{0} - x^{*} ∥)}{2 λ (1 - w_{0} (∥ x_{0} - x^{*} ∥))} \\ \leq & {\tilde{w}}_{0} \end{matrix}

(20)

or:

\begin{matrix} \frac{1}{2 λ} ∥ Λ^{'} {(x_{0})}^{- 1} M ∥ ∥ M^{- 1} (Λ . (x_{0}) - Λ^{'} (z_{0})) ∥ \\ \leq & \frac{1}{2 λ (1 - w_{0} (∥ x_{0} - x^{*} ∥))} [∥ M^{- 1} (Λ^{'} (x_{0}) - M) ∥ + ∥ M^{- 1} (Λ^{'} (z_{0}) - M) ∥] \\ \leq & \frac{w_{0} (∥ x_{0} - x^{*} ∥) + w_{0} () ∥ z_{0} - x^{*} ∥)}{2 λ (1 - w_{0} (∥ x_{0} - x^{*} ∥))} \\ \leq & {\tilde{w}}_{0} . \end{matrix}

(21)

Note, that the iterate

x_{1}

is well-defined by the third substep of method (5), since

Λ^{'} {(x_{0})}^{- 1} \in L (Y, X) .

Moreover, we can write as follows:

\begin{matrix} x_{1} - x^{*} & = & y_{0} - x^{*} + \frac{1}{2 λ} Λ^{'} {(x_{0})}^{- 1} (Λ^{'} (x_{0}) - Λ^{'} (z_{0})) (y_{0} - x_{0}) \\ = & [I + \frac{1}{2 λ} Λ^{'} {(x_{0})}^{- 1} (Λ^{'} (x_{0}) - Λ^{'} (z_{0}))] (y_{0} - x^{*}) \\ - \frac{1}{2 λ} Λ^{'} {(x_{0})}^{- 1} (Λ^{'} (x_{0}) - Λ^{'} (z_{0}))] (x_{0} - x^{*}) . \end{matrix}

(22)

Then, by using (7) and (9) (for

j = 3

), Equation (15) (for

u = x_{0}

), and (17)–(22), we obtain in turn that:

\begin{matrix} ∥ x_{1} - x^{*} ∥ & \leq & (1 + {\tilde{w}}_{0}) ∥ y_{0} - x^{*} ∥ + {\tilde{w}}_{0} ∥ x_{0} - x^{*} ∥ \\ \leq & h_{3} (∥ x_{0} - x^{*} ∥) ∥ x_{0} - x^{*} ∥ \leq ∥ x_{0} - x^{*} ∥ . \end{matrix}

(23)

Therefore, the iterate

x_{1} \in S (x^{*}, r)

and the assertion (13) hold if

n = 0 .

Simply exchange

x_{0}, y_{0}, z_{0}, x_{1}

by

x_{i}, y_{i}, z_{i}, x_{i + 1}; i

a natural number in the preceding calculations to terminate the induction for items (10)–(13). Furthermore, from the estimate

∥ x_{i + 1} - x^{*} ∥ \leq c ∥ x_{i} - x^{*} ∥ \leq c^{i + 1} ∥ x_{0} - x^{*} ∥ < r,

(24)

we deduce that the iterate

x_{i + 1} \in S (x^{*}, r)

and

{lim}_{i ⟶ + \infty} x_{i} = x^{*} .

□

Next, a domain is specified with only one solution of the equation

Λ (x) = 0 .

Proposition 1.

Suppose that there exists

ρ > 0

such that condition (H5) holds in the ball

S (x^{*}, ρ)

and

ρ_{1} \geq ρ

such that:

\int_{0}^{1} w_{0} (θ ρ_{1}) d θ < 1 .

(25)

Set

D_{2} = D \cap S [x^{*}, ρ_{1}] .

Then,

x^{*}

is the unique solution of the equation

Λ (x) = 0

in the region

D_{2} .

Proof.

Suppose that there exists a solution

u^{*} \in D_{2}

of the equation

Λ (x) = 0

with

u^{*} \neq x^{*} .

Define the linear operator

M_{1} = \int_{0}^{1} Λ^{'} (x^{*} + θ (u^{*} - x^{*})) d θ .

Then, it follows by (H5) and (25) that:

M^{- 1} (M_{1} - M) ∥ \leq \int_{0}^{1} w_{0} (θ ∥ u^{*} - x^{*} ∥) d θ \leq \int_{0}^{1} w_{0} (θ ρ_{1}) d θ < 1 .

Hence,

M_{1}^{- 1} \in L (Y, X) .

Then, from the identity

u^{*} - x^{*} = M_{1}^{- 1} (Λ (u^{*}) - Λ (x^{*})) = M_{1}^{- 1} (0) = 0,

we conclude that

u^{*} = x^{*} .

□

Remark 2.

If all criteria (H1)–(H7) hold, then we can take

ρ = r

in Proposition 1.

3. Semi-Local Analysis of Method (5)

The analysis is similar to the one of Section 2. However

x^{*}, w_{0}

, and w are exchanged by

x_{0}, v_{0}

, and

v,

respectively. Suppose the following:

(C1): There exists a CNDF $v_{0} : T ⟶ T$ such that the equation $v_{0} (t) - 1 = 0$ has an SPS. Denote such a solution by $ρ_{2} .$ Set $T_{2} = [0, ρ_{2}] .$
(C2): There exists a CNDF $v : T_{2} ⟶ T .$
Define the sequence ${α_{n}}$ for $α_{0} = 0,$ some $β_{0} \in [0, ρ_{2}]$ , and all $n = 0, 1, 2, \dots$ by the following:

$\begin{matrix} γ_{n} & = & β_{n} + (1 - λ) (β_{n} - α_{n}), \\ \bar{v} & = & \{\begin{matrix} v (γ_{n} - α_{n}) \\ o r \\ v_{0} (α_{n}) + v_{0} (γ_{n}), \end{matrix} \\ α_{n + 1} & = & β_{n} + \frac{{\bar{v}}_{n} (β_{n} - α_{n})}{2 λ (1 - v_{0} (α_{n}))} + (1 - λ) (β_{n} - α_{n}), \\ δ_{n + 1} & = & \int_{0}^{1} v ((1 - θ)) (α_{n + 1} - α_{n}) d θ (α_{n + 1} - α_{n}) \\ + (1 + v_{0} (α_{n})) (α_{n + 1} - β_{n}) \end{matrix}$

(26)

and:

$β_{n + 1} = α_{n + 1} + \frac{δ_{n + 1}}{1 - v_{0} (α_{n + 1})} .$

The scalar sequence ${α_{n}}$ is shown to be majorizing for ${x_{n}}$ in Theorem 2. However, let us present a convergence criterion for it.
(C3): There exists $ρ_{3} \in [0, ρ_{2}]$ such that for all $n = 0, 1, 2, \dots$ ,

$v_{0} (α_{n}) < 1 a n d α_{n} \leq ρ_{3} .$

It follows by this condition and (26) that for all $n = 0, 1, 2, \dots$ ,

$0 \leq α_{n} \leq β_{n} \leq γ_{n} \leq α_{n + 1} \leq ρ_{3},$

and there exists $α^{*} \in [0, ρ_{3}]$ such that ${lim}_{n ⟶ + \infty} α_{n} = α^{*} .$ The limit point $α^{*}$ is the unique least upper bound of the sequence ${α_{n}} .$ Notice that if the function $v_{0}$ is strictly increasing, and then we can take $ρ_{3} = v_{0}^{- 1} (1) .$ As in the local analysis, the real functions $v_{0}$ and v relate to the operators on method (5).
(C4): There exist $x_{0} \in D$ and an invertible operator $M \in L (X, Y)$ such that for all $u \in D$

$∥ M^{- 1} (Λ^{'} (u) - M) ∥ \leq v_{0} (∥ u - x_{0} ∥) .$

Note, that $∥ M^{- 1} (Λ^{'} (u) - M) ∥ \leq v_{0} (0) < 1 .$ Hene, $Λ^{'} {(x_{0})}^{- 1} \in L (Y, X),$ and we can set $β_{0} \geq ∥ Λ^{'} {(x_{0})}^{- 1} Λ (x_{0}) ∥ .$ Set $D_{3} = D \cap S (x_{0}, ρ_{2}) .$
(C5): $∥ M^{- 1} (Λ^{'} (u_{2}) - Λ^{'} (u_{1})) ∥ \leq v (∥ u_{2} - u_{1} ∥) .$ for all $u_{1}, u_{2} \in D_{3}$
and
(C6): $S [x_{0}, α^{*}] \subset D .$

Remark 3.

Possible selections for M are

M = 1

or

M = F^{'} (x_{0}) .

Other selections are also possible as long as criteria (C4) and (C5) are satisfied. The semi-local analysis of convergence uses criteria (C1)–(C6) and the developed notation.

Theorem 2.

Suppose that criteria (C1)–(C6) hold. Then, the following assertions hold for method (5):

{x_{n}} \subset S (x_{0}, α^{*}),

(27)

∥ y_{n} - x_{n} ∥ \leq β_{n} - α_{n},

(28)

∥ z_{n} - x_{n} ∥ \leq γ_{n} - β_{n},

(29)

∥ x_{n + 1} - y_{n} ∥ \leq α_{n + 1} - β_{n}

(30)

and there exists a solution

x^{*} \in S [x_{0}, α^{*}]

of the equation

Λ (x) = 0

such that

∥ x^{*} - x_{n} ∥ \leq α^{*} - α_{n} .

(31)

Proof.

Induction on n is used to show assertions (27)–(29). Clearly, assertion (27) holds if

n = 0 .

We also have by (26), method (5), and the definition of

β_{0}

that

∥ y_{0} - x_{0} ∥ = ∥ Λ^{'} {(x_{0})}^{- 1} Λ (x_{0}) ∥ \leq β_{0} - α_{0} < α^{*} .

Thus, assertion (28) holds and the iterate

y_{0} \in S (x_{0}, α^{*}) .

By subtracting the first from the second substep of method (5), we obtain the following:

\begin{matrix} z_{i} - y_{i} & = & λ (y_{i} - x_{i}) + Λ^{'} {(x_{i})}^{- 1} Λ (x_{i}) \\ = & λ (y_{i} - x_{i}) - (y_{i} - x_{i}) \\ = & - (1 - λ) (y_{i} - x_{i}), \end{matrix}

so

∥ z_{i} - y_{i} ∥ \leq (1 - λ) ∥ y_{i} - x_{i} ∥ \leq (1 - λ) (β_{i} - α_{i}) = γ_{i} - β_{i}

and

\begin{matrix} ∥ z_{i} - x_{0} ∥ & \leq & ∥ z_{i} - y_{i} ∥ + ∥ y_{i} - x_{0} ∥ \\ \leq & β_{i} - β_{i} + β_{i} - α_{0} = γ_{i} \leq α^{*} . \end{matrix}

Thus, (29) holds and the iterate

z_{i} \in S (x_{0}, α^{*}) .

Then, by subtracting the second from the third substep, we obtain the following:

\begin{matrix} x_{i + 1} - z_{i} & = & (1 - λ) (y_{i} - x_{i}) \\ + \frac{1}{2 i} Λ^{'} {(x_{i})}^{- 1} (Λ^{'} (x_{i}) - F^{'} (z_{i})) \end{matrix}

leading to

\begin{matrix} ∥ x_{i + 1} - z_{i} ∥ & \leq & \frac{{\bar{v}}_{i} ∥ y_{i} - x_{i} ∥}{2 λ (1 - v_{0} (α_{i}))} + (1 - λ) ∥ y_{i} - x_{i} ∥ \\ \leq & \frac{{\bar{v}}_{i} (β_{i} - α_{i})}{2 λ (1 - v_{0} (α_{i}))} + (1 - λ) (β_{i} - α_{i}) \\ = & α_{i + 1} - γ_{i} \end{matrix}

and

\begin{matrix} ∥ x_{i + 1} - x_{0} ∥ & \leq & ∥ x_{i + 1} - z_{i} ∥ + ∥ z_{i} - x_{0} ∥ \\ \leq & α_{i + 1} - γ_{i} + γ_{i} - α_{0} \\ = & α_{i + 1} < α^{*} . \end{matrix}

Hence, assertion (30) holds and the iterate

x_{i + 1} \in S (x_{0}, α^{*}) .

Then, we can write by the first substep of method (5) the following Ostrowski-type [16] representation:

\begin{matrix} Λ (x_{i + 1}) & = & Λ (x_{i + 1}) - Λ (x_{i}) - Λ^{'} (x_{i}) (x_{i + 1} - y_{i}) \\ = & Λ (x_{i + 1}) - Λ (x_{i}) - Λ^{'} (x_{i}) (x_{i + 1} - x_{i}) + Λ^{'} (x_{i}) (x_{i + 1} - y_{i}) \\ = & \int_{0}^{1} [Λ^{'} (x_{i} + θ (x_{i + 1} - x_{i})) - Λ^{'} (x_{i})] d θ (x_{i + 1} - x_{i}) \\ + Λ^{'} (x_{i}) (x_{i + 1} - y_{i}) \end{matrix}

leading to

\begin{matrix} ∥ M^{- 1} Λ (x_{i + 1}) ∥ & \leq & \int_{0}^{1} v ((1 - θ) ∥ x_{i + 1} - x_{i} ∥) d θ ∥ x_{i + 1} - x_{i} ∥ \\ + ∥ M^{- 1} (Λ^{'} (x_{i}) - M + M) ∥ ∥ x_{i + 1} - x_{i} ∥ \\ \leq & \int_{0}^{1} v ((1 - θ) (α_{i + 1} - α_{i})) d θ (α_{i + 1} - α_{i}) \\ + (1 + v_{0} (α_{i})) (α_{i + 1} - β_{i}) \end{matrix}

(32)

\begin{matrix} = & δ_{i + 1} . \end{matrix}

(33)

Consequently, we obtain the following:

\begin{matrix} ∥ y_{i + 1} - x_{i + 1} ∥ & \leq & ∥ Λ^{'} {(x_{i + 1})}^{- 1} M ∥ ∥ M^{- 1} Λ (x_{i + 1}) ∥ \\ \leq & \frac{δ_{i + 1}}{1 - v_{0} (∥ x_{i + 1} - x_{0} ∥)} \\ \leq & \frac{δ_{i + 1}}{1 - v_{0} (α_{i + 1})} = β_{i + 1} - α_{i + 1} \end{matrix}

and

\begin{matrix} ∥ y_{i + 1} - x_{0} ∥ & \leq & ∥ y_{i + 1} - x_{i + 1} ∥ + ∥ x_{i + 1} - x_{0} ∥ \\ \leq & β_{i + 1} - α_{i + 1} + α_{i + 1} - α_{0} = β_{i + 1} < α^{*} . \end{matrix}

Thus, the induction for assertions (27)–(30) is complete and all the iterates

{x_{i}}

belong in the ball

S (x_{0}, α^{*}) .

We also have the following:

∥ x_{i + 1} - x_{i} ∥ \leq α_{i + 1} - α_{i} .

(34)

However, the sequence

{α_{i}}

is Cauchy as convergent to

α^{*} .

Therefore, by (34), the sequence

{x_{i}}

is also Cauchy in the Banach space X and as such, it is convergent to some

x^{*} \in S [x_{0}, α^{*}] .

Take

i ⟶ + \infty

and use the continuity of

Λ

in (34) to obtain

Λ (x^{*}) = 0 .

Furthermore, by the estimate (34) for

j = 1, 2, \dots

and the triangle inequality, we have the following:

∥ x_{i + j} - x_{i} ∥ \leq α_{i + J} - α_{i} .

(35)

Finally, by letting

j ⟶ + \infty

in (35), we show assertion (31). □

Next, a domain is determined with only one solution of the equation

Λ (x) = 0 .

Proposition 2.

Suppose there exists a solution

u^{*} \in S (x_{0}, ρ_{4})

for some

ρ_{4} > 0,

criterion (C4) holds in the ball

S (x_{0}, ρ_{4})

, and there exists

ρ_{5} \geq ρ_{4}

such that

\int_{0}^{1} v_{0} (θ ρ_{4} + (1 - θ) ρ_{5}) d θ < 1 .

(36)

Set

D_{3} = D \cap S [x_{0}, ρ_{5}] .

Then, the only solution of the equation

Λ (x) = 0

in the domain

D_{3}

is

u^{*} .

Proof.

Let

z^{*} \in D_{3}

be a solution of the equation

Λ (x) = 0

such that

z^{*} \neq u^{*} .

Define the linear operator

M_{2} = \int_{0}^{1} Λ^{'} (u^{*} + θ (z^{*} - u^{*})) d θ .

Using condition (C4) and (36), we obtain in turn that

\begin{matrix} ∥ M^{- 1} (M_{2} - M) ∥ & \leq & \int_{0}^{1} v_{0} (θ ∥ u^{*} - x_{0} ∥ + (1 - θ) ∥ z^{*} - x_{0} ∥) d θ \\ \leq & \int_{0}^{1} v_{0} (θ ρ_{4} + (1 - θ) ρ_{5}) d θ < 1 . \end{matrix}

Thus,

M_{2}^{- 1} \in L (Y, X) .

Finally, from the identity

z^{*} - u^{*} = M_{2}^{- 1} (Λ (z^{*}) - Λ (u^{*})) = M_{2}^{- 1} (0) = 0,

we deduce that

z^{*} = u^{*} .

□

Remark 4.

(i): The limit point $α^{*}$ can be exchanged by $ρ_{2}$ in condition (C6).
(ii): if all conditions (C1)–(C6) hold, then one can take $ρ_{4} = α^{*}$ and $u^{*} = x^{*}$ in Proposition 2.

4. Local Analysis of Method (6)

The analysis is analogous to the one of the Section 2. However, there are some differences. Suppose:

(H1)’ = (H1).
(H2)’ There exists $δ \in [0, 1)$ and a CNDF $w : T_{0} ⟶ T$ such that for the function $g_{1} : T_{0} ⟶ T$ defined by the following:

$g_{1} (t) = \frac{\int_{0}^{1} w ((1 - θ) t) d θ}{1 - w_{0} (t)} + \frac{δ^{m + 1}}{1 - δ} (1 + \int_{0}^{1} w_{0} (θ t) d θ),$

the equation $g_{1} (t) - 1 = 0$ has an SPS in the interval $(0, ρ_{0}) .$ Denote such a solution by $r_{1}^{'} .$
(H3)’ For the function $g_{2} : T_{0} ⟶ T$ defined by the following:

$g_{2} (t) = [(1 - λ) + λ g_{1} (t)] t,$

the equation $g_{2} (t) - 1 = 0$ has an SPS in the interval $(0, ρ_{0}) .$ Denote such a solution by $r_{2}^{'} .$

Define the function

g_{3} : T_{0} ⟶ T

by the following:

g_{3} (t) = ((1 + d (t))) g_{1} (t) + d (t) t,

where

d (t) = \frac{1}{2 λ} \frac{1 - δ^{k + 1}}{1 - δ} \bar{w} (t)

and the function

\bar{w}

is as defined above in condition (H4).

(H4)’ The equation $g_{3} (t) - 1 = 0$ has an SPS. Denote such a solution by $r_{3}^{'} .$ Set

$r^{'} = min {r_{j}^{'}}, j = 1, 2, 3$

(37)

and

$T_{3} = [0, r^{'}] .$

It follows by these definitions that for all $t \in T_{3}$ :

$0 \leq w_{0} (t) < 1$

(38)

and

$0 \leq g_{j} (t) < 1 .$

(39)
(H5)’ There exist a solution $x^{*} \in D$ of the equation $Λ (x) = 0$ and an invertible operator $M \in L (X, Y)$ such that for all $u \in D$ :

$∥ M^{- 1} (Λ^{'} (u) - M) ∥ \leq v_{0} (∥ u - x^{*} ∥) .$

Set $D_{4} = D \cap S [x^{*}, r^{'}] .$
(H6)’ $∥ M^{- 1} (Λ^{'} (u_{2}) - Λ^{'} (u_{1})) ∥ \leq v (∥ u_{2} - u_{1} ∥)$ for all $u_{1}, u_{2} \in D_{4} .$
(H7)’ $S [x^{*}, r^{'}] \subset D$ and set $δ \geq ∥ M^{- 1} (M - Λ^{'} (x)) ∥$ for all $x \in D_{4} .$

We have the following estimate:

∥ B ∥ \leq ∥ I ∥ + ∥ Δ ∥ + \dots + {∥ Δ ∥}^{k} \leq 1 + δ + \dots + δ^{k} = \frac{1 - δ^{k + 1}}{1 - δ} .

(40)

Theorem 3.

Suppose that criteria (H1)’–(H2)’ hold. Then, the conclusions of Theorem 1 hold for method (6), provided that

r, h_{j}

are replaced by

r^{'},

and

g_{j},

respectively.

Proof.

The computations are as in the proof of Theorem 1. Hence, we only stretch the differences. We can write by the first substep of method (6).

\begin{matrix} y_{i} - x^{*} & = & x_{i} - x^{*} - Λ^{'} {(x_{i})}^{- 1} Λ (x_{i}) \end{matrix}

\begin{matrix} + (Λ^{'} {(x_{i})}^{- 1} - B M^{- 1}) Λ (x_{i}) \end{matrix}

(41)

\begin{matrix} = & x_{i} - x^{*} - Λ^{'} {(x_{i})}^{- 1} Λ (x_{i}) + (B_{\infty} - B) M^{- 1} Λ (x_{i}) . \end{matrix}

(42)

The following estimates are needed in turn:

\begin{matrix} ∥ B_{\infty} - B ∥ & = & ∥ Δ^{k + 1} + Δ^{k + 2} + \dots ∥ \\ \leq & δ^{k + 1} + δ^{k + 2} + \dots = δ^{k + 1} (1 + δ + \dots) = \frac{δ^{k + 1}}{1 - δ} \end{matrix}

(43)

and

Λ (x_{i}) = Λ (x_{i}) - Λ (x^{*}) = \int_{0}^{1} Λ^{'} (x^{*} + θ (x_{i} - x^{*}) d θ (x_{i} - x^{*}),

leading to

\begin{matrix} ∥ M^{- 1} Λ (x_{i}) ∥ & \leq & ∥ M^{- 1} [\int_{0}^{1} [Λ^{'} (x^{*} + θ (x_{i} - x^{*}) - M] d θ + M] (x_{i} - x^{*}) ∥ \\ \leq & (1 + \int_{0}^{1} w_{0} (θ ∥ x_{i} - x^{*} ∥) d θ) ∥ x_{i} - x^{*} ∥ . \end{matrix}

(44)

Using (43) and (44), we obtain the following:

\begin{matrix} ∥ y_{i} - x^{*} ∥ & \leq & [\frac{\int_{0}^{1} w ((1 - θ) ∥ x_{i} - x^{*} ∥) d θ}{1 - w_{0} (∥ x_{i} - x^{*} ∥)} \\ + \frac{δ^{k + 1}}{1 - δ} (1 + \int_{0}^{1} w_{0} (θ ∥ x_{i} - x^{*} ∥) d θ)] ∥ x_{i} - x^{*} ∥ \\ \leq & g_{1} (∥ x_{i} - x^{*} ∥) ∥ x_{i} - x^{*} ∥ \leq ∥ x_{i} - x^{*} ∥ < r^{'} . \end{matrix}

Then, by the second substep, we have the following:

\begin{matrix} ∥ z_{i} - x^{*} ∥ & \leq & (1 - λ) ∥ x_{i} - x^{*} ∥ + λ ∥ y_{i} - x^{*} ∥ \\ \leq & g_{2} (∥ x_{i} - x^{*} ∥) ∥ x_{i} - x^{*} ∥ \leq ∥ x_{i} - x^{*} ∥ . \end{matrix}

Moreover, by the third substep of method (6), we obtain in turn that:

\begin{matrix} x_{i + 1} - x^{*} & = & [I + \frac{1}{2 λ} B M^{- 1} (Λ^{'} (x_{i}) - Λ^{'} (z_{i}))] (y_{i} - x_{i}) \\ + \frac{1}{2 λ} B M^{- 1} (Λ^{'} (x_{i}) - Λ^{'} (z_{i})) (x_{i} - x^{*}) . \end{matrix}

(45)

But by (40) and (45), we obtain in turn that:

\begin{matrix} ∥ x_{i + 1} - x^{*} ∥ & \leq & (1 + d_{i}) ∥ y_{i} - x^{*} ∥ + d_{i} ∥ x_{i} - x^{*} ∥ \\ \leq & g_{3} (∥ x_{i} - x^{*} ∥) ∥ x_{i} - x^{*} ∥ \leq ∥ x_{i} - x^{*} ∥, \end{matrix}

where we also used

∥ \frac{1}{2 λ} B M^{- 1} (Λ^{'} (x_{i}) - Λ^{'} (x_{i})) ∥ \leq \frac{1}{2 λ} (1 + δ + \dots + δ^{k}) {\bar{w}}_{i} \leq d_{i} .

The rest of the proof is given in Theorem 1. □

The uniqueness of the solution domain can be found in Proposition 1.

5. Semi-Local Analysis for Method (6)

The analysis is similar to the one given in Section 3. Suppose:

(C1)’ = (C1).
(C2)’ There exists a CNDF $v : T_{2} ⟶ T .$ Define the sequence ${a_{n}}$ for $a_{0} = 0,$ some $b_{0} \in [0, ρ_{2})$ , and all $n = 0, 1, 2, \dots$ by the following:

$c_{n} = b_{n} + (1 - λ) (b_{n} - a_{n}),$

${\bar{μ}}_{n} = \{\begin{matrix} v (c_{n} - a_{n}) \\ o r \\ v_{0} (a_{n}) + v_{0} (c_{n}) \end{matrix},$

(46)

$a_{n + 1} = b_{n} + e_{n},$

$e_{n} = \frac{1}{2 λ} \frac{1 - δ^{k + 1}}{1 - δ} {\bar{μ}}_{n}$

$b = δ \frac{1 - δ^{k}}{1 - δ}, \bar{b} = \frac{1}{1 - b},$

$q_{n + 1} = (1 + \int_{0}^{1} v_{0} (a_{n} + θ (a_{n + 1} - a_{n})) d θ) (a_{n + 1} - a_{n}) + \bar{b} (b_{n} - a_{n}),$

and

$b_{n + 1} = a_{n = 1} + \frac{1 - δ^{k + 1}}{1 - δ} q_{n + 1} .$

A convergence criterion is needed for the sequence ${a_{n}} .$
(C3)’ There exists $ρ_{6} \in [0, ρ_{2})$ such that for all $n = 0, 1, 2, \dots$ $δ < \frac{1}{2}$ and $a_{n} \leq ρ_{6} .$ It follows by this criterion and (46) that for all $n = 0, 1, 2, \dots$ , $0 \leq a_{n} \leq b_{n} \leq c_{n} \leq a_{n + 1},$ and there exists $a^{*} \in [0, ρ_{6}]$ such that ${lim}_{n ⟶ + \infty} a_{n} = a^{*} .$
(C4)’ There exist $x_{0} \in D$ and an invertible operator $M \in L (X, Y)$ such that for all $u \in D$

$∥ M^{- 1} (Λ^{'} (u) - M) ∥ \leq v_{0} (∥ u - x_{0} ∥) .$

Set $D_{5} = D \cap S (x_{0}, ρ_{2}) .$
(C5)’ $∥ M^{- 1} (Λ^{'} (u_{2}) - Λ^{'} (u_{1})) ∥ \leq v (∥ u_{2} - u_{1} ∥)$ for all $u_{1}, u_{2} \in D_{5} .$
and
(C6)’ $S [x_{0}, a^{*}] \subset D,$ where $δ \geq ∥ M^{- 1} (Λ^{'} (u) - M) ∥$ for all $u \in D_{5} .$

Note, that we have the estimate

\begin{matrix} ∥ I - B ∥ & \leq & ∥ Δ ∥ + \dots + {∥ Δ ∥}^{k} \leq δ + \dots + δ^{k} \\ = & δ \frac{1 - δ^{k}}{1 - δ} = b < 1, \end{matrix}

(47)

since

δ \in [0, \frac{1}{2}) .

Hence,

B^{- 1} \in L (Y, X)

and

∥ B^{- 1} \leq \frac{1}{1 - b} = \bar{b} .

(48)

Theorem 4.

Suppose that criteria (C1)’–(C6)’ hold. Then, the conclusions of Theorem 2 holds for method (6) provided that

α^{*}

and

{α_{n}}

are replaced by

a^{*}

and

{a_{n}},

respectively.

Proof.

It follows as in Theorem 2 and the estimates

\begin{matrix} z_{i} - y_{i} & = & λ (y_{i} - x_{i}) + B M^{- 1} Λ (x_{n}) \\ = & λ (y_{i} - x_{i}) - (y_{+} i - x_{i}) \\ = & (λ - 1) (y_{i} - x_{i}), \end{matrix}

thus,

∥ z_{i} - y_{i} ∥ \leq (1 - λ) ∥ y_{i} - x_{i} ∥ \leq (1 - λ) (b_{i} - a_{i}),

and by (47),

∥ x_{i + 1} - y_{i} \leq e_{i} = a_{i + 1} - b_{i} .

Moreover, we can write again the Ostrowski [16] representation for

Λ (x_{i +})

as

\begin{matrix} Λ (x_{i + 1}) & = & Λ (x_{i + 1}) - Λ (x_{i}) - M B^{- 1} (y_{i} - x_{i}), \\ = & \int_{0}^{1} Λ^{'} (x_{i} + θ (x_{i + 1} - x_{i})) d θ (x_{i + 1} - x_{i}) \\ - M B^{- 1} (y_{i} - x_{i}) . \end{matrix}

(49)

Hence, by (48) and (49), we obtain the following:

\begin{matrix} ∥ M^{- 1} Λ (x_{i +}) ∥ & \leq & (1 + \int_{0}^{1} v_{0} (∥ x_{i} - x_{0} ∥ + θ ∥ x_{i + 1} - x_{i} ∥) d θ) ∥ x_{i + 1} - x_{i} ∥ \\ + ∥ B^{- 1} ∥ ∥ y_{i} - x_{i} ∥ \\ \leq & (1 + \int_{0}^{1} v_{0} (a_{n} + θ (a_{n + 1} - a_{n})) d θ) (a_{n + 1} - a_{n}) \\ + \bar{b} (b_{i} - a_{i}) = q_{i + 1} . \end{matrix}

Therefore, we obtain by the first substep of method (6) in turn that:

\begin{matrix} ∥ y_{i + 1} - x_{+ 1} ∥ & \leq & ∥ B M^{- 1} Λ (x_{i +}) ∥ \\ \leq & ∥ B ∥ ∥ y_{i + 1} - x_{i + 1} ∥ \\ \leq & {(1 + ∥ Δ ∥ + \dots + ∥ Δ ∥}^{k}) q_{i + 1} \\ \leq & (1 + δ + \dots + δ^{k}) q_{i + 1} \\ = & \frac{(1 - δ^{k + 1}) q_{i + 1}}{1 - δ} = b_{i + 1} - a_{i + 1} . \end{matrix}

The rest follows as in the proof of Theorem 2. □

The uniqueness part and the comments are given in Proposition 2 and Remark 4.

6. Numerical Examples

In the following example, we consider method (6) for 6

M = I

, which remains independent of both

x_{0}

. Additionally, they are compared with method (5), where

M = Λ^{'} (x_{0})

and

λ = \frac{1}{2} .

Example 1.

The solution is sought for the nonlinear system

\begin{matrix} f_{1} (x, y) & = & x - 0.1 sin x - 0.3 cos y + 0.4 \\ f_{2} (x, y) & = & y - 0.2 cos x + 0.1 sin y + 0.3 \end{matrix}

Let

Λ = (f_{1}, f_{2}) .

Then, the system becomes

\begin{matrix} Λ (s) = 0 f o r s = {(θ_{1}, θ_{2})}^{T} . \end{matrix}

Then:

Λ^{'} ((x, y)) = [\begin{matrix} 1 - 0.1 cos (x) & 0.3 sin (y) \\ 0.2 sin (x) & 0.1 cos (y) + 1 \end{matrix}] .

Method (6),

k = 1

,

M = I

\begin{matrix} B_{1} (x) = I + (I - Λ^{'} (x)), \\ p_{1} (x) = x - (I + (I - Λ^{'} (x))) Λ (x), \\ x_{n + 1} = p_{1} (x_{n}) . \end{matrix}

(50)

Method (6),

k = 2

,

M = I

\begin{matrix} B_{2} (x) = I + (I - Λ^{'} (x)) + {(I - Λ^{'} (x))}^{2}, \\ p_{2} (x) = x - B_{2} (x) Λ (x), \\ x_{n + 1} = p_{2} (x_{n}) . \end{matrix}

(51)

Method (6),

k = 3

,

M = I

\begin{matrix} B_{3} (x) = I + (I - Λ^{'} (x)) + {(I - Λ^{'} (x))}^{2} + {(I - Λ^{'} (x))}^{3}, \\ p_{3} (x) = x - B_{3} (x) Λ (x), \\ x_{n + 1} = p_{3} (x_{n}) . \end{matrix}

(52)

Method (6),

k = 4

,

M = I

\begin{matrix} B_{4} (x) = I + (I - Λ^{'} (x)) + {(I - Λ^{'} (x))}^{2} + {(I - Λ^{'} (x))}^{3} + {(I - Λ^{'} (x))}^{4}, \\ p_{4} (x) = x - B_{4} (x) Λ (x), \\ x_{n + 1} = p_{4} (x_{n}) . \end{matrix}

(53)

Method (6),

k = 5

,

M = I

\begin{matrix} B_{5} (x) = I + (I - Λ^{'} (x)) + {(I - Λ^{'} (x))}^{2} + {(I - Λ^{'} (x))}^{3} + {(I - Λ^{'} (x))}^{4} + {(I - Λ^{'} (x))}^{5}, \\ p_{5} (x) = x - B_{5} (x) Λ (x), \\ x_{n + 1} = p_{5} (x_{n}) . \end{matrix}

(54)

Method (6),

k = \bar{1, 5}

,

M = Λ^{'} (x_{0})

\begin{matrix} x_{n + 1} = x_{n} - B M^{- 1} Λ (x_{n}), \\ A = M^{- 1} (M - Λ^{'} (x)), \\ B = I + \sum_{i = 1}^{k} A^{i} . \end{matrix}

(55)

Thus, the comparison shows that the behavior of method (6) is essentially the same as method (5). However, the iterates of method (6) are cheaper to obtain than (5). As observed in Table 1, Table 2, Table 3 and Table 4, the number of iterations required for the proposed methods with k ranging from 3 to 5 closely aligns with those of method (5).

Table 1. The number of iterations needed to achieve tolerance

ε = 10^{- 9}

(i.e, (

∥ s_{k} - s_{k - 1} ∥ < 10^{- 9}

)) with initial guess

x_{0} = (1, 1)

,

λ = 1

and

∥ I - Λ^{'} (x_{0}) ∥ = 0.2593 < 1

.

Table 2. The number of iterations needed to achieve tolerance

ε = 10^{- 9}

(i.e, (

∥ s_{k} - s_{k - 1} ∥ < 10^{- 9}

)) with initial guess

x_{0} = (0, 0)

,

λ = 1

and

∥ I - Λ^{'} (x_{0}) ∥ = 0.1 < 1

.

Table 3. The number of iterations needed to achieve tolerance

ε = 10^{- 9}

(i.e, (

∥ s_{k} - s_{k - 1} ∥ < 10^{- 9}

)), where

x_{0} = (- 15, - 15)

,

λ = 1

and

∥ I - Λ^{'} (x_{0}) ∥ = 0.2120 < 1

.

Table 4. The number of iterations needed to achieve tolerance

ε = 10^{- 12}

(i.e, (

∥ s_{k} - s_{k - 1} ∥ < 10^{- 12}

)), where

x_{0} = (- 15, - 15)

,

λ = 1

and

∥ I - Λ^{'} (x_{0}) ∥ = 0.2120 < 1

.

The approximated solution is

s^{a p p} = (- 0.1124965854417, - 0.0920701967370)

with the function value error

∥ Λ (s^{a p p}) ∥ = 0.3925 \times 10^{- 13}

, which is obtained using the initial point

x_{0} = (- 15, - 15)

with the error tolerance

ε = 10^{- 12}

(i.e, (

∥ s_{k} - s_{k - 1} ∥ < 10^{- 12}

)). It is observed that with the same tolerance level, one can obtain the mentioned approximated solution for other initial points considered in Table 1 and Table 2.

Table 5 and Figure 1 show the results of calculations used to determine the Computational Order of Convergence (COC) and the Approximated Computational Order of Convergence (ACOC), aiming to compare the convergence order of method (6) with the convergence order of method (5).

Table 5. The computational order of convergence and the approximated computational order of convergence, where

x_{0} = (- 15, - 15)

,

ε = 10^{- 12}

.

Figure 1. COC and ACOC using

x_{0} = (- 15, - 15)

,

ε = 10^{- 12}

.

Table 5 and Figure 1 demonstrate that the convergence of the proposed methods closely corresponds to the convergence of Newton’s method, particularly for values of k ranging from 4 to 5 with the convergence order closely approximating 2. Furthermore, Figure 2 shows a quick comparison of Table 1, Table 2 and Table 3.

Figure 2. Number of iterations needed to achieve tolerance

ε = 10^{- 9}

.

Example 2.

Let

X = Y = R^{3}

and

D = S [x^{*}, 1] .

We study the motion of a particle moving in three dimensions that has started from rest. The mapping Λ is defined on D for

a = {(a_{1}, a_{2}, a_{3})}^{T} \in R^{3}

as:

\begin{matrix} Λ (a) = {(a_{1}, e^{a_{2}} - 1, \frac{e - 1}{2} a_{3}^{2} + a_{3})}^{T} . \end{matrix}

(56)

Then, the definition of the derivative according to Fréchet [2,10,13,18,19,20] is given for the mapping Λ:

Λ^{'} (a) = [\begin{matrix} 1 & 0 & 0 \\ 0 & e^{a_{2}} & 0 \\ 0 & 0 & (e - 1) a_{3} + 1 \end{matrix}] .

The point

x^{*} = {(0, 0, 0)}^{T}

solves the equation

Λ (a) = 0 .

Moreover,

Λ^{'} (x^{*}) = I .

The conditions of Theorem 1 hold provided that

w_{0} (t) = (e - 1) t, w (t) = e^{\frac{1}{e - 1}} t, ρ_{0} = \frac{1}{e - 1}, k = 1, λ = \frac{1}{2}

and

δ = (e - 1) t .

Then, by (37), we have

r_{1} = 0.231165, r_{2} = 0.393879, r_{3} = 0.131383 = r^{'} .

Example 3.

Let

K [0, 1]

stand as the space of continuous functions mapping the interval

[0, 1]

into the real number system. Let

X = Y = K [0, 1]

and

D = S [x^{*}, 1]

with

x^{*} (ξ) = 0 .

The operator Λ is defined on

K [0, 1]

as:

Λ (z) (ξ) = z (ξ) - 6 \int_{0}^{1} ξ z {(τ)}^{3} d τ .

Nonlinear integral equations of the form

Λ (z) (ξ) = 0

are of Hemmerstein-type and are used to study the motion problems [5,12,18,19]. Here, the definition of the derivative according to Fréchet gives for the function Λ:

Λ^{'} (z (w)) (ξ) = w (ξ) - 18 \int_{0}^{1} ξ τ z {(τ)}^{2} w (τ) d τ

for each

w \in K [0, 1] .

Therefore, the conditions are validated, since for

x^{*} = 0, M = I,

Λ^{'} (x^{*} (ξ)) = I

, provided that

w_{0} (t) = 18 t, w (t) = 36 t, δ = 18 t, λ = \frac{1}{2}, ρ_{0} = \frac{1}{18} .

Then, we obtain by (37) that

r_{1} = 0.016615, r_{2} = 0.051081, r_{3} = 0.009570 = r^{'} .

7. Concluding Remarks

The Chebyshev method is of R-order three. However, it is computationally expensive because it requires the evaluation of the second derivative of the operator involved at each step of the iteration. By replacing the second derivative in terms of first derivatives, new methods are derived of R-order three. However, the implementation still remains for the new methods, since the inverse of a certain linear operator also needs to be computed. That is why we replace the inverse by a finite sum of linear operators converging to that inverse. The local as well as the semi-local analysis of convergence is studied, relying on the concept of related conditions that control the derivative and majorizing sequences. The sequence generated by the new HCTM are easier to find and essentially converge to the solution as fast as the optimized Chebysehev methods. This fact is also demonstrated by numerical examples. Due to its generality, the methodology of this article can also be applied to other methods with inverses [11,16,19,20,21,22,23]. This is the direction of our future work.

Author Contributions

Conceptualization, I.K.A. and S.G.; Algorithm, I.K.A. and S.G.; methodology, I.K.A. and S.G.; software, I.K.A. and S.G.; validation, I.K.A. and S.G.; formal analysis, I.K.A. and S.G.; investigation, I.K.A. and S.G.; resources, I.K.A. and S.G.; data curation, I.K.A. and S.G.; writing—original draft preparation, I.K.A. and S.G.; writing—review and editing, I.K.A. and S.G.; visualization, I.K.A. and S.G.; supervision, I.K.A. and S.G.; project administration, I.K.A. and S.G. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are contained within the article.

Acknowledgments

We would like to thank Muniyasamy M, from the Department of Mathematical and computational Sciences, the National Institute of Technology Karnataka, India for providing code for Example 1 of this paper.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Argyros, I.K.; George, S. On the complexity of extending the convergence region for Traub’s method. J. Complex. 2020, 56, 101423. [Google Scholar] [CrossRef]
Ben-Israel, A.; Greville, T.N.E. Generalized Inverses: Theory and Applications; John Wiley and Sons: New York, NY, USA, 1974. [Google Scholar]
Moore, R.H.; Nashed, M.Z. Approximations to generalized inverses of linear operators. SIAM J. Appl. Math. 1974, 27, 1–16. [Google Scholar] [CrossRef]
Nashed, M.Z. Generalized Inverses and Applications; Academic Press: New York, NY, USA, 1976. [Google Scholar]
Padcharoen, A.; Kumam, P.; Chaipunya, P.; Shehu, Y. Convergence of inertial modified Krasnoselskii-Mann iteration with application to image recovery. Thai J. Math. 2020, 18, 126–142. [Google Scholar]
Proinov, P.D.; Petkova, M.D. Local and semilocal Convergence of a family of Multi-point Weierstrass-type Root-Finding Methods. Mediterr. J. Maths. 2020, 17, 107. [Google Scholar] [CrossRef]
Regmi, S.; Argyros, I.K.; George, S.; Argyros, C.I. Extended Convergence of Three Step Iterative Methods for Solving Equations in Banach Space with Applications. Symmetry 2022, 14, 1484. [Google Scholar] [CrossRef]
Häubler, W.M. A Kantorovich-type convergence analysis for the Gauss-Newton-method. Numer. Math. 1986, 48, 119–125. [Google Scholar]
Argyros, I.K.; George, S.; Shakhno, S.; Regmi, S.; Havdiak, M.; Argyros, M.I. Asymptotically Newton-Type Methods without Inverses for Solving Equations. Mathematics 2024, 12, 1069. [Google Scholar] [CrossRef]
Kantorovich, L.V.; Akilov, G. Functional Analysis in Normed Spaces; Fizmatgiz: Moscow, Russia, 1959; (German translation, Akademie-Verlag: Berlin, Germany, 1964): (English translation (2nd edition), Pergamon Press: London, UK, 1981), (1964). [Google Scholar]
Ezquerro, J.A.; Hernandez-Veron, M.A. Domains of global convergence for Newtons’s method from auxiliary points. Appl. Math. Lett. 2018, 85, 48–56. [Google Scholar] [CrossRef]
Ezquerro, J.A.; Hernandez-Veron, M.A. Newton’s Method: An Updated Approach of Kantorovich’s Theory; Birkhauser: Basel, Switzerland, 2017. [Google Scholar]
Krasnoselskij, M.A. Two remarks on the method of successive approximations. Uspehi Mat. Nauk. 1995, 10, 123–127. (In Russian) [Google Scholar]
Traub, J.F.; Wozniakowsi, H. Convegence and complexity of Newton iteration for operator equations. J. Assoc. Comput. March. 1979, 26, 250–258. [Google Scholar] [CrossRef]
Proinov, P.D. New general convergence theory for iterative processes and its applications to Newton- Kantarovich type theorems. J. Complex. 2010, 25, 3–42. [Google Scholar] [CrossRef]
Ostrowski, A.M. Solution of Equations in Euclidean and Banach Spaces; Academic Press: New York, NY, USA, 1973. [Google Scholar]
Yamamoto, T. A convergence theorem for Newton-like methods in Banach spaces. Numer. Math. 1987, 51, 545–557. [Google Scholar] [CrossRef]
Berinde, V. Iterative Approximation of Fixed Points; Springer: New York, NY, USA, 2007. [Google Scholar]
Deuflhard, P. Newton Methods for Nonlinear Problems. Affine Invariance and Adaptive Algorithms; Springer Series in Computational Mathematics; Springer: Berlin/Heidelberg, Germany, 2004; Volume 35. [Google Scholar]
Ezquerro, J.A.; Gutierrez, J.M.; Hernandez, M.A.; Romero, N.; Rubio, M.J. The Newton Method: From Newton to Kantorovich. Gac. R. Soc. Mat. Esp. 2010, 13, 53–76. (In Spanish) [Google Scholar]
Rheinboldt, W.C. A unified convergence theory for a class of iterative process. SIAM J. Numer. Anal. 1968, 5, 42–63. [Google Scholar] [CrossRef]
Catinas, E. The inexact, inexact perturbed, and quasi-Newton methods are equivalent models. Math. Comp. 2005, 74, 291–301. [Google Scholar] [CrossRef]
Potra, F.A. Sharp error bounds for a class of Newton-like methods. Lib. Math. 1985, 5, 71–84. [Google Scholar]

Figure 1. COC and ACOC using

x_{0} = (- 15, - 15)

,

ε = 10^{- 12}

.

Figure 2. Number of iterations needed to achieve tolerance

ε = 10^{- 9}

.

Table 1. The number of iterations needed to achieve tolerance

ε = 10^{- 9}

(i.e, (

∥ s_{k} - s_{k - 1} ∥ < 10^{- 9}

)) with initial guess

x_{0} = (1, 1)

,

λ = 1

and

∥ I - Λ^{'} (x_{0}) ∥ = 0.2593 < 1

.

Table 1. The number of iterations needed to achieve tolerance

ε = 10^{- 9}

(i.e, (

∥ s_{k} - s_{k - 1} ∥ < 10^{- 9}

)) with initial guess

x_{0} = (1, 1)

,

λ = 1

and

∥ I - Λ^{'} (x_{0}) ∥ = 0.2593 < 1

.

Method	Iterations	Method	Iterations
Equation (5)	3	Equation (5)	3
Equation (50), $k = 1$	6	Equation (55), $k = 1$	9
Equation (51), $k = 2$	4	Equation (55), $k = 2$	7
Equation (52), $k = 3$	4	Equation (55), $k = 3$	6
Equation (53), $k = 4$	3	Equation (55), $k = 4$	5
Equation (54), $k = 5$	3	Equation (55), $k = 5$	4

Table 2. The number of iterations needed to achieve tolerance

ε = 10^{- 9}

(i.e, (

∥ s_{k} - s_{k - 1} ∥ < 10^{- 9}

)) with initial guess

x_{0} = (0, 0)

,

λ = 1

and

∥ I - Λ^{'} (x_{0}) ∥ = 0.1 < 1

.

Table 2. The number of iterations needed to achieve tolerance

ε = 10^{- 9}

(i.e, (

∥ s_{k} - s_{k - 1} ∥ < 10^{- 9}

)) with initial guess

x_{0} = (0, 0)

,

λ = 1

and

∥ I - Λ^{'} (x_{0}) ∥ = 0.1 < 1

.

Method	Iterations	Method	Iterations
Equation (5)	2	Equation (5)	2
Equation (50), $k = 1$	5	Equation (55), $k = 1$	9
Equation (51), $k = 2$	4	Equation (55), $k = 2$	7
Equation (52), $k = 3$	3	Equation (55), $k = 3$	6
Equation (53), $k = 4$	3	Equation (55), $k = 4$	5
Equation (54), $k = 5$	2	Equation (55), $k = 5$	4

Table 3. The number of iterations needed to achieve tolerance

ε = 10^{- 9}

(i.e, (

∥ s_{k} - s_{k - 1} ∥ < 10^{- 9}

)), where

x_{0} = (- 15, - 15)

,

λ = 1

and

∥ I - Λ^{'} (x_{0}) ∥ = 0.2120 < 1

.

Table 3. The number of iterations needed to achieve tolerance

ε = 10^{- 9}

(i.e, (

∥ s_{k} - s_{k - 1} ∥ < 10^{- 9}

)), where

x_{0} = (- 15, - 15)

,

λ = 1

and

∥ I - Λ^{'} (x_{0}) ∥ = 0.2120 < 1

.

Method	Iterations	Method	Iterations
Equation (5)	3	Equation (5)	3
Equation (50), $k = 1$	7	Equation (55), $k = 1$	10
Equation (51), $k = 2$	5	Equation (55), $k = 2$	7
Equation (52), $k = 3$	4	Equation (55), $k = 3$	6
Equation (53), $k = 4$	4	Equation (55), $k = 4$	5
Equation (54), $k = 5$	3	Equation (55), $k = 5$	4

Table 4. The number of iterations needed to achieve tolerance

ε = 10^{- 12}

(i.e, (

∥ s_{k} - s_{k - 1} ∥ < 10^{- 12}

)), where

x_{0} = (- 15, - 15)

,

λ = 1

and

∥ I - Λ^{'} (x_{0}) ∥ = 0.2120 < 1

.

Table 4. The number of iterations needed to achieve tolerance

ε = 10^{- 12}

(i.e, (

∥ s_{k} - s_{k - 1} ∥ < 10^{- 12}

)), where

x_{0} = (- 15, - 15)

,

λ = 1

and

∥ I - Λ^{'} (x_{0}) ∥ = 0.2120 < 1

.

Method	Iterations	Method	Iterations
Equation (5)	3	Equation (5)	3
Equation (50), $k = 1$	9	Equation (55), $k = 1$	12
Equation (51), $k = 2$	6	Equation (55), $k = 2$	8
Equation (52), $k = 3$	5	Equation (55), $k = 3$	7
Equation (53), $k = 4$	4	Equation (55), $k = 4$	6
Equation (54), $k = 5$	4	Equation (55), $k = 5$	5

Table 5. The computational order of convergence and the approximated computational order of convergence, where

x_{0} = (- 15, - 15)

,

ε = 10^{- 12}

.

Table 5. The computational order of convergence and the approximated computational order of convergence, where

x_{0} = (- 15, - 15)

,

ε = 10^{- 12}

.

Method	COC	ACOC
Equation (5)	1.0	NA
Equation (50), $k = 1$	0.977468	0.969404
Equation (51), $k = 2$	0.943378	0.897736
Equation (52), $k = 3$	0.867463	0.816881
Equation (53), $k = 4$	0.710293	0.264330
Equation (54), $k = 5$	0.654524	0.304212

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Hybrid Chebyshev-Type Methods for Solving Nonlinear Equations

Abstract

1. Introduction

2. Local Analysis of Method (5)

3. Semi-Local Analysis of Method (5)

4. Local Analysis of Method (6)

5. Semi-Local Analysis for Method (6)

6. Numerical Examples

7. Concluding Remarks

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics