Symmetric-Type Multi-Step Difference Methods for Solving Nonlinear Equations

Ioannis K. Argyros; Stepan Shakhno; Samundra Regmi; Halyna Yarmola; Michael I. Argyros

doi:10.3390/sym16030330

,

and

¹

Department of Computing and Mathematical Sciences, Cameron University, Lawton, OK 73505, USA

²

Department of Theory of Optimal Processes, Ivan Franko National University of Lviv, Universytetska Str. 1, 79000 Lviv, Ukraine

³

Department of Mathematics, University of Houston, Houston, TX 77205, USA

⁴

Department of Computational Mathematics, Ivan Franko National University of Lviv, Universytetska Str. 1, 79000 Lviv, Ukraine

Symmetry2024, 16(3), 330;https://doi.org/10.3390/sym16030330

This article belongs to the Special Issue Advances in Mathematical Models and Partial Differential Equations

Version Notes

Order Reprints

Abstract

Symmetric-type methods (STM) without derivatives have been used extensively to solve nonlinear equations in various spaces. In particular, multi-step STMs of a higher order of convergence are very useful. By freezing the divided differences in the methods and using a weight operator a method is generated using m steps (m a natural number) of convergence order 2 m. This method avoids a large increase in the number of operator evaluations. However, there are several problems with the conditions used to show the convergence: the existence of high order derivatives is assumed, which are not in the method; there are no a priori results for the error distances or information on the uniqueness of the solutions. Therefore, the earlier studies cannot guarantee the convergence of the method to solve nondifferentiable equations. However, the method may converge to the solution. Thus, the convergence conditions can be weakened. These problems arise since the convergence order is determined using the Taylor series which requires the existence of high-order derivatives which are not present in the method, and they may not even exist. These concerns are our motivation for authoring this article. Moreover, the novelty of this article is that all the aforementioned problems are addressed positively, and by using conditions only related to the divided differences in the method. Furthermore, a more challenging and important semi-local analysis of convergence is presented utilizing majorizing sequences in combination with the concept of the generalized continuity of the divided difference involved. The convergence is also extended from the Euclidean to the Banach space. We have chosen to demonstrate our technique in the present method. But it can be used in other studies using the Taylor series to show the convergence of the method. The applicability of other single- or multi-step methods using the inverses of linear operators with or without derivatives can also be extended with the same methodology along the same lines. Several examples are provided to test the theoretical results and validate the performance of the method.

Keywords:

convergence; Banach spaces; Fréchet derivative; divided difference; iterative method; Taylor expansion series; high-order derivative

1. Introduction

Let E stand for Banach space and

D \subseteq E

for an open set. Suppose that

P : D \to E

is a continuous operator. A plethora of applications are thus reduced to solving the equation

P (x) = 0 .

(1)

A solution

x^{*} \in D

to this equation is needed in closed form. But this is attainable only in special situations. That is why most solution approaches are iterative when a sequence is developed that is convergent to

x^{*}

under some conditions of the operator P and the starting point

x_{0} \in D

.

The existence and uniqueness of the solution’s results are usually developed for an equation like (1). In this case, Equation (1) has solutions and so it is needed to present and analyze iterative methods (ITs).

No matter which IT is utilized, there are three concerns. First, the iterations must exist in the domain D. That is, if the IT requires the evaluation of an IT at all

x_{k}

, it must be assured that these iterates remain in the domain of the operator P. If we refer to Newton’s method, the Fréchet derivative

P^{'} (x_{k})

which is a linear operator as well as the inverse

P^{'} {(x_{k})}^{- 1}

must be well defined at all

x_{k}

. That is why we usually provide conditions that ensure that the iterates exist provided that the IT initiates from an initial point

x_{0}

. Another more challenging concern is the convergence of the sequence

{x_{k}}

, and if their limits are indeed solutions to Equation (1). A plethora of such results exists [1,2]. The first one is usually called local convergence, where we start by assuming the existence of some

x^{*} \in D

, and then we provide some neighborhoods of

x^{*}

called convergence balls such that all of the initial points are in it, and all of the iterates produced by the IT exist and converge to

x^{*}

. The second type of convergence, usually called semilocal, does not rely on the existence of the solution

x^{*}

, but it shows certain, usually difficult to verify, conditions of the operator P and the initial point

x_{0}

. However, under these conditions, the convergence of the sequence to

x^{*}

is assured. In the semilocal case, we also provide computable error estimates for the distances

∥ x_{k + 1} - x_{k} ∥

or

∥ x^{*} - x_{k} ∥

, which are not given in the local convergence theorems. However, even these estimates are usually pessimistic.

A widely used single-step method is Newton’s (NA), defined as [1,3]

x_{0} \in D, x_{k + 1} = x_{k} - P^{'} {(x_{k})}^{- 1} P (x_{k}) .

The convergence order of (NA) is two [1]. But the inversion of the derivative

P^{'} (x_{k})

is required at each step. This inversion may not be possible or may very expensive to carry out.

The modified Newton’s method [1,2] is defined by

x_{0} \in D, x_{k + 1} = x_{k} - P^{'} {(x_{0})}^{- 1} P (x_{k}) .

This method requires only the inversion of

P^{'} (x_{0})

. But the convergence order of it is only one.

To avoid the generally expensive computation of the Fréchet derivative

P^{'} (x_{k})

of the operator P and increase the convergence to be higher than one other ITs have been developed [1,3,4,5,6,7] using divided differences of order one [1,2]. The methods of chords or Regula falsi or the Secant method are some of the most used ITs for solving Equation (1). They are defined by

x_{- 1}, x_{0} \in D, x_{k + 1} = x_{k} - {[x_{k}, x_{k - 1}; P]}^{- 1} P (x_{k}),

where

L (E, E)

denotes the space for mappings of E into E for linear operators that are bounded, and

[\cdot, \cdot; P] : D \times D \to L (E, E)

is called a divided difference of order one. But the R-order of convergence is

\frac{1 + \sqrt{5}}{2}

which is larger than one but smaller than two. However, it is possible to still use divided differences and obtain a convergence order greater than

\frac{1 + \sqrt{5}}{2}

.

By utilizing the symmetric difference formula

P^{'} (x) \approx [x + h, x - h; P],

we derive the Symmetric-Steffensen-type method (SSTM)

x_{k + 1} = x_{k} - {[x_{k} + P (x_{k}), x_{k} - P (x_{k}); P]}^{- 1} P (x_{k}),

which is also of convergence order two [4,5]. But the divided difference

[\cdot, \cdot; P]

is used instead of the derivative

P^{'}

. Other Steffensen-type methods are studied in [6,7,8,9]. Generalizations of the SSTM have been suggested to increase the convergence order in combination with weight operators, and frozen derivatives.

Symmetries play a central role in the dynamics of physical systems. Quantum physics are at the core of symmetry principles. Symmetries do not only naturally appear in geometry. They appear every time that a mathematical object stays unchanged under transformations. Even and odd functions studied in calculus are examples of symmetry. Symmetric matrices or graphs are another example. Symmetries characterize the solutions of differential or integral equations. That is why it makes sense to consider iterative methods of a symmetric nature to solve such equations.

Let us revisit the family of methods defined by [5]

\begin{matrix} u_{k} & = & x_{k} + a P (x_{k}), A_{k} = [u_{k}, x_{k}; P], \\ y_{k}^{(1)} & = & x_{k} - A_{k}^{- 1} P (x_{k}), z_{k} = y_{k}^{(1)} + b P (y_{k}^{(1)}), \\ y_{k}^{(2)} & = & y_{k}^{(1)} - B (s_{k}) A_{k}^{- 1} P (y_{k}^{(1)}), s_{k} = A_{k}^{- 1} [y_{k}^{(1)}, z_{k}; P], \\ \dots \\ y_{k}^{(m - 1)} & = & y_{k}^{(m - 2)} - B (s_{k}) A_{k}^{- 1} P (y_{k}^{(m - 2)}), \\ x_{k + 1} & = & y_{k}^{(m)} = y_{k}^{(m - 1)} - B (s_{k}) A_{k}^{- 1} P (y_{k}^{(m - 1)}), \end{matrix}

(2)

where

k = 0, 1, 2, \dots

is the number of iterations; m is the number of steps;

a, b

are real numbers; and

B (s_{k})

is a linear weight operator [10,11]. The weight

B (s)

is any real matrix function that satisfies

B (0) = 1

,

B (1) = - 1

and

∥ B^{″} (0) ∥ < \infty .

The convergence order 2 m is proven in [5] using the Taylor series for

E = R^{n}

. A detailed favorable comparison with other competing methods using similar information has been carried out in [5,10,11]. This includes the computation of the CPU time. It is because of these advantages that we picked method (2) to demonstrate our technique. But, we noticed that there are also limitations to the approach [5,10,11].

Motivation

(L_{1})

Although method (2) is derivative free, Theorem 1 in [5] can be used provided that

P^{(q)}

and

q = 1, 2, \dots, 5

exist. But these derivatives do not appear in (2). Let us consider the real function

f : D \subseteq R \to R

defined by

f (t) = θ_{1} t^{7} + θ_{2} t^{6} + θ_{3} t^{4} log t

, for

t \neq 0

and

f (t) = 0

for

t = 0

provided that

θ_{1}, θ_{2}, θ_{3}

are real parameters satisfying

θ_{1} + θ_{2} = 0

and

θ_{3} \neq 0

. Choose D to be any interval containing 0 and 1. Then,

t^{*} = 1

solves the equations

f (t) = 0

. But the function

f^{(s)}

is not continuous at

t = 0 \in D

. Hence, there is no assurance from Theorem 1 in [5] that the sequence

{x_{k}}

is convergent to

t^{*} = 1

. However, this sequence converges to

t^{*} = 1

if, for example,

a = θ_{1} = 1, b = θ_{2} = - 1, B = I, m = 2

, and

x_{0} = 1.2 .

Hence, the conditions of Theorem 1 in [5] can be weakened.

(L_{2})

There are no computable a priori estimates for

∥ x^{*} - x_{k} ∥

. So, the number of iterations to be carried out to reach a desired error tolerance is not known in advance.

(L_{3})

The uniqueness of the solution is not known in a neighborhood containing it.

(L_{4})

The radius of convergence is unknown. Thus, the selection of

x_{0}

assuring the convergence of the sequence

{x_{k}}

to

x^{*}

is a very difficult task.

(L_{5})

The results in [5] are restricted by

R^{n}

.

(L_{6})

The more important and challenging semi-local analysis of the method (2) has not been studied previously.

The limitations

(L_{1}) - (L_{6})

are the motivation for writing this article. Addressing these limitations is the novelty of this paper:

Novelty

{(L_{1})}^{'}

The local analysis of convergence only uses conditions based on the operators in (2).

{(L_{2})}^{'}

The required number of iterations to reach an error tolerance is known in advance since a priori error estimates for

∥ x^{*} - x_{k} ∥

become available.

{(L_{3})}^{'}

A neighborhood of

x^{*}

is determined containing no other solution.

{(L_{4})}^{'}

A computable radius of convergence is provided, Thus, the solution of

x_{0}

becomes possible.

{(L_{5})}^{'}

The results are valid in the more Banach space.

{(L_{6})}^{'}

The semi-local analysis of convergence is provided making use of the majorizing sequence.

It is worth noting that the concerns

(L_{1})

–

(L_{6})

always appear in the study of iterative methods using the Taylor series to show convergence such as the ones in [1,3,4,5,6,8,9,10,11,12,13,14,15]. But our technique avoids the Taylor series and uses conditions only for the operators in the method. This way, the benefits

{(L_{1})}^{'}

-

{(L_{6})}^{'}

become possible.

Both types of analysis of convergence rely on the concept of the generalized continuity of the divided difference. This is how we extend the applicability of the method (2). It this worth noting that the approach in this paper may be used to extend the applicability of other methods making use of inverses of linear operators along the same lines [1,3,4,12,13,14]. A similar approach was used for studying the convergence of some methods in [15].

The divided difference in both the local and semilocal convergence analysis is controlled by a majorant real function. In particular, semilocal convergence relies on the scalar majorizing sequences being constructed a priori. The frozen derivatives replace the expensive in the general inversion of the linear operators at each substep of the iteration and still increase the convergence order. This way, the convergence of the method is governed by real functions and sequences that are easier to handle. The monotone convergence of the method in (2) has not been addressed in this paper for Banach space valued operators. But we plan to study this type of convergence in future research in the setting of Hilbert space or partially ordered topological spaces using the extension of the fixed point theory in these spaces. The results are expected to be fruitful, since it has already been established in [4,5,6,7,8,9] that STMs have advantages over existing methods.

Notice also that, due to the a priori estimates we have provided, the minimum number of iterations needed to reach a desired tolerance is known in advance (see the error estimates (7) and (8) in Theorem 1, and (19) and (20) in the Theorem 2).

The rest of the article is divided as follows: Section 2 and Section 3 deal with local followed by semilocal analyses of convergence. The numerical experimentations appear in Section 4 and the Concluding remarks in Section 5.

2. Convergence 1: Local

Some concepts related to the order of convergence of iterative methods should be mentioned.

Let

{x_{k}}

be a sequence in E which converges to

x^{*}

. Then, the convergence is of order q,

q > 1

if there exist a constant

C, C > 0

and a natural number

\bar{k}

, such that

∥ x_{k + 1} - x^{*} ∥ \leq C ∥ x_{k} - x^{*} ∥^{q} for each k \geq \bar{k},

or for

e_{k} = x_{k} - x^{*}

∥ e_{k + 1} ∥ \leq C ∥ e_{k} ∥^{q} for each k \geq \bar{k} .

The convergence is said to be linear if

q = 1

, and C exists such that

C \in (0, 1)

Moreover, the error in the

k -

iteration for

E = R^{n}

is defined by

e_{k + 1} = M e_{k}^{q} + O (e_{k}^{q + 1}),

where q is the order of convergence and M is a

q -

linear function, i.e.,

M \in L (R^{n} \times R^{n} \times \dots \times R^{n})

.

The computational efficiency of an iterative method is

Ω = q^{\frac{1}{p}}

or

Ω = \frac{log q}{p},

where q is the order of convergence and p is the computational cost per iteration. Moreover, if

x_{k - 2}, x_{k - 1}, x_{k}, x_{k + 1}

are four consecutive iterates of an iterative method approximating the solution

x^{*}

of the equation

P (x) = 0

, then the following types of convergence have been suggested in [16] and [3], respectively

δ_{1} \approx \frac{ln (\frac{∥ x_{k + 1} - x^{*} ∥}{∥ x_{k} - x^{*} ∥})}{ln (\frac{∥ x_{k} - x^{*} ∥}{∥ x_{k - 1} - x^{*} ∥})},

δ_{2} \approx \frac{ln (\frac{∥ x_{k + 1} - x_{k} ∥}{∥ x_{k} - x_{k - 1} ∥})}{ln (\frac{∥ x_{k} - x_{k - 1} ∥}{∥ x_{k - 1} - x_{k - 2} ∥})} .

The former is usually called the computational order and the latter the approximate computational order of convergence. It is worth noticing that, since the solution

x^{*}

is usually unknown, the second formula is more useful.

Define the open ball

V (x, a) = {y \in E : ∥ y - x ∥ < a}

, where

x \in E

stands for the center of the ball and

a > 0

is the radius. Moreover, define the closed ball

V [x, a] = {y \in E : ∥ y - x ∥ \leq a}

.

Let

M = [0, + \infty)

. We give the definition of the proof of the local analysis of convergence for the method (2) which relies on some conditions.

Suppose:

(H_{1})

There exist continuous as well as nondecreasing functions (CND)

f_{1} : M \to M

,

φ_{0} : M \times M \to M

such that the equation

φ_{0} (f_{1} (t), t) - 1 = 0

admits the smallest solution which is positive (SSP). Denote such a solution by

ϱ

. Set

M_{0} = [0, ϱ)

.

(H_{2})

There exists

x^{*} \in D

solving the equation

P (x) = 0

and an invertible linear operator T such that for all

x \in D

,

u = x + a P (x)

∥ u - x^{*} ∥ \leq f_{1} (∥ x - x^{*} ∥)

and

∥ T^{- 1} ([u, x; P] - T) ∥ \leq φ_{0} (∥ u - x^{*} ∥, ∥ x - x^{*} ∥) .

Set

D_{0} = V (x^{*}, ϱ) \cap D

.

The definition of

ϱ

, and the condition

(H_{2})

imply

∥ T^{- 1} ([u, x; P] - T) ∥ \leq φ_{0} (∥ u - x^{*} ∥, ∥ x - x^{*} ∥) < 1 .

Thus, the operator

A = [u, x; P]

is invertible by the Banach perturbation Lemma [2] on linear operators (see also Lemma 1).

(H_{3})

There exist CND

f_{2} : M_{0} \to M

and

φ : M_{0} \times M_{0} \to M,

φ_{1} : M_{0} \times M_{0} \times M_{0} \times M_{0} \to M

and

φ_{2} : M_{0} \to M

, such that, for each

v, x, y \in D_{0}, u = x + a P (x), z = y + b P (y)

,

A = [u, x; P]

,

s = A^{- 1} [y, z; P]

, and provided that

B (s)

is a weight function

∥ T^{- 1} ([u, x; P] - [x, x^{*}; P]) ∥ \leq φ (∥ u - x^{*} ∥, ∥ x - x^{*} ∥),

∥ z - x^{*} ∥ \leq f_{2} (∥ x - x^{*} ∥),

∥ I - B (s) A^{- 1} [v, x^{*}; P] ∥ \leq φ_{1} (∥ x - x^{*} ∥, ∥ u - x^{*} ∥, ∥ z - x^{*} ∥, ∥ v - x^{*} ∥),

and

∥ T^{- 1} ([x, x^{*}; P] - T) ∥ \leq φ_{2} (∥ x - x^{*} ∥ .

Define the functions

h_{i}, i = 1, 2, \dots, m

on the interval

M_{0}

by

h_{1} (t) = \frac{φ (f_{1} (t), t)}{1 - φ_{0} (f_{1} (t), t)},

and for

j = 2, \dots, m

h_{j} (t) = φ_{1} (t, h_{j - 1} (t) t, f_{1} (t), f_{2} (t)) h_{j - 1} (t) .

(H_{4})

The equations

h_{j} (t) - 1 = 0

admit SPS in the interval

M_{0}

. Thus, we denote such solutions with

s_{j}

, respectively.

Set

s^{*} = min {s_{j}} and M^{*} = [0, s^{*}) .

(3)

These definitions assure that for each

t \in M^{*}

0 \leq φ_{0} (f_{1} (t), t) < 1

(4)

and

0 \leq h_{j} (t) < 1 .

(5)

(H_{5})

V [x^{*}, s^{*}] \subset D .

Remark 1.

(1) The functions

f_{1}

and

f_{2}

can be defined as follows:

f_{1} (t) = (∥ I + a T ∥ + | a | ∥ T ∥ φ_{2} (t)) t

and

f_{2} (t) = (∥ I + b T ∥ + | b | ∥ T ∥ φ_{2} (h_{1} (t) t)) h_{1} (t) t .

The justification for these choices follows, in turn, from the calculations:

\begin{matrix} u - x^{*} = x - x^{*} + a T T^{- 1} [x, x^{*}; P]) (x - x^{*}) \\ = (I + a T T^{- 1} ([x, x^{*}; P]) (x - x^{*}) \\ = (I + a T T^{- 1} ([x, x^{*}; P] - T + T) (x - x^{*}) \\ = [(I + a T) + a T T^{- 1} ([x, x^{*}; P] - T)] (x - x^{*}) . \end{matrix}

Thus, it follows that

∥ u - x^{*} ∥ \leq (∥ I + a T ∥ + | a | ∥ T ∥ φ_{2} (∥ x - x^{*} ∥)) ∥ x - x^{*} ∥,

which justifies the choice of the function

f_{1}

.

Similarly, we can write

z - x^{*} = [(I + b T) + b T T^{- 1} ([y, x^{*}; P] - T)] (y - x^{*})

leading to

∥ z - x^{*} ∥ \leq [∥ I + b T ∥ + | b | ∥ T ∥ φ_{2} (∥ y - x^{*} ∥)] ∥ y - x^{*} ∥,

which justifies the choice of the function

f_{2}

.

(2) A popular pick for

T = P^{'} (x^{*}) .

But this implies the invertibility of the function

P^{'}

. Moreover,

x^{*}

is a simple solution in this case. We do not assume the invertability of

P^{'} (x^{*})

and it is not necessarily implied by any of the conditions.

This way, method (2) can be employed to approximate solutions for the equation

P (x) = 0

which are not necessarily simple [5]. Other choices are possible (see also the Numerical Section).

The following is a useful lemma [2].

Lemma 1.

Let Q be a linear operator satisfying

∥ Q ∥ < 1

. Then, the linear operator

I - Q

is invertible and

{∥ (I - Q)}^{- 1} ∥ \leq \frac{1}{1 - ∥ Q ∥} .

Recall that

Q_{1}

is an approximate inverse of Q is

∥ I - Q_{1} A ∥ < 1

, provided that

Q_{1}

is also a linear operator. Moreover, in this case, the following hold Q and

Q_{1}

are both invertible and

∥ Q^{- 1} ∥ \leq \frac{∥ Q_{1} ∥}{1 - ∥ I - Q_{1} Q ∥},

and

∥ Q_{1}^{- 1} ∥ \leq \frac{∥ Q ∥}{1 - ∥ I - Q_{1} Q ∥} .

This result is also mentioned as the Banach lemma [2].

Next, the local analysis of convergence for the method (2) uses the conditions

(H_{1})

–

(H_{5})

as well as the preceding notation.

Theorem 1.

Suppose that the conditions

(H_{1})

–

(H_{5})

hold and pick

x_{0} \in V (x^{*}, s^{*}) - {x^{*}}

. Then, the following assertions hold

{x_{k}} \subset V (x^{*}, s^{*}),

(6)

∥ y_{k}^{(1)} - x^{*} ∥ \leq h_{1} (∥ x_{k} - x^{*} ∥) ∥ x_{k} - x^{*} ∥ \leq ∥ x_{k} - x^{*} ∥ < s^{*},

(7)

for

j = 2, \dots, m

∥ y_{k}^{(j)} - x^{*} ∥ \leq h_{j} (∥ x_{k} - x^{*} ∥) ∥ x_{k} - x^{*} ∥ \leq ∥ x_{k} - x^{*} ∥,

(8)

and

lim_{k \to \infty} x_{k} = x^{*} .

(9)

In particular for

j = m

∥ x_{k + 1} - x^{*} ∥ = ∥ y_{m} - x^{*} ∥ \leq h_{m} (∥ x_{k} - x^{*} ∥) ∥ x_{k} - x^{*} ∥ \leq c ∥ x_{k} - x^{*} ∥,

(10)

where

c = h_{m} (∥ x_{0} - x^{*} ∥) \in [0, 1)

.

Proof.

Mathematical induction shall establish these assertions. According to the hypothesis,

x_{0} \in V (x^{*}, s^{*}) - {x^{*}} \subset V (x^{*}, s^{*}),

so the assertions (6) holds if

k = 0

. The application of conditions

(H_{1})

and

(H_{2})

give, in turn,

\begin{matrix} ∥ T^{- 1} (A_{0} - T) ∥ & \leq & φ_{0} (∥ u_{0} - x^{*} ∥, ∥ x_{0} - x^{*} ∥) \\ \leq & φ_{0} (f_{1} (∥ x_{0} - x^{*} ∥), ∥ x_{0} - x^{*} ∥) \leq φ_{0} (f_{1} (s^{*}), s^{*}) < 1 . \end{matrix}

Thus,

A_{0}^{- 1}

exists, and

∥ A_{0}^{- 1} T ∥ \leq \frac{1}{1 - φ_{0} (f_{1} (∥ x_{0} - x^{*} ∥), ∥ x_{0} - x^{*} ∥)} .

(11)

It also follows that the iterate

y_{0}^{(1)}

is well defined by the first substep of the method (2). We can also write

y_{0}^{(1)} - x^{*} = x_{0} - x^{*} - A_{0}^{- 1} P (x_{0}) = A_{0}^{- 1} (A_{0} - [x_{0}, x^{*}; P]) (x_{0} - x^{*})

leading, using (3), (11), and

(H_{3})

, to

\begin{matrix} ∥ y_{0}^{(1)} - x^{*} ∥ & \leq & \frac{φ (f_{1} (∥ x_{0} - x^{*} ∥), ∥ x_{0} - x^{*} ∥) ∥ x_{0} - x^{*} ∥}{1 - φ_{0} (f_{1} (∥ x_{0} - x^{*} ∥), ∥ x_{0} - x^{*} ∥)} \\ \leq & h_{1} (∥ x_{0} - x^{*} ∥) ∥ x_{0} - x^{*} ∥ \leq ∥ x_{0} - x^{*} ∥ < s^{*} . \end{matrix}

(12)

Hence, the iterate

y_{0}^{(1)} \in V (x^{*}, s^{*})

, and the assertion (7) holds provided that

k = 0

.

Concerning the rest of the substeps in method (2), since the iterates

y_{0}^{(2)}, \dots, y_{0}^{(j)}

are well defined, we obtain, in turn,

\begin{matrix} y_{0}^{(j)} - x^{*} & = & y_{0}^{(j - 1)} - x^{*} - B (s_{0}) A_{0}^{- 1} [y_{0}^{(j - 1)}, x^{*}; P] (y_{0}^{(j - 1)} - x^{*}) \\ = & (I - B (s_{0}) A_{0}^{- 1} [y_{0}^{(j - 1)}, x^{*}; P]) (y_{0}^{(j - 1)} - x^{*}), \end{matrix}

from which we deduce

\begin{matrix} ∥ y_{0}^{(j)} - x^{*} ∥ & \leq & φ_{1} (∥ x_{0} - x^{*} ∥, ∥ y_{0}^{(j - 1)} - x^{*} ∥, ∥ u_{0} - x^{*} ∥, ∥ z_{0} - x^{*} ∥) ∥ y_{0}^{(j - 1)} - x^{*} ∥ \\ \leq & h_{j} (∥ x_{0} - x^{*} ∥) ∥ x_{0} - x^{*} ∥ \leq ∥ x_{0} - x^{*} ∥ . \end{matrix}

(13)

The induction for assertion (8) is completed for

k = 0

. In particular, for

j = m

, we obtain

\begin{matrix} ∥ x_{1} - x^{*} ∥ & = & ∥ y_{0}^{(m)} - x^{*} ∥ \leq h_{m} (∥ x_{0} - x^{*} ∥) ∥ x_{0} - x^{*} ∥ \\ \leq & c ∥ x_{0} - x^{*} ∥ \leq ∥ x_{0} - x^{*} ∥, \end{matrix}

(14)

resulting in (6) and (10) provided that

k = 1

. The inductions for (6)–(8) and (10) are completed for

k = 0

. But these calculations can be repeated if

x_{i}

replaces

x_{0}

in the preceding estimations.

Therefore, we have as in (14) that

∥ x_{k + 1} - x^{*} ∥ \leq c ∥ x_{k} - x^{*} ∥ \leq c^{k + 1} ∥ x_{0} - x^{*} ∥ < ∥ x_{0} - x^{*} ∥ < s^{*},

(15)

which implies that the iterate

x_{k + 1} \in V (x^{*}, s^{*})

, and

lim_{k \to \infty} x_{k} = x^{*} .

The uniqueness of the solution

x^{*}

is discussed in the following result.

Proposition 1.

Suppose that the condition

(H_{2})

holds in the ball

V (x^{*}, ϱ_{1})

for some

ϱ_{1} > 0

; there exists

ϱ_{2} \geq ϱ_{1}

such that the last condition in

(H_{4})

holds and

φ_{2} (ϱ_{2}) < 1 .

(16)

Set

D_{1} = V [x^{*}, ϱ_{2}] \cap D .

Then, the equation

P (x) = 0

is uniquely solvable by

x^{*}

in the region

D_{1}

.

Proof.

Suppose that there exists

w^{*} \in D_{1}

, solving the equation

P (x) = 0

, and

w^{*} \neq x^{*}

, Then, define the divided difference

L_{1} = [w^{*}, x^{*}; P]

. Then, the condition

(H_{2})

and (16) imply

∥ T^{- 1} (L_{1} - T) ∥ \leq φ_{2} (∥ w^{*} - x^{*} ∥) \leq φ_{2} (ϱ_{2}) < 1 .

Hence,

L_{1}^{- 1}

exists. Finally, from the identity

w^{*} - x^{*} = L_{1}^{- 1} (P (w^{*}) - P (x^{*})) = L_{1}^{- 1} (0) = 0

, concluding that

w^{*} = x^{*}

. □

Remark 2.

It is clear that

ϱ_{1} = s^{*}

provided that all the conditions of Theorem 1 hold in Proposition 1.

3. Convergence 2: Semi-Local

The semi-local analysis of convergence relies on majorizing sequences and similar computations. But the solution

x^{*}

and the function “

φ

” are exchanged by

x_{0}

, and the function “

ψ

”, respectively.

Recall that, a sequence

{s_{k}} \subseteq [0, + \infty)

for which

∥ x_{k + 1} - x_{k} ∥ \leq s_{k + 1} - s_{k}, k = 0, 1, 2, \dots

holds is majorizing for

{x_{k}}

. Moreover, suppose that

lim_{k \to + \infty} s_{k} = s^{*} < + \infty

exist.

Then, it follows that

lim_{k \to + \infty} x_{k} = x^{*}

and

∥ x^{*} - x_{k} ∥ \leq s^{*} - s_{k}, k = 0, 1, 2, \dots

Therefore, the study of the convergence of the sequence

{x_{k}}

is reduced to the study of the scalar sequence

{s_{k}}

[2].

Suppose:

(C_{1})

There exist CND

g_{1} : M \to M, ψ_{0} : M \times M \to M

such that the equation

ψ_{0} (g_{1} (t), t) - 1 = 0

has a SPS. Denote such a solution by

p_{0}

. Set

N = [0, p_{0}) .

There exist CND

ψ : N \times N \times N \to M, g_{2} : N \to M

,

ψ_{1} : N \times N \times N \times N \to M

and

ψ_{2} : N \times N \to M

.

Define the scalar sequences for

α_{0}^{(0)} = 0

, some

α_{0}^{(1)} = 0

,

i = 1, 2, \dots, m

and each

k = 0, 1, 2, \dots

by

\begin{matrix} b_{k}^{(i)} & = & ψ (α_{k}^{(0)}, α_{k}^{(i)}, g_{1} (α_{k}^{(0)})) (α_{k}^{(i)} - α_{k}^{(0)}), \\ γ_{k}^{(i)} & = & ψ_{1} (α_{k}^{(0)}, α_{k}^{(i)}, g_{1} (α_{k}^{(0)}), g_{2} (α_{k}^{(0)})), \\ α_{k}^{(i + 1)} & = & α_{k}^{(i)} + γ_{k}^{(i)} b_{k}^{(i)}, \\ b_{k}^{(m - 2)} & = & (1 + ψ_{2} (α_{k}^{(1)}, α_{k}^{(m - 2)})) (α_{k}^{(m - 2)} - α_{k}^{(1)}) + b_{k}^{(i)}, \\ α_{k}^{(m - 1)} & = & α_{k}^{(m - 2)} + γ_{k}^{(m - 2)} b_{k}^{(m - 2)}, \\ α_{k + 1}^{(0)} & = & α_{k}^{(m - 1)} + γ_{k}^{(m - 1)} b_{k}^{(m - 1)}, \\ δ_{k + 1} & = & (1 + ψ_{2} (α_{k + 1}^{(0)}, α_{k}^{(0)})) (α_{k + 1}^{(0)} - α_{k}^{(0)}) \\ + (1 + ψ_{2} (α_{k + 1}^{(0)}, g_{2} (α_{k}^{(0)})) (α_{k + 1}^{(1)} - α_{k + 1}^{(0)}) \end{matrix}

(17)

and

α_{k + 1}^{(1)} = α_{k + 1} + \frac{δ_{k + 1}}{1 - ψ_{0} (g_{1} (α_{k + 1}^{(0)}), α_{k + 1}^{(0)})} .

This sequence is shown to be majorizing for

{x_{k}}

in Theorem 2. But first, a convergence condition is required for this sequence.

(C_{2})

There exist a parameter

p \in [0, p_{0})

such that for each

i = 0, 1, \dots, m

,

k = 0, 1, 2, \dots

ψ_{2} (g_{1} (α_{k}^{(0)}), α_{k}^{(0)}) < 1

and

α_{k}^{(i)} < p .

This condition and the formula (17) imply that the sequence

{α_{k}^{(i)}}

is nonnegative, nondecreasing, and bounded from above by

p_{0}

. Consequently, the sequence

{α_{k}^{(i)}}

is convergent to its least upper bound which is a unique number. Denote such a number with

α^{*}

.

As in the local analysis, the scalar parameters and sequences relate to the functions in method (2).

(C_{3})

A point

x_{0} \in D

and an invertible function T exist, such that, for each

x \in D

,

u = x + a P (x)

,

z = x + b P (x)

,

∥ u - x_{0} ∥ \leq g_{1} (∥ x - x_{0} ∥), ∥ z - x_{0} ∥) \leq g_{2} (∥ x - x_{0} ∥),

and

∥ T^{- 1} ([u, x; P] - T) ∥ \leq ψ_{0} (∥ u - x_{0} ∥, ∥ x - x_{0} ∥) .

Set

D_{2} = V (x_{0}, p_{0}) \cap D

.

This condition and the definition of

p_{0}

imply, for

x = x_{0}

, that

∥ T^{- 1} ([u_{0}, x_{0}; P] - T) ∥ \leq ψ_{0} (g_{1} (∥ x_{0} - x^{*} ∥), 0) \leq ψ_{0} (g_{1} (p_{0}), 0) < 1 .

Thus,

A_{0} \neq 0

. Hence, we can take

α_{0}^{(1)} \geq ∥ A_{0}^{- 1} P (x_{0}) ∥ .

(C_{4})

For each

x \in D_{2}, u = x + a P (x), y = x - A^{- 1} P (x), v = y + b P (y),

∥ z - x_{0} ∥ \leq g_{1} (∥ x - x_{0} ∥), ∥ v - x_{0} ∥ \leq g_{2} (∥ x - x_{0} ∥),

∥ T^{- 1} ([u, x; P] - [v, x; P]) ∥ \leq ψ (∥ x - x_{0} ∥, ∥ u - x_{0} ∥, | v - x_{0} ∥),

∥ B (s) A^{- 1} T ∥ \leq ψ_{1} (∥ x - x_{0} ∥, ∥ u - x_{0} ∥, ∥ z - x_{0} ∥, ∥ v - x_{0} ∥),

and

∥ T^{- 1} ([x, w; P] - T) ∥ \leq ψ_{2} (∥ x - x_{0} ∥, ∥ w - x_{0} ∥),

and

(C_{5})

V [x_{0}, α^{*}] \subset D

.

Remark 3.

(i)

The functions

g_{1}

and

g_{2}

can be chosen to be

g_{1} (t) = (∥ I + a T ∥ + | a | ∥ T ∥ ψ_{2} (t, 0) t) + | a | ∥ T ∥ ∥ P (x_{0}) ∥

and

g_{2} (t) = (∥ I + b T ∥ + | b | ∥ T ∥ ψ_{2} (t, 0) t) + | b | ∥ T ∥ ∥ P (x_{0}) ∥ .

As in the local analysis, the motivational calculations are:

\begin{matrix} u - x_{0} = x - x_{0} + a (P (x) - P (x_{0}) + P (x_{0})) \\ = [(I + a T) + a T T^{- 1} ([x, x_{0}; P] - T)] (x - x_{0}) + a P (x_{0}), \end{matrix}

so

∥ u - x_{0} ∥ \leq [∥ I + a T ∥ + | a | ∥ T ∥ ψ_{2} (∥ x - x_{0} ∥, 0)] ∥ x - x_{0} ∥ + | a | ∥ P (x_{0}) ∥,

justifying the choice of the function

g_{1}

.

Similarly

z - x_{0} = y_{k}^{(1)} - x_{0} + b T T^{- 1} (P (y_{k}^{(1)}) - P (x_{0}) + P (x_{0}));

thus,

∥ z_{k} - x_{0} ∥ \leq (∥ I + b T ∥ + | b | ∥ T ∥ ψ_{2} (∥ y_{k}^{(1)} - x_{0} ∥, 0)) ∥ y_{k}^{(1)} - x_{0} ∥ + | b | ∥ T ∥ ∥ P (x_{0} ∥,

which justifies the choice of the function

g_{2}

.

(i i)

A popular choice for

T = P^{'} (x_{0})

. The rest of the comments are omitted as they are similar to the ones in Remark 1.

The semilocal analysis of convergence is provided in the next result.

Theorem 2.

Suppose that the conditions

(C_{1})

-

(C_{5})

hold. Then, the sequence

{x_{k}}

is well defined in

V (x_{0}, α^{*})

, remains in

V (x_{0}, α^{*})

for each

k = 0, 1, 2, \dots .

and is convergent to a solution

x^{*} \in V [x_{0}, α^{*}]

on the equation

P (x) = 0

such that

∥ x^{*} - x_{k} ∥ \leq α^{*} - α_{k}^{(0)} .

(18)

Proof.

As in the local analysis, mathematical induction is utilized to show the assertions

∥ y_{k}^{(1)} - x_{k} ∥ \leq α_{k}^{(1)} - α_{k}^{(0)},

(19)

and

∥ y_{k}^{(i + 1)} - y_{k}^{(i)} ∥ \leq α_{k}^{(i + 1)} - α_{k}^{(i)} .

(20)

The assertion (19) holds if

k = 0

since

∥ y_{0}^{(1)} - x_{0} ∥ \leq ∥ A_{0}^{- 1} P (x_{0}) ∥ \leq α_{0}^{(1)} = α_{0}^{(1)} - α_{0}^{(0)} < α^{*},

and the iterate

y_{0}^{(1)} \in V (x_{0}, α^{*})

. Then, we have the estimates

P (y_{k}^{(1)}) = P (y_{k}^{(1)}) - P (x_{k}) - A_{k} (y_{k}^{(1)} - x_{k}) = ([y_{k}^{(1)}, x_{k}; P] - A_{k}) (y_{k}^{(1)} - x_{k}) .

So,

\begin{matrix} ∥ T^{- 1} P (y_{k}^{(1)}) ∥ & = & ψ (∥ x_{x} - x_{0} ∥, ∥ y_{k}^{(1)} - x_{0} ∥, ∥ u_{k} - x_{0} ∥) ∥ y_{k}^{(1)} - x_{k} ∥ \\ \leq & ψ (α_{k}^{(0)}, α_{k}^{(1)}, g_{1} (α_{k}^{(0)})) (α_{k}^{(1)} - α_{k}^{(0)}) = b_{k}^{(0)}, \end{matrix}

y_{k}^{(2)} - y_{k}^{(1)} = B (s_{k}) A_{k}^{- 1} P (y_{k}^{(1)}),

\begin{matrix} ∥ y_{k}^{(2)} - y_{k}^{(1)} ∥ & \leq & ∥ B (s_{k}) A_{k}^{- 1} T ∥ ∥ T^{- 1} P (y_{k}^{(1)}) ∥ \\ \leq & ψ_{1} (∥ x_{k} - x_{0} ∥, ∥ z_{k} - x_{0} ∥, ∥ y_{k}^{(1)} - x_{0} ∥, ∥ u_{k} - x_{0} ∥) b_{k}^{(1)} \\ \leq & γ_{k}^{(1)} b_{k}^{(1)} = α_{k}^{(2)} - α_{k}^{(1)} \end{matrix}

and

\begin{matrix} ∥ y_{k}^{(2)} - x_{0} ∥ & \leq & ∥ y_{k}^{(2)} - y_{k}^{(1)} ∥ + ∥ y_{k}^{(1)} - x_{0} ∥ \\ = & α_{k}^{(2)} - α_{k}^{(1)} + α_{k}^{(1)} - α_{k}^{(0)} = α_{k}^{(2)} < α^{*} . \end{matrix}

Hence, the iterate

y_{k}^{(2)} \in V (x_{0}, α^{*})

and the assertion (20) hold for

i = 1

.

Then, we can write

\begin{matrix} P (y_{k}^{(m - 2)}) & = & P (y_{k}^{(m - 2)}) - P (y_{k}^{(1)}) + P (y_{k}^{(1)}) \\ = & [y_{k}^{(m - 2)}, y_{k}^{(1)}; P] (y_{k}^{(m - 2)} - y_{k}^{(1)}) + P (y_{k}^{(1)}); \end{matrix}

thus,

\begin{matrix} ∥ T^{- 1} P (y_{k}^{(m - 2)}) ∥ & \leq & ∥ T^{- 1} ([y_{k}^{(m - 2)}, y_{k}^{(1)}; P] - T + T) (y_{k}^{(m - 2)} - y_{k}^{(1)}) ∥ + ∥ T^{- 1} P (y_{k}^{(1)}) ∥ \\ \leq & (1 + ψ_{2} (α_{k}^{(1)}, α_{k}^{(m - 2)})) (α_{k}^{(m - 2)} - α_{k}^{(1)}) + b_{k}^{(1)} = b_{k}^{(m - 2)} \end{matrix}

leading to

\begin{matrix} ∥ y_{k}^{(m - 1)} - y_{k}^{(m - 2)} ∥ & \leq & ψ_{2} (∥ x_{k} - x_{0} ∥, ∥ u_{k} - x_{0} ∥, ∥ z_{k} - x_{0} ∥, ∥ y_{k}^{(m - 2)} - x_{0} ∥) α_{k}^{(m - 2)} \\ = & α_{k}^{(m - 1)} - α_{k}^{(m - 2)}, \end{matrix}

\begin{matrix} ∥ y_{k}^{(m - 1)} - x_{0} ∥ & \leq & ∥ y_{k}^{(m - 1)} - y_{k}^{(m - 2)} ∥ + ∥ y_{k}^{(m - 2)} - x_{0} ∥ \\ \leq & α_{k}^{(m - 1)} - α_{k}^{(m - 2)} + α_{k}^{(m - 2)} - α_{0}^{(0)} = α_{k}^{(m - 1)} < α^{*}, \end{matrix}

\begin{matrix} ∥ x_{k + 1} - y_{k}^{(m - 1)} ∥ & = & ∥ y_{k}^{(m)} - y_{k}^{(m - 1)} ∥ \\ \leq & ψ_{1} (∥ x_{k} - x_{0} ∥, ∥ z_{k} - x_{0} ∥, ∥ u_{k} - x_{0} ∥, ∥ y_{k}^{(m - 1)} - x_{0} ∥) λ_{k}^{(m - 1)} \\ = & α_{k}^{(m)} - α_{k}^{(m - 1)} = α_{k + 1}^{(0)} - α_{k}^{(m - 1)} \end{matrix}

and

\begin{matrix} ∥ x_{k + 1} - x_{0} ∥ & \leq & ∥ x_{k + 1} - y_{k}^{(m - 1)} ∥ + ∥ y_{k}^{(m - 1)} - x_{0} ∥ \\ \leq & α_{k + 1}^{(0)} - α_{k}^{(m - 1)} + α_{k}^{(m - 1)} - α_{0}^{(0)} = α_{k + 1}^{(0)} < α^{*} . \end{matrix}

Hence, the iterates

y_{k}^{(i)} \in V (x_{0}, α^{*})

and the iteration (20) holds.

Then, we can also write

\begin{matrix} P (x_{k + 1}) & = & P (x_{k + 1}) - P (x_{k}) - A_{k + 1} (y_{k + 1}^{(1)} - x_{k + 1}) \\ = & P (x_{k + 1}) - P (x_{k}) - [x_{k + 1}, x_{k}; P] (x_{k + 1} - x_{k}) \\ + & [x_{k + 1}, x_{k}; P] (x_{k + 1} - x_{k}) - A_{k + 1} (y_{k + 1}^{(1)} - x_{k + 1}) \\ = & [x_{k + 1}, x_{k}; P] (x_{k + 1} - x_{k}) - A_{k + 1} (y_{k + 1}^{(1)} - x_{k + 1}) . \end{matrix}

But

∥ T^{- 1} ([x_{k + 1}, x_{k}; P] - T + T) ∥ \leq 1 + ψ_{2} (α_{k + 1}^{(0)}, α_{k}^{(0)})

and similarly

∥ T^{- 1} (A_{k + 1} - T + T) ∥ \leq 1 + ψ_{2} (α_{k + 1}^{(0)}, g_{2} (α_{k + 1}^{(0)})),

leading to

\begin{matrix} ∥ T^{- 1} P (x_{k + 1}) ∥ & \leq & (1 + ψ_{2} (α_{k + 1}^{(0)}, α_{k}^{(0)})) (α_{k + 1}^{(0)} - α_{k}^{(0)}) \\ + & (1 + ψ_{2} (α_{k + 1}^{(0)}, g_{2} (α_{k + 1}^{(0)}))) (α_{k + 1}^{(1)} - α_{k + 1}) = δ_{k + 1}, \end{matrix}

(21)

so

\begin{matrix} ∥ y_{k + 1}^{(1)} - x_{k + 1} ∥ & \leq & ∥ A_{k + 1}^{- 1} T ∥ ∥ T^{- 1} P (x_{k + 1}) ∥ \\ \leq & \frac{δ_{k + 1}}{1 - ψ_{0} (g_{1} (α_{k + 1}^{(0)}), α_{k + 1}^{(0)}))} = α_{k + 1}^{(1)} - α_{k + 1} \end{matrix}

and

\begin{matrix} ∥ y_{k + 1}^{(1)} - x_{0} ∥ & \leq & ∥ y_{k + 1}^{(1)} - x_{k + 1} ∥ + ∥ x_{k + 1} - x_{0} ∥ \\ \leq & α_{k + 1}^{(1)} - α_{k + 1}^{(0)} + α_{k + 1}^{(0)} - α_{k}^{(0)} = α_{k + 1}^{(1)} < α^{*} . \end{matrix}

Hence, the induction for the assertions (19) and (20) is completed and all the iterates of the method (2) belong in

V (x_{0}, α^{*})

. The condition

(C_{2})

implies that the sequence

{α_{k}^{(i)}}

is complete as it is convergent to

α^{*}

. Then, using (19) and (20), the sequence

{x_{k}}

is also complete in E and, as such, it is convergent to some

x^{*} \in V [x_{0}, x^{*}]

. Then, by letting

k \to + \infty

in (21) and the continuity of P, we deduce that

P (x^{*}) = 0

. Finally, the assertion (18) follows from the estimate

∥ x_{k + j} - x_{k} ∥ \leq α_{k + j} - α_{k}

by letting

j \to + \infty

. □

The uniqueness of the solution follows.

Proposition 2.

Suppose that there exists a solution

w_{0} \in V (x_{0}, ϱ_{3})

to the equation

P (x) = 0

for some

ϱ_{3} > 0

;

the last condition in

(C_{4})

holds in the ball

V (x_{0}, ϱ_{3})

and

ϱ_{4} > ϱ_{3}

exists, such that

ψ_{2} (ϱ_{3}, ϱ_{4}) < 1 .

(22)

Set

D_{3} = V [x_{0}, ϱ_{4}] \cap D

.

Then, the only solution to the equation

P (x) = 0

in the region

D_{3}

is

w_{0}

.

Proof.

Suppose that

w \in D_{3}

exists, solving the equation

P (x) = 0

and satisfying

w \neq w_{0}

. Define

T_{2} = [w_{0}, w; P]

. It follows from the last condition in

(C_{4})

and (22) that

∥ T^{- 1} (T_{2} - T) ∥ \leq ψ_{2} (∥ w_{0} - x_{0} ∥, ∥ w - x_{0} ∥) \leq ψ_{2} (ϱ_{3}, ϱ_{4}) < 1,

so,

T_{2}^{- 1}

exists. Then, similarly to Proposition 1, from the identity

w_{0} - w = T_{2}^{- 1} (P (w_{0}) - P (w)) = T_{2}^{- 1} (0) = 0,

we conclude

w = w_{0}

. □

Remark 4.

(1)

Under all the conditions of Theorem 2, take

w_{0} - x^{*}

and

ϱ_{3} = α^{*}

in Proposition 2.

(2)

The limit

α^{*}

can be replaced by

p_{0}

in Theorem 2.

4. Numerical Experiments

In this section, we present numerical examples that confirm the local theoretical results and show the results of testing the method on systems of nonlinear equations. The calculations are carried out in GNU Octave 7.3.0. Systems of nonlinear algebraic and transcendental equations arise as a result of applying the difference method for solving boundary value problems or the quadrature method for solving integral equations.

Let us compute the radii of convergence for the Algorithm 1 for different values of the real parameter a and b. For this, consider the following nonlinear equation.

Algorithm 1: The algorithm from method (2) for solving the system of nonlinear equation consists of the following steps:

1. Select the starting approximation

x_{0}

, real parameters a, b and the tolerance

ε

.
2. For

k = 0, 1, 2, \dots

while

∥ x_{k + 1} - x_{k} ∥ \geq ε

or (and)

∥ F (x_{k + 1}) ∥ \geq ε

do:
2.1. calculate

P (x_{k})

;
2.2. calculate

u_{k} = x_{k} + a P (x_{k})

;
2.3. calculate

A_{k} = [u_{k}, x_{k}; P]

;
2.4. calculate

y_{k}^{(1)} = x_{k} - A_{k}^{- 1} P (x_{k})

;
if

m \geq 2

then
2.5. calculate

z_{k} = y_{k}^{(1)} + b P (y_{k}^{(1)})

;
2.6. calculate

s_{k} = A_{k}^{- 1} [y_{k}^{(1)}, z_{k}; P]

and

B (s_{k})

;
2.7. calculate

L_{k} = B (s_{k}) A_{k}^{- 1}

;
2.8. for

j = 2, \dots, m

2.8.1. calculate

P (y_{k}^{(j - 1)})

;
2.8.2. calculate

y_{k}^{(j)} = y_{k}^{(j - 1)} - L_{k} P (y_{k}^{(j - 1)})

;
2.9. set

x_{k + 1} = y_{k}^{(m)}

.

Example 1.

Let

E = R

,

D = V (x^{*}, 0.5)

,

c \in [0, 1)

P (x) = x^{3} - c = 0

and the exact solutions is

x^{*} = c^{1 / 3}

.

Let

T = P^{'} (x^{*})

and

c = 0.9

. Then, for the function

P (x)

we have

P^{'} (x) = 3 x^{2}

and

[x, y; P] = x^{2} + x y + y^{2}

. To define functions

f_{1}

and

f_{2}

we use the following equalities:

u - x^{*} = x - x^{*} + a [x, x^{*}; P] (x - x^{*}) = (1 + a [x, x^{*}; P]) (x - x^{*})

and

z - x^{*} = y - x^{*} + b [y, x^{*}; P] (y - x^{*}) = (1 + b [y, x^{*}; P]) (y - x^{*}),

respectively. For

m = 2

we obtain the following radii:

for

a = b = 0.1

,

s^{*} = min {0.1909, 0.053751} = 0.053751

;

for

a = b = 0.5

,

s^{*} = min {0.088357, 0.030085} = 0.030085

;

for

a = b = 1

,

s^{*} = min {0.048933, 0.018782} = 0.018782

.

Now let us analyze the behavior of the method depending on the choice of function

B (s)

. We considered the following cases:

(a): If $B (s) = I - (s - I) + (s - I) (s - I)$ , then this method was considered in [5];
(b): If $B (s) = I$ , then we have a multi-step Steffensen-type method.

Example 2.

Consider the system of n equations

\sum_{j = 1}^{n} x_{j} + e^{x_{i}} - 1 = 0, i = 1, \dots, n .

Here

E = {I R}^{n}

,

D \subseteq R^{n}

and the exact solution

x^{*} = {(0, \dots, 0)}^{T}

.

Let us choose

n = 10

,

m = 2

, and starting approximation

x_{0} = {(2, \dots, 2)}^{T}

. We use the following stopping criterion for an iterative process

∥ x_{k + 1} - x_{k} ∥ \leq 10^{- 10} .

Here

∥ ∥

denotes the Euclidean norm.

Figure 1 and Figure 2 show the change in the correction’s norm at each iteration. The results are given for different values of parameters a and b.

Figure 1. Example 2: norm of correction at each iteration.

Figure 2. Example 2: norm of correction at each iteration.

The number of iterations of the method (2) differs little for the considered function

B (s)

. However, the following differences can be noted. Method (2)-(a) converges no slower than method (2)-(b); however, the computational complexity of one iteration is higher by

O (n^{3})

operations. Starting from a certain iteration, the correction rate of the method (2)-(a) decreases faster than for the method (2)-(b). In addition, the study of the considered method on different examples showed that it is advisable to choose parameters a and b close to zero. In this case, method (2) converges for a larger number of initial approximations.

Example 3.

Consider the boundary value problem

\{\begin{matrix} y^{″} (t) - y^{'} (t) t g (t) + \frac{2 y^{2} (t)}{sin (t)} = 0, 0 < t < \frac{π}{2}, \\ y (0) = 0, y (π / 2) = 1 . \end{matrix}

One of the exact solutions is

y_{*} (t) = sin (t)

. Denote

υ_{i} \approx y (t_{i})

,

i = 0, \dots, n + 1

, where

t_{i} = i h

and

h = \frac{π}{2 (n + 1)}

. Using the approximation for the first and second-order derivatives

υ_{i}^{″} \approx \frac{υ_{i - 1} - 2 υ_{i} + υ_{i + 1}}{h^{2}}, υ_{i}^{″} \approx \frac{υ_{i + 1} - υ_{i - 1}}{2 h}, i = 1, \dots, n,

we obtain the following system of the nonlinear equations

\begin{matrix} P_{i} (x) & = & - 2 υ_{i} + υ_{i + 1} - \frac{h}{2} υ_{i + 1} t g (t_{i}) + \frac{2 h^{2} υ_{i}^{2}}{sin (t_{i})} = 0, i = 1, \\ P_{i} (x) & = & υ_{i - 1} - 2 υ_{i} + υ_{i + 1} - \frac{h}{2} (υ_{i + 1} - υ_{i - 1}) t g (t_{i}) + \frac{2 h^{2} υ_{i}^{2}}{sin (t_{i})} = 0, i = 2, \dots, n - 1, \\ P_{i} (x) & = & υ_{i + 1} - 2 υ_{i} + 1 - \frac{h}{2} (1 - υ_{i - 1}) t g (t_{i}) + \frac{2 h^{2} υ_{i}^{2}}{sin (t_{i})} = 0, i = n \end{matrix}

with

x = {(υ_{1}, \dots, υ_{n})}^{T}

. Let

a = b = 0.1

,

n = 19

and the initial approximation be

x_{0, i} = sin t_{i} + 0.5

,

i = 1, \dots, n

. We use

∥ F (x_{k}) ∥ \leq 10^{- 10}

as the stopping criteria for this problem. The norm of residuals at each iteration is presented in Table 1.

Table 1. Results for Example 3.

5. Conclusions

A new methodology is developed to deal with methods using derivatives not related to divided differences or derivatives in the proof of convergence.

In this article, using this methodology, we positively address concerns limiting the applicability of methods whose convergence is based on the Taylor series. Other limitations such as the lack of information on the uniqueness of a solution and a priori estimates of

∥ x^{*} - x_{k} ∥

or

∥ x_{k + 1} - x_{k} ∥

are also addressed in this article under weak conditions.

The methodology is demonstrated on Symmetric-Steffensen-type multi-step and derivative-free methods using frozen divided differences. But it can be used on other single- as well as multi-step methods as long as they use linear operators which are invertible.

In our future research we plan to utilize this methodology in other methods [1,2,4,6,7,8,9,10,11,12,13,14,15,16].

Author Contributions

Conceptualization, I.K.A., S.S., S.R., H.Y. and M.I.A.; methodology, I.K.A., S.S., S.R., H.Y. and M.I.A.; software, I.K.A., S.S., S.R., H.Y. and M.I.A.; validation, I.K.A., S.S., S.R., H.Y. and M.I.A.; formal analysis, I.K.A., S.S., S.R., H.Y. and M.I.A.; investigation, I.K.A., S.S., S.R., H.Y. and M.I.A.; resources, I.K.A., S.S., S.R., H.Y. and M.I.A.; data curation, I.K.A., S.S., S.R., H.Y. and M.I.A.; writing—original draft preparation, I.K.A., S.S., S.R., H.Y. and M.I.A.; writing—review and editing, I.K.A., S.S., S.R., H.Y. and M.I.A.; visualization, I.K.A., S.S., S.R., H.Y. and M.I.A.; supervision, I.K.A., S.S., S.R., H.Y. and M.I.A.; project administration, I.K.A., S.S., S.R., H.Y. and M.I.A. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Traub, J.F. Iterative Methods for the Solution of Equations; Prentice-Hall: Upper Saddle River, NJ, USA, 1964. [Google Scholar]
Potra, F.; Pták, V. Nondiscrete Induction and Iterative Processes; Pitman Publishing: Lanham, MD, USA, 1984. [Google Scholar]
Cordero, A.; Torregrosa, J.R. Variants of Newton’s method using fifth-order quadrature formulas. Appl. Math. Comput. 2007, 190, 686–698. [Google Scholar] [CrossRef]
Amat, S.; Ezquerro, J.; Hernández, M. On a Steffensen-like method for solving nonlinear equations. Calcolo 2016, 53, 171–188. [Google Scholar] [CrossRef]
Cordero, A.; Villalba, E.G.; Torregrosa, J.R.; Triguero-Navarro, P. Introducing memory to a family of multi-step multidimensional iterative methods with weight function. Expo. Math. 2023, 42, 398–417. [Google Scholar] [CrossRef]
Alarcón, V.; Amat, S.; Busquier, S.; López, D. Steffensen’s type method in Banach spaces with applications on boundary-value problems. J. Comput. Appl. Math. 2008, 216, 243–250. [Google Scholar] [CrossRef]
Amat, S.; Argyros, I.; Busquier, S.; Hernández-Verón, M.; Magreñán, A.; Martínez, E. A multistep Steffensen-type method for solving nonlinear systems of equations. Math. Methods Appl. Sci. 2020, 43, 7518–7536. [Google Scholar] [CrossRef]
Bhalla, S.; Kumar, S.; Argyros, I.; Behl, R.; Motsa, S. High-order modification of Stefensen’s method for solving system of nonlinear equations. Comput. Appl. Math. 2018, 37, 1913–1940. [Google Scholar] [CrossRef]
Narang, M.; Bhatia, S.; Alshomrani, A.S.; Kanwar, V. General efficient class of Steffensen type methods with memory for solving systems of nonlinear equations. J. Comput. Appl. Math. 2019, 352, 23–39. [Google Scholar] [CrossRef]
Chicharro, F.I.; Cordero, A.; Garrido, N.; Torregrosa, J.R. Stability and applicability of iterative methods with memory. J. Math. Chem. 2019, 57, 1282–1300. [Google Scholar] [CrossRef]
Chun, C.; Neta, B.; Kozdon, J.; Scott, M. Choosing weight functions in iterative methods for simple roots. Appl. Math. Comput. 2014, 227, 788–800. [Google Scholar] [CrossRef]
George, S. On convergence of regularized modified Newton’s method for nonlinear ill-posed problems. J. Inv. Ill-Posed Probl. 2010, 18, 133–146. [Google Scholar] [CrossRef]
George, S.; Nair, M.T. An a posteriory parameter choice for simplified regularization of ill-posed problems. Inter. Equat. Oper. Th. 1993, 16, 392–399. [Google Scholar] [CrossRef]
Kung, H.T.; Traub, J.F. Optimal order of one-point and multi-point iteration. J. Assoc. Comput. Mach. 1974, 21, 643–651. [Google Scholar] [CrossRef]
Argyros, I.K.; Shakhno, S. Extended local convergence for the combined Newton-Kurchatov method under the generalized Lipschitz conditions. Mathematics 2019, 7, 207. [Google Scholar] [CrossRef]
Weeracoon, S.; Ferando, T. A variant of Newton’s method with accelerated third-order convergence. Appl. Math. Lett. 2000, 13, 87–93. [Google Scholar] [CrossRef]

Figure 1. Example 2: norm of correction at each iteration.

Figure 2. Example 2: norm of correction at each iteration.

Table 1. Results for Example 3.

k	Method 2-(a)	Method 2-(b)
1	7.9038 $\times 10^{- 3}$	2.1947 $\times 10^{- 2}$
2	1.9599 $\times 10^{- 5}$	1.1784 $\times 10^{- 4}$
3	4.5432 $\times 10^{- 13}$	2.0348 $\times 10^{- 8}$
4		3.0205 $\times 10^{- 16}$

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Symmetric-Type Multi-Step Difference Methods for Solving Nonlinear Equations

Abstract

1. Introduction

2. Convergence 1: Local

3. Convergence 2: Semi-Local

4. Numerical Experiments

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

References

Article Metrics

Citations

Article Access Statistics