Stability Analysis of Jacobian-Free Newton’s Iterative Method

Amiri, Abdolreza; Cordero, Alicia; Darvishi, Mohammad Taghi; Torregrosa, Juan R.

doi:10.3390/a12110236

Open AccessArticle

Stability Analysis of Jacobian-Free Newton’s Iterative Method

¹

Department of Mathematics, Faculty of Science, Razi University, 67149 Kermanshah, Iran

²

Institute for Multidisciplinary Mathematics, Universitat Politècnica de València, 46022 València, Spain

^*

Author to whom correspondence should be addressed.

Algorithms 2019, 12(11), 236; https://doi.org/10.3390/a12110236

Submission received: 3 October 2019 / Revised: 30 October 2019 / Accepted: 2 November 2019 / Published: 6 November 2019

Download

Browse Figures

Versions Notes

Abstract

It is well known that scalar iterative methods with derivatives are highly more stable than their derivative-free partners, understanding the term stability as a measure of the wideness of the set of converging initial estimations. In multivariate case, multidimensional dynamical analysis allows us to afford this task and it is made on different Jacobian-free variants of Newton’s method, whose estimations of the Jacobian matrix have increasing order. The respective basins of attraction and the number of fixed and critical points give us valuable information in this sense.

Keywords:

nonlinear system of equations; iterative method; Jacobian-free scheme; basin of attraction

1. Introduction

Let

F (x) = 0

,

F : D \subseteq R^{n} ⟶ R^{n}

, be a system of nonlinear equations. Usually, this kind of problems can not be solved analytically and the approach to a solution is made by means of iterative techniques. The best known iterative algorithm is Newton’s method, with second order of convergence and iterative expression

\begin{matrix} x^{(k + 1)} = x^{(k)} - {[F^{'} (x^{(k)})]}^{- 1} F (x^{(k)}), k = 0, 1, \dots \end{matrix}

(1)

from an estimation

x^{(0)} \in D

. This iterative scheme needs the evaluation of the nonlinear function and its associate Jacobian matrix at each iteration. However, sometimes, the size of the system or the specific properties of the problem do not allow to evaluate the Jacobian matrix

F^{'} (x)

, or even its calculation at each iteration (for example, if F is an error function); in these cases, some approximations of the Jacobian matrix can be used. The most usual one is a divided difference matrix, that is, a linear operator

[x^{(k)}, y^{(k)}; F]

satisfying condition (see [1,2])

\begin{matrix} [x, y; F] (x - y) = F (x) - F (y) . \end{matrix}

(2)

In this case, when

F^{'} (x^{(k)})

in Newton’s method is replaced by

[x^{(k)}, y^{(k)}; F]

, where

y = x^{(k)} + F (x^{(k)})

, we obtain the so-called Steffensen’s method [1], also with order of convergence two. To compute in practice the elements of the divided difference operator, the following first-order divided difference operator

\begin{matrix} {[x^{(k)}, y^{(k)}; F]}_{i, j}^{1} = \frac{F_{i} (y_{1}, y_{2}, \dots, y_{j - 1}, y_{j}, x_{j + 1}, \dots, x_{n}) - F_{i} (y_{1}, y_{2}, \dots, y_{j - 1}, x_{j}, x_{j + 1}, \dots, x_{n})}{y_{j} - x_{j}}, \end{matrix}

(3)

or the symmetric second-order one,

\begin{matrix} {[x^{(k)}, y^{(k)}; F]}_{i, j}^{2} = (F_{i} (y_{1}, y_{2}, \dots, y_{j - 1}, y_{j}, x_{j + 1}, \dots, x_{n}) - F_{i} (y_{1}, y_{2}, \dots, y_{j - 1}, x_{j}, x_{j + 1}, \dots, x_{n}) \\ + F_{i} (x_{1}, x_{2}, \dots, x_{j - 1}, x_{j}, y_{j + 1}, \dots, y_{n}) - F_{i} (x_{1}, x_{2}, \dots, x_{j - 1}, y_{j}, y_{j + 1}, \dots, y_{n})) / (2 (y_{j} - x_{j})), \end{matrix}

(4)

are proposed in [3].

Let us remark that operator (4) is symmetric and it can be used to evaluate the divided difference even when the problem is nonsymmetric. However, the number of evaluations of the scalar functions in the computation of (4) is higher than those in (3). Moreover, when divided difference (3) is used as an approximation for the Jacobian matrices appearing in any iterative method, then usually the iterative procedure does not preserve its order of convergence.

The authors in [4] proposed to replace

y = x + F (x)

in (2) by

y = x + Γ (x)

, being

Γ (x) = {(f_{1} {(x)}^{m}, f_{2} {(x)}^{m}, \dots, f_{n} {(x)}^{m})}^{T}

, with

m \in N

. Then, divided difference

\begin{matrix} [x, x + Γ (x); F] Γ (x) = F (x) - F (x + Γ (x)), \end{matrix}

(5)

becomes an approximation of

F^{'} (x)

of order m. It was also shown that, by choosing a suitable value of m, the order of convergence of any iterative method can be preserved with a reduced computational cost. So, the Jacobian-free variants of Newton’s scheme that we analyze hold the second order of convergence of the original method.

Our aim is to see if, further on the order of convergence of the method, the use of different divided differences to replace the Jacobian matrix in the iterative expression of Newton’s method, can affect the dependence on initial estimations of the modified scheme to converge.

By using Taylor expansion of the divided difference operator (5), authors in [4] proved the following results.

Theorem 1.

(See [4]) Let F be a nonlinear operator

F : D \subseteq R^{n} ⟶ R^{n}

with coordinate functions

f_{i}

,

i = 1, 2, \dots, n

and

m \in N

such that

m \geq 1

. Let us consider the divided difference operator

[x + Γ (x), x; F]

, where

Γ (x) = {(f_{1} {(x)}^{m}, f_{2} {(x)}^{m}, \dots, f_{n} {(x)}^{m})}^{T}

, then the order of the divided difference

[x + Γ (x), x; F]

as an approximation of the Jacobian matrix

F^{'} (x)

is m.

Corollary 1.

(See [4]) Under the same assumptions as in Theorem 1, the order of the central divided difference operator

\begin{matrix} [x + Γ (x), x - Γ (x); F] \end{matrix}

(6)

is

2 m

as an approximation of

F^{'} (x)

, being

Γ (x) = {(f_{1} {(x)}^{m}, f_{2} {(x)}^{m}, \dots, f_{n} {(x)}^{m})}^{T}

.

Based on these results, in [4] it was presented a new technique to transform iterative schemes for solving nonlinear systems into Jacobian-free ones, preserving the order of convergence in all cases. The key fact of this new approach is the mth power of the coordinate functions of

F (x)

, that needs different values depending on the order of convergence of the first step of the iterative method. This general procedure was checked, both theoretical and numerically, showing the preservation of the order of convergence and very precise results when the appropriate values of m were employed.

Also the authors in [5] showed that the order of the approximation of

F^{'} (x)

might be improved (in terms of efficiency) by means of the Richardson extrapolation. It can be seen in the following result.

Lemma 1.

(See [5]) Divided difference

\begin{matrix} R [x, x + Γ (x); F] : = \frac{1}{3} (2^{2} [x + \frac{1}{2} Γ (x), x - \frac{1}{2} Γ (x); F] - [x + Γ (x), x - Γ (x); F]), \end{matrix}

(7)

which is obtained by Richardson extrapolation of (6) is an approximation of order

4 m

of

F^{'} (x)

.

Although the design and convergence analysis of iterative methods for solving nonlinear problems is a successful area of research in the last decades, it has been recently that the study of their stability has become usual (see, for example, [6,7,8,9,10]). So, when a method is presented, not only its order of convergence and efficiency are important, but also its dependance on the initial estimations used to converge. This is known as the stability analysis of the iterative scheme.

The study of the stability of an iterative procedure has been mostly made by using techniques from complex discrete dynamics, that are very useful in the scalar case. Nevertheless, it frequently does not provide enough information when systems of nonlinear equations must be solved. This is the reason why the authors in [11] applied by first time real multidimensional discrete dynamics in order to analyze the performance of vectorial iterative methods on polynomial systems. In this way, it was possible to conclude about their stability properties: their dependence on the initial estimation used and the simplicity or complexity of the sets of convergent initial guesses (known as Fatou set) and their boundaries (Julia set). These procedure have been employed in the last years to analyze new and existing vectorial iterative schemes, see for instance [5,12,13,14]. We are going to use these techniques in this paper to the vectorial rational functions obtained by applying different Jacobian-free variants of Newton’s method on several low-degree polynomial systems. These vectorial rational functions are called also multidimensional fixed point functions in the literature.

The polynomial systems used in this study are defined by the nonlinear functions:

\begin{matrix} q (x) = \{\begin{matrix} q_{1} (x) = x_{1}^{2} - 1 \\ q_{2} (x) = x_{2}^{2} - 1 \end{matrix}, r (x) = \{\begin{matrix} r_{1} (x) = x_{1}^{6} - 1 \\ r_{2} (x) = x_{2}^{6} - 1 \end{matrix}, p (x) = \{\begin{matrix} p_{1} (x) = x_{1}^{2} - x_{2} - 1 \\ p_{2} (x) = x_{2}^{2} - x_{1} - 1 \end{matrix} . \end{matrix}

By using uncoupled systems as

p (x) = 0

and

r (x) = 0

and coupled as

p (x) = 0

, we can generalize the performance of the proposed methods to another nonlinear systems. Moreover, let us remark that these results can be obtained by using similar systems with size

n > 2

but we analyze the case

n = 2

in order to use two-dimensional plots to visualize the analytical findings.

In the next section, the dynamical behavior of the fixed point functions of Jacobian-free versions of Newton’s method applied on

q (x)

,

r (x)

and

p (x)

are studied when forward, central and Richardson extrapolation-type of divided differences are used. To get this aim, some dynamical concepts must be introduced.

Definition 1.

(See [11]) Let

G : R^{n} ⟶ R^{n}

be a vectorial function. The orbit of the point

x^{(0)} \in R^{n}

is defined as the set of successive images of

x^{(0)}

by the vectorial function,

{x^{(0)}, G (x^{(0)}), \dots, G^{m} (x^{(0)}), \dots}

.

The dynamical behavior of the orbit of a point of

R^{n}

can be classified depending on its asymptotic behavior. In this way, a point

x^{*} \in R^{n}

is a fixed point of G if

G (x^{*}) = x^{*}

.

The following results are well known results in discrete dynamics and in this paper we use them to study the stability of nonlinear operators.

Theorem 2.

(See [15]) Let

G : R^{n} \to R^{n}

be

C^{2}

. Assume that

x^{*}

is a k-periodic point,

k \geq 1

. Let

λ_{1}, λ_{2}, \dots, λ_{n}

be the eigenvalues of

G^{'} (x^{*})

.

(a): If all eigenvalues $λ_{j}$ have $| λ_{j} | < 1$ , then $x^{*}$ is attracting.
(b): If one eigenvalue $| λ_{j_{0}} | > 1$ , $j_{0} \in {1, 2, \dots, n}$ then $x^{*}$ is unstable, that is, a repelling or a saddle point.
(c): If all eigenvalues $λ_{j}$ have $| λ_{j} | > 1$ , then $x^{*}$ is repelling.

Also, a fixed point is called hyperbolic if for all eigenvalues

λ_{j}

of

G^{'} (x^{*})

, we have

| λ_{j} | \neq 1

. If there exists an eigenvalue such that

| λ_{j} | < 1

and an eigenvalue that

| λ_{i} | > 1

the hyperbolic point is called saddle point.

Let us note that, the entries of

G^{'} (x^{*})

are the partial derivatives of each coordinate function of the vectorial rational operator that defines the iterative scheme. When the calculation of spectrum of

G^{'} (x^{*})

is difficult the following result which is consistent with the previous theorem, can be used.

Proposition 1.

(See [11]) Let

x^{*}

be a fixed point of G then,

(a): If $∣ \frac{\partial g_{i} (x^{*})}{\partial x_{j}} ∣ < \frac{1}{n}$ for all $i, j \in {1, \dots, n}$ , then $x^{*} \in R^{n}$ is attracting.
(b): If $∣ \frac{\partial g_{i} (x^{*})}{\partial x_{j}} ∣ = 0$ for all $i, j \in {1, \dots, n}$ , then $x^{*} \in R^{n}$ is superattracting.
(c): If $∣ \frac{\partial g_{i} (x^{*})}{\partial x_{j}} ∣ > \frac{1}{n}$ for all $i, j \in {1, \dots, n}$ , then $x^{*} \in R^{n}$ $x^{*}$ is unstable and lies at the Julia set.

In this paper, we only use Theorem 2 to investigate the stability of the fixed points. Let us consider an iterative method for finding the roots of a nonlinear systems

F (x) = 0

. This generates a multidimensional fixed point operator

G (x)

. A fixed point

x^{*}

of

G (x)

is called a strange fixed point if it is not a root of the nonlinear function

F (x)

. The basin of attraction of

x^{*}

(which may be a root of

F (x)

or a strange fixed point) is the set of pre-images of any order such that

\begin{matrix} A (x^{*}) = {x^{(0)} \in R^{n} : G^{m} (x^{(0)}) \to x^{*}, m \to \infty} . \end{matrix}

Definition 2.

A point

x \in R^{n}

is a critical point of

G (x)

if the eigenvalues

λ_{j}

of

G^{'} (x)

are null for all

j = 1, 2, \dots, n

.

The critical points play an important role in this study since a classical result of Julia and Fatou establishes that, in the connected component of any basin of attraction including an attracting fixed point, there is always at least a critical point.

As it is obvious, a superattracting fixed point is also a critical point. A critical point that is not a root of function

F (x)

is called free critical point.

The motivation of this work is to analyze the stability of the Jacobian-free variants of Newton’s method for the most simple nonlinear equations. Certainly, it is known that, in general, divided differences are less stable than Jacobian matrices, but we study how the increasing of the precision in the estimation of the Jacobian matrix affects to the stability of the methods and in the wideness of the basins of attraction of the roots.

2. Jacobian-Free Variants of Newton’s Method

In this section, we study the dynamical properties of Jacobian-free Newton’s method when different divided differences are used. To get this purpose we analyze the dynamical concepts on polynomial systems

q (x) = 0

,

r (x) = 0

and

p (x) = 0

. The dynamical concepts on two dimensional systems can be extended to an n-dimensional case (see [11] to notice how the dynamics of a multidimensional iterative method can be analyzed), so for visualizing graphically the analytical results we investigate the two-dimensional case. From now on, the modified Newton’s scheme which results from replacing forward divided difference (5) instead of Jacobian matrix in Newton’s method, is denoted by FMN_m, for

m = 1, 2, \dots

. In a similar way, when central divided difference (6) is used to replace Jacobian matrix in Newton’s procedure, the resulting modified schemes are denoted by CMN_m, for

m = 1, 2, \dots

. Also, the modified Newton’s method obtained by using divided difference (7) is denoted by RMN_m, for

m = 1, 2, \dots

.

Let us remark that Newton’s method has quadratic convergence and, by using the mentioned approximations of the Jacobian matrix, this order is preserved, even in case

m = 1

(in this case, the scheme is known as Steffensen’s method).

We use proposed families FMN_m, CMN_m and RMN_m on the polynomial systems

q (x) = 0

,

r (x) = 0

and

p (x) = 0

. In the following sections, the coordinate functions of the different classes of iterative methods, joint with their fixed and critical points are summarized.

2.1. Second-Degree Polynomial System $q (x) = 0$

Proposition 2.

The coordinate functions of the fixed point operator

λ^{1, m} (x)

associated to FMN_m for

m = 1, 2, \dots

on polynomial system

q (x) = 0

are

\begin{matrix} λ_{j}^{1, m} (x) = x_{j} - \frac{q_{j} (x)}{2 x_{j} + q_{j} {(x)}^{m}}, j = 1, 2 . \end{matrix}

Moreover,

(a): For $m = 1, 2, \dots$ , the only fixed points are the roots of $q (x)$ , $(- 1, - 1)$ , $(- 1, 1)$ , $(1, - 1)$ and $(1, 1)$ , that are also superattracting. There is no strange fixed point in this case.
(b): The components of free critical points $c_{λ^{1, m}}^{n} = (k, l)$ are roots of $2 q_{j} (x) + (2 + 2 m) x_{j} q_{j} {(x)}^{m} + q_{j} {(x)}^{(2 m)} = 0$ , provided that k and l are not equal to 1 and $- 1$ , simultaneously.

Remark 1.

Except for

m = 1

, that is a case with 12 free critical points, for

m > 1

there exist 32 free critical points of the fixed point operator associated to FMN_m. In particular, free critical points for the fixed point function

λ^{1, m} (x)

,

m = 1, 2, \dots, 6

are

\begin{matrix} (k, l), k, l \in {- 3.73205, - 0.267949, - 1, 1}, for m = 1, \\ (k, l), k, l \in {- 2.11724, - 1.13867, 0.191305, 0.760289, 1, - 1}, for m = 2, \\ (k, l), k, l \in {- 1.85429, - 1.2072, - 0.614069, - 0.14308, 1, - 1}, for m = 3, \\ (k, l), k, l \in {- 1.74245, - 1.24306, 0.112834, 0.541487, - 1, 1}, for m = 4, \\ (k, l), k, l \in {- 1.67929, - 1.26615, - 0.496325, - 0.0927087, 1, - 1}, for m = 5, \\ (k, l), k, l \in {- 1.63822, - 1.28266, 0.0785163, 0.464271, 1, - 1}, for m = 6, \end{matrix}

provided that k and l are not equal to 1 and

- 1

, simultaneously.

Figure 1 shows the dynamical behavior of fixed point function

λ^{1, m} (x)

for

m = 1, 2, \dots, 6

. These figures have been obtained by the routines described in [16]. To draw them, a mesh of

400 \times 400

points has been used, 200 was the maximum number of iterations involved and

10^{- 3}

the tolerance used as stopping criterium. In this paper, we have used a white star to show the roots of the nonlinear polynomial system and a white square for the free critical points. Figure 1 shows that, as greater m is, the wideness of basins of attraction decreases, in spite of having a better approximation of the Jacobian matrix. The color assigned to each of the basin of attraction corresponds to one of the roots of the nonlinear system. The black area shows the no convergence in the maximum number of iterations, or divergence.

To see the behavior of the vectorial function in the black area of dynamical planes, we visualize the orbit of the rational function corresponding to the starting point

(- 3, - 3)

after 200 iterations. This orbit appears as a yellow circle per each iterate and yellow lines between each pair of consecutive iterates. In Figure 1a, corresponding to

m = 1

, its value in the 200th iteration is

(- 213.0141, - 213.0141)

. Figure 1b, which corresponds to

m = 2

, shows lower rate of divergence (or convergence to infinity), being its value at the 200th iteration

(- 8.6818, - 8.6818)

. This effect is higher by increasing m but, for

m \geq 3

it is observed that

\frac{∥ λ^{1, m} (x^{(0)}) - λ^{1, m} (x^{(200)}) ∥}{∥ λ^{1, m + 1} (x^{(0)}) - λ^{1, m + 1} (x^{(200)}) ∥} \approx 1 .

The vectors on the top of Figure 1c–f, corresponds to the last iterate with the starting point

x^{(0)} = (- 3, - 3)

.

Proposition 3.

The coordinates of the fixed point operator

λ^{2, m} (x)

associated to CMN_m, for

m = 1, 2, \dots

, and RMN_m, for

m = 1, 2, \dots

, on polynomial system

q (x) = 0

are

\begin{matrix} λ_{j}^{2} (x) = \frac{1 + x_{j}^{2}}{2 x_{j}}, j = 1, 2, \end{matrix}

which are the same components of the fixed point function of Newton’s method on

q (x)

.

Since

q (x)

is a 2nd-degree polynomial system, the approximations of order equal or higher than 2 of Jacobian matrix are exact and the fixed point function of the CMN_m and RMN_m methods coincide with that of Newton’s method. In this case,

(- 1, - 1)

,

(- 1, 1)

,

(1, - 1)

and

(1, 1)

are superattracting fixed points and there are no strange fixed points or free critical points. Figure 2 shows the resulting dynamical plane, that coincides with that of Newton’s method.

2.2. Sixth-Degree Polynomial System $r (x) = 0$

The corresponding results about FMN_m, CMN_m and RMN_m classes applied on sixth-degree polynomial system

r (x) = 0

are summarized in the following propositions. They resume, in each case, the iteration function, the fixed and critical points.

Proposition 4.

The coordinate functions of the fixed point operator

h^{1, m} (x)

associated to FMN_m on

r (x)

, for

m = 1, 2, \dots

, are

\begin{matrix} h_{j}^{1, m} (x) = x_{j} - \frac{r_{j} {(x)}^{1 + m}}{- x_{j}^{6} + {(x_{j} + r_{j} {(x)}^{m})}^{6}}, j = 1, 2 . \end{matrix}

Moreover,

(a): For $m \geq 1$ the fixed points are $(- 1, - 1)$ , $(- 1, 1)$ , $(1, - 1)$ and $(1, 1)$ that are also superattracting. There are no strange fixed points in this case.
(b): The coordinates of free critical points $c_{h^{1, m}}^{n} = (k, l)$ are the roots of the polynomial

$\begin{matrix} {(x_{j}^{6} - {(x_{j} + r_{j} {(x)}^{m})}^{6})}^{2} & + & r_{j} {(x)}^{1 + m} (- 6 x_{j}^{5} + 6 (1 + 6 m x_{j}^{5} r_{j} {(x)}^{- 1 + m}) {(x_{j} + r_{j} {(x)}^{m})}^{5}) \\ + & 6 (1 + m) x_{j}^{5} r_{j} {(x)}^{m} (x_{j}^{6} - {(x_{j} + r_{j} {(x)}^{m})}^{6}), \end{matrix}$

for $j = 1, 2$ and $m = 1, 2, \dots$ , provided that k and l are not equal to $- 1$ and 1 simultaneously.

Remark 2.

Except for

m = 1

, in which case that the fixed point function

h^{1, m} (x)

has 12 free critical points, the multidimensional rational function FMN_m, for

m = 2, 3, \dots

, has 32 free critical points. The free critical points in this case are

\begin{matrix} (k, l), k, l \in {- 1.3191, - 0.290716, - 1, 1} for m = 1, \\ (k, l), k, l \in {- 1.20592, - 1.01655, 0.289432, 0.978576, - 1, 1} for m = 2, \\ (k, l), k, l \in {- 1.17593, - 1.03983, - 0.934077, - 0.288186, - 1, 1} for m = 3, \\ (k, l), k, l \in {- 1.16205, - 1.0543, 0.286976, 0.897504, - 1, 1} for m = 4, \\ (k, l), k, l \in {- 1.15401, - 1.064, - 0.868592, - 0.285801, - 1, 1} for m = 5, \\ (k, l), k, l \in {- 1.14876, - 1.07101, 0.284659, 0.845166, - 1, 1} for m = 6, \end{matrix}

provided that k and l are not equal to

- 1

and 1 simultaneously.

Figure 3 shows the dynamical planes of the fixed point function

h^{1, m} (x)

for

m = 1, 2, \dots, 6

. Inside the black areas, there are regions where the orbits tend toward the poles of the rational function (those points that make the denominator of the elements of the rational function null). In fact, the orbits of points in these areas reach very close to the roots, so the vectorial rational function at these points become numerically singular. Figure 3a,b show two points in these black areas.

Figure 4 and Figure 5 show details of the dynamical planes of FMN₂ and FMN₃ methods. These figures show the difference between the basins of attraction of an odd member with an even member of the family FMN_m. Because of the symmetry, the transpose of the basin of attraction of

(1, - 1)

coincide with that of

(- 1, 1)

, so we only depicted one of them.

The analysis for the cases of central divided differences is made in the following result.

Proposition 5.

The coordinate functions of the fixed point operator

h^{2, m} (x)

associated to CMN_m, for

m = 1, 2, \dots

, on

r (x)

are

\begin{matrix} h_{j}^{2, m} (x) = x_{j} - \frac{r_{j} (x)}{6 x_{j}^{5} + 20 x_{j}^{3} r_{j} {(x)}^{2 m} + 6 x_{j} r_{j} {(x)}^{4 m}}, j = 1, 2 . \end{matrix}

Moreover,

(a): The fixed points are $(- 1, - 1)$ , $(1, - 1)$ , $(- 1, 1)$ and $(1, 1)$ that are also superattracting and there are not strange fixed points.
(b): The components of free critical points $c_{h^{2, m}}^{n} = (k, l)$ in this case are the roots of the polynomial $P_{h^{2, m}}^{j} (x)$ , provided that $Q_{h^{2, m}}^{j} \neq 0$ , being

$\begin{matrix} P_{h^{2, m}}^{j} (x) & = & 15 x_{j}^{10} + 30 (3 + 4 m) x_{j}^{8} r_{j} {(x)}^{2 m} - 3 r_{j} {(x)}^{4 m} + (221 + 72 m) x_{j}^{6} r_{j} {(x)}^{4 m} \\ + 6 x_{j}^{2} r_{j} {(x)}^{2 m} (- 5 + 3 r_{j} {(x)}^{6 m}) + 15 x_{j}^{4} (- 1 + 8 r_{j} {(x)}^{6 m}), j = 1, 2, \\ Q_{h^{2, m}}^{j} & = & 2 {(3 x_{j}^{5} + 10 x_{j}^{3} r_{j} {(x)}^{2 m} + 3 x_{j} r_{j} {(x)}^{4 m})}^{2}, j = 1, 2, \end{matrix}$

where k and l are not simultaneously equal to 1 and $- 1$ .

Remark 3.

The fixed point rational function

h^{2, m} (x)

, for all m, has 32 free critical points. The free critical points in this case are

\begin{matrix} (k, l), k, l \in {- 0.424661, 0.424661, - 0.984594, 0.984594, 1, - 1} for m = 1 . \\ (k, l), k, l \in {- 0.417373, 0.417373, - 0.913322, 0.913322, 1, - 1} for m = 2 . \\ (k, l), k, l \in {- 0.410772, 0.410772, - 0.864585, 0.864585, 1, - 1} for m = 3 . \\ (k, l), k, l \in {- 0.404767, 0.404767, - 0.830398, 0.830398, - 1, 1} for m = 4 . \\ (k, l), k, l \in {- 0.399279, 0.399279, - 0.804535, 0.804535, 1, - 1} for m = 5 . \\ (k, l), k, l \in {- 0.394237, 0.394237, - 0.783908, 0.783908, 1, - 1} for m = 6 . \end{matrix}

provided that k and l are not equal to 1 and

- 1

simultaneously.

Central divided differences show very stable behavior. All the free critical points are inside the basins of attraction of the roots. This means that it is not possible other performance than convergence to the roots (or divergence). In fact, the basins of attraction of the roots in this case are much greater than those in the forward divided difference. Black areas near to the boundaries of the basins of attraction are regions of slow convergence or divergence. Figure 6a,b shows the slow convergence of the point

(- 1.5, 1.5)

towards (−1,1); let us observe that, as for

m = 1

point

(- 1.5, 1.5)

is closer to the boundary so its speed is higher than that for

m = 2

. The behavior of function

h^{2, m} (x)

in black area among the basins of attraction and near the axis x and y are similar for

m = 1, 2, \dots

. In fact by choosing points in this area the iterative function is slowly divergent (see Figure 6c,d).

Let us remark that, although this scheme is quite stable, the basins of attraction of the roots are much smaller than those of Newton’s method on

r (x)

, where there are not black regions.

Finally, the following result gives us information about the stability of the sixth-degree system when higher-order estimations of the Jacobian are made.

Proposition 6.

The coordinate functions of the fixed point operator

h^{3, m} (x)

associated to RMN_m on

r (x)

, for

m = 1, 2, \dots

, are

\begin{matrix} h_{j}^{3, m} (x) = x_{j} - \frac{2 r_{j} (x)}{12 x_{j}^{5} - 3 x_{j} r_{j} {(x)}^{4 m}}, j = 1, 2 . \end{matrix}

(a): For $m = 1, 2, \dots$ , the fixed points are $(- 1, - 1)$ , $(- 1, 1)$ , $(1, - 1)$ and $(1, 1)$ that are also superattracting and there are not strange fixed points in this case.
(b): The components of free critical points $c_{h^{3, m}}^{n} = (k, l)$ are the roots of polynomial

$\begin{matrix} P_{h^{3, m}}^{j} (x) = - 40 x_{j}^{4} + 40 x_{j}^{10} + 2 r_{j} {(x)}^{4 m} - 2 (7 + 24 m) x_{j}^{6} r_{j} {(x)}^{4 m} + 3 x_{j}^{2} r_{j} {(x)}^{8 m}, \end{matrix}$

for $j = 1, 2$ and $m = 1, 2, \dots$ , provided that $- 4 x_{j}^{5} + x_{j} r_{j} {(x)}^{4 m} \neq 0$ and k and l are not simultaneously equal to 1 and $- 1$ .

Remark 4.

The fixed point function

h^{3, m} (x)

on

r (x)

for all m has 60 free critical points. The free critical points, for

m = 1, 2, \dots, 6

, are

\begin{matrix} (k, l), k, l \in {- 0.467908, 0.467908, - 1.1047, 1.1047, - 1.23916, 1.23916, - 1, 1} for m = 1, \\ (k, l), k, l \in {- 0.446836, 0.446836, - 1.10721, 1.10721, - 1.18013, 1.18013, - 1, 1} for m = 2, \\ (k, l), k, l \in {- 0.431784, 0.431784, - 1.16201, 1.16201, - 1.10964, 1.10964, - 1, 1} for m = 3, \\ (k, l), k, l \in {- 0.420071, 0.420071, - 1.15298, 1.15298, - 1.11137, 1.11137, - 1, 1} for m = 4, \\ (k, l), k, l \in {- 0.41049, 0.41049, - 1.1475, 1.1475, - 1.11265, 1.11265, - 1, 1} for m = 5, \\ (k, l), k, l \in {- 0.402394, 0.402394, - 1.14379, 1.14379, - 1.11364, 1.11364, - 1, 1} for m = 6, \end{matrix}

provided that k and l are not equal to 1 and

- 1

, simultaneously.

In this case, we obtain approximations of order

4, 8, 12, \dots

for

m = 1, 2, 3, \dots

, respectively, but nevertheless their basins of attraction are smaller than those of central divided difference (and of course of Newton’s method). Figure 7 shows the dynamical planes for

m = 1, 2, 3, 4

. As in the previous cases, in the black area the convergence is very slow (or it diverges).

2.3. Second-Degree Polynomial System $p (x) = 0$

The results about FMN_m, CMN_m and RMN_m classes applied on second-degree polynomial system

p (x) = 0

are stated in the following propositions. They resume, in each case, the iteration function, the fixed and critical points.

Proposition 7.

The coordinate functions of the fixed point operator

k^{1, m} (x)

associated to FMN_m on

p (x)

, for

m = 1, 2, \dots

, are

\begin{matrix} k_{1}^{1, m} (x) = x_{1} - \frac{- 1 - x_{1} + x_{2}^{2}}{- 1 + \frac{(- x_{1}^{2} + {(x_{1} + {(1 - x_{1}^{2} + x_{2})}^{m})}^{2}) (- x_{2}^{2} + {(x_{2} + {(1 + x_{1} - x_{2}^{2})}^{m})}^{2})}{{(1 - x_{1}^{2} + x_{2})}^{m} {(1 + x_{1} - x_{2}^{2})}^{m}}} \\ - \frac{(- 1 + x_{1}^{2} - x_{2}) (- x_{2}^{2} + {(x_{2} + {(1 + x_{1} - x_{2}^{2})}^{m})}^{2})}{{(1 + x_{1} - x_{2}^{2})}^{m} (- 1 + \frac{(- x_{1}^{2} + {(x_{1} + {(1 - x_{1}^{2} + x_{2})}^{m})}^{2}) (- x_{2}^{2} + {(x_{2} + {(1 + x_{1} - x_{2}^{2})}^{m})}^{2})}{{(1 - x_{1}^{2} + x_{2})}^{m} {(1 + x_{1} - x_{2}^{2})}^{m}})}, m = 1, 2, \dots \\ k_{2}^{1, m} (x) = x_{2} - \frac{- 1 + x_{1}^{2} - x_{2}}{- 1 + \frac{(- x_{1}^{2} + {(x_{1} + {(1 - x_{1}^{2} + x_{2})}^{m})}^{2}) (- x_{2}^{2} + {(x_{2} + {(1 + x_{1} - x_{2}^{2})}^{m})}^{2})}{{(1 - x_{1}^{2} + x_{2})}^{m} {(1 + x_{1} - x_{2}^{2})}^{m}}} \\ - \frac{(- 1 - x_{1} + x_{2}^{2}) (- x_{1}^{2} + {(x_{1} + {(1 - x_{1}^{2} + x_{2})}^{m})}^{2})}{{(1 - x_{1}^{2} + x_{2})}^{m} (- 1 + \frac{(- x_{1}^{2} + {(x_{1} + {(1 - x_{1}^{2} + x_{2})}^{m})}^{2}) (- x_{2}^{2} + {(x_{2} + {(1 + x_{1} - x_{2}^{2})}^{m})}^{2})}{{(1 - x_{1}^{2} + x_{2})}^{m} {(1 + x_{1} - x_{2}^{2})}^{m}})}, m = 1, 2, \dots . \end{matrix}

Moreover,

(a): For $m \geq 1$ the only fixed points are the roots $(- 1, 0)$ , $(0, - 1)$ , $(\frac{1}{2} (1 - \sqrt{5}), \frac{1}{2} (1 - \sqrt{5}))$ and $(\frac{1}{2} (1 + \sqrt{5}), \frac{1}{2} (1 + \sqrt{5}))$ , that are superattracting.
(b): For $m = 1$ , the free critical points are $(- 0.632026, - 0.460233)$ , $(- 0.460233, - 0.632026)$ , $(- 0.203287, 0.294659)$ and $(0.294659, - 0.203287)$ .

The calculation of free critical points of

k^{1, m} (x)

for

m > 1

is very complicated hence, so it has been provided only for

m = 1

. Figure 8 shows the dynamical planes of the fixed point function

k^{1, m} (x)

for

m = 1, 2, 3, 4

. Similar results to polynomial system

q (x) = 0

and

r (x) = 0

have been obtained in this case, as by increasing m, the wideness of the basins of attraction decreases. To study the behavior of the vectorial function in the black area of dynamical planes, we visualize the diverging orbits of

k^{1, m} (x)

with the starting point

(0, - 3)

, after 200 iterations. The vectors on the top of the Figure 8a–d, show the last vectorial iterate with the starting point

(0, - 3)

.

The following proposition shows, that using CMN_m, for

m = 1, 2, \dots

, and RMN_m, for

m = 1, 2, \dots

, on polynomial system

p (x) = 0

, we obtain, as was the case for the system

q (x) = 0

, the same fixed point function as the classical Newton’s method.

Proposition 8.

The coordinates of the fixed point operator

k^{2, m} (x)

(which is denoted by

k^{2} (x)

) associated to CMN_m for

m = 1, 2, \dots

and RMN_m for

m = 1, 2, \dots

on polynomial system

p (x)

are

\begin{matrix} k_{1}^{2} (x) = \frac{1 + 2 (1 + x_{1}^{2}) x_{2} + x_{2}^{2}}{- 1 + 4 x_{1} 4 x_{2}}, \\ k_{2}^{2} (x) = \frac{1 + x_{1}^{2} + 2 x_{1} (1 + x_{2}^{2})}{- 1 + 4 x_{1} 4 x_{2}}, \end{matrix}

which are the same components of the fixed point function of Newton’s method on

p (x)

. Moreover, the fixed points of

k^{2} (x)

are the roots

(- 1, 0)

,

(0, - 1)

,

(\frac{1}{2} (1 - \sqrt{5}), \frac{1}{2} (1 - \sqrt{5}))

and

(\frac{1}{2} (1 + \sqrt{5}), \frac{1}{2} (1 + \sqrt{5}))

that are superattracting. There are no strange fixed points nor free critical points in this case.

Figure 9 shows the dynamical plane of the fixed point function

k^{2} (x)

, corresponding to methods CMN_m and RMN_m for any

m = 1, 2, \dots

as well as their original partner, Newton’s scheme.

3. Conclusions

In this paper, several new Jacobian-free Newton’s method have been introduced, by using forward, central and Richardson divided differences based on an element-by-element power of the nonlinear function

F (x)

,

G (x) = {(f_{1} {(x)}^{m}, f_{2} {(x)}^{m}, \dots, f_{n} {(x)}^{m})}^{T}

. As far as we know, these Jacobian-free variants of Newton’s method have not been analyzed until now. We conclude that better estimations do not always involve greater stability. In fact, the best scheme in terms of numerical efficiency and wideness of the sets of converging initial points, is CMN_m. This central differences method does not need to calculate and evaluate the Jacobian matrix as Newton’s method and provides similar basins of attraction. Although Richardson’s method reaches good results of convergence, has a computational cost that discourages its use.

Author Contributions

The contribution of the authors to this manuscript can be defined as: conceptualization, A.C. and J.R.T.; methodology, M.T.D.; software, A.A.; formal analysis, M.T.D.; investigation, A.A.; writing—original draft preparation, A.A.; writing—review and editing, A.C. and J.R.T.; supervision, A.C. and J.R.T.

Funding

This research was partially supported by Spanish Ministerio de Ciencia, Innovación y Universidades PGC2018-095896-B-C22 and Generalitat Valenciana PROMETEO/2016/089.

Acknowledgments

The authors would like to thank the anonymous reviewers for their constructive comments and suggestions that have improved the final version of this manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

Ortega, J.M.; Rheinboldt, W.C. Iterative Solution of Nonlinear Equations in Several Variables; Academic Press: New York, NY, USA, 1970. [Google Scholar]
Traub, J.F. Iterative Methods for the Solution of Equations; Chelsea Publishing Company: New York, NY, USA, 1982. [Google Scholar]
Grau-Sánchez, M.; Noguera, M.; Amat, S. On the approximation of derivatives using divided difference operators preserving the local convergence order of iterative methods. J. Comput. Appl. Math. 2013, 237, 363–372. [Google Scholar] [CrossRef]
Amiri, A.R.; Cordero, A.; Darvishi, M.T.; Torregrosa, J.R. Preserving the order of convergence: Low-complexity Jacobian-free iterative schemes for solving nonlinear systems. J. Comput. Appl. Math. 2018, 337, 87–97. [Google Scholar] [CrossRef]
Amiri, A.R.; Cordero, A.; Darvishi, M.T.; Torregrosa, J.R. Stability analysis of Jacobian-free iterative methods for solving nonlinear systems by using families of mth power divided differences. J. Math. Chem. 2019, 57, 1344–1373. [Google Scholar] [CrossRef]
Amat, S.; Busquier, S.; Plaza, S. Review of some iterative root–finding methods from a dynamical point of view. Scientia 2004, 10, 3–35. [Google Scholar]
Neta, B.; Chun, C.; Scott, M. Basins of attraction for optimal eighth order methods to find simple roots of nonlinear equations. Appl. Math. Comput. 2014, 227, 567–592. [Google Scholar] [CrossRef]
Geum, Y.H.; Kim, Y.I.; Neta, B. A class of two-point sixth-order multiple-zero finders of modified double-Newton type and their dynamics. Appl. Math. Comput. 2015, 270, 387–400. [Google Scholar] [CrossRef]
Amat, S.; Busquier, S.; Magreñán, Á.A. Reducing Chaos and Bifurcations in Newton-Type Methods. In Abstract and Applied Analysis; Hindawi: London, UK, 2013. [Google Scholar]
Magreñán, Á.A.; Argyros, I.K. A Contemporary Study of Iterative Methods: Convergence, Dynamics and Applications; Academic Press: New York, NY, USA, 2019. [Google Scholar]
Cordero, A.; Soleymani, F.; Torregrosa, J.R. Dynamical analysis of iterative methods for nonlinear systems or how to deal with the dimension? Appl. Math. Comput. 2014, 244, 398–412. [Google Scholar] [CrossRef]
Sharma, J.R.; Sharma, R.; Bahl, A. An improved Newton-Traub composition for solving systems of nonlinear equations. Appl. Math. Comput. 2016, 290, 98–110. [Google Scholar]
García Calcines, J.M.; Gutiérrez, J.M.; Hernández Paricio, L.J. Rivas Rodríguez, M.T. Graphical representations for the homogeneous bivariate Newton’s method. Appl. Math. Comput. 2015, 269, 988–1006. [Google Scholar]
Cordero, A.; Maimó, J.G.; Torregrosa, J.R.; Vassileva, M.P. Multidimensional stability analysis of a family of biparametric iterative methods. J. Math. Chem. 2017, 55, 1461–1480. [Google Scholar] [CrossRef][Green Version]
Robinson, R.C. An Introduction to Dynamical Systems, Continuous and Discrete; American Mathematical Society: Providence, RI, USA, 2012. [Google Scholar]
Chicharro, F.I.; Cordero, A.; Torregrosa, J.R. Drawing dynamical and parameter planes of iterative families and methods. Sci. World J. 2013, 2013. [Google Scholar] [CrossRef] [PubMed]

Figure 1. Dynamical plane for FMN_m,

m = 1, 2, \dots, 6

on

q (x)

. Green, orange, red and blue areas are the basins of attraction of the roots of

q (x)

(marked as white stars); black area denotes divergence and white squares are free critical points.

Figure 1. Dynamical plane for FMN_m,

m = 1, 2, \dots, 6

on

q (x)

. Green, orange, red and blue areas are the basins of attraction of the roots of

q (x)

(marked as white stars); black area denotes divergence and white squares are free critical points.

Figure 2. Dynamical plane of CMN_m, RMN_m and Newton’s method on

q (x)

. Green, orange, red and blue areas are the basins of attraction of the roots of

q (x)

(marked as white stars).

Figure 2. Dynamical plane of CMN_m, RMN_m and Newton’s method on

q (x)

. Green, orange, red and blue areas are the basins of attraction of the roots of

q (x)

(marked as white stars).

Figure 3. Dynamical plane of FMN_m method for

m = 1, 2, \dots, 6

on

r (x)

. Green, orange, red and blue areas are the basins of attraction of the roots of

q (x)

(marked as white stars); black area denotes divergence and white squares are free critical points.

Figure 3. Dynamical plane of FMN_m method for

m = 1, 2, \dots, 6

on

r (x)

. Green, orange, red and blue areas are the basins of attraction of the roots of

q (x)

(marked as white stars); black area denotes divergence and white squares are free critical points.

Figure 4. Basins of attraction of

(- 1, - 1)

,

(- 1, 1)

and

(1, 1)

for FMN₂ method on

r (x)

. Green, orange, red and blue areas are the basins of attraction of the roots of

q (x)

(marked as white stars); black area denotes divergence and white squares are free critical points.

Figure 4. Basins of attraction of

(- 1, - 1)

,

(- 1, 1)

and

(1, 1)

for FMN₂ method on

r (x)

. Green, orange, red and blue areas are the basins of attraction of the roots of

q (x)

(marked as white stars); black area denotes divergence and white squares are free critical points.

Figure 5. Basins of attraction of

(- 1, - 1)

,

(1, - 1)

and

(1, 1)

for FMN₃ method on

r (x)

. Green, orange, red and blue areas are the basins of attraction of the roots of

q (x)

(marked as white stars); black area denotes divergence and white squares are free critical points.

Figure 5. Basins of attraction of

(- 1, - 1)

,

(1, - 1)

and

(1, 1)

for FMN₃ method on

r (x)

. Green, orange, red and blue areas are the basins of attraction of the roots of

q (x)

(marked as white stars); black area denotes divergence and white squares are free critical points.

Figure 6. Dynamical plane for CMN_m method,

m = 1, 2, 3, 4

on

r (x)

. Green, orange, red and blue areas are the basins of attraction of the roots of

q (x)

(marked as white stars); black area denotes divergence and white squares are free critical points.

Figure 6. Dynamical plane for CMN_m method,

m = 1, 2, 3, 4

on

r (x)

. Green, orange, red and blue areas are the basins of attraction of the roots of

q (x)

(marked as white stars); black area denotes divergence and white squares are free critical points.

Figure 7. Dynamical plane for RMN_m method,

m = 1, 2, 3, 4

on

r (x)

. Green, orange, red and blue areas are the basins of attraction of the roots of

q (x)

(marked as white stars); black area denotes divergence and white squares are free critical points.

Figure 7. Dynamical plane for RMN_m method,

m = 1, 2, 3, 4

on

r (x)

. Green, orange, red and blue areas are the basins of attraction of the roots of

q (x)

(marked as white stars); black area denotes divergence and white squares are free critical points.

Figure 8. Dynamical plane for FMN_m method,

m = 1, 2, 3, 4

on

p (x)

. Green, orange, red and blue areas are the basins of attraction of the roots of

q (x)

(marked as white stars); black area denotes divergence and white squares are free critical points.

Figure 8. Dynamical plane for FMN_m method,

m = 1, 2, 3, 4

on

p (x)

. Green, orange, red and blue areas are the basins of attraction of the roots of

q (x)

(marked as white stars); black area denotes divergence and white squares are free critical points.

Figure 9. Dynamical plane of CMN_m, RMN_m and Newton’s method on

p (x)

. Green, orange, red and blue areas are the basins of attraction of the roots of

q (x)

(marked as white stars); black area denotes divergence and white squares are free critical points.

Figure 9. Dynamical plane of CMN_m, RMN_m and Newton’s method on

p (x)

. Green, orange, red and blue areas are the basins of attraction of the roots of

q (x)

(marked as white stars); black area denotes divergence and white squares are free critical points.

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Amiri, A.; Cordero, A.; Darvishi, M.T.; Torregrosa, J.R. Stability Analysis of Jacobian-Free Newton’s Iterative Method. Algorithms 2019, 12, 236. https://doi.org/10.3390/a12110236

AMA Style

Amiri A, Cordero A, Darvishi MT, Torregrosa JR. Stability Analysis of Jacobian-Free Newton’s Iterative Method. Algorithms. 2019; 12(11):236. https://doi.org/10.3390/a12110236

Chicago/Turabian Style

Amiri, Abdolreza, Alicia Cordero, Mohammad Taghi Darvishi, and Juan R. Torregrosa. 2019. "Stability Analysis of Jacobian-Free Newton’s Iterative Method" Algorithms 12, no. 11: 236. https://doi.org/10.3390/a12110236

APA Style

Amiri, A., Cordero, A., Darvishi, M. T., & Torregrosa, J. R. (2019). Stability Analysis of Jacobian-Free Newton’s Iterative Method. Algorithms, 12(11), 236. https://doi.org/10.3390/a12110236

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Stability Analysis of Jacobian-Free Newton’s Iterative Method

Abstract

1. Introduction

2. Jacobian-Free Variants of Newton’s Method

2.1. Second-Degree Polynomial System $q (x) = 0$

2.2. Sixth-Degree Polynomial System $r (x) = 0$

2.3. Second-Degree Polynomial System $p (x) = 0$

3. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Stability Analysis of Jacobian-Free Newton’s Iterative Method

Abstract

1. Introduction

2. Jacobian-Free Variants of Newton’s Method

2.1. Second-Degree Polynomial System q ( x ) = 0

2.2. Sixth-Degree Polynomial System r ( x ) = 0

2.3. Second-Degree Polynomial System p ( x ) = 0

3. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

2.1. Second-Degree Polynomial System $q (x) = 0$

2.2. Sixth-Degree Polynomial System $r (x) = 0$

2.3. Second-Degree Polynomial System $p (x) = 0$