Next Article in Journal
Discounted Optimal Stopping of a Brownian Bridge, with Application to American Options under Pinning
Next Article in Special Issue
Memory in a New Variant of King’s Family for Solving Nonlinear Systems
Previous Article in Journal
Regularity for Semilinear Neutral Hyperbolic Equations with Cosine Families
Previous Article in Special Issue
Two Iterative Methods with Memory Constructed by the Method of Inverse Interpolation and Their Dynamics
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Purely Iterative Algorithms for Newton’s Maps and General Convergence

1
Departamento de Matemática Aplicada y Estadística, Universidad Politécnica de Cartagena, 30202 Cartagena, Spain
2
Facultad de Ciencias, Universidad de Valparaíso, Valparaíso 2340000, Chile
3
CIMFAV and Institute of Mathematical Engineering, Universidad de Valparaíso, General Cruz 222, Valparaíso 2340000, Chile
4
Departamento de Matemáticas y Computación, Universidad de La Rioja, 26006 Logroño, Spain
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Mathematics 2020, 8(7), 1158; https://doi.org/10.3390/math8071158
Submission received: 19 May 2020 / Revised: 6 July 2020 / Accepted: 6 July 2020 / Published: 15 July 2020
(This article belongs to the Special Issue Iterative Methods for Solving Nonlinear Equations and Systems 2020)

Abstract

:
The aim of this paper is to study the local dynamical behaviour of a broad class of purely iterative algorithms for Newton’s maps. In particular, we describe the nature and stability of fixed points and provide a type of scaling theorem. Based on those results, we apply a rigidity theorem in order to study the parameter space of cubic polynomials, for a large class of new root finding algorithms. Finally, we study the relations between critical points and the parameter space.

1. Introduction

The computation of solutions for equations of the form
Ψ ( x ) = α ,
is a classic problem that arises in different areas of mathematics and in particular in numerical analysis. Here Ψ : C C is a complex function, and usually it is assumed that α = 0 . Due to the dependence on the space where the equation is defined, and where possible solutions are acting, it is ambitious to expect a unified theory that provides the exact, or even approximate solutions to this class of equations. Also, depending on the objective that is being addressed, solving an equation as above can be very different in nature, as well as, the techniques used to solve it. For instance, Picard–Lindelöf’s theorem (see for example [1]) on existence and uniqueness of solutions of ordinary differential equations, and fundamental theorem of Algebra in complex analysis. On the other hand, if we turn our attention to explicit solutions, then the problem becomes even more difficult.
Consider a complex polynomial f and
N f = I d f f
the classical Newton’s method. In this case, higher–order methods have been extensively used and studied in order to approach the equation f ( z ) = 0 . The iterative function N f defines a rational map on the extended complex plane (Riemann sphere) C ¯ = C { } . The simple roots of the equation f ( z ) = 0 , or in other words, the roots of the equation f ( z ) = 0 that are not roots of the derivative f ( z ) , are super–attracting fixed points of N f , that is, let ζ be a simple root of f ( z ) , then N f ( ζ ) = ζ and N f ( ζ ) = 0 . For a review of the dynamics of Newton’s method, see for instance [2,3].
More generally, P o l y d and R a t k denote the space of polynomials of degree d and the space of rational functions of degree k, respectively. By a root–finding algorithm or root–finding method it is meant a rational map T f : P o l y d R a t k , such that the roots of the polynomial map f are attracting fixed points of T f . A root–finding algorithm T f has order σ , if the local degree of T f in every simple root of f is σ .
In this paper we study the dynamical aspects of
T f ( z ) = z ( z N f ( z ) ) ( a 0 + a 1 N f ( z ) + a 2 ( N f ( z ) ) 2 ) b 0 + b 1 N f ( z ) ,
where a 0 , a 1 , a 2 , b 0 and b 1 are real numbers. Depending on those parameters, this family is of order 2, 3, 4 or 5. Also, this family can be viewed as a generalization of c –iterative functions (for a definition see Example 5 below).
In [4] C. Mcmullen proved a rigidity theorem that implies that a purely iterative root finding algorithm generally convergent for cubic polynomials, is conformally conjugate to a generating map. Applying this result, J. Hawkins in [5] was able to obtain an explicit expression for rational maps which are generating, and so it is natural to ask which of these rational maps T f are generating maps. We use that rigidity result in order to show that over the space of cubic polynomials, those maps T f that generate a generally convergent algorithm are restricted to Halley’s method applied to the cubic polyomial.
The paper is organized as follows. Section 2 contains some basic notions of the classic theory of complex dynamics. In addition to establish the notation and main examples, Section 3 contains the definition of purely iterative iterative algorithm for Newton’s maps, that will be used throughout the article. Section 4 is devoted to the study of the nature of fixed points. In Section 5 we study the order of convergence of T f , and in Section 6 we provide the results about Scaling theorems. We provide the result concerning maps that generates generally convergent root finding algorithms for cubic polynomials in Section 7. In Section 8 we provide the relation between critical points and parameter space. The last Section summarize the conclusion.

2. Basic Notions in Complex Dynamics

We recall the reader so see [6] or [7,8,9,10,11,12,13,14,15,16] to obtain some basic notions of the classic theory of Fatou-Julia of complex dynamics which appear in (as a reference of the Fatou– Julia theory see for instance P. Blanchard [17] and J. Milnor [18]). Here we show a small summary: Let
R ( z ) = P ( z ) Q ( z )
be a rational map of the extended complex plane into itself, where P and Q are polynomials with no common factors.
  • A point ζ is called a fixed point of R if R ( ζ ) = ζ , and the multiplier of R at a fixed point ζ is the complex number λ ( ζ ) = R ( ζ ) .
  • Depending on the value of the mulitplier, a fixed pointcan be superattracting ( λ ( ζ ) = 0 ), attracting( 0 < | λ ( ζ ) | < 1 ), repelling( | λ ( ζ ) | > 1 ), indifferent ( | λ ( ζ ) | = 1 )
  • Let z 0 be a fixed point of R n which is not a fixed point of R j , for any j with 0 < j < n . We say that orb ( z 0 ) = { z 0 , R ( z 0 ) , , R n 1 ( z 0 ) } is a cycle of lengthn or simply an n–cycle. Note that orb ( z j ) = orb ( z 0 ) for any z j orb ( z 0 ) , and R acts as a permutation on orb ( z 0 ) .
  • The multiplier of an n cycle is the complex number λ ( orb ( z 0 ) ) = ( R n ) ( z 0 ) .
  • At each point z j of the cycle, the derivative ( R n ) has the same value.
  • An n cycle { z 0 , z 1 , , z n 1 } is said to be attracting, repelling, indifferent, depending the value of the associated multiplier (same conditions than in the fixed points).
  • The Julia set of a rational map R, denoted J ( R ) , is the closure of the set of repelling periodic points. Its complement is the Fatou set F ( R ) . If z 0 is an attracting fixed point of R, then the convergence region B ( z 0 ) is contained in the Fatou set and J ( R ) = B ( z 0 ) , where denotes the topological boundary.

3. Definitions and Notations

Now we recall the definition of purely iterative algorithms due to S. Smale in [19]. Let P d be the space of all polynomials of degree less than or equal to d . For every k 1 , define the space J k = C k + 2 and the map
j : C × P d J k
given by
j ( z , f ) = ( z , f ( z ) , f ( z ) , , f [ k ] ( z ) ) ,
where f [ k ] denotes the kth derivative of f. Let F : J k C be the rational map defined as
F ( z , ξ 0 , , ξ k ) = z P ( z , ξ 0 , , ξ k ) Q ( z , ξ 0 , , ξ k ) ,
where P and Q are polynomials in k + 2 variables z , ξ 0 , , ξ k , with no common factors. A purely iterative algorithm is a rational endomorphism T ^ f : C ¯ C ¯ that depends on f P d and takes the form
T ^ f ( z ) = F ( j ( z , f ( z ) ) ) ,
for a rational map as in (2).
Consider a modification of the preceding definition. Let R a t d be the space consisting of the rational maps of degree less than or equal to d . Define a subset V R a t d as
V = { R R a t d : finite   fixed   points   α   of   R   are   simple ,   1 1 R ( α ) N , R ( ) = and R ( ) = d / ( d 1 ) } .
Since Newton’s method applied to z d 1 is a rational map that satisfies the conditions in V , we conclude that V , for every d 2 .
As above, define
^ : C ¯ × V J k
by
^ ( z , R ) = ( z , R ( z ) , R ( z ) , , R [ k ] ( z ) ) .
Let G : J k C be the rational map defined as
G ( z , ξ 0 , , ξ k ) = z P ( z , ξ 0 , , ξ k ) Q ( z , ξ 0 , , ξ k ) ,
where P and Q are polynomials in k + 2 variables with no common factors.
We define a rational endomorphism T f : C ¯ C ¯ , depending on R R a t d , by
T f ( z ) = G ( ^ ( z , R ) ) ,
where R V .
In [6] it is proved the following.
Theorem 1.
For every G : J k C defined as before, there exists a complex polynomial f of degree d such that for every R V , R = N f , where N f is the Newton method. Also, there exists a linear space H of dimension d + 1 such that V is contained in R a t d H .
Theorem 1 motivates the following definition.
Definition 1.
Let S f : C ¯ C ¯ be the rational endomorphism depending on f P d , given by
S f ( z ) = G ( ^ ( z , N f ( z ) ) )
where N f is Newton’s map applied to f, and G is defined by the Formula (4). A rational endomorphism S f as above will be called a purely iterative algorithm for Newton’s maps.
Remark 1.
Note that the degree of the polynomials P and Q in (4) does not depend on the degree of f.
In this paper we consider the family of purely iterative algorithm for Newton’s maps given by the Formula (5), with
G ( z , ξ 0 , ξ 1 ) = z P ( z , ξ 0 , ξ 1 ) Q ( z , ξ 0 , ξ 1 ) , P ( z , ξ 0 , ξ 1 ) = ( z ξ 0 ) ( a 0 + a 1 ξ 1 + a 2 ξ 1 2 )
and
Q ( z , ξ 0 , ξ 1 ) = b 0 + b 1 ξ 1 ,
where a 0 , a 1 , a 2 , b 0 and b 1 are real numbers. Then the family is given by
T f ( z ) = z ( z N f ( z ) ) ( a 0 + a 1 N f ( z ) + a 2 ( N f ( z ) ) 2 ) b 0 + b 1 N f ( z ) ,
which is exactly the Formula (1) above.
Example 1.
The family of Purely iterative algorithms for Newton’s maps (1) include several important families of root–finding algorithms.
1. 
Newton’s method is obtained by taking a 0 = b 0 = 1 , a 1 = a 2 = b 1 = 0 . Indeed, in this case
P ( z , ξ 0 , ξ 1 ) = z ξ 0 a n d Q ( z , ξ 0 , ξ 1 ) = 1 .
Hence T f ( z ) = z ( z N f ( z ) ) = N f ( z ) . This method has been briefly studied in the last decades [20].
2. 
Halley’s method is obtained by considering a 0 = 2 , a 1 = 0 , a 2 = 0 , b 0 = 2 and b 1 = 1 . Indeed,
T f ( z ) = H f ( z ) = z 2 z N f ( z ) 2 N f ( z ) .
Therefore,
P ( z , ξ 0 , ξ 1 ) = 2 ( z ξ 0 ) a n d Q ( z , ξ 0 , ξ 1 ) = 2 ξ 1 .
For a study of dynamical and numerical properties of Halley’s method, see for instance [21,22].
3. 
Whittaker’s iterative method also known as convex acceleration of Whittaker’s method (see [23,24]), is an iterative map of order of convergence two given by
W f ( z ) = z 1 2 ( z N f ( z ) ) 2 N f ( z ) .
Thus, according with (4) and (5), Whittaker’s method is a purely iterative algorithm for Newton’s maps when considering k = 1 , and the polynomials
P ( z , ξ 0 , ξ 1 ) = ( z ξ 0 ) 1 ξ 1 2 a n d Q ( z , ξ 0 , ξ 1 ) 1 .
4. 
Newton’s method for multiple roots is obtained by considering a 0 = b 0 = 1 , a 1 = a 2 = 0 and b 1 = 1 . Indeed,
T f ( z ) = M f ( z ) = z z N f ( z ) 1 N f ( z ) .
Note that
P ( z , ξ 0 , ξ 1 ) = z ξ 0 a n d Q ( z , ξ 0 , ξ 1 ) = 1 ξ 1 .
This method has been studied by several authors. See for example [25,26] and more recently [27,28].
5. 
The following method, that may be new and it is denoted by S H 2 f , is a modification of the super–Halley method(for a study of this method see for instance [29]). This is given by the formula
S H 2 f ( z ) = z 1 2 z N f ( z ) 3 N f ( z ) 1 N f ( z ) .
Consider the polynomials
P ( z , ξ 0 , ξ 1 ) = 1 2 ( z ξ 0 ) ( 3 ξ 1 ) a n d Q ( z , ξ 0 , ξ 1 ) = 1 ξ 1 .
Again, it follows from (4) and (5) that S H 2 f is a purely iterative algorithm for Newton’s maps.
6. 
More generally, considering a 0 = 2 , a 1 = 1 2 θ , a 2 = 0 , b 0 = 2 and b 1 = 2 θ we obtain the following third-order family studied in [30,31].
M f , θ ( z ) = z ( z N f ( z ) ) 1 + N f ( z ) 2 ( 1 θ N f ( z ) ) .
In this case
P ( z , ξ 0 , ξ 1 ) = ( z ξ 0 ) ( 2 ( 1 2 θ ξ 0 ) ) a n d Q ( z , ξ 0 , ξ 1 ) = 2 2 θ ξ 1 .
7. 
The following family of iterative functions represents Newton’s method, Chebyshev’s iterative function, Halley’s method, Super-Halley, c–iterative function (considering θ = 0 below) and Chebyshev-Halley family, among others. See for instance [22,29,30,31,32,33,34,35,36,37,38]. The family of iterative methods given by
M f , θ , c ( z ) = z ( z N f ( z ) ) 1 + N f ( z ) 2 ( 1 θ N f ( z ) ) + c ( N f ( z ) ) 2 ,
where θ and c are complex parameters conveniently chosen, form a family of purely iterative algorithms for Newton’s maps. Indeed, this follows by considering the polynomials
P ( z , ξ 0 , ξ 1 ) = ( z ξ 0 ) [ 2 ( 1 θ ξ 1 ) + ξ 1 + 2 c ξ 1 ( 1 θ ξ 1 ) = ( z ξ 0 ) ( 2 + ξ 1 ( 1 2 θ + 2 c ) 2 c θ ξ 1 2 )
and
Q ( z , ξ 0 , ξ 1 ) = 2 2 θ ξ 1 ,
in (4) and (5).
It is clear that a 0 = 2 , a 1 = 1 2 ( c + θ ) , a 2 = 2 c θ , b 0 = 2 and b 1 = 2 θ .
Remark 2.
Note that a purely iterative algorithm for Newton’s maps may not be a root–finding algorithm. For instance, by considering the polynomial f ( z ) = ( z 1 ) 2 ( z + 1 ) , and the purely iterative Newton’s map defined by
P ( z , ξ 0 , ξ 1 ) = ( z ξ 0 ) ( 1 + ξ 1 + ξ 1 2 )
and
Q ( z , ξ 0 , ξ 1 ) = 1 + b 1 ξ 1 ,
where b 1 < 2 , it follows that root 1 is repelling for the associated rational map. In fact, in this case
T f ( z ) = z ( z N f ( z ) ) ( 1 + N f ( z ) + ( N f ( z ) ) 2 ) 1 + b 1 N f ( z ) .
Note that T f ( 1 ) = 1 . Since b 1 < 2 then 16 + 8 b 1 < 0 . Therefore
T f ( 1 ) = 1 14 16 + 8 b 1 > 1 .
As a consequence the root 1 is a repelling fixed point.

4. The Nature of Fixed Points

In order to ensure that T f be a root–finding algorithm (see Remark 2), some restrictions over the choice of the real parameters a 0 , a 1 , a 2 , b 0 and b 1 , are required. Let m 1 be an integer and define
λ m = 1 l m
where,
l m = a m 2 b m + a 2 m 2 ( m ( b 0 + b 1 ) b 1 )
and a = a 0 + a 1 + a 2 and b = a 1 + 2 a 2 .
Theorem 2.
Suppose that
0 < λ m < 1
for every m 1 . Let f : C C be a complex polynomial of degree d 2 . Denote by α i its zeros and by m i 1 their multiplicities. Then T f defined in (1) is a root finding algorithm. Moreover,
(a) 
Each root α i of multiplicity m i 1 is an attracting fixed point for T f with multiplier λ m i = 1 l m i . Assuming that a 0 = b 0 , we have that every simple root is a superattracting fixed point for T f .
(b) 
T f has a repelling fixed point at ∞ with multiplier λ d 1 .
(c) 
If a 1 = a 2 = 0 , b 0 , b 1 0 and a 0 / b 1 < 0 then the extraneous fixed points of T f are the zeros of f which are not zeros of f. More precisely, if β is a zero of order n 2 of f , then it is a repelling fixed point of T f with multiplier
1 a 0 b 1 ( n 1 ) .
Remark 3.
If a 0 = 0 , then by Formula (6) we have that λ 1 = 1 , that is, the simple roots of a complex polynomial are parabolic fixed points for T f . In this case T f cannot be a root–finding algorithm. So, from now on, a 0 0 .
Proof. 
(a) First note that the factor ( z N f ( z ) ) in (1) implies that T f ( α i ) = α i for every i. If f has a zero α of multiplicity m , then α is a (super)attracting fixed point of Newton’s method with multiplier ( m 1 ) / m . Thus
N f ( z ) = α + ( z α ) m 1 m + O ( z α ) 2 .
It follows that
z N f ( z ) = 1 m ( z α ) + O ( z α ) 2 , N f ( z ) = m 1 m + O ( z α ) and ( N f ( z ) ) 2 = m 1 m 2 + O ( z α ) .
Consequently,
T f ( z ) = α + ( z α ) ( z α ) ( m 2 a 0 + a 1 ( m 1 ) m + a 2 ( m 1 ) 2 ) m 2 ( m b 0 + b 1 ( m 1 ) ) + O ( z α ) 2 = α + 1 a m 2 b m + a 2 m 2 ( m ( b 0 + b 1 ) b 1 ) ( z α ) + O ( z α ) 2 = α + 1 l m ( z α ) + O ( z α ) 2
Consequently, α is an attracting fixed point with multiplier 0 < 1 l m < 1 . By supposing that a 0 = b 0 , we have that l 1 = 1 , which implies that α is a superattracting fixed point.
( b ) Note that the degree d polynomial f can be written as
f ( z ) = μ z d 1 + O 1 | z | ,
for some μ C . Indeed, if f ( z ) = a d z d + a d 1 z d 1 + + a 1 z + a 0 , then
f ( z ) = a d z d 1 + a d 1 a d z + + a 1 a d z d 1 + a 0 a d z d .
Therefore, when | z | tends to , it follows that f a 0 z d , and we may write Newton’s method applied to the polynomial f as
N f ( z ) = d 1 d z + O ( 1 ) and N f ( z ) = d 1 d + O ( 1 ) .
By constructing the Formula (1), this implies that
z N f ( z ) = 1 d 1 d z + O ( 1 ) = 1 d z + O ( 1 ) ,
and so,
T f ( z ) = a 0 + a 1 N f ( z ) + a 2 ( N f ( z ) ) 2 b 0 + b 1 N f ( z ) = a 0 + a 1 ( d 1 ) d + a 2 ( d 1 ) 2 d 2 + O ( 1 ) b 0 + b 1 ( d 1 ) d + O ( 1 ) .
Thus,
T f ( z ) = z ( z N f ( z ) ) ( a 0 + a 1 N f ( z ) + a 2 ( N f ( z ) ) 2 ) b 0 + b 1 N f ( z ) = z 1 d z + O ( 1 ) a 0 + a 1 ( d 1 ) d + a 2 ( d 1 ) 2 d 2 + O ( 1 ) b 0 + b 1 ( d 1 ) d + O ( 1 ) = 1 a 0 d 2 + a 1 d 2 a 1 d + a 2 d 2 2 a 2 d + a 2 d 2 ( b 0 d + b 1 d b 1 ) z + O ( 1 ) = 1 a d 2 b d + a 2 d 2 ( d ( b 0 + b 1 ) b 1 ) z + O ( 1 ) = 1 l d z + O ( 1 ) = λ d z + O ( 1 )
( c ) Suppose that β is a zero of order n 2 of f which is not a zero of f. Then β is a pole of order n 2 for the map b 0 + b 1 N f , that is,
b 0 + b 1 N f ( z ) = λ ( z β ) n + O | z β | ( n 1 ) .
This implies that
N f ( z ) = λ b 1 ( 1 n ) ( z β ) n 1 + O | z β | ( n 2 )
and consequently,
T f ( z ) = z a 0 ( z N f ( z ) ) b 0 + b 1 N f ( z ) = z a 0 ( z [ λ / ( b 1 ( 1 n ) ( z β ) n 1 ) + O | z β | ( n 2 ) ] ) λ / ( z β ) n + O | z β | ( n 1 ) = β + ( z β ) + a 0 ( z β ) b 1 ( 1 n ) + O ( z β ) 2 = β + ( z β ) 1 a 0 b 1 ( n 1 ) + O ( z β ) 2 ,
provided m > 1 . Since b 1 0 and a 0 / b 1 < 0 , the quantity 1 a 0 b 1 ( n 1 ) is greater than one, and the proof is complete. □
Example 2.
Now we give some examples of Theorem 2:
1. 
Since Newton’s method is given by considering a 0 = b 0 = 1 , a 1 = a 2 = b 1 = 0 , then a = 1 and b = 1 . Hence l m = 1 / m and λ m = 1 1 / m . Thus the condition (7) is satisfied for every integer m 1 .
2. 
Halley’s method is obtained with a = 2 , a 2 = 0 , b = 0 , b 0 = 2 and b 1 = 1 . Then, l m = 2 / ( m + 1 ) and λ m = ( m 1 ) / ( m + 1 ) . Thus, the condition (7) is satisfied for every integer m 1 . In this case, repelling fixed point has multiplier of the form ( m + 1 ) / ( m 1 ) , provided m > 1 .
3. 
In Remark 2 was considered an example of purely iterative algorithm for Newton’s maps, that is not a root-finding algorithm. In this case (7) is not satisfied when b 1 < 2 . Indeed, l 2 = 14 / ( 16 + 8 b 1 ) and λ 2 = 1 14 / ( 16 + 8 b 1 ) > 1 , and so the condition (7) is not satisfied.
4. 
The root finding algorithm S H 2 has order of convergence 3 and does not satisfy a 0 = b 0 . In this case λ m = 2 / ( m + 1 ) .
The following table summarizes the examples (1)
a 0 a 1 a 2 b 0 b 1 λ m
N f 1 0 0 1 0 1 1 / m
M f 1 0 0 1 1 0
H f 2 0 0 2 1 ( m 1 ) / ( m + 1 )
S H 2 f 3 1 0 2 2 1 / ( 2 m )
W f 2 1 0 2 0 1 ( m + 1 ) / ( 2 m 2 )

5. Order of Convergence

This section will describe the order of convergence of T f defined in (1). In this section, N f [ n ] denote the nth derivative of Newton’s method.
Lemma 1.
Consider T f as a root finding algorithm applied to a degree d polynomial f. Then:
1. 
If a 0 = b 0 , then T f is at least of order 2.
2. 
If a 0 = b 0 and b 1 = a 1 a 0 2 , then T f is at least of order 3.
3. 
If a 0 = b 0 , b 1 = a 1 a 0 2 , a 1 = 2 a 2 and N f [ 3 ] ( α ) = 0 for every simple root α of f, then T f is at least of order 4.
4. 
If condition in ( 3 ) is satisfied and additionally a 1 = 0 and N f [ 4 ] ( α ) = 0 for every simple root α of f, then T f has order 5.
Proof. 
Recall that a 0 0 . Let α be a simple root of a polynomial f. Since Newton’s method is an order 2 root–finding algorithm, it follows that
N f ( z ) = α + μ ( z α ) 2 + O ( z α ) 3 and N f ( z ) = 2 μ ( z α ) + O ( z α ) 2 ,
for some μ 0 . Hence,
z N f ( z ) = ( z α ) μ ( z α ) 2 + O ( z α ) 3 , ( N f ( z ) ) 2 = 4 μ 2 ( z α ) 2 + O ( z α ) 3 .
Consequently,
a 0 + a 1 N f ( z ) + a 2 ( N f ( z ) ) 2 b 0 + b 1 N f ( z ) = a 0 + 2 μ a 1 ( z α ) + O ( z α ) 2 b 0 + 2 μ b 1 ( z α ) + O ( z α ) 2 .
Now to prove part ( 1 ) write the Formula (1) as
T f ( z ) = z ( z N f ( z ) ) ( a 0 + a 1 N f ( z ) + a 2 ( N f ( z ) ) 2 ) b 0 + b 1 N f ( z ) = z ( z α ) μ ( z α ) 2 + O ( z α ) 3 a 0 + 2 μ a 1 ( z α ) + O ( z α ) 2 b 0 + 2 μ b 1 ( z α ) + O ( z α ) 2 = α + 1 a 0 b 0 ( z α ) + O ( z α ) 2 .
Thus, if a 0 = b 0 we have that T f ( α ) = α , T f ( α ) = 0 and so T f is a root–finding algorithm of order at least 2, which proves part ( 1 ) .
In order to prove part ( 2 ) consider that a 0 = b 0 . Thus
T f ( z ) = z ( z N f ( z ) ) ( a 0 + a 1 N f ( z ) + a 2 ( N f ( z ) ) 2 ) a 0 + b 1 N f ( z ) = z ( z α ) μ ( z α ) 2 + O ( z α ) 3 a 0 + 2 μ a 1 ( z α ) + O ( z α ) 2 a 0 + 2 μ b 1 ( z α ) + O ( z α ) 2 = α + μ 2 a 1 + a 0 + 2 b 1 a 0 ( z α ) 2 + O ( z α ) 3 .
where μ 0 . Hence, if b 1 = a 1 a 0 2 , then 2 a 1 + a 0 + 2 b 1 = 0 and we conclude that T f ( α ) = α , T f ( α ) = 0 , T f ( α ) = 0 , and so T f is a root–finding algorithm of order at least 3, which proves part ( 2 ) .
Now we will prove part ( 3 ) by similar computations as above. For this, consider the Taylor expansion of Newton’s method around the simple root α of the polynomial f,
N f ( z ) = α + μ 1 ( z α ) 2 + μ 2 ( z α ) 3 + O ( z α ) 4 .
where μ 1 0 . Hence, N f ( z ) = 2 μ 1 ( z α ) + 3 μ 2 ( z α ) 2 + O ( z α ) 3 and combining those computations with the fact that b 0 = a 0 , b 1 = a 1 a 0 2 , and so
T f ( z ) = z ( z N f ( z ) ) ( a 0 + a 1 N f ( z ) + a 2 ( N f ( z ) ) 2 ) a 0 + b 1 N f ( z ) = α μ 2 a 0 4 μ 1 2 a 1 + 8 a 2 μ 1 2 2 a 0 ( z α ) 3 + O ( z α ) 4 . = α μ 2 a 0 + 4 μ 1 2 ( a 1 + 2 a 2 ) 2 a 0 ( z α ) 3 + O ( z α ) 4 .
Since N f [ 3 ] ( α ) = 6 μ 2 = 0 , which implies that μ 2 = 0 . Thus if a 1 = 2 a 2 , we conclude that T f ( α ) = α , T f ( α ) = 0 , T f ( α ) = 0 , T f ( α ) = 0 , and so T f is a root–finding algorithm of order at least 4. This conclude part ( 3 ) .
Finally to prove part ( 4 ) , consider the Taylor expansion of Newton’s method around the simple root α of the polynomial f,
N f ( z ) = α + μ 1 ( z α ) 2 + μ 2 ( z α ) 3 + μ 3 ( z α ) 4 + O ( z α ) 5 .
where μ 1 0 . Hence, N f ( z ) = 2 μ 1 ( z α ) + 3 μ 2 ( z α ) 2 + 4 μ 3 ( z α ) 3 + O ( z α ) 4 , and combining those computations with the hypothesis if b 0 = a 0 , b 1 = a 1 a 0 2 and a 1 = 2 a 2 , implies that
T f ( z ) = z ( z N f ( z ) ) ( a 0 + a 1 N f ( z ) + a 2 ( N f ( z ) ) 2 ) a 0 + b 1 N f ( z ) = α + 2 μ 1 3 a 1 μ 3 a 0 a 0 ( z α ) 4 + O ( z α ) 5 .
Since N f [ 4 ] ( α ) = 24 μ 3 = 0 , we have that μ 3 = 0 . Additionally, by supposing that a 1 = 0 , it follows that T f ( α ) = α , T f ( α ) = 0 , T f ( α ) = 0 , T f ( α ) = 0 , T f [ 4 ] ( α ) = 0 , and so T f is a root–finding algorithm of order at least 5. This conclude part ( 4 ) , and the proof of the Lemma.
 □
Remark 4.
Note that in the Lemma above, while conditions in part ( 1 ) and ( 2 ) are easy to check, in the opposite, part ( 3 ) and ( 4 ) are harder to verify.
The following corollary characterizes the Newton’s method for Multiple roots. See part 4 . in Example 1.
Corollary 1.
Let f be a complex polynomial and denote by α i its roots. Suppose that T f is a root finding algorithm with order of convergence equal to two and the order does not depend on the multiplicity of the roots α i . Then T f is the Newton’s multiple for multiple roots.
Proof. 
By part (a) of Theorem 2 we have that
T f ( z ) = α + 1 l m ( z α ) + O ( z α ) 2 .
Since the order of convergence of the root finding algorithm T f is 2, then for every root α i we have that λ 1 = 1 l m = 0 and a 0 = b 0 for all m N . This implies that
a 2 m a 1 + 2 a 2 + m 2 a 0 + a 1 + a 2 + b 1 m 3 a 0 + b 1 = 0
for every m N , if and only if
b 1 = a 0 , and a 1 = a 2 = 0 .
Therefore
T f ( z ) = z ( z N f ( z ) ) a 0 a 0 a 0 N f ( z ) = z z N f ( z ) 1 N f ( z ) .
This concludes the proof. □
Example 3.
The following example show three examples to find an approximation of the roots of
f z = 2 z 3 4 z 2 5 z 3 ,
which are 1 2 + 1 2 i , 1 2 1 2 i and 3 . Note that for simple roots and order higher than two, we have that l m = 1 . Suppose the first two parts of Lemma 1, that is,
b 0 = a 0 , b 1 = a 1 a 0 2 .
Thus, (1) takes the form
T f ( z ) = z ( z N f ( z ) ) ( a 0 + a 1 N f ( z ) + a 2 ( N f ( z ) ) 2 ) a 0 + a 1 a 0 2 N f ( z ) .
Suppose, in addition that a 0 = 2 , a 1 = 1 and a 2 = 0 . Therefore
T f ( z ) = z 1 2 ( z N f ( z ) ) ( 2 + N f ( z ) ) .
After some calculations, we obtain
T f ( z ) = 120 z 7 416 z 6 + 360 z 5 + 228 z 4 690 z 3 72 z 2 + 306 z + 111 6 z 2 8 z 5 3 .
Now suppose that a 0 = 4 and a 1 = 2 . Then for
f z = 2 z 3 4 z 2 5 z 3 ,
and so,
T f ( z ) = z ( z N f ( z ) ) 2 N f ( z ) 2 2 N f ( z ) ,
which implies
T f ( z ) = 24 z 7 64 z 6 + 216 z 5 + 28 z 4 578 z 3 + 108 z 2 + 294 z + 39 72 z 6 288 z 5 + 388 z 4 + 360 z 3 762 z 2 388 z 5 .
Finally consider
b 0 = a 0 , b 1 = a 1 a 0 2 , a 2 = a 0 4 + a 1 2 , a n d a 1 = 3 4 a 0 .
Note that this set of parameters gives convergence of order 3. Then
T f ( z ) = z ( z N f ( z ) ) 8 6 N f ( z ) ( N f ( z ) ) 2 2 4 5 N f ( z ) .
The following table show the iterations of order three, where
z n + 1 = T f ( z n ) ,
and z 0 = 1 + i .
( 9 ) ( 10 ) ( 11 )
z 0 1 + i 1 + i 1 + i
z 1 0.190 57 0.053 58 i 0.125 02 + 0.837 42 i 0.09 039 0 + 0.636 79
z 2 1.475 6 0.378 19 i 0.490 27 + 0.507 84 i 0.455 09 + 0.485 78 i
z 5 0.767 42 0.233 97 i 0.49999 + 0.5 i 0.500 02 + 0.499 98 i
z 4 0.305 28 0.508 72 i 0.5 + 0.5 i 0.5 + 0.5 i

6. Conjugacy Classes of the Schemes

We next prove an extension of the Scaling Theorem for purely iterative algorithms for Newton’s maps. Let R 1 , R 2 : C ¯ C ¯ be two rational maps. Then R 1 and R 2 are conjugated if there exists a Möbius transformation T : C ¯ C ¯ such that R 1 T ( z ) = T R 2 ( z ) for all z.
Conjugacy plays a central role in the understanding of the behavior of classes of maps under iteration in the following sense. Suppose that we wish to describe both, the quantitative and the qualitative behaviors of the map z T f ( z ) , where T f ( z ) is an iterative function resulting from an iterative method z n + 1 = Φ ( z n ) .
Let f be an arbitrary analytic function. Since conjugacy preserves fixed points, cycles and their character (whether (super)attracting, or repelling, or indifferent), and their basins of attraction, it is a worthwhile idea to try to construct a parameterized family or families consisting of polynomials f a , as simple as possible so that, for a suitable choice of the complex parameter a , there may exists a conjugacy between T f ( z ) and T f a ( z ) .
In order to describe the conjugacy classes of T f , recall a next useful result (see [39], Section 5, Theorem 1).
Theorem 3
(The Scaling Theorem for Newton’s method). Let f ( z ) be an analytic function on the Riemann sphere, and let A ( z ) = α z + β , with α 0 , be an affine map. If g ( z ) = f A ( z ) , then A N g A 1 ( z ) = N f ( z ) , that is, N f is analytically conjugated to N g by A.
Theorem 4
(Scaling Theorem). Assume that k = 1 in (2), that is,
F ( z , ξ 0 , ξ 1 ) = z P ( z , ξ 0 , ξ 1 ) Q ( z , ξ 0 , ξ 1 )
where
P ( z , ξ 0 , ξ 1 ) = ( z ξ 0 ) ( a 0 + a 1 ξ 1 + a 2 ξ 1 2 + + a m ξ 1 m ) , Q ( z , ξ 0 , ξ 1 ) = ( b 0 + b 1 ξ 1 + b 2 ξ 1 2 + + b n ξ 1 n )
and m , n 1 .
Let f and g be polynomials, and let A be the affine map A ( z ) = α z + β , with α 0 . Consider the purely iterative algorithms
T f ( z ) = F ( ^ ( z , N f ( z ) ) ) and T g ( z ) = F ( ^ ( z , N g ( z ) ) ) .
If g ( z ) = f A ( z ) , then T f A = A T g , that is, T f is analytically conjugated to T g by A.
Proof. 
Assume that there exists a constant λ C such that g ( z ) = λ f A ( z ) . According to Theorem 3, we have A ( N g ( z ) ) = α N g ( z ) + β = N f ( A ( z ) ) . Hence
N g ( z ) = N f ( α z + β ) = N f ( A ( z ) ) .
This yields
T f ( A ( z ) ) = F ( ^ ( A ( z ) , N f ( A ( z ) ) ) ) = A ( z ) P ( A ( z ) , N f ( z ) , N f ( A ( z ) ) ) Q ( A ( z ) , N f ( z ) , N f ( A ( z ) ) ) = A ( z ) ( A ( z ) N f ( A ( z ) ) ) ( a 0 + a 1 N f ( A ( z ) ) + + a m N f ( A ( z ) ) m ) ( b 0 + b 1 N f ( A ( z ) ) + b 2 N f ( A ( z ) ) 2 + + b m N f ( A ( z ) ) m ) = α z + β ( α z + β A ( N g ( z ) ) ) ( a 0 + a 1 N g ( z ) + + a m N g ( z ) m ) ) ( b 0 + b 1 N g ( z ) + b 2 N g ( z ) 2 + + b m N g ( z ) m ) = α z + β α ( z N g ( z ) ) ( a 0 + a 1 N g ( z ) + + a m N g ( z ) m ) ( b 0 + b 1 N g ( z ) + b 2 N g ( z ) 2 + + b m N g ( z ) m ) = α z ( z N g ( z ) ) ( a 0 + a 1 N g ( z ) + + a m N g ( z ) m ) ( b 0 + b 1 N g ( z ) + b 2 N g ( z ) 2 + + b m N g ( z ) m ) + β = A ( T g ( z ) )
which completes the proof. □

7. Methods Generally Convergent for Cubic Polynomials

A purely iterative rational root-finding algorithm T f is generally convergent if T f n ( z ) converge to the roots of the polynomial f for almost all complex polynomials of degree d 2 and almost all initial conditions. C. Mcmullen proved that if d > 3 , then there is no possibility to find a generally convergent root–finding algorithm. Moreover, he proved the following result:
Theorem 5
([4]). Every generally convergent algorithm for cubic polynomials is obtained by specifying a rational map R in such a way that
1. 
R is convergent for p ( z ) = z 3 1 .
2. 
A u t ( R ) contains those Möbius maps that permutes the roots of unity.
Moreover the generated algorithm has the form
R c = ϕ c R ϕ c 1
where ϕ c is a Möbius transformation that associate the roots of unity to the points 1, 1 2 ( 1 1 4 c ) and 1 2 ( 1 + 1 4 c ) .
So, the following definition is natural: a rational map R generates a generally convergent algorithm if it is convergent for the cubic polynomial representing the roots of unity and its associated automorphism group contain the Möbius transformations which commutes the roots of unity.
As an example, consider Halley’s method applied to the family of cubic polynomials f λ ( z ) = z 3 + ( λ 1 ) z λ . Hawkins’s theorem implies that if a rational map R generates a generally convergent algorithm, then zero is a fixed point of R. Hence, the condition H f ( 0 ) = 0 implies that λ = 0 or λ = 1 . Also the group of automorphisms must contain the Möbius transformations that permutes the roots of unity, then λ cannot be 0. Thus, λ = 1 and we obtain Halley’s method applied to z 3 1 .
McMullen’s theorem tells us how to generate a generally convergent iterative algorithm by finding the map R. The following question is natural: When T f contain rational maps which are generating of generally convergent algorithms?
The following theorem is due to J. Hawkins (Theorem 1 [5]) and describes explicitly the rational maps which are generating of generally convergent algorithms for cubic polynomials according to their degree.
Theorem 6.
If R generates a generally convergent root–finding algorithm for cubic polynomials, then there exist constants α 0 , α 1 , , α k 0 , such that R has the following form:
R ( z ) = z ( α 0 + α 1 z 3 + + α k z 3 k ) α k + α k 1 z 3 + + α 0 z 3 k .
Using this theorem it is proved that over the space of cubic polynomials, those maps T f that generates a generally convergent algorithm are restricted to Halley’s method applied to the cubic polyomial.
Theorem 7.
Let f be a cubic polynomial. Suppose that
T f ( z ) = z ( z N f ( z ) ) ( a 0 + a 1 N f ( z ) + a 2 ( N f ( z ) ) 2 ) b 0 + b 1 N f ( z )
where a 0 , a 1 , a 2 , b 0 and b 1 are real numbers, generates a generally convergent root–finding algorithm for cubic polynomials. Then T f is the Halley method applied to z 3 1 .
Proof. 
Since T f satisfies the Scaling theorem, then in the case of cubic polynomials, we can restrict to f λ ( z ) = z 3 + ( λ 1 ) z λ . From Theorem 6 we see that 0 is always a fixed point of the generating map. This imposes restrictions on the values of a 0 , a 1 , a 2 , b 0 and b 1 .
If λ 1 , then, since N f λ ( 0 ) = λ / ( λ 1 ) and N f λ ( 0 ) = 0 , we have that T f ( 0 ) = λ a 0 / ( λ 1 ) = 0 which implies two cases: λ = 0 or a 0 = 0 . Also the case λ = 1 must be treated separately. Remark 3 shows that a 0 cannot be 0. Consequently the cases are:
  • λ = 1
  • λ = 0
Case (1). If λ = 1 , and T f generates a generally convergent root–finding algorithm for cubic polynomials then a 1 = a 2 = 0 and b 1 0 , otherwise 0 would be a pole of T f . In fact, in this case f 1 ( z ) = z 3 1 , and in a neighborhood of zero we have N f 1 ( z ) = γ z 2 + O ( z ) , with γ 0 . Thus
T f 1 ( z ) = z ( z N f ( z ) ) ( a 0 + a 1 N f ( z ) + a 2 ( N f ( z ) ) 2 ) b 0 + b 1 N f ( z ) = z + γ 1 z 2 + O ( z ) γ 2 z 3 + O ( 1 ) = γ 3 z 5 + γ 4 z 2 + O ( z )
where γ 1 , γ 2 , γ 3 and γ 4 are non–zero constants. This implies that 0 is a pole of order 5 for T f 1 . Moreover,
lim z 0 z 5 T f 1 ( z ) = 2 a 2 9 b 1 = γ 3
Hence, if a 2 = 0 y b 1 0 , then we have
lim z 0 z 2 T f 1 ( z ) = a 1 3 b 1 = γ 4 ,
and then 0 is a pole of order 2. Therefore, if a 1 = a 2 = 0 and b 1 0 , we are able to remove the poles of T f 1 and consequently T f 1 has a fixed point at 0. Thus the map must have the form
T f 1 ( z ) = z ( ( a 0 + 3 b 0 + 2 b 1 ) z 3 + ( 2 b 1 + a 0 ) ) ( 3 b 0 + 2 b 1 ) z 3 2 b 1 .
According to Theorem 6, Formula (12) must also satisfy the equation
a 0 = 4 b 1 + 3 b 0
in order to T f 1 generates a generally convergent root–finding algorithm for cubic polynomials.
Since the roots of unity are superattracting fixed points, we have
T f 1 ( 1 ) = b 0 a 0 b 0 = 0 ,
which impose the relation a 0 = b 0 . Together with Formula (8), it follows that b 0 = 2 b 1 . As a consequence we have
T f 1 ( z ) = z ( z 3 + 2 ) 2 z 3 + 1 ,
which is exactly Halley’s method applied to z 3 1 .
Case (2). If λ = 0 , it follows that 0 is a fixed point of T f , however it cannot generate a generally convergent method. Indeed, note that
T f 0 ( 1 ) = T f 0 ( 0 ) = b 0 a 0 b 0 = 0 .
Hence, in order to obtain that 1 to be a superattracting fixed point, we must have that b 0 = a 0 . Furthermore, this implies that 0 is a superattracting fixed point, that is, there exists an open subset of the plane which belongs to the basin of attraction of 0. Therefore the rational map T f 0 cannot generate a generally convergent root finding algorithm for cubic polynomials, and the proof is complete. □

8. Dynamical Study of the Fourth-Order Family

In this section we will study the complex dynamics of the family (1) when b 0 = a 0 , b 1 = a 1 a 0 2 and a 2 = a 0 4 + a 1 2 . In this case, according to Lemma 1 T f is of order 4. The fixed point operator associated to this family of methods, on a nonlinear function f ( z ) is
T f ( z ) = z ( z N f ( z ) ) ( a 0 + a 1 N f ( z ) + a 0 4 + a 1 2 ( N f ( z ) ) 2 ) a 0 + a 1 a 0 2 N f ( z ) ,
where N f ( z ) = z f ( z ) f ( z ) . Note that in this case T f depends only on a 0 and a 1 .
By applying this operator on a generic polynomial p ( z ) = ( z a ) ( z b ) , and by using the Möebius map h ( z ) = z a z b , whose properties are
( i ) h = 1 , ( ii ) h a = 0 , ( iii ) h b = ,
the rational operator associated to the family of iterative schemes is finally
G ( z , a 0 , a 1 ) = z 4 3 a 0 + 4 a 1 + 3 a 0 z + 2 a 1 z + a 0 z 2 a 0 + 3 a 0 z + 2 a 1 z + 3 a 0 z 2 + 4 a 1 z 2 .
It is easy to see that this family of methods has at least order of convergence 4. We have seen that for the special case a 1 = 3 4 a 0 the family have fifth order of convergence and the family has the form
G ( z ) = z 5 ( 3 + 2 z ) 2 + 3 z .

8.1. Study of the Fixed Points and Their Stability

It is clear that z = 0 and z = are fixed points of G ( z , a 0 , a 1 ) which are related to the root a and b respectively. Now, we focus our the attention on the extraneous fixed points (those points which are fixed points of T f and are not solutions of the equation f ( z ) = 0 ). First of all, we notice that z = 1 is an extraneous fixed point, which is associated with the original convergence to infinity. Moreover, there are also another two strange fixed points which correspond to the roots of the polynomial
q ( z ) = 6 a 0 2 + 8 a 0 a 1 + 9 a 0 2 z + 16 a 0 a 1 z + 12 a 1 2 z + 6 a 0 2 z 2 + 8 a 0 a 1 z 2 ,
whose analytical expression, depending on a 0 and a 1 , are:
e x 1 ( a 0 , a 1 ) = 3 a 0 2 a 1 3 a 0 2 4 a 0 a 1 + 4 a 1 2 2 a 0 , e x 1 ( a 0 , a 1 ) = 3 a 0 2 a 1 + 3 a 0 2 4 a 0 a 1 + 4 a 1 2 2 a 0 ,
There exist relations between the extraneous fixed points and they are described in the following result.
Lemma 2.
The number of simple extraneous fixed points of G ( z , a 0 , a 1 ) is three, except in the following cases:
(i) 
If a 1 = a 0 2 , then e x 1 ( a 0 , a 1 ) = e x 2 ( a 0 , a 1 ) = 1 that is not a fixed point, so there is only one extraneous fixed point.
(ii) 
If a 1 = 3 a 0 2 , then e x 1 ( a 0 , a 1 ) = e x 2 ( a 0 , a 1 ) = 3 that is not a fixed point, so there is only one extraneous fixed point.
(iii) 
If a 1 = 3 a 0 4 , then e x 1 ( a 0 , a 1 ) = e x 2 ( a 0 , a 1 ) = 0 that is a fixed point related to the root a, so there are is only one extraneous fixed point.
(iv) 
If a 1 = 7 a 0 6 , then e x 1 ( a 0 , a 1 ) = e x 2 ( a 0 , a 1 ) = 1 that is an extraneous fixed point, so there is only one extraneous fixed points.
Related to the stability of that extraneous fixed points, the first derivative of G ( z , a 0 , a 1 ) must be calculated
G z , a 0 , a 1 = 2 z 3 ( 1 + z ) 2 6 a 0 2 + 8 a 0 a 1 + 9 a 0 2 z + 16 a 0 a 1 z + 12 a 1 2 z + 6 a 0 2 z 2 + 8 a 0 a 1 z 2 a 0 + 3 a 0 z + 2 a 1 z + 3 a 0 z 2 + 4 a 1 z 2 2 .
Taking into account the form of the derivative, it is immediate that the origin and ∞ are superattractive fixed points for every value of a 0 and a 1 .
The stability of the other fixed points is more complicated and will be shown in a separate way. First of all, focussing the attention in the extraneous fixed point z = 1 , which is related to the original convergence to ∞, and the following result can be shown.
G ( 1 , a 0 , a 1 ) = 8 ( 3 a 0 + 2 a 1 ) 7 a 0 + 6 a 1 ,
Related to the stability of the extraneous fixed point z = 1 we have the following result.
Lemma 3.
The behavior of z = 1 is the following:
(i) 
If a 0 = 2 a 1 3 , then z = 1 is a superattracting fixed point.
(ii) 
If a 1 < 0 and 10 a 1 17 < a 0 < 2 a 1 3 or 2 a 1 3 < a 0 < 22 a 1 31 . Then, z = 1 is attracting.
(iii) 
If a 1 > 0 and 22 a 1 31 < a 0 < 2 a 1 3 or 2 a 1 3 < a 0 < 10 a 1 17 . Then, z = 1 is attracting.
(iv) 
If a 0 = 10 a 1 17 and a 1 0 , then z = 1 is an indifferent fixed point.
(v) 
If a 0 = 22 a 1 31 and a 1 0 , then z = 1 is an indifferent fixed point.
In the rest, of the cases z = 1 is repelling.
Due to the complexity of the stability function of each one of the extraneous fixed points
G ( e x 1 ( a 0 , a 1 ) , a 0 , a 1 ) = 1 2 a 0 2 ( a 0 + 2 a 1 ) ( 7 a 0 + 6 a 1 ) 27 a 0 4 + 6 a 0 2 a 1 7 a 1 5 ( 3 a 0 2 a 1 ) ( a 0 + 2 a 1 ) 18 a 0 3 5 a 1 + ( 3 a 0 2 a 1 ) ( a 0 + 2 a 1 ) 12 a 1 3 2 a 1 + ( 3 a 0 2 a 1 ) ( a 0 + 2 a 1 ) 2 a 0 a 1 2 20 a 1 + 13 ( 3 a 0 2 a 1 ) ( a 0 + 2 a 1 )
and
G ( e x 2 ( a 0 , a 1 ) , a 0 , a 1 ) = 1 2 a 0 2 ( a 0 + 2 a 1 ) ( 7 a 0 + 6 a 1 ) 27 a 0 4 + 6 a 0 2 a 1 7 a 1 5 ( 3 a 0 2 a 1 ) ( a 0 + 2 a 1 ) + 18 a 0 3 5 a 1 + ( 3 a 0 2 a 1 ) ( a 0 + 2 a 1 ) 12 a 1 3 2 a 1 + ( 3 a 0 2 a 1 ) ( a 0 + 2 a 1 ) 2 a 0 a 1 2 20 a 1 + 13 ( 3 a 0 2 a 1 ) ( a 0 + 2 a 1 )
to characterize its domain analytically is not affordable. We will use the graphical tools of software Mathematica in order to obtain the regions of stability of each of them, in the complex plane.
In Figure 1, the stability region of z = 1 can be observed and in Figure 2 and Figure 3, the region of stability of e x 1 ( a 0 , a 1 ) and e x 2 ( a 0 , a 1 ) are shown. These stability regions are drawn in 3D, since we study the behavior of the derivative G which depends on two parameters, so we need three axes to observe it. Taking into account these regions the following result summarize the behavior of the extraneous fixed points. These Figures are important, as it can be seen in [40,41,42] due to the fact that they give light about the stability of the extraneous fixed points, if there is no region with attracting behaviour of them, they won’t have any problematic behaviour.
As a conclusion we can remark that the number and the stability of the fixed points depend on the parameters a 0 and a 1 .

8.2. Study of the Critical Points and Parameter Spaces

In this section, we compute the critical points and we show the parameter spaces associated to the free critical points. It is well known that there is at least one critical point associated with each invariant Fatou component. The critical points of the family are the solutions of is G ( z , a 0 , a 1 ) = 0 , where
G ( z , a 0 , a 1 ) = 2 z 3 ( 1 + z ) 2 6 a 0 2 + 8 a 0 a 1 + 9 a 0 2 z + 16 a 0 a 1 z + 12 a 1 2 z + 6 a 0 2 z 2 + 8 a 0 a 1 z 2 a 0 + 3 a 0 z + 2 a 1 z + 3 a 0 z 2 + 4 a 1 z 2 2 .
By solving this equation, it is clear that z = 0 and z = are critical points, which are related to the roots of the polynomial p ( z ) and they have associated their own Fatou component. Moreover, there exist critical points no related to the roots, these points are called free critical points. Their expressions are:
c r 0 = 1 ,
c r 1 ( a 0 , a 1 ) = 9 a 0 2 16 a 0 a 1 12 a 1 2 3 21 a 0 4 32 a 0 3 a 1 + 72 a 0 2 a 1 2 + 128 a 0 a 1 3 + 48 a 1 4 4 3 a 0 2 + 4 a 0 a 1
and
c r 2 ( a 0 , a 1 ) = 9 a 0 2 16 a 0 a 1 12 a 1 2 + 3 21 a 0 4 32 a 0 3 a 1 + 72 a 0 2 a 1 2 + 128 a 0 a 1 3 + 48 a 1 4 4 3 a 0 2 + 4 a 0 a 1
The relations between the free critical points are described in the following result.
Lemma 4.
(a) 
If a 1 = a 0 2 or a 1 = a 0 2
(i) c r 1 = c r 2 = 1 .
(b) 
If a 1 = 3 a 0 2 or a 1 = 7 a 0 6
(i) c r 1 = c r 2 = 1 .
(c) 
For other values of a 0 and a 1
(i)The family has 3 free critical points.
Moreover, it is clear that for every value of a 0 and a 1 , c r 1 ( a 0 , a 1 ) = 1 c r 2 ( a 0 , a 1 )
It is easy to see that z = 1 is a pre-periodic point as it is the pre-image of the fixed point related to the convergence to infinity, z = 1 , and the other free critical points are conjugated c r 1 ( a 0 , a 1 ) = 1 / c r 2 ( a 0 , a 1 ) . So, there are only two independent free critical points and only one is not pre-periodic. Without loss of generality, we consider in this paper the free critical point c r 1 ( a 0 , a 1 ) . In order to find the best members of the family in terms of stability, the parameter space corresponding to this independent free critical point will be shown.
The study of the orbits of the critical points gives rise about the dynamical behavior of an iterative method. More precisely, to determinate if there exists any attracting extraneous fixed point or periodic orbit, the following question must be answered: For which values of the parameters, the orbits of the free critical points are attracting periodic orbits? In order to answer this question we are going to draw the parameter space but our main problem is that we have 2 free parameters a 0 , a 1 . In order to avoid this problem, we are going to use a a variant of the algorithm that appears in [43] and similar, in which we will consider the horizontal axis as the possible real values of a 0 and the vertical one as the possible values of a 1 . When the critical point is used as an initial estimation, for each value of the parameter, the color of the point tell us about the place it has converged to: to a fixed point, to an attracting periodic orbit or even the infinity.
In Figure 4, the parameter space associated to c r 1 ( a 0 , a 1 ) is shown. The algorithm to draw this parameter speace is similar to the one used in [44]: A point is painted in cyan if considerint z 0 = c r 1 ( a 0 , a 1 ) the iteration converges to 0 (which is related to one root), in magenta the convergence to ∞ (which is related to the other root) and in yellow appear the points which iteratations converges to 1 (which is related to ∞). Other colors used are:red for theconvergence to a extraneous fixed points and other colors, including black, for cycles.
Now, we are going to show these anomalies using dynamical planes where the convergence to 0, after a maximum of 2000 iterations and with a tolerance of 10 6 appear in magenta, in cyan it appears the convergence to ∞, after a maximum of 2000 iterations and with a tolerance of 10 6 and in black the zones with no convergence to the roots. First of all, in Figure 5 and Figure 6 the dynamical planes associated with the values of a 0 a 1 for which there is no convergence problems, are shown. As a consequence, those selections of pair of values are a good choice since all points converge to the roots of the original equation.
Then, focussing the attention in the region shown in Figure 4 it is evident that there exist members of the family with complicated behavior. In Figure 7, the dynamical planes of a member of the family with regions of convergence to any of the extraneous fixed points is shown. In this case, there exist regions of points which iterations do not converge to any of the roots of the original equations, so these values are not a good choice.
On the other hand, in Figure 4 and Figure 8, the dynamical planes of a member of the family with regions of convergence to z = 1 , related to ∞ is shown, in which we observe that there exist. In this case, there exist regions of points which iterations do not converge to any of the roots of the original equations, so these values are not a good choice.
Finally, in Figure 9 and Figure 10, some dynamical planes of members of the family with convergence to different attracting cycles are shown.
If we choose as a particular value of a 0 = 5 and a 1 = 0.1375 we can observe the existence of a periodic orbit of period 3
{ x 1 = 0.97769 0.210052 i , x 2 = 0.75091 0.660405 i , x 3 = 0.763289 0.646058 i } ,
moreover, this orbit is attracting as
| G ( x 1 ) G ( x 2 ) G ( x 3 ) | = 0.265145 < 1 .
Sharkovsky’s Theorem [31], states that the existence of orbits of period 3, guaranties orbits of any period.

9. Conclusions

This article discusses purely iterative algorithms for Newton’s maps T f , given by the Formula (1), and that were proposed in [6]. This family represents a large class of root finding algorithms, including the best known, and those of high order of convergence. Depending on the parameters a 0 , a 1 , a 2 , b 0 and b 1 , in general the family T f may not define a root finding algorithm. To avoid this difficulty, we achieved a characterization in terms of those parameters, so that it is effectively a root finding algorithm. The scaling theorem has the advantage of reducing the parameter space in dimension, and it is useful for plotting the parameter space in low dimension, among other things. We give a classification of extraneous fixed points and indifferent fixed points of T f , in terms of the parameters a 0 , a 1 , a 2 , b 0 and b 1 . Then, we use those results and Hawkins’s theorem to conclude that over the family T f , the rational map that generates generally convergent root finding algorithms, is the Halley’s method applied to cubic polynomials. This shows that rigidity is even stronger, and is not obtained only in terms of the conjugation.

Author Contributions

All authors have contributed equally. All authors have read and agreed to the published version of the manuscript.

Funding

S.A. was partially supported by Séneca 20928/PI/18 and by MINECO PID2019-108336GB-I00. G.H.was partially supported by REDES 190071, 180151 and 180018 from ANID Chile. Á.A.M. was partially supported by Séneca 20928/PI/18 and by MINECO PGC2018-095896-B-C21.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Hartman, P. Ordinary Differential Equations; John Wiley and Sons, Inc.: New York, NY, USA; London, UK; Sydney, Australia, 1964; p. xiv+612. [Google Scholar]
  2. Blanchard, P. The dynamics of Newton’s method, Complex dynamical systems. Proc. Symp. Appl. Math. Am. Math. Soc. 1994, 49, 139–154. [Google Scholar]
  3. Haeseler, F.V.; Peitgen, H.-O. Newton’s Method and Dynamical Systems; Peitgen, H.-O., Ed.; Kluwer Academic Publishers: Dordrecht, The Netherlands, 1989; pp. 3–58. [Google Scholar]
  4. McMullen, C. Families of rational maps and iterative root-finding algorithms. Ann. Math. 1987, 125, 467–493. [Google Scholar] [CrossRef] [Green Version]
  5. Hawkins, J. McMullen’s root-finding algorithm for cubic polynomials. Proc. Am. Math. Soc. 2002, 130, 2583–2592. [Google Scholar] [CrossRef]
  6. Honorato, G.; Plaza, S. Dynamical aspects of some convex acceleration methods as purely iterative algorithm for Newton’s maps. Appl. Math. Comput. 2015, 251, 507–520. [Google Scholar] [CrossRef]
  7. Arney, D.C.; Robinson, B.T. Exhibiting chaos and fractals with a microcomputer. Comput. Math. Appl. 1990, 19, 1–11. [Google Scholar] [CrossRef] [Green Version]
  8. Buff, X.; Henriksen, C. König’s root-finding algorithms. Nonlinearity 2003, 16, 989–1015. [Google Scholar] [CrossRef]
  9. Crane, E. Mean value conjectures for rational maps. Complex Var. Elliptic Equ. 2006, 51, 41–550. [Google Scholar] [CrossRef]
  10. Curry, J.H.; Garnett, L.; Sullivan, D. On the iteration of a rational function: Computer experiment with Newton’s method. Commun. Math. Phys. 1983, 91, 267–277. [Google Scholar] [CrossRef]
  11. Drakopoulos, V. On the additional fixed points of Schröder iteration functions associated with a one–parameter family of cubic polynomials. Comput. Graph. 1998, 22, 629–634. [Google Scholar] [CrossRef]
  12. Honorato, G.; Plaza, S.; Romero, N. Dynamics of a higher-order family of iterative methods. J. Complex. 2011, 27, 221–229. [Google Scholar] [CrossRef] [Green Version]
  13. Kneisl, K. Julia sets for the super-Newton method, Cauchy’s method, and Halley’s method. Chaos 2001, 11, 359–370. [Google Scholar] [CrossRef] [PubMed]
  14. Rückert, J.; Schleicher, D. On Newton’s method for entire functions. J. Lond. Math. Soc. 2007, 75, 659–676. [Google Scholar] [CrossRef] [Green Version]
  15. Sarría, Í.; González, R.; González-Castaño, A.; Magreñán, Á.A.; Orcos, L. A pedagogical tool based on the development of a computer application to improve learning in advanced mathematics. Rev. Esp. Pedagog. 2019, 77, 457–485. [Google Scholar]
  16. Vrscay, E. Julia sets and Mandelbrot–like sets associated with higher order Schröder rational iteration functions: A computer assisted study. Math. Comput. 1986, 46, 151–169. [Google Scholar]
  17. Blanchard, P. Complex Analytic Dynamics on the Riemann Sphere. Bull. AMS 1984, 11, 85–141. [Google Scholar] [CrossRef] [Green Version]
  18. Milnor, J. Dynamics in One Complex Variable: Introductory Lectures, 3rd ed.; Princeton U. Press: Princeton, NJ, USA, 2006. [Google Scholar]
  19. Smale, S. On the efficiency of algorithms of analysis. Bull. AMS 1985, 13, 87–121. [Google Scholar] [CrossRef]
  20. Amorós, C.; Argyros, I.K.; González, D.; Magreñán, A.A.; Regmi, S.; Sarría, Í. New improvement of the domain of parameters for newton’s method. Mathematics 2020, 8, 103. [Google Scholar] [CrossRef] [Green Version]
  21. Vrscay, E.; Gilbert, W. Extraneous fixed points, basin boundary and chaotic dynamics for Schröder and König rational iteration functions. Numer. Math. 1988, 52, 1–16. [Google Scholar] [CrossRef]
  22. Varona, J.L. Graphic and numerical comparison between iterative methods. Math. Intell. 2002, 24, 37–46. [Google Scholar] [CrossRef]
  23. Whittaker, E.T. A formula for the solution of algebraic and transcendental equations. Proc. Edinburgh Math. Soc. 1918, 36, 103–106. [Google Scholar] [CrossRef] [Green Version]
  24. Ezquerro, J.A.; Hernández, M.A. On a convex acceleration of Newton’s method. J. Optim. Theory Appl. 1999, 100, 311–326. [Google Scholar] [CrossRef]
  25. Schröder, E. On infinitely many algorithms for solving equations. Math. Annal. 1870, 2, 317–365. [Google Scholar] [CrossRef] [Green Version]
  26. Schröder, E. Ueber iterirte Functionen. Math. Ann. 1870, 3, 296–322. (In German) [Google Scholar] [CrossRef]
  27. Pomentale, T. A class of iterative methods for holomorphic functions. Numer. Math. 1971, 18, 193–203. [Google Scholar] [CrossRef]
  28. Gilbert, W.J. Newton’s method for multiple roots. Comput. Graph. 1994, 18, 227–229. [Google Scholar] [CrossRef]
  29. Gutiérrez, J.M.; Hernández, M.A. An acceleration of Newton’s method: Super-Halley method. Appl. Math. Comput. 2001, 117, 223–239. [Google Scholar] [CrossRef]
  30. Gutiérrez, J.M.; Hernández, M.A. A family of Chebyshev-Halley type methods in Banach spaces. Bull. Aust. Math. Soc. 1997, 55, 113–130. [Google Scholar] [CrossRef] [Green Version]
  31. Cordero, A.; Torregrosa, J.R.; Vindel, P. Dynamics of a family of Chebyshev Halley type methods. Appl. Math. Comput. 2013, 219, 8568–8583. [Google Scholar] [CrossRef] [Green Version]
  32. Amat, S.; Busquier, S.; Plaza, S. Review of some iterative root-finding methods from a dynamical point of view. Sci. Ser. A Math. Sci. 2004, 10, 3–35. [Google Scholar]
  33. Amat, S.; Busquier, S.; Gutiérrez, J.M. Geometric constructions of iterative functions to solve nonlinear equations. J. Comput. Appl. Math. 2003, 157, 197–205. [Google Scholar] [CrossRef] [Green Version]
  34. Amorós, C.; Argyros, I.K.; Magreñán, Á.A.; Regmi, S.; González, R.; Sicilia, J.A. Extending the applicability of Stirling’s method. Mathematics 2020, 8, 35. [Google Scholar] [CrossRef] [Green Version]
  35. Argyros, I.K.; Chen, D. Results on the Chebyshev method in Banach spaces. Proyecciones 1993, 12, 119–128. [Google Scholar] [CrossRef]
  36. Drakopoulos, V.; Argyropoulos, N.; Böhm, A. Generalized computation of Schröder iteration functions to motivated families of Julia and Mandelbrot–like sets. SIAM J. Numer. Anal. 1999, 36, 417–435. [Google Scholar] [CrossRef]
  37. Hernández, M.A.; Salanova, M.A. A family of Chebyshev-Halley type methods. Int. J. Comput. Math. 1993, 47, 59–63. [Google Scholar] [CrossRef]
  38. Moysi, A.; Argyros, I.K.; Regmi, S.; González, D.; Magreñán, Á.A.; Sicilia, J.A. Convergence and dynamics of a higher-order method. Symmetry 2020, 12, 420. [Google Scholar] [CrossRef] [Green Version]
  39. Plaza, S. Conjugacy classes of some numerical methods. Proyecciones 2001, 20, 1–17. [Google Scholar] [CrossRef] [Green Version]
  40. Argyros, I.K.; Magreñán, Á.A. Iterative Methods and Their Dynamics with Applications: A Contemporary Study; CRC Press: Boca Raton, FL, USA; Taylor & Francis Group: Boca Raton, FL, USA, 2017. [Google Scholar]
  41. Magreñán, Á.A.; Argyros, I.K. A Contemporary Study of Iterative Methods: Convergence, Dynamics and Applications; Elsevier: Amsterdam, The Netherlands, 2017. [Google Scholar]
  42. Magrenán, Á.A. Different anomalies in a Jarratt family of iterative root-finding methods. Appl. Math. Comput. 2014, 233, 29–38. [Google Scholar]
  43. Magreñán, Á.A. A new tool to study real and complex dynamics: The convergence plane. Appl. Math. Comput. 2014, 248, 215–224. [Google Scholar]
  44. Amat, S.; Argyros, I.K.; Busquier, S.; Magreñán, Á.A. Local convergence and the dynamics of a two-point four parameter Jarratt-like method under weak conditions. Numer. Algorithms 2017, 74, 371–391. [Google Scholar] [CrossRef]
Figure 1. Stability region of z = 1 .
Figure 1. Stability region of z = 1 .
Mathematics 08 01158 g001
Figure 2. Stability region of e x 1 ( a 0 , a 1 ) .
Figure 2. Stability region of e x 1 ( a 0 , a 1 ) .
Mathematics 08 01158 g002
Figure 3. Stability region of e x 2 ( a 0 , a 1 ) .
Figure 3. Stability region of e x 2 ( a 0 , a 1 ) .
Mathematics 08 01158 g003
Figure 4. Parameter space associated to the free critical point c r 1 ( a 0 , a 1 ) .
Figure 4. Parameter space associated to the free critical point c r 1 ( a 0 , a 1 ) .
Mathematics 08 01158 g004
Figure 5. Basins of attraction associated to the member of the family a 0 = 5 and a 1 = 4 .
Figure 5. Basins of attraction associated to the member of the family a 0 = 5 and a 1 = 4 .
Mathematics 08 01158 g005
Figure 6. Basins of attraction associated to the member of the family a 0 = 1 and a 1 = 3 / 4 . This member has fifth order of convergence.
Figure 6. Basins of attraction associated to the member of the family a 0 = 1 and a 1 = 3 / 4 . This member has fifth order of convergence.
Mathematics 08 01158 g006
Figure 7. Basins of attraction associated to the member of the family a 0 = 3.25 and a 1 = 4.5 .
Figure 7. Basins of attraction associated to the member of the family a 0 = 3.25 and a 1 = 4.5 .
Mathematics 08 01158 g007
Figure 8. Basins of attraction associated to the member of the family a 0 = 2.75 and a 1 = 4.75 .
Figure 8. Basins of attraction associated to the member of the family a 0 = 2.75 and a 1 = 4.75 .
Mathematics 08 01158 g008
Figure 9. Basins of attraction associated to the member of the family a 0 = 3.6 and a 1 = 4.75 . The black zones are related to the convergence of a 2-cycle.
Figure 9. Basins of attraction associated to the member of the family a 0 = 3.6 and a 1 = 4.75 . The black zones are related to the convergence of a 2-cycle.
Mathematics 08 01158 g009
Figure 10. Basins of attraction associated to the member of the family a 0 = 5 and a 1 = 0.1375 . The black zones in this case correspond with zones of convergence to a 3-cycle.
Figure 10. Basins of attraction associated to the member of the family a 0 = 5 and a 1 = 0.1375 . The black zones in this case correspond with zones of convergence to a 3-cycle.
Mathematics 08 01158 g010

Share and Cite

MDPI and ACS Style

Amat, S.; Castro, R.; Honorato, G.; Magreñán, Á.A. Purely Iterative Algorithms for Newton’s Maps and General Convergence. Mathematics 2020, 8, 1158. https://doi.org/10.3390/math8071158

AMA Style

Amat S, Castro R, Honorato G, Magreñán ÁA. Purely Iterative Algorithms for Newton’s Maps and General Convergence. Mathematics. 2020; 8(7):1158. https://doi.org/10.3390/math8071158

Chicago/Turabian Style

Amat, Sergio, Rodrigo Castro, Gerardo Honorato, and Á. A. Magreñán. 2020. "Purely Iterative Algorithms for Newton’s Maps and General Convergence" Mathematics 8, no. 7: 1158. https://doi.org/10.3390/math8071158

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop