Next Article in Journal
Real and Complex Potentials as Solutions to Planar Inverse Problem of Newtonian Dynamics
Previous Article in Journal
Fixed Point Theorems for Set-Valued Contractions in Metric Spaces
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Analysis of Traub’s Method for Cubic Polynomials

Department of Applied Mathematics, Naval Postgraduate School, Monterey, CA 93943, USA
Axioms 2024, 13(2), 87; https://doi.org/10.3390/axioms13020087
Submission received: 3 January 2024 / Revised: 22 January 2024 / Accepted: 25 January 2024 / Published: 27 January 2024
(This article belongs to the Section Mathematical Analysis)

Abstract

:
The dynamical analysis of Kurchatov’s scheme is extended to Traub’s method. The difference here is that Traub’s method requires two additional starting points. Therefore, the map is three-dimensional instead of 2-D. We obtain a complete description of the dynamical planes and show that the method is stable for cubic polynomials.

1. Introduction

The solution to a single nonlinear equation,
f ( x ) = 0 ,
can be found in applied science and engineering. For example, the Colebrook equation [1] is used to find the friction factor. Most numerical solution methods are based on Newton’s scheme; i.e., starting with an initial guess x 0 for the root ξ , we create a sequence { x n } :
x n + 1 = x n f ( x n ) f ( x n ) .
The convergence is quadratic, that is,
| x n + 1 ξ | C 2 | x n ξ | 2 .
To increase the order of convergence, p, one has to include higher derivatives, such as in Halley’s scheme [2], using first and second derivatives to achieve cubic convergence. In order to avoid higher derivatives, one can use multipoint methods; see, e.g., Traub [3] or Petković et al. [4].
Derivative-free one-step methods are either linear (such as Picard), superlinear (such as secant), or even quadratic, such as Steffensen’s method [5] or Kurchatov [6,7]. Clearly, we can construct multipoint methods that are derivative-free. Most of those are based on Steffensen’s method to achieve the highest possible order for the least number of function evaluations; see, for example, the fourth-order derivative-free methods in [8,9].
In a recent article, Neta [10] shows that there is a better choice for a first step, even though it is NOT second order. Traub’s method [3], given by
x n + 1 = x n f ( x n ) ( f ( x n 2 ) f ( x n ) ) ( x n 2 x n ) f ( x n 2 ) f ( x n 1 ) x n 2 x n 1 + f ( x n 1 ) f ( x n ) x n 1 x n ,
is of the order 1.839, and it runs faster and has better dynamics than several other derivative-free methods; see Neta [11], where several methods were compared to solve the discretized nonlinear diffusion model with source terms. Clearly, one cannot obtain optimal methods (see Kung and Traub [12]) this way. Kung and Traub [12] conjectured that multipoint methods without memory using d function evaluations could have an order no larger than 2 d 1 . The efficiency index I is defined as p 1 / d . Thus, an optimal method of the order p = 8 has an efficiency index of I = 8 1 / 4 1.6817 , and an optimal method of the order p = 4 has an efficiency index of I = 4 1 / 3 1.5874 , which is better than Newton’s method for which I = 2 1.4142 . The efficiency index of the optimal method cannot reach a value of 2. In fact, realistically, one uses methods of the order 8 at most. For high-order derivative-free methods based on Steffensen’s method as the first step, see Zhanlav and Otgondorj [13] and references there. Such methods are especially useful when the derivative is very expensive to evaluate and, of course, when the function is nondifferentiable.
If we denote x n 2 = w and x n 1 = z , then the Traub algorithm can be studied from a dynamical point of view as a three-dimensional map:
T : w z x > z x x f ( x ) ( f ( w ) f ( x ) ) w x f ( w ) f ( z ) w z + f ( z ) f ( x ) z x .
Our study extends the results of Garijo and Jarque [14] for the secant method and of Campos et al. [7] for Kurchatov’s scheme. Both methods require only one memory point.
For the cubic polynomial
p 3 ( x ) = x ( x 1 ) ( x a ) ,
the map becomes
T a : w z x > z x N ( w , z , x ) D ( w , z , x ) ,
where
N ( w , z , x ) = ( x 2 ( a w z + 1 ) x w z ) x ,
and
D ( w , z , x ) = 2 x 2 ( 2 a w z + 2 ) x w z + a .
The set of nondefinition of T a is given by
δ T a = ( w , z , x ) R 3 / 2 x 2 ( 2 a w z + 2 ) x w z + a = 0 .
Let
E T = R 3 \ n 0 T a n ( δ T a ) .
Points in E T define a natural subset of R 3 , where all iterates of T a are well defined, and so T a : E T E T defines a smooth dynamical system.
The surface D = 0 is a quadric surface. In general, a quadric surface [15] is given by
A 1 x 2 + A 2 w 2 + A 3 z 2 + A 4 x z + A 5 x w + A 6 w z + A 7 x + A 8 w + A 9 z + A 10 = 0 .
By means of rotation and translation, we can obtain one of these forms:
A x 2 + B w 2 + C z 2 + J = 0 A x 2 + B w 2 + I z = 0 .
In our case, A 1 = 2 ,   A 2 = A 3 = A 8 = A 9 = 0 ,   A 4 = A 5 = 1 ,   A 6 = 1 ,   A 7 = 2 ( a + 1 ) , and A 10 = a . If we take the following transformation to a new coordinate system ξ η λ :
x = ξ + 3 η + 2 λ , w = a 2 ξ + b 2 η + c 2 λ , z = 2 η + λ ,
we obtain three equations for the unknown whose solution is a 2 = 4 ,   b 2 = 18 ,   c 2 = 13 , and the surface becomes
6 ξ 2 + 6 η 2 3 λ 2 2 ( a + 1 ) ξ 6 ( a + 1 ) η 4 ( a + 1 ) λ + a = 0 .
Now we perform a translation to move the origin:
Ξ = ξ a + 1 6 , H = η a + 1 2 , Λ = λ + 2 3 ( a + 1 ) .
The equation becomes
Ξ 2 + H 2 1 2 Λ 2 = 1 6 ( a + 1 3 ( a + 1 ) 2 ) .
This surface is a hyperboloid of one sheet. The axis of symmetry is the Λ axis. The trace of the surface can be obtained by taking Λ as constant. If Λ = Λ 0 , then the equation of the trace is
Ξ 2 + H 2 = 1 6 ( a + 1 3 ( a + 1 ) 2 + 3 Λ 0 2 ) .
This is a circle centered at Ξ = 0 , H = 0 . The radius is
r ( a ) = 1 6 ( a + ( a + 1 ) 2 3 + 3 Λ 0 2 ) .
Clearly, the minimum value of the radius is when Λ 0 = 0 and a = 1 2 . At that point, the radius is 1 24 ; see Figure 1, Figure 2 and Figure 3.
Plotting the surface δ T a (10) (see Figure 1, Figure 2 and Figure 3), we see a hyperboloid of one sheet for three different values of a. Notice how the radius changes as a function of the value of a.
The origin in the Ξ H Λ coordinate system corresponds to x = w = z = a + 1 3 .
If x = 0 , then both (8) and (9) vanish. The limit as x tending toward zero is zero, since the numerator is a polynomial in x of a degree higher than the denominator. This is called a focal point. A focal point Q is called simple if the tangent planes to the surfaces N and D are not parallel. This can be expressed by the condition
N × D 0 at   Q .
We now evaluate the gradient of the surfaces N ( w , z , x ) and D ( w , z , x ) at x = 0
N w = x 2 z x , N z = x 2 w x , N x = 3 x 2 2 ( a + 1 w z ) x w z , D w = x z , D z = x w , D x = 4 x ( 2 a + 2 w z ) .
Evaluating the derivatives at x = 0 and then substituting in (18), we have
N × D = | w z | w 2 + z 2 = | a | w 2 + z 2 0 .
The surface D ( w , z , x ) = 0 can be described as follows: If x = 0 (on the w z coordinate plane), we have the hyperbola w z = a . If x = z (a slanted plane through the w coordinate axis), then x = z = α 1 , 2 and w is free, that is, lines through the given points. Similarly, if x = w , then x = w = α 1 , 2 and we have lines. If x = w = z , we obtain the two points:
α 1 , 2 = ( a + 1 ) ± a 2 a + 1 3 .
If w = z again, we obtain a hyperbola:
2 x 2 + 2 ( a + 1 ) x 2 x w + w 2 a = 0 .
If x = α 1 , 2 , then D = 0 but N 0 . Similarly, if x = z or x = w , N 0 . If w = z , then when D = 0 , we can see if there is a point on the hyperbola (20) that will satisfy N = 0 .
Since x 0 and w = z , we have the set of equations
x 2 ( a + 1 ) x w 2 + 2 w x = 0 , 2 x 2 + 2 ( a + 1 ) x + w 2 2 w x = a .
Adding the two equation, we have
x 2 + ( a + 1 ) x = a ,
whose two solutions are x = a and x = 1 . For x = a , we obtain w = z = a ± a ( a 1 ) . For x = 1 , we obtain w = z = 1 ± 1 a .
The fixed points of the map,
T a ( w 0 , z 0 , x 0 ) = ( w 0 , z 0 , x 0 ) ,
are
w = z , z = x , x = ( x 2 ( a w z + 1 ) x w z ) x 2 x 2 ( 2 a w z + 2 ) x w z + a ,
or
x ( 2 x 2 ( 2 a 2 x + 2 ) x x 2 + a ) = x ( x 2 ( a + 1 2 x ) x x 2 ) .
One solution is x = 0 , and the other two are the solution of
x 2 ( a + 1 ) x + a = 0 ,
which are x = 1 and x = a . The fixed points coincide with the roots of p 3 ( x ) .
In order to study their stability, we build the Jacobian matrix
J T a ( w , z , x ) = 0 1 0 0 0 1 G w G z G x ,
where G ( w , z , x ) = x ( x 2 ( a + 1 w z ) x w z ) ( 2 x 2 ( 2 a + 2 w z ) x w z + a ) and the partial derivatives are
G w = 1 4 x ( x 1 ) ( x z ) ( a x ) ( x 2 + ( a w / 2 z / 2 + 1 ) x + w z / 2 a / 2 ) 2 ,
G z = 1 4 x ( x 1 ) ( w x ) ( a x ) ( x 2 + ( a w / 2 z / 2 + 1 ) x + w z / 2 a / 2 ) 2 ,
G x = 1 4 ( 2 x 4 + ( 4 a + 2 w + 2 z 4 ) x 3 + q 2 ( x , w , z ) x 2 q 1 ( w , z , x ) ) ( x 2 + ( a w / 2 z / 2 + 1 ) x + w z / 2 a / 2 ) 2 ,
with
q 2 ( w , z , x ) = w 2 + ( 3 a + z 3 ) w + z 2 + ( 3 a 3 ) z + 2 a 2 + 7 a + 2 ,
q 1 ( w , z , x ) = 2 ( a + 1 w z ) ( w z + a ) x + ( w z ) 2 a w z .
The partial derivatives at the roots are G w = G z = 0 and
G x = w z w z a at x = 0 , ( w 1 ) ( z 1 ) ( w 1 ) ( z 1 ) + a 1 at x = 1 , ( w a ) ( z a ) ( w a ) ( z a ) + ( 1 a ) a at x = a .
Therefore, the fixed points are attractive. In Figure 4, we plot the basins of attraction [16] of Traub’s method for the cubic polynomial (6) for three values of the parameter a, to the left of zero ( a = 1.5 ), to the right of 1 ( a = 1.8 ), and in between ( a = 0.4 ).

2. Dynamical Study of the Methods

Let us first analyze the curves or surfaces that are mapped on fixed points of the operator T a ( w , z , x ) :
T a ( w , z , x ) = ( 0 , 0 , 0 ) z = x = 0   and   G ( w , z , x ) = 0 , G ( w , 0 , 0 ) = 0 , G ( w , 0 , 0 ) = 0 a = 0 ;
therefore, T a ( w , z , x ) maps to the fixed point at the origin if and only if z = x = 0 . On the other hand,
T a ( w , z , x ) = ( 1 , 1 , 1 ) z = x = 1   and   G ( w , 1 , 1 ) = 1 , 1 a 1 a = 1 .
Therefore, T a ( w , z , x ) = ( 1 , 1 , 1 ) z = x = 1 . Finally,
T a ( w , z , x ) = ( a , a , a ) z = x = a   and   G ( w , a , a ) = a , a ( a 2 ( a + 1 w a ) a w a ) 2 a 2 ( 2 a + 2 w a ) a w a + a = a ;
then, T a ( w , z , x ) = ( a , a , a ) z = x = a and plane z = x maps T a ( w , z , x ) to its fixed points.

2.1. Focal Points and Prefocal Curves

Taking into account that focal points are those satisfying that at least one component is 0 / 0 with finite limit, the focal points of T a satisfy
x ( x 2 ( a w z + 1 ) x w z ) = 0 ,
2 x 2 ( 2 a w z + 2 ) x w z + a = 0 .
  • Therefore, if in (23) we consider x = 0 and replace it in (24), we obtain the hyperbola w z = a ; see Figure 5.
    Figure 5. Focal curve for x = 0 for various values of a.
    Figure 5. Focal curve for x = 0 for various values of a.
    Axioms 13 00087 g005
  • N = 0 also if x 2 ( a w z + 1 ) x w z = 0 ; then x 2 a x x = w z w x z x and replacing in (24), we obtain the surface in R 3 ,
    w ( z x ) z x + a = 0 .
    However, x 2 a x x = w z w x z x , so
    x 2 ( a + 1 ) x + a = 0 x = a or   x = 1 .
    If x = 1 , the focal curve is w z w z + a = 0 (Figure 6), and if x = a , w z a w a z + a = 0 (Figure 7).
Figure 6. Focal curve for x = 1 for various values of a.
Figure 6. Focal curve for x = 1 for various values of a.
Axioms 13 00087 g006
Figure 7. Focal curve for x = a for various values of a.
Figure 7. Focal curve for x = a for various values of a.
Axioms 13 00087 g007
Thus, in all cases, the focal curve is a hyperbola. Notice that in Figure 7, we plot the cases that a = 0 and a = 1 . Those plots are just the asymptotes of the hyperbolas in Figure 5 and Figure 6.
Focal   curves w z = a x = 0 , w z w z + a = 0 x = 1 , w z a ( w + z ) + a = 0 x = a .
To show that these points on the focal curves are all simple, we use (18)
N × D = | a | w 2 + z 2 0 at x = 0 , | ( w 1 ) ( 1 z ) | ( w 1 ) 2 + ( z 1 ) 2 = | a 1 | ( w 1 ) 2 + ( z 1 ) 2 0 at x = 1 , | ( w a ) ( a z ) | ( w a ) 2 + ( z a ) 2 = | a ( 1 a ) | ( w a ) 2 + ( z a ) 2 0 at x = a .
We now look at the prefocal surface. A point Q in δ T a is a prefocal point if T 3 , the third component of T a (see (7)), evaluated at Q takes the form 0 / 0 (i.e., N ( Q ) = D ( Q ) = 0 ), and there exists a smooth simple arc γ : = γ ( t ) , t ( ϵ , ϵ ) , with γ ( 0 ) = Q such that lim t 0 T 3 ( γ ) exists and it is finite. Moreover, the line L Q = ( w , z , x ) R 3 , ( w , z ) = F ( Q ) , where F ( Q ) is the first two components of T a . The line is the intersection of the two planes w = z and z = x evaluated at Q when x = a . The equations of the line are
w = w 0 + t , z = z 0 + t , x = a + t .
On δ T a , x = a (for a 0 and a 1 ) and w z a ( w + z ) + a = 0 . Given w 0 on this hyperbola, we obtain z 0 = a w 0 1 w 0 a , and the line for each w 0 is
w = w 0 + t z = a w 0 1 w 0 a + t , x = a + t .
The set of all these lines (for all possible w 0 a on the focal curve) is the prefocal surface. This surface is a cylinder, and the lines are called rulings; see the definition in [15]. In Figure 8, we show the prefocal surface for a given a value.

2.2. Inverse of Ta and Its Properties

In order to understand the global dynamics of the map T a , it is important to find the inverse map T a 1 and in which regions of the phase space the inverse is defined. Let ( w , z , x ) be a given point; then
T a 1 w z x = w = z 3 ( a + 1 w 2 x ) z 2 + ( 2 a + 2 w ) z x a x ( x z ) ( z w ) z = w x = z .
The value of w is obtained by solving x = N ( w , w , z ) D ( w , w , z ) . We now look at the signs of the following three functions:
1.
x z ;
2.
z w ;
3.
z 3 ( a w + 1 2 x ) z 2 + ( 2 a + 2 w ) x z a x .
If all three or just one of them is positive, then w > 0 ; otherwise, w < 0 . The two planes z = x and z = w and the surface defined by the cubic z 3 ( a w + 1 2 x ) z 2 + ( 2 a + 2 w ) x z a x = 0 are the boundaries of the regions. See Figure 9 for the surface defined by the cubic for two different values of the parameter a.
The three surfaces meet at three possible points: the first is z = x = w = 0 , and the other two solutions are given by solving the quadratic equation (using x = z = w )
3 x 2 + ( a + 1 ) x a = 0 .
This equation has two complex roots as long as a ( 7 48 , 7 + 48 ) . For other values of a, the real roots are
a 1 ± a 2 + 14 a + 1 6 .
Notice that unlike the secant [14] and Kurchatov’s [7] schemes, the solutions are not the three fixed points.
These surfaces form the boundary of the regions, but they are not a locus of critical points where the two inverses are defined and merged, since the denominator vanishes when x = z = w . The locus is
L C = ( w . z , x ) R 3 / x = z 3 ( a + 1 w ) z 2 2 z 2 ( 2 a + 2 w ) z + a .
Merging preimages are also in another set, denoted L C 1 , that it is included in the set of points for which the determinant of the Jacobian vanishes, i.e., x = z , or which is the same G w = 0 (see (22).) The set L C 1 = ( w . z , x ) R 3 / d e t ( J ( T a ) = 0 has critical points z = x identical to the points of intersection of the boundaries above. In the case where z = 0 , then w can take any value. In the case where z satisfies (29), then w satisfies the quadratic equation
z w 2 + ( 2 z 2 + 2 a 3 ( a + 1 ) w ) z = 2 z 3 + 4 ( a + 1 ) z 2 3 a z 2 ( a + 1 ) 2 z + 2 a ( a + 1 ) .

3. Conclusions

The oldest method with memory is probably the secant method where one replaces the slope of the tangent line in Newton’s method by the slope of the secant. This, in turn, requires the use of an additional point besides the previous one. We generalize the analysis for two methods having one memory point (namely, the secant and Kurchatov’s schemes) to a method requiring two memory points. In this paper, we chose to study Traub’s method, which received a lot of attention recently. The analysis in three dimensions is not a trivial extension of the two-dimensional analysis in the literature. However, the generalization to methods using more than two memory points is relatively simple.
We have proved that Traub’s method is completely stable for any polynomial of the degree of 3 since the only attractors for any value of the parameter are the roots of the polynomial.

Funding

This research received no external funding.

Data Availability Statement

Data is contained within the article.

Acknowledgments

The author thanks Cordero for stimulating correspondence on the topic.

Conflicts of Interest

The author declares no conflict of interest.

References

  1. Colebrook, C.F. Turbulent flows in pipes, with particular reference to the transition between the smooth and rough pipe laws. J. Inst. Civ. Eng. 1938, 11, 130. [Google Scholar] [CrossRef]
  2. Halley, E. A new, exact and easy method of finding the roots of equations generally and that without any previous reduction. Philos. Trans. R. Soc. Lond 1694, 18, 136–148. [Google Scholar]
  3. Traub, J.F. Iterative Methods for the Solution of Equations; Prentice Hall: New York, NY, USA, 1964. [Google Scholar]
  4. Petković, M.S.; Neta, B.; Petković, L.D.; Džunić, J. Multipoint Methods for the Solution of Nonlinear Equations; Elsevier: Amsterdam, The Netherlands, 2012. [Google Scholar]
  5. Steffensen, J.F. Remarks on iteration. Scand. Actuar. J. 1933, 1, 64–72. [Google Scholar] [CrossRef]
  6. Kurchatov, V.A. On a method of linear interpolation for thew solution of functional equations (Russian). In Doklady Akademii Nauk; Russian Academy of Sciences: Moscow, Russia, 1971; Volume 198, pp. 524–526. [Google Scholar]
  7. Campos, B.; Cordero, A.; Torregrosa, J.R.; Vindel, P. Dynamical analysis of an iterative method with memory on a family of third-degree polynomials. AIMS Math. 2022, 7, 6445–6466. [Google Scholar] [CrossRef]
  8. Ren, H.; Wu, Q.; Bi, W. A class of two-step Steffensen type methods with fourth-order convergence. Appl. Math. Comput. 2009, 209, 206–210. [Google Scholar] [CrossRef]
  9. Sharma, J.R.; Arora, H. An efficient derivative-free iterative method for solving systems of nonlinear equations. Appl. Anal. Discret. Math. 2013, 7, 390–403. [Google Scholar] [CrossRef]
  10. Neta, B. Basin attractors for derivative-free methods to find simple roots of nonlinear equations. J. Numer. Anal. Approx. Theory 2020, 49, 177–189. [Google Scholar] [CrossRef]
  11. Neta, B. Comparison of several numerical solvers for a discretized nonlinear diffusion model with source terms. Georgian Math. J. 2023. Accepted for Publication. [Google Scholar] [CrossRef]
  12. Kung, H.T.; Traub, J.F. Optimal orderof one-point and multipoint iteration. J. Assoc. Comput. Math. 1974, 21, 634–651. [Google Scholar] [CrossRef]
  13. Zhanlav, T.; Otgondorj, K. Comparison of some optimal derivative-free three-point iterations. J. Numer. Anal. Approx. Theory 2020, 49, 76–90. [Google Scholar] [CrossRef]
  14. Garijo, A.; Jarque, X. Global dynamics of the real secant method. Nonlinearity 2019, 32, 4557–4578. [Google Scholar] [CrossRef]
  15. Stewart, J. Calculus, Early Transcendentals; Brooks/Cole: Pacific Grove, CA, USA, 1995. [Google Scholar]
  16. Stewart, B.D. Attractor Basins of Various Root-Finding Methods. Master’s Thesis, Naval Postgraduate School, Department of Applied Mathematics, Monterey, CA, USA, 2001. [Google Scholar]
Figure 1. Surface on which D = 0 for a = 1.5 . The minimum value of the radius is r = 19 / 72 .
Figure 1. Surface on which D = 0 for a = 1.5 . The minimum value of the radius is r = 19 / 72 .
Axioms 13 00087 g001
Figure 2. Surface on which D = 0 for a = 0.5 . The minimum value of the radius is r = 1 / 24 .
Figure 2. Surface on which D = 0 for a = 0.5 . The minimum value of the radius is r = 1 / 24 .
Axioms 13 00087 g002
Figure 3. Surface on which D = 0 for a = 2.0 . The minimum value of the radius is r = 1 / 6 .
Figure 3. Surface on which D = 0 for a = 2.0 . The minimum value of the radius is r = 1 / 6 .
Axioms 13 00087 g003
Figure 4. Dynamical planes of Traub’s method for different values of the parameter a.
Figure 4. Dynamical planes of Traub’s method for different values of the parameter a.
Axioms 13 00087 g004
Figure 8. Prefocal surfaces of Traub’s method for different values of the parameter a.
Figure 8. Prefocal surfaces of Traub’s method for different values of the parameter a.
Axioms 13 00087 g008
Figure 9. One of the boundaries of the region for different values of the parameter a. The other two boundaries are the planes x = z and w = z .
Figure 9. One of the boundaries of the region for different values of the parameter a. The other two boundaries are the planes x = z and w = z .
Axioms 13 00087 g009
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Neta, B. Analysis of Traub’s Method for Cubic Polynomials. Axioms 2024, 13, 87. https://doi.org/10.3390/axioms13020087

AMA Style

Neta B. Analysis of Traub’s Method for Cubic Polynomials. Axioms. 2024; 13(2):87. https://doi.org/10.3390/axioms13020087

Chicago/Turabian Style

Neta, Beny. 2024. "Analysis of Traub’s Method for Cubic Polynomials" Axioms 13, no. 2: 87. https://doi.org/10.3390/axioms13020087

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop