Abstract
Many problems in scientific research are reduced to a nonlinear equation by mathematical means of modeling. The solutions of such equations are found mostly iteratively. Then, the convergence order is routinely shown using Taylor formulas, which in turn make sufficient assumptions about derivatives which are not present in the iterative method at hand. This technique restricts the usage of the method which may converge even if these assumptions, which are not also necessary, hold. The utilization of these methods can be extended under less restrictive conditions. This new paper contributes in this direction, since the convergence is established by assumptions restricted exclusively on the functions present on the method. Although the technique is demonstrated on a two-step Traub-type method with usually symmetric parameters and weight functions, due to its generality it can be extended to other methods defined on the real line or more abstract spaces. Numerical experimentation complement and further validate the theory.
MSC:
65G99; 65H10
1. Introduction
A nonlinear equation or system of nonlinear equations is commonly used to solve a wide range of applied science problems from several domains, including economics, engineering, applied mathematics, health science, physics, and chemistry. An example of such a system is given below:
where [1,2,3,4,5]. The exact solution of Equation (1) is almost nonexistent. Therefore, we have to obtain the solutions to such equations by relying on iterative methods.
A popular technique for finding approximations to the roots of a nonlinear equation is Newton’s method, which has a quadratic order of convergence for simple roots. Newton’s method considers an initial point; the method uses the function’s value and its derivative to iteratively refine the approximation. This process is repeated until we reach the desired level of accuracy. Newton’s method is renowned for its rapid convergence, especially when the initial point is close to the required root. With the advancement of computer algebra, many researchers have proposed modified and extended versions of Newton’s method.
Specifically, we define the following two-step method for each by
where and are parameters, are non-vanishing functions for each point in their domain, and is any function. In addition, we assume that and in the whole study.
There exist specializations of the method (2). We have listed some of them.
- Newton’s method (order two) [1,6,7,8]
Set , in (2) to obtain
- Sharma’s Method (order three) [9]
Set , , i.e., , any and to obtain
- Traub–Kou Method (order three) [9,10,11,12,13]
Set , , i.e., , any and to obtain
- Traub–Potra–Pták Method (order three) [14,15,16]
Take , , i.e., , any and to obtain
- A New Fourth Order Method
Set , , to obtain
The fourth-order of this method is shown in Section 4.
Taylor expansion approaches are mostly used to find the convergence order. Generally, the proofs require high-order derivatives.
However, there are other problems under the Taylor series approach.
- Motivation
- (P1)
- We consider an academic but motivational example that contradicts the above factors. We choose and the function H on the interval by if , and , if , where are parameters satisfying . It is straightforward to see that solves the equation . However, the third derivative of function is not continuous at . However, the results utilizing Taylor series which require the continuity of or even higher cannot guarantee convergence to it.
- (P2)
- There are no prior computable error estimates for . Therefore, the number of iterations required to achieve a specified error tolerance cannot be determined in advance.
- (P3)
- No computable neighborhood of does not contain another solution to (1).
- (P4)
- The choice of to assure convergence of these method to is very difficult or impossible, since the radius of convergence is not usually determined.
- (P5)
- The convergence of iterative methods is established only for the real line.
- (P6)
- For the majority of these techniques, the challenging semi-local examination of convergence is not mentioned in the earlier studies.
Hence, the problem – constitute the motivation for writing the paper. These issues are addressed positively in this study. In particular, we have, respectively:
- Novelty
- The local convergence is assured using on the derivatives on the method, i.e.,
- A prior error estimates those which are computable, and are provided.Thus, the number of iterations required to achieve the tolerance is known in advance.
- A computable neighbourhood of is given that contains no other solution.
- The radius of convergence can be computed.
- The convergence is assured on S.
- Real majorizing sequences are utilised to establish the convergence [1,2,3]. The derivative is controlled using generalised continuity [1].
The approach is applied to the method (2). However, it can be used on other methods not necessarily defined in S, but in more general spaces such as Hilbert, Euclidean or Banach, along the same lines [2,3,4,5,6,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24]. Therefore, we can expand the usages of these methods for solving Equation (1) in cases not covered before. That is, – constitute the novelty of the paper.
The rest of the paper is organized as follows: Local and semi-local analyses are presented in Section 2 and Section 3, respectively. The convergence order of the method is discussed in Section 4. Examples are provided in Section 5, and conclusions are drawn in Section 6. It is also worth noting that (2) unifies the study of methods. Moreover, their convergence study is carried out under the same conditions. Consequently, a direct comparison between these methods becomes possible.
2. Convergence 1: Local Analysis
Some real functions defined on interval play a role in analysis. Suppose:
- (E1)
- There is a function that is continuous and non-decreasing (CD), such that the function has zeros in the interval . Let the smallest such zero be denoted as . Define .
- (E2)
- There exists a CD function such that for , defined byThe function has a , which is denoted by .
- (E3)
- Functions , are given as:andare such that functions and have in interval , which are denoted by and , respectively.Let and .Let , , and . Define the functions and by
- (E4)
- The function has a , which is denoted by . SetThen, we have for eachandThe functions and relate to the derivative on the method (2).
- (E5)
- Let .The notation is used to denote an open real interval with center x and of radius . Moreover, the symbol is the closure of interval .
- (E6)
- For each , we haveand
- (E7)
- .
Remark 1.
- (i)
- (ii)
- is the usual but not the most appropriate (necessarily) selection. However, in this case, is a simple solution of Equation (1). In Theorem 1, we do not assume is a simple solution. So, the method (2) is also useful to find solutions of multiplicity greater than the one provided that . The preceding notation, together with conditions –, is used in the local analysis of convergence for method (2). Related work on other methods for Banach space valued operators can be found in [1,7].
Theorem 1.
Suppose that the conditions – hold. Then, if , the sequence is well defined in the interval , such that the following items hold
where , (for and .
Proof.
The items (14)–(16) are shown using induction on n. By hypothesis , the item (14) clearly holds if , since . Pick . It follows (8), (9) and in turn that
So, by perturbation lemma due to Banach standard and
Then, notice that iterate is defined by first substep of method (2) if , since and
Thus, item (15) holds if , and the iterate .
Then, we show and . We obtain in turn if :
where and we use
Thus, we obtain
so,
Similarly, for
we have
It follows by (20) and (21) that the iterate is well-defined by the second substep of method (2) if , and
We need an estimate on the norm of . Notice that
leading to
An interval is found that contains only as solution.
Proposition 1.
Suppose there exist , such that condition holds in the interval and , such that
Set . Then, the only solution of Equation (1) in the set is .
Proof.
Let with . Set . The application of condition on the interval and (25) imply
So, . Then, from approximations
it follows . □
Remark 2.
Note that if all the conditions of Theorem 1 are satisfied in Proposition 1, we can set .
3. Convergence 2: Semi-Local Analysis
The calculations are similar to those in the previous section, but the roles of the solution and the functions and are now taken by the starting point and the functions and , respectively.
Suppose:
- (M1)
- There exists a CD function so that has a which is denoted by . Set .
- (M2)
- There exists a CD function .It is convenient to set and .Define the sequences and for , and each byThe following conditions assure the convergence of sequences and .
- (M3)
- There exists , such that for eachThis condition together with (28) imply and there exists such that . It is well known that is the unique least upper bound of the sequence . As in the local analysis, the functions and are related to .
- (M4)
- There exists and a non zero number L, such that for eachSet if we take in this condition and use the definition ofThus, , and the iterate exists. So, we can take
- (M5)
- For eachand
- (M6)
- .
Remark 3.
A popular solution for . (See also the explanations in Remark 1).
The semi-local analysis of convergence follows, as in the local case, in next set of results.
Theorem 2.
Suppose that the conditions – hold. Then, the sequence is well-defined and the following items hold
and exists as a solution of the equation , such that
Proof.
Induction shall establish the items (29)–(31). Clearly, (29) holds if . The choice of and the first substep of the method (2) imply
Thus, the item (30) holds if and the iterate . Now, we need the estimates that correspond to the local analysis
and for
and similarly,
Moreover, by subtracting first, in the second substep we obtain
The following estimates are also needed:
leading to
Furthermore, the first substep gives
So, we have
Consequently, we obtain
and
The uniqueness of the solution of Equation (1) is established in the next set of results.
Proposition 2.
Suppose there exists a solution of equation for some , the condition holds the interval and there exist , such that
Set . Then, the equation has only one solution in .
Proof.
Let with . Define . By the hypothesis and expression (38); we obtain in turn
so . Then, from approximation , we conclude that . □
Remark 4.
- (i)
- The limit point can be exchanged by in condition .
- (ii)
- In the Proposition 2, assume and under all conditions –.
4. The Convergence Order of Method (2)
In this section, we show that the convergence order of the method (2) is four, using Taylor series expansions [2,3].
Theorem 3.
Proof.
Set and and , . First, we expand about using Taylor series to obtain in turn
so
Consequently, we have
and
Similarly, we obtain
Substituting these formulas in (7), we obtain
Hence, this ends the proof. □
Remark 5.
Notice that in Theorem 3, the function H must be at least five times differentiable. Therefore, Theorem 3 is not applicable to solve the equation given in the Introduction using method (7).
5. Numerical Experiments
We performed a computational analysis to demonstrate the practical significance of our theoretical results. To this end, we selected four numerical problems to demonstrate our computational findings. These numerical examples are categorized into two groups based on convergence analysis: semi-local area convergence (SLAC) and local area convergence (LAC). The SLAC examples illustrate how the method behaves in a broader region around the initial guess, while the LAC examples focus on the convergence properties in a more restricted, localized area. This distinction helps us to better understand the method’s effectiveness in different ways.
- Local area convergence
We investigated local area convergence (LAC) using the first problem as a case study. Detailed information about this example is provided in (1). The radii of convergence are presented in Table 1.
- Semi-local area convergence
On the other hand, the remaining examples were selected to explore SLAC. The numerical results for semi-local convergence based on the well-known Planck’s radiation problem (2) are presented in Table 2. Table 3 provides the numerical results for the semi-local convergence of the continuous stirred tank reactor (CSTR), described in example (3), which is another well-known problem in applied science. Finally, the numerical results for semi-local convergence based on the blood rheology model (4) are shown in Table 4.
Furthermore, we also calculated the computational order of convergence (COC) for each example. The COC provides us a measure of how quickly the iterative method approaches the required solution. This can be determined using the following formulas:
or the approximate computational order of convergence [17,18] by:
The termination criteria for the program are defined as follows: (i) and (ii) , where . All computations were carried out using Mathematica 11 with multi-precision arithmetic. The specifications of the computer used for programming are as follows:
- Device Name: HP
- Edition: Windows 10 Enterprise
- Version: 22H2
- Installed RAM: 8.00 GB (7.89 GB usable)
- OS Build: 19045.2006
- Processor: Intel(R) Core(TM) i7-4790 CPU @ 3.60 GHz
- System type: 64-bit operating system, ×64-based processor
5.1. Examples for LAC
To illustrate the theoretical findings of local convergence, which are provided in Section 2, we select the Example (1).
Example 1.
Let and . Define the function H on for by
Then, for , where solves the equation . Moreover, we have
So, . Consequently, the conditions and hold, if we choose
The radii of convergence are given in Table 1.
Table 1.
Radii of method (7) for Example (1).
5.2. Examples for SLAC
We consider three examples, (2) through (4), to demonstrate the theoretical results of semi-local convergence proposed in Section 3. The methods under comparison are Newton’s method (3), Sharma’s method (4), the Potra–Pták method (6), and the method (7), which we refer to as (NM), (SM), (PM), and (OM), respectively.
Example 2. Planck’s radiation problem:
The spectral density of electromagnetic radiation emitted by a black body at a specific temperature and in thermal equilibrium is determined using Planck’s radiation equation [19], which is expressed as follows:
where and c denote the black-body’s absolute temperature, radiation wavelength, Boltzmann constant, Planck’s constant, and the speed of light in a vacuum, respectively. To determine the wavelength y that maximizes the energy density , we set . This results in the following formula:
By setting , we have
The required value of the root is . Using this root, the wavelength y can be determined from the equation . The Planck problem was tested with an initial guess of , and the computational results are shown in Table 2.
Table 2.
Convergence pattern of various approaches to Planck’s radiation problem (2).
Table 2.
Convergence pattern of various approaches to Planck’s radiation problem (2).
| Methods | n | CPU Timing | |||
|---|---|---|---|---|---|
| NM | 7 | 2.0000 | 0.0034916 | ||
| SM | 5 | 3.0000 | 0.0106235 | ||
| PM | 5 | 3.0000 | 0.0081417 | ||
| OM | 4 | 4.0000 | 0.0152859 |
Example 3. Continuous stirred tank reactor (CSTR):
We assume an isothermal continuous stirred tank reactor (CSTR) (for details, see [20]). The components and represent the feed rates to and , respectively. The reaction scheme in the reactor is then as follows:
While creating a basic model for feedback control systems, Douglas [21] also studied the aforementioned model. Then, he transformed it into the subsequent mathematical expression:
where is the gain of the proportional controller. The expression (42) is balanced for negative real values of . Specifically, by setting , we obtain
The zeros of the function correspond to the poles of the open-loop transfer function. The function has four zeros: . We select as the desired zero and choose as the initial point for . The computational results are presented in Table 3.
Table 3.
Convergence pattern of various approaches to CSTR problem (3).
Table 3.
Convergence pattern of various approaches to CSTR problem (3).
| Methods | n | CPU Timing | |||
|---|---|---|---|---|---|
| NM | 9 | 2.0000 | 0.0004944 | ||
| SM | 6 | 3.0000 | 0.0006272 | ||
| PM | 7 | 3.0000 | 0.000563 | ||
| OM | 5 | 4.0000 | 0.0008959 |
Example 4. Blood rheology model:
Finally, we examine the physical and flow properties of blood using the blood rheology model [22]. Blood is called a Caisson fluid because it doesn’t follow Newtonian fluid rules. According to the Caisson fluid model, simple fluids move through tubes in a way that creates a speed difference between the walls and the center, where there is minimal deformation. We study the plug flow of Caisson fluids using the following mathematical expression:
We then set to calculate the flow rate reduction, which simplifies to the following nonlinear equation:
By adopting the proposed methods, we determined the desired zero of the function to be and chose as the starting point for . The computed results are shown in Table 4.
Table 4.
Convergence pattern of various approaches to blood rheology model (4).
Table 4.
Convergence pattern of various approaches to blood rheology model (4).
| Methods | n | CPU Timing | |||
|---|---|---|---|---|---|
| NM | 7 | 2.0000 | 0.001264 | ||
| SM | 5 | 3.0000 | 0.0032786 | ||
| PM | 5 | 3.0000 | 0.0022388 | ||
| OM | 4 | 4.0000 | 0.0049327 |
6. Concluding Remarks
The two-step Traub-type iterative method (2) with parameters and weight functions is used as a sample to demonstrate a new convergence technique, which avoids the Taylor series. Method (2) unifies a plethora of other popular iterative methods [2,3,4,5,9,10,11,12,13,14,15,16,17,18,19,20,21,22,23,24]. The usage of method (2) is crucial in cases involving the analytic form of the solution of expression (1). It is hard to find in, for example, the case of fractional order, for a pseudo-hyperbolic telegraph equation [1,19,20,21,22]. Other advantages of this technique include weaker sufficient convergence conditions, information on the number of solutions in a certain interval, and a priori upper error estimates on . Further, the semi-local convergence analysis of the iterative approaches was proposed. Moreover, our theoretical aspects are also supported by the numerical results. The generality of this techniques makes it applicable on other methods defined on the real line, the complex plane, or more abstract spaces such as Hilbert or Banach. This is intended to be the direction of our future area of research [4,5,7,8,12,17,18,23,24]. It is also worth noting that the aforementioned references included methods with inverse of linear operators making them suitable for the application of our approach, since they have also used the Taylor series. Consequently, they have the same drawbacks –.
Author Contributions
Conceptualization, R.B. and I.K.A.; methodology, R.B. and I.K.A.; software, R.B. and I.K.A.; validation, R.B. and I.K.A., formal analysis, R.B. and I.K.A.; investigation, R.B. and I.K.A.; resources, R.B. and I.K.A.; data curation, R.B. and I.K.A.; writing—original draft preparation, R.B. and I.K.A.; writing—review and editing, R.B. and I.K.A.; visualization, R.B. and I.K.A.; supervision, R.B. and I.K.A. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Data Availability Statement
Data are contained within the article.
Conflicts of Interest
The authors declare no conflicts of interest.
References
- Argyros, I.K. The Theory and Applications of Iteration Methods, 2nd ed.; Engineering Series; CRC Press: Boca Raton, FL, USA, 2022. [Google Scholar]
- Ostrowski, A.M. Solution of Equations and System of Equations; Academic Press: New York, NY, USA, 1960. [Google Scholar]
- Traub, J.F. Iterative Methods for the Solution of Equations; Prentice Hall: Hoboken, NJ, USA, 1964. [Google Scholar]
- Jain, P. Steffensen type methods for solving non-linear equations. Appl. Math. Comput. 2007, 194, 527–533. [Google Scholar] [CrossRef]
- Padilla, J.J.; Chicharro, F.; Cordero, A.; Torregrosa, J.R. Parametric family of root-finding iterative methods: Fractals of the basins of attraction. Fractal Fract. 2022, 6, 572. [Google Scholar] [CrossRef]
- Abbasbandy, S. Improving Newton-Raphson method for nonlinear equations by modified decomposition method. Appl. Math. Comput. 2003, 145, 887–893. [Google Scholar] [CrossRef]
- Behl, R.; Argyros, I.K.; Alharbi, S. A One-Parameter Family of Methods with a Higher Order of Convergence for Equations in a Banach Space. Mathematics 2024, 12, 1278. [Google Scholar] [CrossRef]
- Behl, R.; Argyros, I.K. Extended High Convergence Compositions for Solving Nonlinear Equations in Banach Space. Int. J. Comput. Methods 2024, 21, 2350021. [Google Scholar] [CrossRef]
- Sharma, J.R. A composite third order Newton Steffensen method for solving nonlinear equations. Appl. Math. Comput. 2005, 169, 242–246. [Google Scholar] [CrossRef]
- Frontini, M.; Sormani, E. Some variant of Newton’s method with third order convergence. J. Comput. Appl. Math. 2003, 140, 419–426. [Google Scholar] [CrossRef]
- Homeier, H.H.H. On Newton type method with cubic convergence. J. Comput. Appl. Math. 2005, 176, 425–432. [Google Scholar] [CrossRef]
- King, R. A family of fourth order methods for nonlinear equations. SIAM J. Numer. Anal. 1973, 10, 876–879. [Google Scholar] [CrossRef]
- Basto, M.; Semiao, V.; Calheiros, F.L. A new iterative method to compute nonlinear equations. Appl. Math. Comput. 2006, 173, 468–483. [Google Scholar] [CrossRef]
- Jarratt, P. Some efficient fourth order multipoint methods for solving equations. BIT 1969, 9, 119–124. [Google Scholar] [CrossRef]
- Jarratt, P. Some fourth order methods for non linear equations. Math. Comput. 1966, 20, 434–437. [Google Scholar] [CrossRef]
- Weerakoon, S.; Fernando, T.G.I. A variant of Newton’s method with accelerated third-order convergence. Appl. Math. Lett. 2000, 13, 87–93. [Google Scholar] [CrossRef]
- Candelario, G.; Cordero, A.; Torregrosa, J.R.; Vassileva, M.P. Generalized conformable fractional Newton-type method for solving nonlinear systems. Numer. Algorithms 2023, 93, 1171–1208. [Google Scholar] [CrossRef]
- Capdevila, R.R.; Cordero, A.; Torregrosa, J.R. Isonormal surfaces: A new tool for the multidimensional dynamical analysis of iterative methods for solving nonlinear systems. Math. Methods Appl. Sci. 2022, 45, 3360–3375. [Google Scholar] [CrossRef]
- Bradie, B. A Friendly Introduction to Numerical Analysis; Pearson Education Inc.: New Delhi, India, 2006. [Google Scholar]
- Constantinides, A.; Mostoufi, N. Numerical Methods for Chemical Engineers with MATLAB Applications; Prentice Hall: New Jersey, NJ, USA, 1999. [Google Scholar]
- Douglas, J.M. Process Dynamics and Control; Prentice Hall: New Jersey, NJ, USA, 1972; Volume 2. [Google Scholar]
- Fournier, R.L. Basic Transport Phenomena in Biomedical Engineering; Taylor & Francis: New York, NY, USA, 2007. [Google Scholar]
- Behl, R.; Argyros, I.K. On the solution of generalized Banach space valued equations. Mathematics 2022, 10, 132. [Google Scholar] [CrossRef]
- Cordero, A.; Leonardo-Sepúlveda, M.A.; Torregrosa, J.R.; Vassileva, M.P. Increasing in three units the order of convergence of iterative methods for solving nonlinear systems. Math. Comput. Simul. 2024, 223, 509–522. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).