Abstract
We develop a new Kantorovich-like convergence analysis of Newton-type methods to solve nonsingular and singular nonlinear equations in Banach spaces. The outer or generalized inverses are exchanged by a finite sum of linear operators making the implementation of these methods easier than in earlier studies. The analysis uses relaxed generalized continuity of the derivatives of operators involved required to control the derivative and on real majorizing sequences. The same approach can also be implemented on other iterative methods with inverses. The examples complement the theory by verifying the convergence conditions and demonstrating the performance of the methods.
Keywords:
outer inverse; generalized inverse; Banach space; Newton-type method; convergence; Hilbert space MSC:
65J15; 65H10; 90C30; 90C53; 49M15
1. Introduction
Let denote Banach spaces, and let be the space of linear and continuous operators from to Newton-type methods (NTMs) [1]
have been used to solve the equation
Here, the operator is a differentiable operator in the Fréchet sense, approximates Moreover, stands for an outer inverse (OI) of , i.e.,
A plethora of applications in optimization such as penalization problems, minimax problems, and goal programming are formulated as (2) using Mathematical Modelling [2,3,4,5,6,7,8,9,10,11,12,13,14,15].
Method (2) specializes to the Gauss–Newton method (GNM) for solving nonlinear least squares problems, the generalized NTM for undetermined systems, and an NTM for ill-posed Equations [16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36].
As an example of (1) and (2), let and stand for Hilbert spaces. Then, consider the task of finding a local minimum of
where Then, the GNM is defined by
to solve
where is the Moore–Penrose inverse [5,15,31], and is the adjoint of the linear operator (see also the Remark 1).
Ben-Israel [5,17] utilized the conditions
and
for all in a neighborhood of . He also used these conditions with [5,17]. These results are not semilocal since they require information about or . Moreover, if , they require conditions not required in the Kantorovich theory [1,16,33,37,38,39,40,41]. Later Deuflhard and Heindl [39], Haussler [29], and Yamamoto [1] gave Kantorovich-type theorems for the GNM like (4) using convergence conditions involving either OI of Moore–Penrose inverses:
for each . This condition is strong and does not hold in concrete examples (see Section 4 in [31]). A Kantorovich-like result with generalized inverses can be found in [42] without (6). However, it was assumed that and are finite-dimensional and , where denotes the range of a linear operator D. Other drawbacks of the earlier works are that only properties of OI are used. That is and the projectional properties of and . However, the stability and perturbation bounds for OI are not given for method (1). However, this was accomplished through the elegant work of Nashed and Chen [31]. This work reduces to the Kantorovich theory for (2) when is replaced by without additional conditions. Later works on the convergence analysis using Lipchit-type conditions, particularly for the Newton–Gauss method (5), can be found in [10,24] and the references therein.
Next, we address the problems with the implementation of method (1) which constitutes the motivation for this paper. Let and let be an OI of .
Suppose is an OI. A criterion for to be an OI is (see Lemma 2). Then, (2) becomes
But the main problem with the implementation of (2) of (7) still remains. This problem requires the invertibility of This inversion can be avoided. Let m be a fixed natural number. Define the operators
and
Then, we can consider the replacement of (7) given as
By letting in the definition of L, we have that
Thus, it is worth studying the convergence of (8) instead of (2), since we avoid the inversion of the operator . Let us provide examples of possible choices for the operator . First, consider the case when the operator is invertible. Moreover, let is a positive integer, and J denotes the Jacobian of the operator F. Then, choose in the semi-local convergence case of in the local case, where is assumed to be a solution of the equation The selection has been used in [43,44], for In the setting of a Banach space for the operator can be chosen to be (semi-local case) or (local case). Numerous selections for connected to OI or generalized inverses (GI) can also be found in [5,11,13,31] and the references therein. Other selections for are also possible provided that they satisfy the convergence conditions () and () of Section 3. The convergence analysis relies on the relaxed generalized continuity used to control the derivative and majorizing sequences for the iterates (see also Section 2). The results in this article specialize immediately to solve nonsingular equations if is replaced by
2. Preliminaries
We reproduce standard results on OI and GI to make the article as self-contained as possible. More properties can be found in [5,11,12,13,31]. Let . An operator is said to be an inner inverse (II) of if , and an OI of if . It is well known that II and bounded OI always exist. The zero is always an OI. So, we consider only nonzero outer inverses. Suppose the operator B is either an inverse or an OI of . Then, and are linear idempotents (algebraic projectors). Suppose that B is an inverse of E, then for , and . Consequently, the following decompositions hold and . Since, B is an OI of if and only if is an inverse of B, it follows that and . If B is an inner and an outer inverse of , then B is called a GI of . Moreover, there exists a unique GI satisfying and where P is a given projection of into and Q a given projection of into . In the special case when and are Hilbert spaces, and are orthogonal projections. Then, is the Moore–Penrose inverse of .
We need the following six auxiliary Lemmas, the proof of which can be found in Section 2 in [31].
Lemma 1.
Let . If is a bounded OI of Then, the following assertions hold
and
Lemma 2
(Banach Perturbation-Like Lemma). Let and be an OI of Δ. Let also be such that . Then, is a bounded OI of B so that ,
and
Lemma 3.
Let and let and be OI of Δ, and B, respectively. Then, .
Lemma 4.
Let . Suppose that and have topological decompositions . Let stand for the GI of Δ connected to these decompositions. Let B satisfy , and sends to . Then, the following assertions hold:
and
where .
Lemma 5.
Let and be the GI as given in the Lemma 4. Let satisfy and . Then, the conclusions of the Lemma 4 hold and .
Lemma 6.
Let and let be bounded GI of Δ. Let satisfy . Let . Then, is a GI of dim dim and codim codim .
Define the parameter
We need some estimates.
Lemma 7.
Let and let be an OI of Δ. Let with . Then, is a bounded OI of . Moreover, the following estimates hold.
the operator and
where are as defined in the introduction.
Proof.
The operator is a bounded OI of by Lemma 2 for . Moreover, we have in turn by the definition of L:
by the choice of r. It is followed by the Lemma 2 that and . □
3. Semi-Local Convergence
The convergence of the method (8) is shown using scalar majorizing sequences.
Definition 1.
Let be a sequence in . Then, real sequence satisfying
is called a majorizing sequence for . If the sequence converges, then also converges, and for and , we have
Therefore, the convergence of the sequence relates to that of .
Let .
Some conditions are required in the convergence of (8).
Suppose that
- There exists parameters a point having outer inverse such that .
- There exists function which is continuous and nondecreasing such that the equation admits a smallest positive solution . Take .
- There exists functions , . Define the real sequence for aswhereThe sequence is proven to be majorizing for (see Theorem 1). However, some conditions for this sequence are needed first.
- There exists such that for all .By this condition and (12) that and there exists such that .The functions and connect to the operators on the method (8).
- for all . Set , where . We shall also denote by the closure of .
- and for all
- The equation has a smallest solution in , where . Denote such solution by and
- .
Next, the convergence is established for (8).
Theorem 1.
Suppose that the conditions – hold. Then, the sequence produced by the method (8) converges to a unique solution of the equation . Moreover, the following assertion holds
Proof.
Mathematical induction on n shall establish the estimates
Since
the assertion (13) holds if and . It follows by (9) (for ) and Lemma 7 that is an outer inverse of and and . Suppose that for and . Then, we have
and . Hence, by the Lemma 3, it follows that
Then, by the method (8)
But, we have by the definition of L
So,
where we also used by the definition of H. Using the induction hypotheses and the conditions , (14), (15), (11), we have in turn that
Hence, by (8), Lemma 7 and (17)
and
The induction is completed. Thus, we have for any n
and
is an OI of . The sequence is complete as convergent and majorizes . So, the sequence is also complete in Then, it is convergent to a . By the definition
and
Thus, solves . Using Lemma 2, we have for all . So, and from Lemma 1, we obtain , so . Thus, for all n. Suppose that solves the equation . Then, we have and , for all . Then, (11), as in (16) and using
where . Therefore, we conclude . Finally, from (14) and the triangle inequality, we obtain for
By letting in (19), we show the assertion (13). □
Remark 1.
- (i)
- The results of the Theorem 1 specialize for the Newton method with OI defined by , for solving Equation (5). Simply, take and .
- (ii)
- Under the conditions , further suppose that the operator sends to provided that for , the inverse ofThen, by Lemma 4, is a GI. Thus, the proof of Theorem 1 establishes the convergence of method (8) for GI.
- (iii)
4. Examples
The example considers method (4) with for the case , which is independent of . It is also compared with method (8) for . In this case, the methods (4) and (8) become Newton’s method
and
respectively.
We shall solve the system
If Then, we can write
Consequently, we obtain
Example 1.
Method (8′): Set and We have that
Example 2.
Method (8′): Set, and It follows that
Example 3.
Method (8′): Set and Then, we have
Example 4.
Method (8′): Set and Then, we have
Example 5.
Method (8′): Set and Then, we have
Example 6.
Method (8′): Set and It follows that
Definition 2.
Let be a sequence. Then, the computational order of convergence (COC) is for [4]
Definition 3.
Let be a sequence. Then, the approximate computational order of convergence (ACOC) is for
The Table 1, Table 2, Table 3, Table 4 and Table 5 demonstrate that the cheaper-to-implement method (8) is behaving the same as Newton’s method for a large enough
Table 1.
Iterations to obtain error tolerance for initial point where .
Table 2.
Iterations to obtain error tolerance of for initial point where .
Table 3.
Iterations to obtain error tolerance of , for where .
Table 4.
Iterations to obtain error tolerance of , for where .
Table 5.
COC versus ACOC with , .
5. Conclusions
We developed a semi-local Kantorovich-like analysis of Newton-type methods for solving singular nonlinear operator equations using outer or generalized inverses. These methods do not use inverses as in earlier studies but a sum of operators. This sum converges to the inverse and makes the implementation of these methods easier than the ones using inverses. The analysis of the methods relies on the concept of generalized continuity for the operators involved and majorizing sequences. Examples complement the theory. Due to its generality, this article’s technique can be applied on other method with inverses along the same lines [6,14,19,32,39,43,45,46,47,48,49]. It is worth noting that the method (8) should be used for sufficiently small Otherwise, if m is very large, it may be as expensive to implement as method (4).
Author Contributions
Conceptualization, I.K.A., S.G., S.R. and M.I.A.; Methodology, I.K.A., S.G., S.R. and M.I.A.; Software, I.K.A., S.G., S.R. and M.I.A.; Validation, I.K.A., S.G., S.R. and M.I.A.; Formal analysis, I.K.A., S.G., S.R. and M.I.A.; Investigation, I.K.A., S.G., S.R. and M.I.A.; Resources, I.K.A., S.G., S.R. and M.I.A.; Data curation, I.K.A., S.G., S.R. and M.I.A.; Writing—original draft, I.K.A., S.G., S.R. and M.I.A.; Writing—review & editing, I.K.A., S.G., S.R. and M.I.A.; Visualization, I.K.A., S.G., S.R. and M.I.A.; Supervision, I.K.A., S.G., S.R. and M.I.A.; Project administration, I.K.A., S.G., S.R. and M.I.A.; Funding acquisition, I.K.A., S.G., S.R. and M.I.A. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Data Availability Statement
Data is contained within the article.
Acknowledgments
We would like to express our sincere graduate to Mykhailo Havdiak.
Conflicts of Interest
The authors declare no conflicts of interest.
References
- Yamamoto, T. A convergence theorem for Newton-like algorithms in Banach spaces. Numer. Math. 1987, 51, 545–557. [Google Scholar] [CrossRef]
- Adly, S.; Ngai, H.V.; Nguyen, V.V. Newton’s method for solving generalized equations: Kantorovich’s and Smale’s approaches. J. Math. Anal. Appl. 2016, 439, 396–418. [Google Scholar] [CrossRef]
- Artacho, F.J.A.; Dontchev, M.; Gaydu, A.L.; Geoffroy, M.H.; Veliov, V.M. Metric regularity of Newton’s iteration. SIAM J. Control Optim. 2011, 49, 339–362. [Google Scholar] [CrossRef]
- Argyros, I.K. The Theory and Applications of Iteration Methods with Applications, Engineering Series, 2nd ed.; CRC Press, Taylor and Francis Publ.: Boca Raton, FL, USA, 2022. [Google Scholar]
- Ben-Israel, A. On applications of generalized inverses in nonlinear analysis. In Theory and Application of Generalized Inverses of Matrices; Boullion, T.L., Odell, P.L., Eds.; Mathematics Series 4; Texas Technological College: Lubbock, TX, USA, 1968; pp. 183–202. [Google Scholar]
- Cibulka, R.; Dontchev, A.; Preininger, J.; Roubal, T.; Veliov, V. Kantorovich-Type Theorems for Generalized Equations, Research Report 2015–16; Vienna University of Technology: Vienna, Austria, 2015. [Google Scholar]
- Dokov, S.P.; Dontchev, A.L. Robinson’s Strong Regularity Implies Robust Local Convergence of Newton’s Method. In Optimal Control (Gainesville, FL, 1997), Applied Optimization; Kluwer: Dordrecht, The Netherlands, 1998; Volume 15, pp. 116–129. [Google Scholar]
- Deuflhard, P. Newton Algorithms for Nonlinear Problems: Affine Invariance and Adaptive Algorithms; Springer Series in Computational Mathematics; Springer: Berlin/Heidelberg, Germany, 2004; Volume 35. [Google Scholar]
- Kelley, C.T. Solving Nonlinear Equations with Iterative Methods, Solvers and Examples in Julia, Fundamentals of Algorithms; SIAM: Philadelphia, PA, USA, 2023. [Google Scholar]
- Li, C.; Ng, K.F. Convergence analysis of the Gauss–Newton method for convex inclusion and convex-composite optimization problems. J. Math. Anal. Appl. 2012, 389, 469–485. [Google Scholar] [CrossRef]
- Nashed, M.Z. On the perturbation theory for generalized inverse operators in Banach spaces. In Functional Analysis Methods in Numerical Analysis; Nashed, M.Z., Ed.; Lecture Notes in Mathematics; Springer: Berlin/Heidelberg, Germany; New York, NY, USA, 1979; Volume 701, pp. 180–195. [Google Scholar]
- Nashed, M.Z. Inner, outer, and generalized inverses in Banach and Hilbert spaces. Nemer. Funct. Anal. Optim. 1987, 9, 261–325. [Google Scholar] [CrossRef]
- Nashed, M.Z. A new approach to classification and regularization of ill-posed operator equations. In Inverse and Ill-Posed Problems; Engl, H.W., Groetsch, C.W., Eds.; Notes and reports in mathematics in science and engineering; Academic Press: New York, NY, USA, 1987; Volume 4, pp. 53–75. [Google Scholar]
- Pollock, S.; Rebholz, L.G. Anderson acceleration for contraction and noncontractive operators. IMA Numer. Anal. 2021, 41, 2841–2872. [Google Scholar] [CrossRef]
- Rockafellar, R.T. Convex Analysis; Princeton Mathematics Series; Princeton University Press: Princeton, NJ, USA, 1970; Volume 28. [Google Scholar]
- Argyros, I.K.; George, S. On a unified convergence analysis for Newton-type methods solving generalized equations with the Aubin property. J. Complex. 2024, 81, 101817. [Google Scholar] [CrossRef]
- Ben-Israel, A.; Greville, T.N.E. Generalized Inverses: Theory and Applications; John Wiley and Sons: New York, NY, USA, 1974. [Google Scholar]
- Bian, W.; Chen, X.; Kelley, C.T. Anderson acceleration for a class of nonsmooth fixed-point problems. SIAM J. Sci. Comp. 2021, 43, S1–S20. [Google Scholar] [CrossRef]
- Canes, E.; Kremlin, G.; Levitt, A. Convergence analysis of direct minimization and self-consistent iterations. SIAM J. Sci. Comp. 2021, 42, 243–274. [Google Scholar]
- Dontchev, A.L. Local analysis of a Newton-type method based on partial linearization. In The Mathematics of Numerical Analysis (Park City, UT, 1995), Lectures in Applied Mathematics; AMS: Providence, RI, USA, 1996; Volume 32, pp. 295–306. [Google Scholar]
- Dontchev, A.L.; Rockafellar, R.T. Newton’s method for generalized equations: Asequential implicit function theorem. Math. Program. 2010, 123, 139–159. [Google Scholar] [CrossRef]
- Dontchev, A.L.; Rockafellar, R.T. Convergence of inexact Newton methods for generalized equations. Math. Program. 2013, 139, 115–137. [Google Scholar] [CrossRef]
- Ferreira, O.P. A robust semi-local convergence analysis of Newtons method for cone inclusionproblems in Banach spaces under affine invariant majorant condition. J. Comput. Appl. Math. 2015, 279, 318–335. [Google Scholar] [CrossRef]
- Ferreira, O.P.; Goncalves, M.L.N.; Oliveira, P.R. Local convergence analysis of the Gauss–Newton method under a majorant condition. J. Complex. 2011, 27, 111–125. [Google Scholar] [CrossRef]
- Ferreira, O.P.; Goncalves, M.L.N.; Oliveira, P.R. Convergence of the Gauss–Newton method for convex composite optimization under a majorant condition. SIAM J. Optim. 2013, 23, 1757–1783. [Google Scholar] [CrossRef]
- Ferreira, O.P.; Silva, G.N. Inexact Newton’s Method to Nonlinear Functions with Valuesin a Cone. arXiv 2015, arXiv:1510.01947. [Google Scholar]
- Ferreira, O.P.; Silva, G.N. Local Convergence Analysis of Newton’s Method for Solving Strongly Regular Generalized Equations. arXiv 2016, arXiv:1604.04568. [Google Scholar] [CrossRef]
- Ferreira, O.P.; Svaiter, B.F. Kantorovich’s majorants principle for Newton’s method. Comput. Optim. Appl. 2009, 42, 213–229. [Google Scholar] [CrossRef]
- Háussler, W.M. A Kantorovich-type convergence analysis for the Gauss–Newton–Method. Numer. Math. 1986, 48, 119–125. [Google Scholar] [CrossRef]
- Josephy, N. Newton’s Method for Generalized Equations and the PIES Energy Model; University of Wisconsin-Madison: Madison, WI, USA, 1979. [Google Scholar]
- Nashed, M.Z.; Chen, X. Convergence of Newton-like methods for singular operators using outer inverses. Numer. Math. 1993, 66, 235–257. [Google Scholar] [CrossRef]
- Proinov, P.D. New general convergence theory for iterative processes and its applications to Newton–Kantarovich type theorems. J. Complex. 2010, 25, 3–42. [Google Scholar] [CrossRef]
- Rheinboldt, W.C. A unified convergence theory for a class of iterative process. SIAM J. Numer. Anal. 1968, 5, 42–63. [Google Scholar] [CrossRef]
- Robinson, S.M. Extension of Newton’s method to nonlinear functions with values in a cone. Numer. Math. 1972, 19, 341–347. [Google Scholar] [CrossRef]
- Robinson, S.M. Normed convex processes. Trans. Amer. Math. Soc. 1972, 174, 127–140. [Google Scholar] [CrossRef]
- Robinson, S.M. Strongly regular generalized equations. Math. Oper. Res. 1980, 5, 43–62. [Google Scholar] [CrossRef]
- Argyros, I.K.; George, S.; Shakhno, S.; Regmi, S.; Havdiak, M.; Argyros, M.I. Asymptotically Newton–Type Methods without Inverses for Solving Equations. Mathematics 2024, 12, 1069. [Google Scholar] [CrossRef]
- Argyros, I.K.; George, S.; Regmi, S.; Argyros, C.I. Hybrid Newton-like Inverse Free Algorithms for Solving Nonlinear Equations. Algorithms 2024, 17, 154. [Google Scholar] [CrossRef]
- Deuflhard, P.; Heindl, G. Affine invariant convergence theorems for Newton’s method and extensions to related methods. SIAM J. Numer. Anal. 1979, 16, 1–10. [Google Scholar] [CrossRef]
- Kantorovich, L.V.; Akilov, G. Functional Analysis in Normed Spaces; Fizmatgiz: Moscow, Russia, 1959. [Google Scholar]
- Silva, G.N. Kantorovich’s theorem on Newton’s method for solving generalized equationsunder the majorant condition. Appl. Math. Comput. 2016, 286, 178–188. [Google Scholar]
- Yamamoto, T. Uniqueness of the solution in a Kantorovich-type theorem of Haussler for the Gauss–Newton Method. Jpn. J. Appl. Math. 1989, 6, 77–81. [Google Scholar] [CrossRef]
- Ezquerro, J.A.; Gutierrez, J.M.; Hernandez, M.A.; Romero, N.; Rubio, M.J. The Newton algorithm: From Newton to Kantorovich. Gac. R. Soc. Mat. Esp. 2010, 13, 53–76. (In Spanish) [Google Scholar]
- Ezquerro, J.A.; Hernandez-Veron, M.A. Domain of global convergence for Newton’s algorithm from auxiliary points. Appl. Math. Lett. 2018, 85, 48–56. [Google Scholar] [CrossRef]
- Catinas, E. The inexact, inexact perturbed, and quasi-Newton algorithms are equivalent models. Math. Comp. 2005, 74, 291–301. [Google Scholar] [CrossRef]
- Candelarion, G.; Cordero, A.; Torregrosa, J.R.; Vassileva, M.P. Generalized conformable fractional Newton-type method for solving nonlinear systems. Numer. Algorithms 2023, 93, 1171–1208. [Google Scholar] [CrossRef]
- Erfanifar, R.; Hajariah, M.A. A new multi-step method for solving nonlinear systems with high efficiency indices. Numer. Algorithms 2024. [Google Scholar] [CrossRef]
- Gutierrez, J.M.; Hernandez, M.A. Newtons method under weak Kantorovich conditions. IMA J. Numer. Anal. 2000, 20, 521–532. [Google Scholar] [CrossRef]
- Singh, S. A third order iterative algorithm for inversion of cumulative beta distribution. Numer. Algor. 2023, 94, 1331–1353. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).