Abstract
We present recent advances in the analysis of nonlinear problems involving singular (degenerate) operators. The results are obtained within the framework of p-regularity theory, which has been successfully developed over the past four decades. We illustrate the theory with applications to degenerate problems in various areas of mathematics, including optimization and differential equations. In particular, we address the problem of describing the tangent cone to the solution set of nonlinear equations in singular cases. The structure of p-factor operators is used to propose optimality conditions and to construct novel numerical methods for solving degenerate nonlinear equations and optimization problems. The numerical methods presented in this paper represent the first approaches targeting solutions to degenerate problems such as the Van der Pol differential equation, boundary-value problems with small parameters, and partial differential equations where Poincaré’s method of small parameters fails. Additionally, these methods may be extended to nonlinear degenerate dynamical systems and other related problems.
1. Introduction
Many fundamental results in nonlinear analysis and classical numerical methods in Banach spaces X and Y rely on the regularity of the mapping at a given point . The regularity of a Fréchet differentiable mapping F is commonly understood as the surjectivity of its Fréchet derivative . However, a growing number of applications in areas such as partial differential equations, control theory, and optimization require the development of special approaches to deal with nonregular problems.
We present the theory of p-regularity, which originated in the 1980s with the aim of providing constructive tools for the analysis of nonregular problems. To date, the theory of p-regularity has found successful applications in various contexts and different areas of mathematics, as discussed in numerous papers. This paper highlights the most distinguished applications of the theory of p-regularity, with the goal of reviewing important results and indicating potential and promising directions for its future development and applications.
The theory of p-regularity, also known as higher-order regularity theory, offers a framework for studying nonlinear problems in situations where regularity assumptions are not satisfied. It focuses on utilizing higher-order derivatives to analyze and understand the behavior of mappings having first-order derivatives that are not onto or lack regularity.
Definition 1
(cf. Definition 1.16 of [1]). Let be a continuously differentiable mapping from an open set of a Banach space X into a Banach space Y. A vector is called a regular point of F if maps X onto the entire space Y, expressed as If , we refer to as a singular (nonregular, irregular, degenerate) point of F.
1.1. Recollection of the Fundamental Results in the Regular Case
Regularity is a common assumption in many fundamental results of real and functional analysis, such as the inverse function theorem and the Implicit Function Theorem (IFT). In this section, we revisit some of these results. The theorems presented in this section have their roots in the following classical result.
Theorem 1
(Banach open mapping principle [2], see also [3]). Let X and Y be Banach spaces. For any linear and bounded single-valued mapping , the following properties are equivalent:
- (a)
- A is surjective.
- (b)
- A is open.
- (c)
- There is a constant , such that for all there exists with and .
The Banach open mapping principle can be extended to nonlinear mappings in various ways. One such result is stated in the next theorem. Let denote the open ball in X with center at x and radius t.
Theorem 2
(Graves’ Theorem [4]). Let X and Y be Banach spaces, and let be a continuous function with . Let A be a linear operator from X onto Y, and let be the corresponding constant from Theorem 1. Suppose that there exists a constant with and , such that
for all . Then, the equation has a solution whenever .
Note that the assumption of differentiability of F at 0 is not made, as the concept of a strictly differentiable function was introduced several years after the publication of Graves’ work [4]. Instead, the surjectivity of the operator A is used. The proof of Theorem 2, along with its reformulation in terms of a strictly differentiable function F and related discussions, can be found in [5]. For historical remarks, refer to [6].
Both the inverse function theorem and the Implicit Function Theorem can be deduced from Theorem 3 below; see Theorem 1.20 in Section 1.2 of [1] for details. We should also mention that certain variants of the inverse function theorem can be derived directly from the Implicit Function Theorem; see Section 4.2 for details.
To state Theorem 3, let us recall the definition of the Banach constant, denoted by , for a bounded linear operator A between Banach spaces X and Y, as given in [1]:
Theorem 3
(Lyusternik–Graves Theorem, [1]). Let X and Y be Banach spaces. Suppose that is strictly differentiable and regular at . Then, for any positive , there exists an such that
whenever and .
For a thorough analysis of numerous consequences of Theorem 3, we refer the reader to the paper written by Dmitruk, Milyutin, and Osmolovskii [7], where the theorem is called the “generalized Lyusternik theorem”. In the monographs by Dontchev [3], Ioffe [1], and Dontchev and Rockafellar [8], Theorem 3 is also called the “Lyusternik-Graves theorem”. From a more general point of view, the theorem is treated by Dontchev and Frankowska in [9,10].
One of the consequences of Theorem 3 is the description of tangent vectors to the level set of a continuously differentiable mapping F at regular points (see Section 4.1 below).
1.2. Generalizations
Since the early 1970s, due to theoretical interests and an increasing number of involved economic and industrial applications, a vast body of literature has been devoted to relaxing the surjectivity assumption of the derivative in the fundamental results, some of which are given above, while maintaining as much of their conclusions as possible. It is beyond the scope of this paper to provide an exhaustive survey of the existing generalizations of the theorems stated above. For the purpose of this paper, we can distinguish between generalizations exploiting higher-order derivatives (see, e.g., Frankowska [11]) and generalizations that attempt to relax the surjectivity assumption of the derivative without referring to higher-order derivatives (see, e.g., Ekeland [12], Hamilton [13], Bednarczuk, Leśniewski, and Rutkowski [14]). The theory of p-regularity belongs to the first group of generalizations where higher-order derivatives are involved.
In this manuscript, we present the main concepts and results of the p-regularity theory, which has been developing successfully for the last forty years. One of the main goals of the theory of p-regularity is to replace the operator of the first derivative, which is not surjective, by a special mapping that is onto. Nonlinear mappings analyzed within the framework of p-regularity theory are those for which the derivatives up to the order are not surjective at a given point , where . The main concept of the p-regularity theory is the construction of the p-factor-operator, which is surjective at the point (see Definition 4). The special definition and the property of surjectivity of the p-factor operator lead to generalizations of the fundamental results of analysis, including IFT and some classical numerical methods. The p-factor operator is defined in such a constructive way that it efficiently replaces the nonsurjective first derivative in a variety of situations. The structure of the p-factor-operator is used as a basis for analyzing nonregular problems and for constructing numerical methods for solving degenerate nonlinear equations and optimization problems. We discuss these generalizations in this paper.
There are many publications that focus on the case of and use a 2-factor operator in a variety of applications. In this work, we consider a more general case of and do not make some additional assumptions introduced and required in the publications of other authors.
In the framework of metric spaces, the concept related to the problems discussed in the present paper and attempting to generalize the classical results given above is the concept of metric regularity; see, for example, [1,3]. For a function f acting between Banach spaces X and Y and being strictly differentiable at the given point , Corollary 5.3 in [3] complements Theorem 5.1 in [3]. It concludes that metric regularity at for is equivalent to surjectivity of the Fréchet derivative of f at . In the case when , this is the same as the nonsingularity of the Jacobian matrix .
The theory of p-regularity and the apparatus of p-factor operators make it possible to create new methods in computational mathematics to solve nonlinear problems of mathematical physics, such as the Van der Pol differential equation, boundary-value problems with a small parameter, Partial Differential Equations (PDEs) where Poincaré’s method of small parameters fails, nonlinear degenerate dynamical systems, and others. This is associated with the proposed fundamentally new design (after Newton) of a numerical method for solving essentially (degenerate) problems, which is described in this paper. Moreover, the proposed approach will allow us to construct a new type of difference scheme in computational mathematics for solving problems of nonlinear mathematical physics that are stable and converge quickly to a solution.
This pertains to the numerical solution of nonlinear equations such as the nonlinear heat equation, Burgers equation, Korteweg–de Vries equation, Navier–Stokes equation, etc. It also enables us to reorganize Numerical Analysis in a novel way, with solutions obtained being related to the mentioned problems. All of this is applicable to the emerging prospects for developing new technologies and designs in Computational Mathematics for solving problems and models related to Artificial Intelligence, optimization problems, dynamical systems, optimal control problems, etc. New opportunities are emerging for modeling and researching neural networks and creating new architectures for supercomputing.
It is important to highlight the development of fundamentally new and innovative methods in computational mathematics. The resulting schemes were far from any previous designs, emerging from years of research into the structure of degeneracy—specifically, the structure of degenerate mappings and solution sets of degenerate systems. Analyzing these structures requires methods that differ significantly from those used in the analysis of linear problems, leading to entirely new forms of mathematical objects, such as the p-factor-operator: where . In general form, this operator is formally introduced in Definition 4.
In degenerate problems, where the first derivative operator is not surjective, the p-factor-operator serves as its replacement. At the same time, this research has uncovered a previously hidden nonlinear world with unexpectedly rich diversity (cf. Theorem 4). The range of possible new methods is remarkably broad, yet it follows a structured framework that remains stable under small changes—similar to regularization techniques. These methods can also be adjusted based on the specific problem being solved.
Moreover, recent studies revealed that the so-called ill-posed problems and essentially nonlinear ones are locally equivalent. This finding suggests that many important problems, such as inverse problems, can be solved using the p-factor method or p-factor regularization. This represents a new and promising direction in both theoretical mathematics and practical applications.
1.3. Aims and Scope
The main focus of this work is on analyzing and solving nonlinear equations of the form
and optimization problems of the form
where and are sufficiently smooth mappings, and X and Y are Banach spaces. Many interesting applied nonlinear problems can be written in one of these forms.
Nonlinear mappings F and problems of the form (3) and (4) can be divided into two classes, called regular (or nonsingular) and singular (or degenerate). The classification depends on the mapping F, which is either regular (that is, is onto for a given ) or singular (that is, if ) is not onto). Roughly speaking, regular mappings are those for which the Implicit Function Theorem arguments can be applied, and singular problems are those for which they cannot, at least, be directly applied.
The purpose of this paper is to give an overview of methods and tools of the p-regularity theory, and to show how they can be applied to analyze and develop methods for solving singular (irregular, degenerate) nonlinear equations and equality-constrained optimization problems. The development of the theory of p-regularity started in approximately 1983–1984, with the concept of p-regularity introduced by Tret’yakov in [15,16].
One of the main results of the theory of p-regularity is a detailed description of the structure of the zero set of a nonregular nonlinear mapping . It is interesting to note that there have been several examples in the history of mathematics when fundamental results were obtained independently in the same general time period. One such example related to the theory of p-regularity concerns theorems about the structure of the zero sets of an irregular mapping satisfying a special higher-order regularity condition. The result that we are referring to was simultaneously obtained by Buchner, Marsden and Schecter [17] and Tretyakov [16]. The approaches proposed in [17] and in [16] are the same. The difference is in motivation and the context for the main result in both papers. In [17], the structure of the zero set around a point where the derivative is not surjective was studied in the context of bifurcation theory. Theorem 1.3 in [17] is referred to as a blowing-up result. In Fink and Rheinboldt [18], it was noted that Theorem 1.3 in [17] was a powerful generalization of the Morse Lemma, and some interesting counterexamples for a naive approach to the Morse Lemma were found. The same theorem derived by Tretyakov [16] is one of the main results for the p-regularity theory. The result led to various theoretical developments and applications of the theory to nonregular (or degenerate) problems in many areas of mathematics. We should note that the results and constructions introduced by Marsden and Tret’yakov are the same in the completely degenerate case.
This paper is organized as follows. We discuss essential nonlinearity and singular mappings in Section 2. We then recall the main concepts and definitions of the p-regularity theory in Section 3. We discuss some classical results of analysis and methods for solving nonlinear problems via the p-regularity theory in Section 4. In each subsection, we focus on singular problems that illustrate that the classical results are not necessarily satisfied in the nonregular case. We present generalizations of the same classical results, which were derived during the last forty years using the constructions and definitions of the p-regularity theory.
In this manuscript, we consider a variety of applications. We start Section 4 with the Lyusternik theorem in Section 4.1. The Lyusternik theorem plays an important role in the description of the solution sets of nonlinear equations and feasible sets of optimization problems in the regular case. However, the classical Lyusternik theorem might not hold if mapping F is singular at a given point . The first generalization of the classical Lyusternik theorem for p-regular mappings was derived and proved simultaneously in [15,17]. It can be applied to describe the zero set of a p-regular mapping. Representation Theorem and Morse Lemma are also presented in Section 4.1. We continue with the consideration of the Implicit Function Theorem in Section 4.2. Numerous books and papers, such as [13,19], discuss the classical Implicit Function Theorem. However, the classical version of the theorem is not applicable when a mapping is not regular, meaning that is not onto for some , where the index y denotes the partial derivative with respect to the variable y (for a more detailed explanation of the notation, see the “General Notation” below). We present a generalization of the Implicit Function Theorem for nonregular mappings in Section 4.2. In Section 4.3, we cover the p-factor Newton’s method for solving nonlinear Equation (3) and finding critical points of an unconstrained optimization problem. Optimality conditions for equality-constrained optimization problems and Lagrange multiplier theorems for the regular and degenerate cases are considered in Section 4.4. The modified Lagrange function method for 2-regular problems is covered in Section 4.5. Singular problems of the calculus of variations and optimality conditions for p-regular problems of the calculus of variations are considered in Section 4.6. The existence of solutions to nonlinear equations in regular and degenerate cases is covered in Section 4.7. The second-order nonlinear ordinary differential equations with boundary conditions are presented in Section 4.8. Newton interpolation polynomials and the p-factor interpolation method are considered in Section 4.9. We make some concluding remarks in Section 5.
General Notation
Let be the space of all continuous linear operators from X to Y, and for a given linear operator , let us denote its kernel and image by and , respectively. Also, denotes the adjoint of , where and denote the dual spaces of X and Y, respectively.
Let p be a natural number and let be a continuous symmetric p-multilinear mapping. The p-form associated with is the map defined by
for , where all instances of x in the expression are the same. Alternatively, we may simply view as a homogeneous polynomial of degree p with . Therefore, the space of continuous homogeneous polynomials of degree p is denoted by .
If is of class , then its derivative at a point is a continuous linear operator . The second derivative is a bilinear operator on and can be viewed as a mapping from X to . See ([20], Chapter VIII) for further details.
If is of class , we denote by the pth-order derivative of F at a given point . This is a symmetric p-multilinear map from to Y. The associated p-form, also called the pth–order mapping, is defined as
In particular, for , we have
Furthermore, for a given p-multilinear map, we introduce the following key notation for the p-kernel of the pth-order mapping:
Here, h represents elements of X that are repeatedly applied in the multilinear mapping. This set is also referred to as the locus of .
When is a continuously differentiable mapping, we use the notation to denote the partial derivative of F with respect to y. Specifically, for , the operator is defined as the Fréchet derivative of F with respect to y, satisfying:
where and is a continuous linear operator.
2. Essential Nonlinearity and Singular Mappings
Let be Banach spaces. Let be a mapping in , where W is a neighborhood of a point . According to Definition 1, a mapping F is called regular at , if
The following lemma on the local representation of a regular mapping holds.
Lemma 1
(Lemma 1, Section 1.3.3 of [21]). Let X and Y be Banach spaces, let W be a neighborhood of a point , and let be of class . If F is regular at , then there exist a neighborhood U of 0, a neighborhood , and a diffeomorphism , such that:
- 1.
- ,
- 2.
- for all ,
- 3.
- (the identity mapping on X).
Lemma 1 states that the diffeomorphism transforms F locally into an affine mapping:
In other words, provides a local reparametrization under which F takes an affine form in U. This result is also known as the local “trivialization theorem” (Theorem 1.26 of [1]).
If the regularity condition (5) is not satisfied, then, in general, F cannot be locally linearized because such a diffeomorphism does not exist.
There exist many mappings that do not admit local linearization. The concept of essentially nonlinear mappings, introduced in [22], provides a formal framework for describing such cases.
Definition 2.
Let V be a neighborhood of a point in X, and let be a neighborhood of 0. A mapping , where , is said to be essentially nonlinear at if there exists a perturbation of the form
such that there does not exist any nondegenerate transformation , , satisfying , , and such that Equation (6) holds with φ and .
We say that as if For example, if , then is .
Definition 3.
We say that the mapping is singular (or degenerate) at if it fails to be regular; that is, if its derivative is not onto:
The following Theorem 4, which establishes the relationship between the two notions of essential nonlinearity and singularity, was derived as Theorem 2.3 in [22]. We provide its proof here to complete our development.
Theorem 4.
Suppose V is a neighborhood of a given point and are Banach spaces. Suppose is in . If , then F is essentially nonlinear at the point if and only if F is singular at .
Proof.
Suppose F is singular at the point and . Since , there exists a nonzero element , such that
Thus, . Since is linear, we can assume .
Assume on the contrary that F is not essentially nonlinear at . Define the mapping by
where
By Definition 2, with defined in (9), there exist a neighborhood of 0 and a nondegenerate transformation , , such that , , and (6) holds with and :
for all .
Since and , it follows from (10) that
However, using , , and , we obtain
where . Thus, for small x,
Taking into account that for any , along with Equation (12) and the fact that , we conclude that
This contradicts (11), and therefore F is essentially nonlinear at .
To prove the converse, suppose that F is essentially nonlinear at but not singular; that is, suppose F is regular at this point.
By the persistence of the regularity condition, for any perturbation
where , the map remains regular at , and . Hence, by Lemma 1, can be written as
where and . This contradicts the definition of the essential nonlinearity of the mapping F. □
Under additional splitting assumptions, which are not made here, the representation (14) would be a standard consequence of the IFT, as in, for example, ([23], §2.5).
3. Elements of -Regularity Theory
For the purpose of describing essentially nonlinear problems, a concept of p-regularity was introduced by Tret’yakov [15,16,24] using the notion of a p-factor operator. In this section, we introduce the main definitions of the p-regularity theory, as presented, for example, in [21,22,24].
Let X and Y be Banach spaces. Suppose that is a -class mapping that is singular (nonregular) at a given point . We construct the p-factor operator under the assumption that the space Y can be decomposed into the (topological) direct sum
where is the closure of the image of the first derivative of F evaluated at . To define the remaining spaces, let and let be a closed complementary subspace to , that is, if exists. Next, let be the projection operator onto along . Define as the closed linear span of the projection of the quadratic map image:
More generally, define inductively as follows:
where is a choice of a closed complementary subspace for with respect to Y, and is the projection operator onto along for . Finally, let The order p is the minimum number for which (15) holds. In particular, for , we have . When Y is a Hilbert space, there exists a complementary subspace to , namely the orthogonal subspace .
Remark 1.
The subspaces in assumption (15) can be replaced, in further considerations, by subspaces constructed using the so-called factorization procedure. Specifically, we define
as before. However, instead of , we use the space , called the quotient (or factor) space. Note that the quotient space is itself a Banach space (see, e.g., [25]). Moreover, if decomposition (15) holds, then is isomorphic to . For simplicity of presentation, we continue to use assumption (15).
Define the following mappings (see Tret’yakov [24]):
where is the projection operator onto along with respect to Y for . Recall that is the projection onto along (or parallel to) if .
In our notation, denotes the k-th derivative of at . By the construction of the subspaces , we have
We define a mapping F as completely degenerate up to order p if
Remark 2.
With all the notation established above, we are now ready to define the p-factor operator.
Definition 4.
For a fixed vector , and mappings , defined in (16), the linear operator ,
is called the p-factor operator. Alternatively, the following equivalent form can be used:
Note that when F is regular at , meaning , we have . In this case, the p-factor operator reduces to the operator of the first derivative: for any .
For , the p-factor-operator (19) takes the form
or, equivalently, for where and . In view of (17), the construction of the operator (and in general) is closely tied to the decomposition of the image space (15). The idea is to use higher-order derivatives of F up to order p to obtain (15).
In particular, for and given by (20), we seek those that ensure the equality , where is the complementary space to .
If a mapping F is completely degenerate up to order p, meaning that (18) holds, and , then the p-factor operator simplifies to .
Recall that a bounded linear operator between Banach spaces X and Y is called Fredholm if the kernel of T has finite dimension and the image of T is a closed subspace of finite codimension in Y (see, for example ([26], Chapter 4) and [27]).
Hence, in the case of a Fredholm operator , the subspace has a complementary finite-dimensional subspace such that .
With the p-factor operator defined in (19), we are now ready to state a few definitions of various types of p-regularity for a -class mapping .
Definition 5.
We say that the mapping is p-regular at a given point along an element if
Remark 3.
The condition of p-regularity of the mapping F at the point along is equivalent to the following condition:
where In particular, when , we have , and condition (21) reduces to , which follows from elementary algebra.
We also define the k-kernel of the kth-order mapping as follows:
Definition 6.
We say the mapping F is p-regular at if it is p-regular along any h from the set
where the i-kernel of the ith-order mapping is defined in (22).
For a linear surjective operator between Banach spaces, we denote its right inverse by (see [28]). Therefore, and we have
We define the norm of by
We say that is bounded if
Definition 7.
A mapping is called strongly p-regular at a point if there exists such that
where is the right inverse operator of and
The following examples illustrate the construction of the p-factor operator for the cases and .
Example 1.
Consider the mapping defined by
Let . Then, the Jacobian is singular (degenerate) at Hence, . Let and To construct the 2-factor operator, we use the projection matrices
According to Equation (16), the mappings and have the form
Then
and
Hence, for , the 2-factor operator is defined by
It can be verified that the 2-factor operator is surjective whenever .
In this example, we have
This result implies that . Hence, according to Definition 5, the mapping F is 2-regular at along any with , but it is not 2-regular at . As we observe, it may happen that F is 2-regular along some but . Therefore, a given mapping F may fail to be 2-regular with respect to all , .
Example 2.
Case . Consider mapping defined by
With , we obtain
Then, with ,
In this example,
To construct the 3-factor operator, we use the projection matrices
Then, using Equation (16), we define and as follows:
By the definition of the 3-factor-operator, we obtain
For , the 3-factor operator takes the form
and .
For , the 3-factor operator takes the form
and .
Now, using (22), we determine the elements in the kernels by solving the following equations:
Thus, we obtain
and
Finally, one can verify that
4. Singular Problems and Classical Results via the -Regularity Theory
4.1. Lyusternik Theorem and Description of Solution Sets
The Lyusternik theorem plays an important role in describing solution sets of nonlinear equations and feasible sets of optimization problems in the regular case. This theorem has practical applications across various fields. It is particularly important in the study of optimization and variational problems. By characterizing the tangent cone, the theorem provides valuable information about critical points and the behavior of solutions in their vicinity. In control theory, the Lyusternik theorem can be used to analyze the stability and controllability of nonlinear control systems. By examining the tangent cone, one can gain insights into system behavior near critical points and determine the conditions necessary for stability and controllability. The Lyusternik theorem is also useful in the development and analysis of optimization algorithms, such as gradient-based methods. By characterizing the tangent cone, the theorem helps in designing efficient algorithms and understanding their convergence properties.
These are just a few examples of the practical applications of the Lyusternik theorem. Its insights into the tangent cone are valuable in many areas, including optimization, control theory, partial differential equations, and geometry, providing a deeper understanding of the behavior of solutions and critical points in a variety of mathematical problems.
Consider a nonlinear mapping , where U is a neighborhood of a point . We are interested in the description of the set :
This notation highlights the fact that we will focus our attention on It is useful to recall the following definition of tangent vectors and tangent cones (see, for instance, [29]).
Definition 8.
Let M be a subset of a Banach space X. A vector is said to be tangent to the set M at a point if there exist and a mapping , , such that
and
The set of vectors tangent to M at the point is called the tangent cone to the set M at , and is denoted by .
4.1.1. Lyusternik Theorem in the Regular Case
In the regular case, the Lyusternik theorem (see [30]) can be formulated as follows.
Theorem 5
(Lyusternik theorem). Let X and Y be Banach spaces, and let U be a neighborhood of in X. Assume that is Fréchet differentiable on U, and that its derivative is continuous at . Suppose further that F is regular at .
If F is singular at , then in some problems we may have , as illustrated in the following example.
Example 3.
Let , and let . Define the mapping by
Then, the derivative of F is given by Evaluating at , we obtain , , and . Calculating
we conclude that .
Example 4.
Let be defined as Then, the set
consists only of the zero function. The derivative of F, given by , is surjective, where I is the identity operator on . Moreover, we have
Example 5.
Let and define . To calculate , it is enough to apply Lyusternik’s theorem with given by
Using the trigonometric addition formulas, we obtain
The first term on the right-hand side approaches 0 as . In the second term, we use the fact that approaches 1 as
Therefore, the derivative of F at is given by
which is surjective onto . By Theorem 5, we conclude that
The problem of describing solution sets in more general settings (e.g., nonlinear systems of inequalities) is approached qualitatively using metric regularity [31,32] and geometric derivability [33].
4.1.2. A Generalization of the Lyusternik Theorem
Consider the problem of describing the set in the nonregular case. As demonstrated in Example 3, the classical Lyusternik theorem 5 may not hold when F is singular at , so that .
The first generalization of the classical Lyusternik theorem for p-regular mappings was independently derived and proved in [15,17]; see also [21]. This generalization can be used to describe the zero set of a p-regular mapping.
Theorem 6
(Generalized Lyusternik theorem, [15]). Let X and Y be Banach spaces, and let U be a neighborhood of a point . Assume that is a p–times continuously Fréchet differentiable mapping on U. Assume also that F is p-regular at .
The problem of describing the tangent cone to the solution set of a nonlinear Equation (3) with a singular mapping F has also been studied in other papers (see, for example, [16,34]).
Example 6.
To illustrate the statement of Theorem 6, define mapping by
Consider A straightforward computation shows that , and
Also,
Let (or ), then Hence, the mapping is 2-regular at . Then the statement of Theorem 6 in this example reduces to
or
The next theorem presents another version of Theorem 6, which was formulated in [24] (see also [17,21] for additional results along these lines). To state the result, we denote by , the distance function from a point to a set M:
Theorem 7.
Let X and Y be Banach spaces, and let U be a neighborhood of a point . Assume that is a p-times continuously Fréchet differentiable mapping in U. Assume also that F is strongly p-regular at . Then, there exist a neighborhood of , a mapping , and constants and , such that for all the following holds:
where are given by (16), and
For the proof, see [21].
4.1.3. Representation Theorem
The Representation Theorem is used in nonlinear analysis and is relevant to the study of the local behavior and representation of a mapping F around a special point . It also guarantees the existence of certain auxiliary mappings that have desirable properties and relate to the given mapping F and its local representation in some neighborhood of .
The Representation Theorem can be used, for example, in the study of optimization problems, particularly in constrained optimization. It helps in establishing the existence of critical points and characterizing their properties, which is essential for finding optimal solutions. The theorem is also relevant to variational methods, partial differential equations, and other areas of mathematical analysis. Moreover, it is useful in various numerical methods and computational techniques for approximating solutions of equations. Its versatility and utility stem from its ability to provide insights into the local behavior and representations of mappings near critical points, with wide-ranging applications in mathematical analysis and optimization. Its versatility and usefulness stem from its ability to provide insights into the local behavior and representations of mappings around critical points, which has wide-ranging applications in mathematical analysis and optimization.
To simplify the presentation of the next result, we state it for the case of the completely degenerate mapping F, defined in (18). Recall that in this case, , and the p-factor operator can be simplified to .
Theorem 8
([22]). Let X and Y be Banach spaces, and let V be a neighborhood of in X. Suppose that is of class , and that for . Also assume the existence of a constant , such that
Then, there exist a neighborhood U of 0 in X, a neighborhood V of in X, and mappings and , such that φ and ψ are Fréchet-differentiable at 0 and , respectively, and the following hold:
- 1.
- ;
- 2.
- for all ;
- 3.
- for all ;
- 4.
- .
All assumptions of Theorem 8 are satisfied, for example, by the mapping
where , . See also [35] for additional work on the representation theorem.
4.1.4. Morse Lemma
The Morse Lemma is another fundamental result in analysis that relates the behavior of a smooth function near a nondegenerate critical point to the local structure of its level sets. The Morse Lemma has several important applications in various areas of mathematics.
The Morse Lemma is used in differential geometry to analyze the behavior of geodesics and study the geometry of manifolds. By considering a function that measures the length or energy of curves on a manifold, the Morse Lemma allows us to understand the critical points of this function and their geometric implications. It provides insights into the existence, stability, and bifurcations of geodesics on a manifold.
The Morse Lemma has important applications in optimization and control theory, where it is used to analyze the behavior of objective functions and control systems near critical points. It helps characterize the local behavior of optimal solutions and understand stability properties. The Lemma can be employed to find critical points, perform sensitivity analysis, and study bifurcations in optimization problems and dynamical systems.
The Morse Lemma is also utilized in singularity theory, which focuses on the properties and classification of singular points or critical points of differentiable mappings. It provides a framework for understanding the local behavior of singularities and the ways in which their structure may change under small perturbations. The Lemma plays a key role in the classification and analysis of singular points and their stability.
The most interesting formulation of the Morse Lemma in the finite-dimensional case is given in the following lemma.
Lemma 2
(Morse Lemma). Let , and let be a function of class , such that and the Hessian is not degenerate. Then, in a neighborhood V of , there exist a curvelinear coordinate system and an integer number , such that
for all .
Proof.
Without loss of generality, we can assume that Hessian matrix is diagonal:
where for some number between 0 and n, the first columns have 1 on the main diagonal, and the other columns have . Otherwise, changing the basis, we can transform the Hessian to be a diagonal matrix.
Then, in this case,
Note that if the assumptions of the Morse Lemma hold, then the assumptions of the representation Theorem 8 are satisfied with and . Hence, there exists a mapping , such that
where and . It follows that . Note that if , then , and if , then we obtain a contradiction. Now, we can apply the statement of the representation Theorem 8 to the mapping to get the statement of Morse Lemma 2. □
See additional work on Morse Lemma in [36].
4.2. Implicit Function Theorem
In this section, we consider the equation , where and X, Y, and Z are Banach spaces. Let be a given point in that satisfies . We are interested in the existence of a mapping defined in a neighborhood , such that is a solution of the equation near the given point . This mapping should satisfy the following conditions:
4.2.1. Implicit Function Theorem in the Regular Case
In the case when is a continuously differentiable mapping, we denote its (Fréchet) derivative with respect to y at a point by .
In the case when F is regular at a point , meaning is onto, the classical Implicit Function Theorem (IFT) guarantees the existence of a smooth mapping defined in a neighborhood , such that (30) holds and for all x in , where There are numerous books and papers devoted to the IFT, including [13,19]. Various formulations of the standard IFT exist, and Theorem 9 presents one such statement.
Theorem 9
(Implicit Function Theorem). Let X and Y be Banach Spaces. Assume that is continuously Fréchet differentiable at , , and that Then, there exist constants , a sufficiently small , and a function such that, for , the following holds:
The situation changes when the mapping F is degenerate (nonregular) at ; that is, when is not onto. In this case, the classical IFT cannot be applied to guarantee the (local) existence of a solution . The importance of examining this situation arises from the need to solve various nonlinear problems, many of which, as shown in [22], are singular (degenerate).
4.2.2. Implicit Function Theorem in the Degenerate Case
In this section, we focus on the case when mapping is not regular; that is, when is not onto.
As an example, consider mapping , , where with some . If , then and , so the mapping F is not surjective. The classical IFT is not applicable in this case. However, there exists mapping , such that . Moreover, , and, by (31), the following inequality holds with :
Thus, while the conditions of a standard implicit function are not satisfied in the example, the statement similar to (31) holds. The example serves as a motivation and illustration for the p-order IFT. To our knowledge, the first generalization of the IFT for nonregular mappings was formulated in [24]. Generalizations of the IFT for 2-regular mappings were obtained in [21,36]. We will present a few versions of the IFT for p-regular mappings in this section.
To simplify the presentation, we begin with Theorem 10, which is stated in Euclidean spaces. A slight modification of this theorem was derived in [21]. To formulate the theorem, we first need to define the operator related to the mapping . To do so, and similarly to the mappings introduced in Section 3, we define the following mappings (see [24]):
where is the projection operator onto along with respect to Z for . The definition of is similar to the definition of the subspaces in Section 3.
Now we are ready to present the definition of the linear operator , which is similar to the operator defined in (19). Since the construction of the p factor-operators are similar, we retain the same notation to keep the presentation clear and consistent. For a fixed vector , and mappings defined in (16), the linear operator is given by
where indicates that all derivatives are taken with respect to the same variable y, which belongs to Y.
Before stating Theorem 10, we introduce some additional notation that will be used:
- In the expression , r represents the total order of differentiation, where differentiation is performed q times with respect to x and times with respect to y.
- While the notation appears in the definition (33) of the linear operator , the expression signifies that all components of the derivative are equal to zero.
- The subscript notation (q-times) indicates partial differentiation with respect to the first variable x performed q times.
- For , the notation represents the function value itself.
Theorem 10
(Implicit Function Theorem [22]). Suppose that X, Y, and Z are Euclidean spaces, and let W be a neighborhood of a point in . Assume that is of class . Suppose and there exists a neighborhood in X, such that the following conditions hold:
- The Singularity Condition:
- The pth Order Regularity Condition at the Point :
- The Banach Condition:There exists a constant such that, for any with , the following holds:
- The Elliptic Condition with respect to x:There exists a constant such thatfor all and for all .
If conditions 1 to 4 are satisfied, then for any , there exist and such that , and there is a map satisfying:
- (a)
- ;
- (b)
- for all ;
- (c)
- for all .
The alternative version of the IFT for nonregular mappings, presented as Theorem 11, was proved in [37]. Before stating the theorem, we introduce the following definition (Definition 2.3 in [38]).
Definition 9.
The mapping is called uniformly p-regular over a set M in Y if
where
Additionally, we define the mapping by
where
Under the assumption that , we also introduce the corresponding inverse multivalued operator :
where , .
Theorem 11
(The pth-order IFT). Let X, Y and Z be Banach spaces, and let and be sufficiently small neighborhoods of and , respectively. Suppose that and . Assume that the mappings , , introduced in Equation (32), satisfy the following conditions:
- (1)
- The Singularity Condition:
- (2)
- The p-Factor Approximation Condition:There exists a sufficiently small such that, for all , the following holds:
- (3)
- The Banach Condition:There exists a nonempty open set in X such that for any sufficiently small γ, the intersection of the set with the ball is not empty and . Moreover, for , there exist and a constant c such that and
- (4)
- The Uniform p-Regularity Condition:The mapping is uniformly p-regular over the set
If conditions 1 to 4 are satisfied, then there exists a constant , a sufficiently small , and a mapping such that the following hold for :
There are generalizations of IFT for nonregular mappings derived by other authors. Some examples include a generalization of the IFT and its application to a parametric linear time-optimal control problem presented in [39], generalized IFT applied to ordinary differential equations in [40], and IFT for 2-regular mappings in [41,42].
4.3. Newton’s Method
4.3.1. Classical Newton’s Method for Nonlinear Equations and Unconstrained Optimization Problems
Consider the problem of solving the nonlinear Equation (3), where is sufficiently smooth, so that for some Let be a solution of (3), that is, . Assume that mapping F is singular at the point .
In the finite dimensional case, when , , and , the singularity of F at means that the Jacobian of F is singular at , as in the following example.
Example 7
([43]). Consider function from Example 1, defined by
where is a solution to Equation (3) and is singular (degenerate) at the point
Consider a sufficiently small and some initial point . The classical Newton method is defined by
If in this example, we obtain
Then
If , then does not exist and, hence, method (35) is not applicable. Even in the case when exists, method (35) might be diverging. As an example, consider point , where t is sufficiently small. Then
and, for a sufficiently small values of t, when For instance, if , then and the method (35) is diverging.
For the overview of the existing approaches to Newton-like methods for singular operators, see, e.g., [44].
Now we consider Newton’s method for finding critical points of an unconstrained optimization problem:
where . The classical Newton’s method applied to problem (36) has the form
As an example, consider minimization of function f given by (see [43]). One of the critical points of the function f is . Let where . Then
and Hence, does not exist, so Newton’s method (37) is not applicable.
4.3.2. The p-Factor Newton’s Method
In this section, we describe a method for solving nonlinear Equation (3), where and the matrix is singular at the solution point (see [43]). The proposed method is based on the construction of the p-factor operator.
There are various publications describing the p-factor-method for solving degenerate nonlinear systems and nonregular optimization problems. Some examples are given in [43,45,46].
Let . Similarly to the definitions in Section 3, now we define by and define the projection as the projection of Y onto the orthogonal complementary subspace of in Y. Similarly, we can define as
Continuing in the same way for each we obtain and
Let h be a fixed vector such that and mapping F is p-regular at the solution along vector h. Let matrices , be defined as follows:
for all
We assume that is a solution of . Now, instead of , consider
The assumption of p-regularity of the mapping F at the solution along the vector h implies that the p-factor matrix given by
is not singular. Hence, and
Then, the p-factor Newton method can be defined as
The following theorem provides conditions that ensure the quadratic convergence of the p-factor Newton method (39).
Theorem 12
([43]). Let , and let be a solution of . Assume that there exists a vector , , such that the p-factor matrix defined in Equation (38) is not singular. Then, for any (with sufficiently small) and for the sequence generated by the method in Equation (39), the following inequality holds for some constant :
In the case of , the p-factor Newton method (39) reduces to the following:
where is the orthogonal projection onto , and the vector h is chosen such that the 2-factor matrix
is not singular. This condition is equivalent to F being 2-regular at along h. In this case, the equation
is satisfied at . Note that (42) implies that is a locally unique solution of (3).
The 2-factor Newton method presented here can be applied to solve the equation in Example 7. Specifically, instead of using the iterative procedure (35), the 2-factor Newton method given by (41) should be used.
Example 8
([43]). Consider the following problem
where is defined by . If , then , and for , we have It can be shown that F is 3-regular at along .
Namely, according to the previous definitions, in this example,
Then, the following matrix is nonsingular:
Consider the 3-factor method:
Let Then
4.4. Optimality Conditions for Equality-Constrained Optimization Problems
In this section, we consider optimization problem (4):
where is a sufficiently smooth function and is a sufficiently smooth mapping from a Banach space X to a Banach space Y.
4.4.1. Optimality Conditions: Lagrange Multiplier Theorem
There is an extensive body of literature discussing optimality conditions for regular constrained optimization problems, which are problems that satisfy certain constraint qualifications. One notable reference on this topic is Chapter 3 of the book [47].
The classical optimality conditions state that if is a regular solution of Problem (4), then there exists a Lagrange multiplier in the form of a constant vector , such that
where denotes the adjoint of , and and denote the dual spaces of X and Y, respectively.
The situation changes in the degenerate case when the derivative is not surjective. In such cases, the classical optimality conditions in the form of Equation (43) do not hold, as illustrated in the following example.
4.4.2. Optimality Conditions for p-Regular Optimization Problems
In this section, we will focus on the case when the equality constraints defined by mapping are not regular at a solution of the problem (4). We define the p-factor-Lagrange function as
where , for , and the mappings are defined in (16). Note that the p-factor-Lagrange function is a generalization of the classical Lagrange function and reduces to it in the regular case.
The development of optimality conditions for nonregular problems has become an active area of research (see [16,48,49,50,51] and references therein).
To state the sufficient conditions in Theorem 13, we also introduce an alternative version of the p-factor-Lagrange function, which is defined as follows:
To state optimality conditions for p-regular optimization problems, we use the definition of strong p-regularity at given in Definition 7. We also use the set defined in Equation (23), and the operator defined in Equation (19).
Theorem 13
([16], necessary and sufficient conditions for optimality). Assume that X and Y are Banach spaces, U is a neighborhood of a point in X, is a twice continuously Fréchet differentiable function in U, and is a -times Fréchet differentiable mapping in U.
Necessary conditions for optimality.
Assume that for an element , the set is closed in . Suppose that F is p-regular at the point along the vector . If is a local minimizer of problem (4), then there exist multipliers such that the partial derivative of the function with respect to x, denoted by , satisfies
Sufficient conditions for optimality.
Assume that the set is closed in for every , and that Assume also that the mapping F is strongly p-regular at Suppose that there exist a constant and a multiplier such that Equation (47) is satisfied, and that the second-order partial derivative of the function (defined in (46)) with respect to x, denoted by , satisfies
for every Then is a strict local minimizer of the problem Equation (4).
Example 10.
In this example, we continue with the analysis of problem (44) from Section 4.4.1. It can be verified that the point is a local minimizer of Equation (44). In Example 6, we showed that the mapping is 2-regular at along the vector .
In this example, the 2-factor Lagrange function , defined in (45) for , is given by
where , , and Substituting the given expressions, we obtain the following form:
Solving the equation
we obtain the following system:
Substituting , we obtain and . Hence, the function defined in (46), takes the following form in this example:
Recall that the set was determined in Example 6 as
4.5. Modified Lagrangian Function Method
4.5.1. The Problem
Consider the following constrained optimization problem:
where is an objective function, and are constraint functions. The goal is to find a vector such that is minimized while satisfying all constraints. To solve this, we introduce the modified Lagrangian function , which incorporates both the objective function and the constraints (see, e.g., [45,52,53]):
where . This modified Lagrangian function transforms the nonlinear optimization problem into a system of nonlinear equations.
Define the mapping by
where , and .
Consider the equation
Let be the Jacobian matrix of the mapping . Then, the Jacobian matrix of the mapping is given by
Define the set consisting of active constraints, and the set
consisting of weakly active constraints, and the set , consisting of strongly active constraints.
Recall that the Strict Complementary Condition (SCQ) means that, for each index , one and only one of and is equal to zero. If is a solution of Problem (52), and for some index j, both and , then the set is nonempty, and the SCQ fails. Consequently, is a degenerate matrix. Example 11 illustrates this situation.
4.5.2. Modified Lagrange Function Method for 2-Regular Problems
In this section, we consider the constrained optimization problem (49) with the modified Lagrangian function defined in (50). We focus on the nonregular case when the Jacobian matrix defined in (53) is singular at the solution of (52).
We will show that the mapping defined in (51) is 2-regular at .
Without loss of generality, assume that , so that and for all . Additionally, we assume that . Introduce the notation . Then, the rows of matrix with the numbers from the th to the th contain only zeros. Define the vector as follows
where is an s-dimensional all-one row vector.
The following result is well known.
Lemma 3
([45]). Let an matrix V and an matrix Q satisfy the properties:
- 1.
- Q has linearly independent columns, and
- 2.
- for all .
Assume also that is a full-rank diagonal matrix. Then, the matrix defined by
is a nonsingular matrix.
The Linear Independence Constraint Qualification (LICQ) holds for the optimization problem (49) if the gradients of active constraints are linearly independent.
The second-order sufficient optimality condition holds if there exists such that
for all that satisfy the conditions
Lemma 4
The proof of Lemma 4 can be derived from Lemma 3.
Indeed, if is a diagonal matrix with as the j-th diagonal entry,
and
then , where matrix is defined in (57). Lemma 4 implies that the 2-factor Newton method is given by
and it can be applied to solve the system (52), where G is defined in (51). As a result, we have the following theorem.
Theorem 14
([45]). Let be a solution to (49) and , for . Assume that the LICQ and the second-order sufficient optimality conditions (58) are satisfied at the point . Then, there exists a sufficiently small open ball , where , such that the estimate
holds for the method (59), where and is a constant independent of k.
In addition, there are other publications where a modified Lagrange function is used in various contexts, such as [54,55]. Higher-order analysis of optimality conditions has been performed in [56].
4.6. Calculus of Variations
The methods of the calculus of variations are widely used to solve many problems in physics and classical mechanics. However, since the classical approach cannot be directly applied to many of these problems, there is a need to extend or reformulate classical theorems to accommodate irregular cases. Over the years, various types of irregular problems in the calculus of variations have been extensively studied in both mathematics and its applications (see, e.g., [1,3,32,57,58,59,60,61,62]).
4.6.1. Singular Problems of Calculus of Variations
In this section, we consider the following Lagrange problem, which involves finding a curve , such that (see [63]):
subject to the subsidiary conditions:
where
and
We assume that all mappings and their partial derivatives are continuous with respect to and . We denote by a solution to Problem (60) and (61). While each of x, , and each component of x is a function of t, (e.g., , and ), we do not write this explicitly in order to avoid over complicated notation.
In the regular case, when the Euler–Lagrange necessary conditions are satisfied and take the form (see, e.g., [60,64]):
Let Then
In the singular case, when we can only guarantee that the following equation is satisfied:
where In this case, might be equal to 0, which results in no constructive conditions for the description or finding .
Example 12
([63]). Consider the following problem of finding a curve such that
subject to
where , and
Here,
The solution of Problem (64) and (65) is and is singular. Indeed, using the differentiation rules in functional spaces, we obtain
Introducing the notation and using the methods of differential equations, one can show that the mapping with boundary conditions is not surjective. Indeed, for satisfying
the equation does not have a solution.
With , the corresponding Euler–Lagrange equations in this case are as follows:
Unfortunately, we cannot guarantee that . For , we obtain a series of spurious solutions to the problem (64) and (65):
The derivation of the solutions (66) is based on standard techniques, so we are omitting the technical details from the paper.
4.6.2. Optimality Conditions for p-Regular Problems of Calculus of Variations
To formulate optimality conditions for the problem (60) and (61) in the singular case, we define the p-factor Euler–Lagrange function by
where
Functions , are determined for the mapping in a way that is similar to how functions are defined for the mapping in Equation (16). Namely,
Let
where
Theorem 15
The proof of Theorem 15 is similar to the one for the singular isoperimetric problem in [65].
We now go back to Example 12 for further consideration. The mapping is 2-regular at along . This means that in this problem .
Consider the following equation
The equation is equivalent to the system of Euler–Lagrange equations
One can verify that the following “false solutions” of (64) and (65),
do not satisfy the system (68) if This implies that
are not solutions to the two-factor Euler–Lagrange Equation (67) from Theorem 15. Therefore, the only solution to Example 12 is Indeed, the two-factor Euler–Lagrange equation in this case has the following form:
This system has the solution and ,
4.7. Existence of Solutions to Nonlinear Equations
This section addresses the existence of a solution to an equation of the form (3), , in the neighbourhood of a chosen point . A very general setting is considered, where the function F maps from a Banach space X to a Banach space Y, and the assumptions pertain to the properties of its derivatives in the neighborhood under consideration. This is one of the classical problems of nonlinear analysis, with many important applications, especially in the theory of differential equations (cf. [66,67,68]).
One well-known method for addressing this problem is Newton’s method (see [69]). The solution is obtained as the limit of a recursively defined sequence of approximations. This method is applied in the proof of the first theorem in this section. In particular, the existence of the inverse operator to the derivative of the function at a chosen point is assumed.
The next theorem presented is more general and uses the p-factor construction of the operator for functions of class . A certain limitation of this construction is the assumption of the existence of continuous projections onto subspaces of Y corresponding to successive orders of the derivatives of the function F.
4.7.1. Existence of Solutions to Nonlinear Equations in the Regular Case
Let X and Y be Banach spaces. Consider a mapping and a problem of existence of a point such that We know that equation is solvable and has a solution when the operator is surjective [27,70]. A modified version of the following theorem was given in [70].
Theorem 16.
Let X and Y be Banach spaces, and let , and let Assume and for some constant . Suppose that the derivative is invertible and there exist constants and such that . If, moreover, the following conditions are satisfied:
- 1.
- 2.
- 3.
- ,
then the equation has a solution
If the first derivative of F at is not surjective, then Theorem 16 cannot be applied. Consider, for example, a mapping defined by
Note that if , the assumptions of Theorem 16 are not satisfied, but the equation still has a solution .
4.7.2. Existence of Solutions to Nonlinear Equations in the Singular Case
In this section, we continue considering the problem introduced in Section 4.7.1. Specifically, let X and Y be Banach spaces, and let . Assume that for some . We are interested in the existence of a solution to the equation in some open ball of such that . Most of the work in solving this problem focuses on Newton’s method or its modifications, under the assumption that is regular (see, e.g., [71]).
Now, consider the degenerate case where is not regular. The focus here is on finding a small constant such that the neighborhood contains a solution to the equation . We introduce the following notation and assumptions for some :
The following theorem was proved in [72].
Theorem 17.
Let X and Y be Banach spaces, and let be of class Assume that there exists with , such that F is a p-regular mapping at along
Assume also that there exists where such that the following inequalities hold:
- 1.
- 2.
Then the equation has a solution , where is a fixed point such that
Recall that if our focus is on finding a radius such that the open ball contains a solution of , then Theorem 17 implies that . For example, we can take
As an example of singular nonlinear equation, we consider the problem of existence of local nontrivial solutions of the Boundary Value Problem (BVP) for the ordinary differential equation
with the boundary conditions
which is degenerate at . Here, and are given functions such that
Remark 4.
Recall that the operator is defined in (19). The surjectivity of the operator for any implies the p-regularity condition of the mapping F at the point (by the definition). It is also equivalent to the following inequality with a vector h such that :
4.8. Differential Equations
4.8.1. Nonlinear Boundary-Value Problem
The nonlinear BVP analyzed in this section has the form
with boundary conditions
where , , and g is a function from to , satisfying
We are interested in the problem of the existence of a solution to the BVP (74) and (75) for given functions and .
Introduce the notation
and regard F as a mapping where
and We can rewrite Equation (74) as
The assumptions (75) and (76) imply that is a solution of (78): . Without loss of generality, we restrict our attention to a neighborhood of the point . The problem of existence of a solution to the BVP (74) and (75) for a given function is equivalent to the problem of existence of an implicit function , such that and
If and the mapping F is regular at —that is, if the partial derivative of F with respect to y, denoted , is a surjective linear operator—then the classical IFT 9 guarantees the existence of a smooth mapping defined on a neighborhood of such that and . In this case, the operator is given by
since .
However, the situation changes in the nonregular case. Consider, for example, the BVP
which has no solution. To see this, multiply both sides of the equation by and integrate from 0 to . The left-hand side, after integration by parts, evaluates to zero, while the right-hand side is nonzero. In this example, the operator is not surjective, and, therefore, the classical Implicit Function Theorem does not apply to guarantee the existence of a solution to Equation (78).
4.8.2. Nonlinear Boundary-Value Problem in the Nonregular Case
We consider the boundary-value problem (74) and (75) in the nonregular case, using the definitions and notation introduced in Section 4.8.1. Our analysis is restricted to a neighborhood of the point ,
As shown in Section 4.8.1, the operator is not surjective. In this case, we apply the pth-order Implicit Function Theorem 11 with to derive conditions for the existence of an implicit function , and, consequently, for the existence of a solution to the BVP (74) and (75).
To apply Theorem 11, we first introduce some auxiliary spaces and functions for the mapping , in accordance with Section 4.2.2.
By the definition of the operator in (80), its image is the set of all , such that there exists satisfying
The general solution of (81) has the form:
Substituting the boundary conditions yields and
Hence,
and, as expected, . The kernel of is defined by the boundary value problem
whose solution is , with . Therefore, .
Let be a closed complementary subspace to . Then, and the projection operator is defined as
Next, define the mappings and by
For , the 2-factor-operator has the form:
where is a function.
Example 13.
Consider the following nonlinear BVP:
where , , , v is a constant, and , with X, Y and Z defined above.
We now verify that all conditions of the pth-order Implicit Function Theorem 11 are satisfied for the mapping with a sufficiently small and . Note that is a solution of the homogeneous BVP corresponding to (86), so that .
For , Condition 1 of Theorem 11 holds for F due to the structure of the mapping , as well as and introduced in (84).
Condition 2 (the 2-factor-approximation) depends only on the properties of the mapping and reduces to the existence of a sufficiently small and a neighborhood of such that for all ,
and
Both inequalities hold, and hence Condition 2 is satisfied.
Condition 3 is equivalent to the existence of a neighborhood such that for some , there exists a function and such that
Problem (87) has an explicit solution
which exists only for . Then, Condition 3 reduces to verifying that there exists a constant such that
This inequality is equivalent to
which is satisfied, for instance, by taking .
To verify Condition 4, we observe that with and h given by (88), the set consists of a single element The operator , defined in (85), with takes the form:
which is surjective, and therefore Condition 4 is satisfied.
Having verified all four conditions of Theorem 11, we conclude that there exists a solution to the BVP (86), satisfying
4.9. Interpolation by Polynomials
In this section, we consider one of the newest applications of the p-regularity theory. There are many books on numerical analysis and numerical methods where the topics of interpolation and polynomial approximation are described in detail (see, for example, [73,74]).
4.9.1. Newton Interpolation Polynomial
Let f be and consider the equation
where . For some , define the points , , as follows:
Let
The problem of interpolation is to find a polynomial of degree at most n such that , , and that gives a good approximation of the function .
Let be sufficiently small and assume that , where is a constant. Assume that the equation has a solution , and the equation has a solution . Our goal is to use the interpolation polynomial and its solution to obtain the -accuracy of the solution , in the sense that
where is a constant. In the regular case, this can be obtained by using, for example, the Newton interpolation polynomial with
Recall that the Newton interpolation polynomial of degree n, related to the data points
is defined by
where
The coefficients are called divided differences and are defined using the following relations:
where
In the following example, we consider a nonlinear function , which is not regular at a solution of the equation , and investigate whether a solution of the equation provides the desired accuracy (89) for the solution of , assuming that holds for a sufficiently small .
Example 14.
Consider the function . The solution of the equation is . The function is singular at up to the second order because for The goal in this example is to investigate whether the estimate (89) is satisfied when using the interpolation polynomial and a solution of to approximate the solution of . Using the equations given above with , we obtain
where the coefficients and are determined by using Equation (91).
Let be sufficiently small and consider the segment . The interpolation points are and . Calculating the coefficients, we obtain
Hence, the interpolation polynomial has the form
Moreover,
for a sufficiently small ε.
The solution of the equation is
which is not satisfactory from the approximation accuracy point of view, since
and the desired accuracy (89) is not obtained.
Thus, in the degenerate case, contrary to the regular case, while we have the required accuracy of the approximation for the function , the accuracy of the solution is only of the order ε, and not .
4.9.2. The p-Factor Interpolation Method
In this section, we demonstrate that the desired accuracy (89) for the solution of the equation in the degenerate case can be achieved by using the p-factor interpolation polynomial, rather than the classical Newton interpolation polynomial, to obtain an approximate solution of .
Let be a function that is singular at a point .
For some , we associate f with its corresponding p-factor function , defined as
where , . Similarly to the Newton interpolation method, we construct the p-factor interpolation polynomial using the function as follows:
where the functions are defined in the same way as in (90), and the coefficients , for , are given by
Theorem 18.
Let the equation has a solution . Assume that is p-regular along at the point . Suppose that is the Newton interpolation polynomial for the associated function , constructed with a sufficiently small interpolation step .
Then, the equation has a solution such that
where is an independent constant.
We omit the proof, as it is similar to the proof of convergence of the classical iterative Newton method.
As in the previous sections, we say that a function is p-regular along at the point if there is a natural number such that
Note that if , the definition of a p-regular function f reduces to the standard definition of a regular function, and the p-factor interpolation polynomial coincides with the classical Newton interpolation polynomial .
Example 15.
We will apply the p-factor interpolation method to the function from Example 14. Define the function for and as
and consider the p-factor interpolation polynomial . Using the same interval as in Example 14, the interpolation points are and . The coefficients are given by
and
Thus, the p-factor interpolation polynomial is
Hence, for a sufficiently small ε, we have
Solving the equation , we obtain
Therefore,
an we thus obtain the desired -accuracy stated in estimate (89) for the solution of the equation .
Let us now compare the use of the classical polynomial with the p-factor interpolation polynomial in approximating the solution of the equation , for the function f from Example 14. As mentioned earlier, the polynomial is a good approximation for the function in the sense that
However, the solution of does not yield the desired accuracy for , since
and the target accuracy of order is not achieved.
In contrast, the p-factor interpolation polynomial approximates with order :
Thus, using the p-factor interpolation polynomial , we achieve the desired accuracy for the solution of . Specifically, as shown above, the solution of satisfies estimate (89):
This level of accuracy could not be achieved using the classical interpolation polynomial .
5. Conclusions
In this paper, we described various applications of the theory of p-regularity, including the generalization of the Lyusternik and Implicit Function theorems, the Newton method, optimality conditions for equality and inequality constraints, calculus of variations, and the solvability of nonlinear equations.
We should note that we did not cover all areas where the results of the theory can be applied. In addition, there are other areas of mathematics where the theory of p-regularity (or p-factor-analysis) has not yet been applied. For example, we did not provide examples of applying the theory to the analysis of the existence of solutions for singular nonlinear partial differential equations, such as the Burger’s nonlinear equation, the Laplace nonlinear differential equation, and others. We also did not cover results related to the existence of solutions depending on a parameter for the Van der Pol differential equation, the Duffing equation, and others. Other results not covered in this paper include examples of applying the theory of p-regularity for the analysis of nonlinear dynamical systems, optimality conditions for optimal control problems in the nonregular (degenerate) case. Based on the theory of p-regularity, we can develop the theory of so-called p-convexity, which can be effective for the analysis of nonlinear problems. Additional information can be found in other studies by the authors.
Author Contributions
Conceptualization, O.B., A.P. and A.A.T.; Methodology, E.B., O.B., A.P. and A.A.T.; Validation, O.B., K.L., A.P. and A.A.T.; Formal analysis, E.B., O.B., K.L., A.P. and A.A.T.; Writing—original draft, E.B., O.B., K.L., A.P. and A.A.T.; Writing—review and editing, E.B., O.B., K.L. and A.A.T. All authors have read and agreed to the published version of the manuscript.
Funding
This research received no external funding.
Institutional Review Board Statement
Not applicable.
Data Availability Statement
The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author(s).
Acknowledgments
We would like to thank the reviewers for their careful and detailed reading of our manuscript. We are sincerely grateful for their constructive comments, insightful suggestions, and the time and effort they dedicated to evaluating our work. Their feedback has been invaluable in helping us improve the clarity, quality, and overall presentation of this paper.
Conflicts of Interest
The authors declare no conflict of interest.
References
- Ioffe, A.D. Variational Analysis of Regular Mappings. Theory and Applications, 1st ed.; Springer: Berlin/Heidelberg, Germany, 2017. [Google Scholar]
- Banach, S. Théorie des Opérations Linéaires; Monografie Matematyczne, Warszawa 1932: English Translation; North Holland: Amsterdam, The Netherlands, 1987. [Google Scholar]
- Dontchev, A.L. Lectures on variational analysis. In Applied Mathematical Sciences; Springer: Cham, Switzerland, 2021; Volume 205, p. xii+219. [Google Scholar] [CrossRef]
- Graves, L. Some mapping theorems. Duke Math. J. 1950, 17, 111–114. [Google Scholar] [CrossRef]
- Dontchev, A.L. The Graves theorem revisited. J. Convex Anal. 1996, 3, 45–54. [Google Scholar]
- Dontchev, A.L.; Bauschke, H.H.; Burachik, R.; Luke, D.R. The Inverse Function Theorems of L. M. Graves. In Splitting Algorithms, Modern Operator Theory, and Applications; Springer International Publishing: Berlin/Heidelberg, Germany, 2019. [Google Scholar] [CrossRef]
- Dmitruk, A.V.; Milyutin, A.A.; Osmolovskii, N.P. Lyusternik’s Theorem and the Theory of Extrema. Russ. Math. Surv. 1980, 35, 11–51. [Google Scholar] [CrossRef]
- Dontchev, A.L.; Rockafellar, R.T. Implicit Functions and Solution Mappings, 2nd ed.; Springer Series in Operations Research and Financial Engineering; Springer: New York, NY, USA, 2014. [Google Scholar] [CrossRef]
- Dontchev, A.L.; Frankowska, H. Lyusternik-Graves theorem and fixed points I. Proc. Am. Math. Soc. 2011, 139, 521–534. [Google Scholar] [CrossRef]
- Dontchev, A.L.; Frankowska, H. Lyusternik-Graves theorem and fixed points II. J. Convex Anal. 2012, 19, 955–973. [Google Scholar]
- Frankowska, H. High order inverse function theorems. Ann. Inst. H. Poincaré C Anal. Non Linéaire 1989, 6, 283–303. [Google Scholar] [CrossRef]
- Ekeland, I. An inverse function theorem in Fréchet spaces. Ann. Inst. H. Poincaré C Anal. Non Linéaire 2011, 28, 91–105. [Google Scholar] [CrossRef]
- Hamilton, R. The inverse function theorem of Nash and Moser. Bull. Am. Math. Soc. 1982, 7, 65–222. [Google Scholar] [CrossRef]
- Bednarczuk, E.M.; Leśniewski, K.W.; Rutkowski, K.E. On tangent cone to systems of inequalities and equations in Banach spaces under relaxed constant rank condition. ESAIM Control Optim. Calc. Var. 2021, 27, 22. [Google Scholar] [CrossRef]
- Tret’yakov, A.A. Necessary conditions for optimality of p-th order. In Control and Optimization; Moscow State University: Moscow, Russia, 1983; pp. 28–35. [Google Scholar]
- Tret’yakov, A.A. Necessary and sufficient conditions for optimality of p-th order. USSR Comput. Math. Math. Phys. 1984, 24, 123–127. [Google Scholar] [CrossRef]
- Buchner, M.; Marsden, J.; Schecter, S. Applications of the blowing-up construction and algebraic geometry to bifurcation problems. J. Differ. Equ. 1983, 48, 404–433. [Google Scholar] [CrossRef]
- Fink, J.; Rheinboldt, W. A geometric framework for the numerical study of singular points. SIAM J. Numer. Anal. 1987, 24, 618–633. [Google Scholar] [CrossRef]
- Krantz, S.; Parks, H. The Implicit Function Theorem: History, Theory, and Applications; Birkha¨user: Boston, MA, USA; Basel, Switzerland; Berlin, Germany, 2002. [Google Scholar]
- Dieudonné, J. Foundations of Modern Analysis; Pure and Applied Mathematics; Academic Press: New York, NY USA; London, UK, 1969; Volume 10-I. [Google Scholar]
- Izmailov, A.F.; Tret’yakov, A.A. Factor-Analysis of Nonlinear Mappings, 1st ed.; Nauka: Moscow, Russia, 1994. [Google Scholar]
- Tret’yakov, A.A.; Marsden, J.E. Factor–analysis of nonlinear mappings: p-regularity theory. Commun. Pure Appl. Anal. 2003, 2, 425–445. [Google Scholar]
- Abraham, R.; Marsden, J.E.; Ratiu, T. Manifolds, Tensor Analysis and Applications; Springer: New York, NY, USA, 1988. [Google Scholar]
- Tret’yakov, A.A. The implicit function theorem in degenerate problems. Russ. Math. Surv. 1987, 42, 179–180. [Google Scholar] [CrossRef]
- Alekseev, V.M.; Tihomirov, V.M.; Fomnin, S.V. Optimal Control; Nauka: Moscow, Russia, 1979. [Google Scholar]
- Bühler, T.; Salamon, D.A. Functional Analysis; Springer: Berlin, Germany, 1991. [Google Scholar]
- Kantorovitch, L.V.; Akilov, G.P. Functional Analysis; Pergamon Press: Oxford, UK, 1982. [Google Scholar]
- Ioffe, A.D.; Tihomirov, V.M. Theory of extremal problems. Stud. Math. Its Appl. 1979, 6, 1–460. [Google Scholar]
- Clarke, F.H. Optimization and Nonsmooth Analysis; Wiley: New York, NY, USA, 1983. [Google Scholar]
- Liusternik, L.A.; Sobolev, V.J. Elements of Functional Analysis; Hindustan Publishing: Delhi, India, 1961. [Google Scholar]
- Dontchev, A.L.; Levis, A.S.; Rockafellar, R.T. The radius of metric regularity. Trans. Am. Math. Soc. 2002, 355, 493–517. [Google Scholar] [CrossRef]
- Ioffe, A.D. Metric regularity and subdifferential calculus. Variational Analysis of Regular Mappings. Theory and Applications. Russ. Math. Surv. 2000, 55, 501–558. [Google Scholar] [CrossRef]
- Sekiguchi, Y.; Takahashi, W. Tangent and normal vectors to feasible regions with geometrically derivable sets. In Scientiae Mathematicae Online; Japanese Association of Mathematical Sciences: Sakai, Osaka, Japan, 2006; pp. 517–527. [Google Scholar]
- Ledzewicz, U.; Schättler, H. A higher order generalization of the Lusternik theorem. Nonlinear Anal. 1998, 34, 793–815. [Google Scholar] [CrossRef]
- Izmailov, A.F. Theorems on the representation of nonlinear mapping families and implicit function theorems. Math. Notes 2000, 67, 45–54. [Google Scholar] [CrossRef]
- Izmailov, A.F. Some generalizations of the Morse lemma. Tr. Mat. Inst. Steklov (Proc. Steklov Inst. Math.) 1998, 220, 142–156. [Google Scholar]
- Brezhneva, O.A.; Tret’yakov, A.A.; Marsden, J.E. Higher-order implicit function theorems and degenerate nonlinear boundary-value problems. Commun. Pure Appl. Anal. 2008, 7, 293–315. [Google Scholar] [CrossRef]
- Randrianantoanina, B.; Randrianantoanina, N. (Eds.) Banach Spaces and Their Applications in Analysis. In Honor of Nigel Kalton’s 60th Birthday; De Gruyter: Berlin, Germany; Boston, MA, USA, 2007. [Google Scholar] [CrossRef]
- Kostina, E.; Kostyukova, O. Generalized implicit function theorem and its application to parametric optimal control problems. J. Math. Anal. Appl. 2006, 320, 736–756. [Google Scholar] [CrossRef]
- Arutyunov, A.V.; Zhukovskiy, S.E. Nonlocal Generalized Implicit Function Theorems in Hilbert Spaces. Differ. Equ. 2020, 56, 1525–1538. [Google Scholar] [CrossRef]
- Arutyunov, A.V. On Implicit Function Theorems at Abnormal Points. Proc. Steklov Inst. Math. 2010, 271, 18–27. [Google Scholar] [CrossRef]
- Arutyunov, A.V.; Salikhova, K. Implicit Function Theorem in a Neighborhood of an Abnormal Point. Proc. Steklov Inst. Math. 2021, 315, 19–26. [Google Scholar] [CrossRef]
- Szczepanik, E.; Prusińska, A.; Tret’yakov, A.A. The p-factor method for nonlinear optimization. Schedae Informaticae 2012, 21, 143–159. [Google Scholar]
- Buhmiler, S.; Krejič, N.; Lužanin, Z. Practical quasi-Newton algorithms for singular nonlinear systems. Numer. Algor. 2010, 55, 481–502. [Google Scholar] [CrossRef]
- Brezhneva, O.A.; Evtushenko, Y.G.; Tret’yakov, A.A. The 2-factor-method with a modified Lagrange function for degenerate constrained optimization problems. Dokl. Math. 2006, 73, 384–387. [Google Scholar] [CrossRef]
- Candelario, G.; Cordero, A.; Torregrosa, J.R.; Vassileva, M.P. Generalized conformable fractional Newton-type method for solving nonlinear systems. Numer. Algor. 2023, 93, 1171–1208. [Google Scholar] [CrossRef]
- Bonnans, J.F.; Shapiro, A. Perturbation Analysis of Optimization Problems; Springer: New York, NY, USA; Berlin/Heidelberg, Germany, 2000. [Google Scholar]
- Avakov, E.R.; MagarilIlyaev, G.G.; Tikhomirov, V.M. The level-set of a smooth mapping in a neighborhood of a singular point, and zeros of a quadratic mapping. Russ. Math. Surv. 2013, 68, 401–433. [Google Scholar] [CrossRef]
- Brezhneva, O.A.; Tret’yakov, A.A. Optimality conditions for degenerate extremum problems with equality constraints. SIAM J. Control Optim. 2003, 42, 723–745. [Google Scholar] [CrossRef]
- Constantin, E. Necessary conditions for weak minima and for strict minima of order two in nonsmooth constrained multiobjective optimization. J. Glob. Optim. 2021, 80, 177–193. [Google Scholar] [CrossRef]
- Gfrerer, H. Second-order necessary conditions for nonlinear optimization problems with abstract constraints: The degenerate case. SIAM J. Optim. 2007, 18, 589–612. [Google Scholar] [CrossRef]
- Evtushenko, Y.G. Generalized Lagrange multiplier technique for nonlinear programming. J. Optim. Theory Appl. 1977, 21, 121–135. [Google Scholar] [CrossRef]
- Bertsekas, D.P. Nonlinear Programming; Athena Scientific: Belmont, MA, USA, 2016. [Google Scholar]
- Antipin, A.S.; Vasilieva, O.O. Dynamic method of multipliers in terminal control. Comput. Math. Math. Phys. 2015, 55, 766–787. [Google Scholar] [CrossRef]
- Zhiltsov, A.V.; Namm, R.V. The Lagrange multiplier method in a finite-dimensional convex programming problem. Dalnevost. Mat. Zh. 2015, 15, 53–60. [Google Scholar]
- Brezhneva, O.A.; Tret’yakov, A.A. When the Karush–Kuhn–Tucker Theorem Fails: Constraint Qualifications and Higher-Order Optimality Conditions for Degenerate Optimization Problems. J. Optim. Theory Appl. 2017, 174, 367–387. [Google Scholar] [CrossRef]
- Bradley, J.S.; Everitt, W.N. Inequalities Associated with Regular and Singular Problems in the Calculus of Variations. Trans. Am. Math. Soc. 1973, 182, 303–321. [Google Scholar] [CrossRef]
- Konjik, S.; Kunzinger, M.; Oberguggenberger, M. Foundations of the calculus of variations in generalized function algebras. Acta Appl. Math. 2008, 103, 169–199. [Google Scholar] [CrossRef]
- Lecke, A.; Baglini, L.L.; Giordano, P. The classical theory of calculus of variations for generalized functions. Adv. Nonlinear Anal. 2019, 8, 779–808. [Google Scholar] [CrossRef]
- Gelfand M., F.S.V. Calculus of Variations; Prentice-Hall: Englewood Cliffs, NJ, USA, 1963. [Google Scholar]
- Sivaloganathan, J. Singular minimisers in the Calculus of Variations: A degenerate form of cavitation. Ann. L’Institut Henri Poincaré C Anal. Non Linéaire 1992, 9, 657–681. [Google Scholar] [CrossRef]
- Tuckey, C. Nonstandard Methods in the Calculus of Variations; Pitman Research Notes in Math. Ser. 297; Longman Scientific & Technical: Harlow, UK, 1993. [Google Scholar]
- Prusińska, A.; Szczepanik, E.; Tret’yakov, A.A. p-th order optimality conditions for singular Lagrange problem in calculus of variations. Elements of p-regularity theory. In System Modeling and Optimization. CSMO 2011. IFIP Advances in Information and Communication Technology; Springer: Berlin/Heidelberg, Germany, 2013; Volume 391, pp. 528–537. [Google Scholar]
- Bellman, R.; Adomian, G. The Euler-Lagrange Equations and Characteristics; Springer: Dordrecht, The Netherlands, 1985; pp. 36–58. [Google Scholar] [CrossRef]
- Korneva, I.T.; Tret’yakov, A.A. Application of the factor-analysis to the calculus of variations. In Proceedings of Simulation and Analysis in Problems of Decision Making Theory; Computing Center of the Russian Academy of Sciences: Moscow, Russia, 2002; pp. 144–162. (In Russian) [Google Scholar]
- Cabada, A.; Aleksić, S.; Tomović, T.V.; Dimitrijević, S. Existence of Solutions of Nonlinear and Non-local Fractional Boundary Value Problems. Mediterr. J. Math. 2019, 16, 119. [Google Scholar] [CrossRef]
- Bobisud, L.E. Existence of solutions for nonlinear singular boundary value problems. Appl. Anal. 1990, 35, 43–57. [Google Scholar] [CrossRef]
- Zhang, G.; Cheng, S.S. Existence of solutions for a nonlinear system with a parameter. J. Math. Anal. Appl. 2006, 314, 311–319. [Google Scholar] [CrossRef]
- Reddien, G.W. On Newton’s method for singular problems. SIAM J. Numer. Anal. 1978, 15, 993–996. [Google Scholar] [CrossRef]
- Demidovitch, B.P.; Maron, I.A. Computational Mathematics; Nauka: Moscow, Russia, 1973. [Google Scholar]
- Moore, R.E. A Test for Existence of Solutions to Nonlinear Systems. SIAM J. Numer. Anal. 1977, 14, 611–615. [Google Scholar] [CrossRef]
- Prusińska, A.; Tret’yakov, A.A. On the Existence of Solutions to Nonlinear Equations Involving Singular Mappings with Non-zero p-Kernel. Set-Valued Anal. 2011, 19, 399–416. [Google Scholar] [CrossRef][Green Version]
- Bradie, B. A Friendly Introduction to Numerical Analysis; Pearson Prentice Hall: Upper Saddle River, NJ, USA, 2006. [Google Scholar]
- Burden, R.L.; Faires, J.D. Numerical Analysis; Thomson Brooks/Cole: Pacific Grove, CA, USA, 2005. [Google Scholar]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).