Abstract
Two sufficiency theorems for parametric and a nonparametric problems of Bolza in optimal control are derived. The dynamics of the problems are nonlinear, the initial and final states are free, and the main results can be applied when nonlinear mixed time-state-control inequality and equality constraints are presented. The deviation between admissible costs and optimal costs around the optimal control is estimated by functionals playing the role of the square of some norms.
Keywords:
calculus of variations; optimal control; mixed restraints; inequality and equality restrictions; free initial and final states; sufficient conditions; weak minima; measurable optimal controls MSC:
49K15
1. Introduction
In this paper, we derive two new sufficiency theorems in optimal control problems as the parametric and nonparametric problems of Bolza with nonlinear dynamics, free initial and final states, and inequality and equality mixed time-state-control constraints. The fundamental components of the sufficiency theorems of this article are a similar version of the Pontryagin maximum principle, a hypothesis usually called the transversality condition, a crucial second order inequality arising from the original algorithm employed to prove one of the sufficiency theorems, a related hypothesis of the Legendre–Clebsh necessary condition, the positivity of a quadratic function on the cone of critical directions, and a fundamental integral Weierstrass inequality involving a function whose role is parallel to the Hamiltonian of the problem. Given an admissible process, its set of active indices of the inequality restrictions has to be piecewise constant on the underlying time interval, the Lagrange multipliers associated with the inequality mixed constraints must be nonnegative and in fact they have to be zero whenever the corresponding index is inactive. The optimal control of the proposed optimal process need not be continuous but only measurable, see, for example, [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17], where the authors study several optimal control problems having a degree of generality very similar to the one treated in this paper and where the continuity of the optimal controls is a crucial assumption in those sufficiency theories. In the first of the sufficiency theorems of this work, the deviation between optimal costs and feasible costs is estimated by quadratic functionals, two of them playing the role of the square of the norm of the classical Banach space .
It is worth mentioning that second order sufficient conditions, as pointed out in [15], are necessary in nonlinear problems when the extremal is not unique or when an existence theorem is not applicable. In addition, the sufficient treatments have shown to be of crucial relevance in some parametric optimal control problems studying the analysis of stability or sensitivity, see, for example, [16,17]. In the previous references, the initial or final states are free, but they are restricted to lie in some surfaces delimited by curves; in contrast, the initial and final states of the nonparametric optimal control problem studied in this article are completely free, in the sense that they are not necessarily restricted to a parametrization, but they only must belong to any sets belonging to the images of a surface determined by a function. On the other hand, it is worth observing that all the crucial hypotheses of the sufficiency treatment studied in this article, are stated in the theorems, in contrast with other second order necessary and sufficiency theories that depend upon the verifiability of some crucial preliminary assumptions, see, for example, [18,19,20], where the necessary second order conditions for optimality depend on some previous hypotheses involving some notions of normality or regularity of a solution; or [11], where the corresponding sufficiency theory depends on the linear independence of some vectors whose role is the gradients of the active inequality and the equality restrictions. Finally, it is important to point out that, in [21,22], one can also find some sufficiency theories where the deviation between admissible and optimal costs around the optimal control has a quadratic growth.
The main novelties of this paper concern the facts that the sufficiency technique, used to prove Theorem 1, is independent of the standard hypothesis of continuity of the optimal controls, an assumption imposed in almost all the sufficiency theories having a similar degree of generality as the one studied in this article. In Corollary 1, the initial and final points of the states are not only variable, but they are completely free, in the sense that they may belong to any sets that must only be contained in a manifold, the sufficiency method employed to prove one of the results of the paper does not invoke classical sufficiency tools such as bounded matrix-valued Riccati equations, Hamilton–Jacobi inequalities, generalized notions of conjugate points, the linear independence of the gradients involving the active inequality and the equality constraints, insertions of the original optimal control problem in an abstract optimization problem involving a Banach space, or certain techniques based on arguments of convexity, see [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17] for details. In the parametric sufficiency theorem of the article, if an admissible process satisfies all of its hypotheses, the former not only is a weak minimum, but the deviation between the optimal cost and the admissible costs is estimated by functionals playing similar roles of the squares of several norms.
The organization of the article is the following. In Section 2, we state a parametric optimal control problem we shall be concerned with together with some elementary definitions, and we also pose one of the main results of the paper. In Section 3, we establish a nonparametric optimal control problem we shall be interested in, some fundamental definitions, a corollary that forms one of the crucial results of the paper, and an example illustrating how one can apply the results of the article. Section 4 is dedicated to state three auxiliary results in which the proof of one of the theorems is based and whose proof is referred to [23]. Section 5 is dedicated to the proof of Theorem 1. Finally, in Section 6, some conclusions and some future directions of open problems are briefly enunciated.
2. A Fundamental Theorem
Suppose an interval in is given, in which we have functions , , , , and . Set
where and . If , then and we are not concerned with statements regarding . Similarly, if , then , and we are not concerned with statements regarding .
Let be a sequence of measurable functions and let be a measurable function. We shall denote uniform convergence of to by . Similarly, strong convergence in by and weak convergence in by .
We are going to assume throughout the article that L, f and have first and second continuous derivatives with respect to x and u on . Moreover, we shall suppose that the functions l and are of class on .
Let be the space of all absolutely continuous functions mapping T to and .
Define , and keep in mind that the notation means any member . The parametric optimal control problem we shall be concerned with, denoted by (P), consists of minimizing a functional of the form
over all satisfying the constraints
The elements (* means transpose) are called parameters, the members in are called processes, and a process is feasible or admissible if it satisfies the constraints. The notation means a member .
The following notation will allow us to introduce the main results of this section.
• A process is a solution of if it is feasible and for all feasible processes . A feasible process is a weak minimum of (P) if it is a minimum of I with respect to the norm
that is, if, for some , for all feasible processes satisfying . In other words, if I affords a weak minimum at , then, if is admissible and it is sufficiently close to , in the sense that the quantities , and are sufficiently small, then .
• For all , define the Hamiltonian of the problem by
Given and set, for all ,
and let
• The second variation of J along in the direction , is given by
where for all ,
and the notation refers to any element in . In addition, is the second derivative of l evaluated at b.
• Define
• Given and , set
where
• For all and all , define
where
Finally, given , denote by
the set of active indices of with respect to the inequality restrictions. For all , let be the cone of all verifying
The set is the cone of critical directions with respect to , and the symbol means the derivative of .
Theorem 1 below is a crucial tool in order to obtain Corollary 1, the latter being the main result of the article. Theorem 1 gives sufficiency for weak minima of problem (P). Hypothesis (i) of Theorem 1 is the transversality condition, hypothesis (ii) is an inequality relation that was found during the original proof of Theorem 1, hypothesis (iii) is a modified version of the Legendre–Clebsch condition, hypothesis (iv) is the positivity of a quadratic integral on the cone of critical directions, and hypothesis (v) involves a Weierstrass integral inequality hypothesis. A remarkable component of Theorem 1 concerns the fact that the optimal control is not necessarily continuous but only measurable. The notation, is the second derivative of along in the direction .
Theorem 1.
Let be a feasible process. Suppose that is piecewise constant on T, that there exist , with , and , such that
and the following is verified
- (i)
- .
- (ii)
- .
- (iii)
- .
- (iv)
- for all , .
- (v)
- For all feasible with , .
Then, for some and all admissible processes satisfying ,
In particular, is a weak minimum of (P).
3. The Main Result
Suppose that we have an interval in , two sets and functions , , and . Set
where and . If , then and we are not concerned with statements regarding . Similarly, if , then and we are not concerned with statements regarding .
We are going to assume throughout this section that , g and have first and second continuous derivatives with respect to x and u on . Additionally, we suppose that the function ℓ, is of class on .
Set , where as usual, denotes the space of absolutely continuous functions mapping T to and .
In this section, we shall be concerned with the nonparametric optimal control problem, denoted by , of minimizing the functional
over all pairs satisfying the restrictions
Members in A are called processes, and a process is feasible if it satisfies the restrictions.
A process solves if it is feasible and for all feasible processes . A feasible process is a weak minimum of if it is a minimum of relative to the essential supremum norm, that is, if for some , for all feasible processes satisfying .
Let be any function of class such that . Relate the nonparametric optimal control problem with the parametric optimal control problem given in Section 2, denoted by , that is, is the parametric problem defined in Section 2, with the following data; , , , , and the components of , that is, .
Lemma 1.
The following is verified:
- (i)
- is a feasible process of if and only if is a feasible process of and .
- (ii)
- If is a feasible process of , then
- (iii)
- If solves , then solves .
Proof.
This is precisely Lemma 1 of [23]. □
Corollary 1 below is an immediate consequence of Theorem 1 and Lemma 1. It gives sufficiency conditions of problem . Once again, it is worthwhile observing that the optimal control is not necessarily continuous but only measurable.
Corollary 1.
Let be any function of class such that and let be the parametric optimal control problem posed before enunciating Lemma 1. Let be a feasible process of . Suppose that is piecewise constant on T, there exist , with , and , , such that
and the following is verified:
- (i)
- .
- (ii)
- .
- (iii)
- .
- (iv)
- for all , .
- (v)
- For all feasible with , .
Then, is a weak minimum of .
Remark 1.
It is worth observing that our sufficiency theory can also be applied to isoperimetric problems of Bolza with inequality and equality constraints.
In order to illustrate this fact for the nonparametric problem studied in this section, let be functions of class in . In addition, let be functions having first and second continuous derivatives with respect to x and u on , and consider the isoperimetric nonparametric optimal control problem of minimizing
subject to
Additionally, set ,
, , where and . Here, in , there are k intervals and singletons . In , there are K singletons . Remark 1 follows from the fact that the isoperimetric optimal control problem stated above is equivalent to the nonparametric optimal control problem of minimizing
subject to
Example 1 below shows how Corollary 1 can be applied. In the former, an inequality-equality constrained optimal control problem is solved by verifying that the first order sufficiency conditions
are satisfied by an element . In addition, satisfies hypotheses (i), (ii), (iii), (iv) and (v) of Corollary 1, and hence it is a weak minimum of .
Example 1.
Consider the nonparametric optimal control problem of minimizing
over all satisfying the constraints
where
For this example, the data of the nonparametric problem are given by , , , , , , , , , , and .
It is straightforwardly verified that the functions , g, and their first and second derivatives with respect to x and u are continuous on . Moreover, the function ℓ is in .
Additionally, it is evident that the process is admissible of . Let be defined by . Clearly, is in and . The related parametric problem denoted by has the following data; , , , , , and , the components of , that is, with and .
Note that, if we set , then is feasible for . In addition, clearly, is constant on T. Let , and note that , and .
Now,
and note that
It is straightforwardly verified that, for all ,
and then satisfies the first order sufficiency hypotheses of Corollary 1. As , , , then
and hence hypothesis (i) of Corollary 1 is satisfied. In addition, one can easily verify that,
and so hypothesis (ii) of Corollary 1 is fulfilled.
Now, for all ,
and so, for all ,
which in turn implies that satisfies hypothesis (iii) of Corollary 1.
In addition, observe that, for all ,
Thus, is given by all satisfying
Moreover, note that, for all ,
and, for all ,
Therefore, for all ,
Consequently,
for all , , and so hypothesis (iv) of Corollary 1 is satisfied.
Now, observe that, if is feasible, for all ,
Therefore, if is feasible,
In addition, if is feasible,
Accordingly, for any and for any feasible with ,
Therefore, hypothesis (v) of Corollary 1 is fulfilled for any and . By Corollary 1, is a weak minimum of .
Remark 2.
The reader can find a concrete example concerning the existence of a purely measurable optimal control in which one of its components satisfies a classical type of amplitude constraints on the controls u.
Indeed, see Example 1 of [23], where one can find a concrete optimal control with
and the feasible controls satisfying the amplitude constraints
Remark 3.
It would be of interest to see how the references quoted in this article or even the sufficiency theory presented in this paper can be generalized to the more complicated situation of the discrete-time case. See, for instance, [24], where time is measured in days in order to introduce a mathematical model to describe the outbreak of the Sars-Cov-2 in Ireland in March–May 2020. In the above reference, the optimal control treatment appeals to piecewise constant controls and state constraints for which a theoretical analysis is not amenable and hence a numerical approach is studied. It is worth mentioning that the optimal control model mentioned above saved lives and minimized the economical costs of the pharmaceutical interventions.
4. Auxiliary Lemmas
Now, we state three auxiliary lemmas which are going to be useful in order to prove Theorem 1. The proof of these results are included in the proofs of Lemmas 2, 3, and 4 of [23], respectively. From now on, we are not going to relabel the subsequences of a given sequence since this fact will not modify our results.
Throughout this section, we shall assume that we are given an element and a sequence in such that
For all , define
For all , define
where
Lemma 2.
For some and some subsequence of , on T.
Lemma 3.
There exist , , and a subsequence of , such that on T. Moreover, if for all , , then on T.
Lemma 4.
Suppose that on T. Let , assume that on T, , and let be as in Lemma 2. Then,
5. Proof of Theorem 1
The proof of Theorem 1 will be split up into two Lemmas. In Lemmas 5 and 6 below, we are assuming that all the hypotheses of Theorem 1 are satisfied. Before enunciating the lemmas, let us introduce some definitions.
First, note that given and , if we define by and , then
Define by
Note that the Weierstrass function of is defined by
As one readily verifies, for all and all ,
Define
We have that for all , and
where
and , are given by
We have
where
Lemma 5.
If the conclusion of Theorem 1 is false, then there exists a subsequence of feasible processes such that
Proof.
If the conclusion of Theorem 1 is false, then for all , there exists a feasible process such that
As
if is feasible, then . In addition, since
then . Therefore, (3) implies that, for all , there exists feasible with
Thus, if the conclusion of Theorem 1 is false, then, for all , there exists feasible such that
Clearly, the first relation in (4) implies that
In addition, if and only if . Then, by the second relation of (4),
Suppose for infinitely many q’s. For , we have
Denoting by the line segment in joining the points and , by the second relation of (4), by condition (i) of Theorem 1, by (6), and the mean value theorem, there exists such that
Choose an appropriate subsequence of , such that
for some with . By (5),
By (7) and (8) and condition (ii) of Theorem 1, it follows that
which contradicts (iv) of Theorem 1. Therefore, we may assume that, for all ,
□
Lemma 6.
If the conclusion of Theorem 1 is false, then hypothesis (iv) of Theorem 1 is false.
Proof.
Let be the sequence of feasible processes given in Lemma 5. Then,
Case (1): First, suppose that the sequence is bounded in .
For all , define
By Lemma 2, there exist and a subsequence of , such that on T. By Lemma 3, there exist , , and a subsequence of , such that if , then
Since the sequence is bounded in , then we may assume that there exists some such that
First, we are going to show that for ,
Observe that, for and all , we have that
By (9), (10) and (12), we obtain (11). Now, we claim that
To prove it, observe that by (2), (9) and (10),
both on T. This fact, combined with Lemma 2, implies that
Since satisfies the conditions
and by hypothesis (i) of Theorem 1, we have that
Consequently, by (1), the fact that
(15) and condition (ii) of Theorem 1,
Now, for all and ,
where
As one readily verifies,
By hypothesis (iii) of Theorem 1, we have
For all , define
where
By the fact that
and the admissibility of , on T. With this in mind, by (17) and Lemma 4,
By (16) and (18), we have
Now, let us show that . By (16), condition (v) of Theorem 1 and the fact that for all ,
With this in mind and (14), if we suppose that , then would not be positive, which is not the case, and this establishes (13). Now, let us prove that
Note that, for all ,
where
As
all on T, it follows that
By Lemma 3, on T. Then, (19) is verified.
Now, we claim that
- i.
- .
- ii.
- .
As one can easily verify, (i) and (ii) above are obtained if one simply copies the proofs from (27) to (29) of [25].
Consequently, from (11), (19), (i) and (ii), above, it follows that . This fact together with (13) contradict hypothesis (iv) of Theorem 1.
Case (2): Now, assume that the sequence is unbounded. Then,
In this case, if one copies the proofs from (31) to (38) of [25], then one obtains that for some with ,
- a.
- .
- b.
Consequently, (a) and (b) above contradict hypothesis (iv) of Theorem 1. □
6. Conclusions
In this paper, we have obtained two sufficiency theorems for a parametric and a nonparametric problems of Bolza having nonlinear dynamics, variable initial and final states and nonlinear inequality and equality mixed time-state-control constraints. The proposed optimal controls need not be continuous but also purely measurable. In the nonparametric problem, the initial and final states are not only variable but also completely free, all the crucial sufficiency hypotheses are included in the theorems and, in the parametric sufficiency theorem, the deviation around the optimal cost, can be measured by a proportion of a function involving several functionals playing similar roles of the square of some norms. The algorithm of sufficiency used to prove one of the main results of the article is self-contained in the sense that it is independent of classical techniques used to obtain sufficiency of problems having a similar degree of generality as the one studied in this work. On the other hand, some future directions of research can be visualized by applying this method of sufficiency. Concretely, we conjecture that Corollary 1 can be proved directly, that is, without invoking the theorem of the parametric problem. The only issue that must be addressed is that the sets appearing in the corollary should be manifolds determined by some functions , satisfying the relations . We also conjecture that a parallel version of Corollary 1 can be derived directly with the sets being any sets and they need not have to be subsets of any manifold. Once again, another issue that possibly arises is that we have to diminish the class of admissible processes by requiring that the strategies be of class instead of being absolutely continuous on the underlying interval of time.
Funding
This research was funded by Dirección General de Asuntos del Personal Académico, DGAPA-UNAM, by the project PAPIIT-IN102220.
Data Availability Statement
Not applicable.
Acknowledgments
The author thanks the Dirección General de Asuntos del Personal Académico, Universidad Nacional Autónoma de México, for the financial support provided by the project PAPIIT-IN102220. Additionally, the author thanks the three anonymous referees for the encouraging comments made in their reviews.
Conflicts of Interest
The author declares no conflict of interest.
References
- Malanowski, K. Sufficient optimality conditions for optimal control subject to state constraints. SIAM J. Control Optim. 1997, 35, 205–227. [Google Scholar] [CrossRef]
- Malanowski, K.; Maurer, H. Sensitivity analysis for parametric control problems with control-state constraints. Comput. Optim. Appl. 1996, 5, 253–283. [Google Scholar] [CrossRef]
- Malanowski, K.; Maurer, H.; Pickenhain, S. Second order sufficient conditions for state-constrained optimal control problems. J. Optim. Theory Appl. 2004, 123, 595–617. [Google Scholar] [CrossRef]
- Maurer, H. First, and second order sufficient optimality conditions in mathematical programming and optimal control. In Mathematical Programming at Oberwolfach; Springer: Berlin/Heidelberg, Germany, 1981; Volume 14, pp. 163–177. [Google Scholar]
- Maurer, H. Sufficient conditions and sensitivity analysis for economic control problems. Ann. Oper. Res. 1999, 88, 3–14. [Google Scholar] [CrossRef]
- Maurer, H.; Oberle, H.J. Second order sufficient conditions for optimal control problems with free final time: The Riccati approach. SIAM J. Control Optim. 2002, 41, 380–403. [Google Scholar] [CrossRef]
- Maurer, H.; Pickenhain, S. Second order sufficient conditions for control problems with mixed control-state constraints. J. Optim. Theory Appl. 1995, 86, 649–667. [Google Scholar] [CrossRef]
- Loewen, P.D. Second-order sufficiency criteria and local convexity for equivalent problems in the calculus of variations. J. Math. Anal. Appl. 1990, 146, 512–522. [Google Scholar] [CrossRef][Green Version]
- Rosenblueth, J.F. Variational conditions and conjugate points for the fixed-endpoint control problem. IMA J. Math. Control Inf. 1999, 16, 147–163. [Google Scholar] [CrossRef]
- Osmolovskii, N.P. Second order sufficient conditions for an extremum in optimal control. Control Cybern. 2002, 31, 803–831. [Google Scholar]
- Osmolovskii, N.P. Second-order sufficient optimality conditions for control problems with linearly independent gradients of control constraints. ESAIM Control. Optim. Calc. Var. 2012, 18, 452–482. [Google Scholar] [CrossRef][Green Version]
- Stefani, G.; Zezza, P.L. Optimality conditions for a constrained optimal control problem. SIAM J. Control Optim. 1996, 34, 635–659. [Google Scholar] [CrossRef]
- Hestenes, M.R. Calculus of Variations and Optimal Control Theory; John Wiley: New York, NJ, USA, 1966. [Google Scholar]
- Milyutin, A.A.; Osmolovskii, N.P. Calculus of Variations and Optimal Control; American Mathematical Society: Providence, RI, USA, 1998. [Google Scholar]
- Stefani, G.; Zezza, P.L. Constrained regular LQ-control problems. SIAM J. Control Optim. 1997, 35, 876–900. [Google Scholar] [CrossRef]
- Maurer, H.; Pesh, H.J. Solution differentiability for parametric nonlinear control problems with control-state constraints. J. Optim. Theory Appl. 1995, 86, 285–309. [Google Scholar] [CrossRef]
- Malanowski, K. Two norm approach in stability and sensitivity analysis of optimization and optimal control problems. Adv. Math. Sci. Appl. 1993, 2, 397–443. [Google Scholar]
- Cortez, K.L.; Rosenblueth, J.F. Normality and uniqueness of Lagrange multipliers. Discret. Contin. Dyn. Syst. 2018, 38, 3169–3188. [Google Scholar] [CrossRef]
- Cortez, K.L.; Rosenblueth, J.F. The broken link between normality and regularity in the calculus of variations. Syst. Control Lett. 2019, 124, 27–32. [Google Scholar] [CrossRef]
- Rosário de Pinho, M.D.; Rosenblueth, J.F. Mixed constraints in optimal control: An implicit function theorem approach. IMA J. Math. Control Inf. 2007, 24, 197–218. [Google Scholar] [CrossRef]
- Alt, W.; Felgenhauer, U.; Seydenschwanz, M. Euler discretization for a class of nonlinear optimal control problems with control appearing linearly. Comput. Optim. Appl. 2018, 69, 825–856. [Google Scholar] [CrossRef]
- Osmolovskii, N.P.; Veliov, V.M. Metric sub-regularity in optimal control of affine problems with free end state. ESAIM Control Optim. Calc. Var. 2019. [Google Scholar] [CrossRef]
- Sánchez Licea, G. Sufficiency for purely essentially bounded optimal controls. Symmetry 2020, 12, 238. [Google Scholar] [CrossRef]
- Lennon, O.N.; Áine, B. Piecewise-constant optimal control strategies for controlling the outbreak of COVID-19 in the Irish population. Math. Biosci. 2020, 330, 108496. [Google Scholar]
- Sánchez Licea, G. Sufficiency for singular trajectories in the calculus of variations. AIMS Math. 2019, 5, 111–139. [Google Scholar] [CrossRef]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).