Abstract
The time-optimal control problem for a system consisting of two non-synchronous oscillators is considered. Each oscillator is controlled with a shared limited scalar control. The objective of the control is to accelerate the oscillatory system to a given specific position, where the first oscillator must have non-zero phase coordinates, but the second one must remain motionless at the terminal moment. For an arbitrary number of unknown switching moments that determine the optimal relay control, the necessary extremum conditions in the form of nonlinear matrix equalities are proposed. The study of the necessary/sufficient conditions of the extremum made it possible to describe the reachability set in the phase space of the first oscillator, to find an analytical form of the curve corresponding to the two-switching control class, which also separates the reachability set of the three switching-control class. The corresponding theorems are proved and the dependence of the criteria on control constraints is shown. Matrix conditions for different classes of control switchings are found. All of the obtained analytical results are numerically validated and illustrated with mathematical modeling.
Keywords:
optimal control; harmonic oscillator; Pontryagin maximum principle; limited scalar control MSC:
49N05; 49K30
1. Introduction
Problems surrounding the lack of control resources (when the dimensions of a control vector are less than the dimensions of the state space of a physically-controlled system) have widespread applications in practice. Oscillatory systems, such as electric circuits [1], mechanical systems [2,3], quantum oscillators [4,5], systems involving physical nature [6,7], and controlled with one external force, are instances of such systems. Using the oscillatory system, it is possible to investigate periodic solutions and the stability of the model of two coupled oscillators in the optics of chiral molecules [6], as well as consider the behaviors of biologically complex mechanisms [7]. Increasing the number of oscillators in these models to sufficiently large N makes it possible to study (analytically and numerically) the macro-parameters and properties of similar systems [8,9].
Thus, the Timoshenko–Ehrenfest beam motion control, which is modeled with a system of N oscillators that perform forced oscillations under the actions of scalar and limited control forces, was considered in work [2]. The time-optimal problem for a platform with attached oscillators was investigated in [3]. The maximum variation of the oscillation energy of the oscillator system for a given time was examined in [10]. Due to the difficulties in finding an analytical solution for the control problems, it is of interest to asymptotically synthesize the optimal control of a system of an arbitrary number of linear oscillators (with general limited control), as obtained in [11].
Optimal control problems were investigated using the Pontryagin maximum principle, which was formulated during the seminars on the theory of oscillations and automatic control by L.S. Pontryagin and M.A. Aizerman in the 1950s, where the problem of optimal performance was considered [12]. For systems considered in [1,5], it is often necessary for one of the subsystems to get into the required position as quickly as possible, while the others should remain at rest at the terminal moment. Thus, for a single oscillator, V.G. Boltyansky obtained the synthesis of optimal control in the time-optimal problem [13]. The optimal control problems for the systems of many oscillators with various criteria have already been studied by F.L. Chernousko in [14], where a significant complexity of obtaining an analytical solution was noted, and the problem involving two oscillators was solved for a very special case of their frequency ratio.
In this paper, we review the time-optimal control problem for two non-synchronous oscillators controlled by limited scalar impacts with initial zero-phase coordinates but different terminal ones, which are features of the problem statement that considerably complicate it.
One of the key aspects of such control problems is the study of the accessibility and controllability of the oscillatory system [15]. It can be shown with the methods of geometric control theory [16], namely: the Sussman–Djurdjevic and the Poincare theorems, as well as with the results of the classical optimal control theory, the La Salle–Conti theorem [17].
This article is structured as follows. In Section 2, the optimal control problem is formulated. In Section 3, the optimal control law is presented using the Pontryagin maximum principle. Section 4 examines the necessary and sufficient optimality conditions for different classes of control switchings. Section 5 discusses the continuity of the problem criterion on the control parameter. Section 6 illustrates the obtained results, in particular, the mathematical modeling of the set of reachability of the first oscillator, the criterion of the problem, and the time intervals of the switching controls. In the conclusion, we suggest directions for further research.
2. Optimal Control Problem Statement
A system consisting of two non-synchronous oscillators, i.e., having different natural frequencies , was considered. The control impacts the momenta of both oscillators, but not their coordinates . The motion of this linear control system is described with the following system of equations:
The system (1) can be rewritten in matrix form , where
The number of non-zero components of the control vector B is less than the dimensionality of the system; moreover, the control is limited by a value , which is a parameter of the problem:
At the initial moment, the phase states of both oscillators coincide with the origins of the phase coordinate planes, i.e., their coordinates and momenta are equal to zero. At the terminal moment, the phase coordinates of the second oscillator become zero again, while the first oscillator moves to the state , which are also parameters of the problem. The explicit forms of the initial and terminal conditions are
where is a terminal moment and T defines a transpose operation.
The total time of the system’s movement from the initial state to the terminal one is chosen as the criterion of the problem and needs to be minimized:
3. Solution of the Optimal Control Problem
To investigate and solve problems (1)–(5), the Pontryagin maximum principle (PMP) is used. The maximum principles in the canonical coordinates consist of the following statements:
- The Hamiltonian for the optimal control problem is expressed aswhere are adjoint variables and is constant.
- A Hamiltonian system, including motion equations and an adjoint system of equations:
- Maximum condition
Transversality conditions are absent because the optimal problem statement involves fixed endpoints.
The Hamiltonian (8) is linear with respect to and, thus, the optimal control is given by the bang–bang control: for and for , which can be written as
where is a switching function. The adjoint system can be rewritten in the following form
From the system (11) and can be found:
where is a constant vector. Combining the expressions (9) and (12), an explicit form of the optimal control law can be obtained
This function is uniquely defined and cannot be equal to zero on the whole interval, perhaps with the exception of the isolated points, which lead to the absence of special control modes.
4. The Necessary Extremum Conditions
For the relay control (13), two values are defined: the moment of switching , , when the control changes signs and the duration of control intervals . Here, a further designation , which means “”, is used for convenience. The optimal control with switchings and K control intervals is illustrated in Figure 1:
Figure 1.
Control intervals for optimal control .
Definition 1.
The term “-switching control class” defines an optimal solution where the optimal control has exactly switchings.
The solution to the system of motion Equation (1) with boundary conditions (3) and (4) has a general form
The structure of optimal control (13) allows to write a solution of the system that takes into account not only the number of switchings , but also the sign of the control at the first interval.
where an additional parameter k equal to 0 or 1 corresponds to the initial control and , respectively.
Remark 1.
A system (15) consisting of four equations can be used to study the three-switching control class, where four control intervals need to be determined.
The two-switching control class occurs as degeneration of other classes. For example, when the outermost interval is zeroed in on a three-switching control class, or the internal interval is zeroed in on a four-switching control class.
Lemma 1.
In the two-switching control class, the functional dependencies for the duration of the control intervals , , are valid
The proof of Lemma 1 is given in Appendix A.
To investigate control classes with more than three switchings, the necessary extremum conditions are introduced.
Theorem 1
Proof.
The switching function (10) that determines the optimal control (13) becomes zero at moments , when control switchings occur:
Equation (18) form the system consisting of equations
Remark 2.
In the case when , the condition of the existence of non-zero vector C has the form
which is equivalent to the following equation
If , there are (number of permutations) of the non-zero conditions for the vector C
Remark 3.
Theorem 1 allows finding all of the required switching moments that define vector C as the kernel of the corresponding linear mapping in (19).
This implies another necessary condition of the extremum, which is formulated in the form of the following lemma.
Lemma 2.
The optimal control with vector C obtained from (19) contains exactly zeros and, according to Definition 1, corresponds to the -switching control class.
In the next section, a theorem considering the continuity of the problem criterion from the parameter is formulated and proved.
5. Continuity of
To find sufficient conditions for the continuity of a function , the implicit function theorem is applied. Let us write out the Jacobian of a system consisting of equations of motion (15) and additional conditions (16) obtained by Theorem 1 on the necessary extremum conditions.
The equations of the joint system are denoted by , where relate to the equations of motion (15), and are responsible for the non-degeneracy conditions (20). The Jacobian corresponds to the -switching control class and consists of partial derivatives of K functions with K variables.
Let us transform the above determinant, leading the Jacobian to a simpler form. The determinant of the system will not change if the previous column is subtracted from each column, except for the first.
After multiplying the first and third rows of the resulting matrix by −1 and then replacing the sign for each even column, the transformed Jacobian has the following form
The Jacobian, according to Equation (25), for the three-switching control class is written as follows
The expression (26) has an explicit form
Expressions (21) and (26) can be compared, or rather their explicit forms (22) and (27), respectively. It turns out that with an index shift by one at , the functional parts of these expressions are equal. Expression (26) includes four control intervals in the three-switching class and Expression (21) includes five intervals in the four-switching class, where the first control interval becomes zero at the boundary value , and that is how the transition into the three-switching class occurs.
This result is derived from several interrelated observations. The first four lines of the Jacobian (25), ordered by cos, sin, and , , which obviously do not change the sign of the determinants, can be written as
and then converted to the form
The matrix includes the matrix
where the notation is introduced. For the three-switching control class , the equality is fulfilled
The last equality follows from the next lemma.
Lemma 3.
If a square matrix A of size is composed of M blocks of matrices of size , such that , and a square matrix B of size is made up of M blocks of the product of matrices , where are square matrices of size , then
Proof.
Matrix B can be written as the product of matrices , where matrix A has the form
and the auxiliary matrix containing has the form
Due to the multiplicativity of the determinant and the properties of the block–diagonal matrix, the determinant of matrix B has the form
□
According to Lemma 3, the corresponding matrices for the three-switching control class are equal
Moreover, , so the equality (31) is established.
One can see that for the three-switching control class, the condition is possible only if the necessary conditions of the extremum (23) and the equality for for the four-switching control class are fulfilled simultaneously.
The Jacobian (25) can be written via the determinants of the square matrices of size , composed of of the matrix .
The expression (35) is a decomposition of (25) by the last rows, while the first four rows are divided into corresponding minors.
For further narration, it is necessary to write down the implicit function theorem [19] for the case of the following system of equations.
which is solved with respect to the variables . The following system of functional relations (locally equivalent to system (36)) is studied.
The following designations are introduced:
A square matrix is reversible only when its determinant is non-zero.
Theorem 2
(The implicit function theorem as in [19]). If a mapping , defined in the neighborhood U of a point is such that the following requirements are met
- ,
- − reversible matrix,
then there exists a -dimensional interval where
and a mapping , such that for any point the following condition holds
and
Now, it is possible to prove the continuity, not only of but also of the criterion by the implicit function theorem applied to our problem.
Lemma 4.
For continuity of the functions for -switching control class, the following condition is required
Proof.
The implicit function theorem 2 is applied for the considered problem. Let , then
where, as before, refer to the equations of the systems (15) and (20). For the sake of the reader’s comprehension, the arguments of the functions are omitted.
The first condition of Theorem 2 is satisfied if the parameter is non-zero. For the solution of the optimal control problem, taking into account the term of Theorem 1, the second condition of Theorem 2 is fulfilled. The reversibility of requires the condition which is equivalent to . All conditions of the implicit function theorem are satisfied, so there is a -dimensional interval and continuous mapping f on this interval.
Consequently, the durations of the control intervals are continuous functions of . Since , then is a continuous function of too. □
Lemma 5
(Sufficient conditions of continuity of ). is continuous if the non-zero conditions of (23) are met in the -switching class.
Proof.
We investigate the Jacobian , which we will rewrite using the formula (35) in a form
where the total sum is divided into two parts. The set of indexes always contains a null time element that corresponds to . corresponds to all other sets without a null time element. The fulfillment of the conditions (23) leads to zeroing of the second sum in (41), due to the fact that each summand is zero. Then, it holds
Due to the linear dependence of any set consisting of four different by the condition (23), any vector except can be written as a linear combination of . Then (42) can be expressed as follows
where combines both the coefficients and the coefficients of expansion by .
Further, two cases are possible, namely , when is continuous by Lemma 4, or . In this case, if , then are linearly dependent. Moreover, if , then due to and the presence in the first column of only the first and third elements other than zero, it follows that the first and third rows match. Which again entails a linear dependence of . This means that the necessary conditions of the extremum include corresponding to and increasing the number of control switchings by one, which changes the total number of control switchings from to K. Thus, the conditions of the lemma for the number of control switchings are violated, which means that . By Lemma 4, the function in the -switching control class is continuous. □
Corollary 1.
In a given control switching class, is a right-continuous function.
Remark 4.
Corollary 2.
Lemma 6.
is a continuous function for .
Proof.
Let two adjacent control classes consist of P and Q switchings, respectively. Indeed, due to Lemma 5 and Corollary 1 in a given class of control switchings, is continuous. Let us denote the value of the criterion in each of the classes with and at , where the upper index indicates the number of control switchings, and the lower index indicates the optimal value of control switchings. Next, for , let us denote the value of the criterion in the P-switching control class with their optimal number—Q for , and vice versa , the value of the criterion in the Q-switching control class with their optimal number is P and . Then obviously the inequalities are fulfilled, and . Again, let us use Lemma 5 and Corollary 1. So and are continuous on . Then . Thus, taking into account the above inequalities, it follows that . The statement of the lemma follows from the union of all intervals of control switchings. □
Continuity of means that is continuous in the phase plane of the first oscillator on terminal positions in each direction . Then, applying the equations of characteristics (7) according to [20], where exist, we obtain
The adjoint variables are bounded, including at , so the variation of the criterion will be quite small under small variations of the terminal positions .
6. Modeling
To illustrate the aforesaid theoretical results, numerical modeling for various configurations of the (1)–(5) problem were presented. All numerical experiments were conducted via Python scripts, which were developed specifically for this problem. In particular, the SciPy library was used to find solutions to the systems of nonlinear equations, and Matplotlib was used to visualize the obtained results.
Firstly, in Section 6.1, the motion of the system to state (with optimal control (13) for control classes with different switchings) was described. In these examples, the dependencies of the optimal number of control switchings, the criterion of the problem , and others on parameter , were investigated. In Section 6.2, a range of terminal states of the first oscillator was considered, but now with a fixed value of . In this case, all the values were considered to be functions of the parameters .
6.1. Fixed Terminal Position of the First Oscillator
Control classes with three, four, and five switchings are considered. For a range of values in each control class, the solutions that satisfied Theorem 1 and minimized the criterion (5) were found.
An important step is checking that the resulting system solutions form, according to Remark 3, the control law with the number of switchings that are initially set. In other words, Lemma 2 must be fulfilled. For example, an attempt to find a solution for a three-switching control class for results in switching function containing more than three switchings that do not satisfy Lemma 2. Figure 2 shows such an example for the parameter . The points obtained by Theorem 1 are highlighted in red. An extra zero of function is marked with a red cross. The constant vector , calculated according to Remark 3, defines the switching function , which is pictured with a blue line.
Figure 2.
Switching function for .
Lemma 2 is not satisfied in this case.
However, Lemma 2 will be fulfilled for , as illustrated in Figure 3. Moreover, the optimal control law defined by is shown.
Figure 3.
Switching function and optimal control for .
Vector C in this case: The phase portraits of the first and second oscillators satisfy the given boundary conditions (44) and are presented in Figure 4.
Figure 4.
The phase portraits of the first and second oscillators.
The dependence of the criterion on the constraint is shown in Figure 5. The blue, red, and black curves show solutions for three-, four-, and five-switching control classes, which are defined by the boundaries of each control class. Figure 5 shows that is a continuous function according to Lemma 6.
Figure 5.
Dependence .
Figure 6 illustrates the dependencies of the control interval durations from . The implicit function theorem shows that the durations of control intervals are continuous on inside each class, which can be observed in the graph. In area I, which corresponds to the five-switching class, the duration of the first control interval becomes zero when the boundary of the selected class is reached. Region II corresponds to the four-switching class, where the second interval of I becomes the first interval, and it also becomes zero when it reaches the boundary . The third interval from region I becomes the second interval in II and the first interval in the three-switching region (III). If one of the internal intervals of the four-switching class is nullified, then the transition to the two-switching class takes place.
Figure 6.
Dependencies .
Figure 7 shows the dependencies of the Jacobian for different switching control classes on .
Figure 7.
Dependence .
As one can see from the graph, when the class boundaries are reached, the Jacobians become equal to zero. Moreover, the Jacobian of the four-switching class is zero when passing to the two-switching class at one point. Lemma 1 is valid for the two-switching control class. Two-, three-, and four-switching control classes for different finite states of the first oscillator () and are presented next. As before, it was checked that the obtained solutions for each final state form, according to Remark 3, the control law with the appropriate number of switchings.
6.2. The Phase Plane of the First Oscillator for Different Control Classes
Different control classes are shown in Figure 8. The red and green areas correspond to the three- and four-switching control classes with initial controls , respectively. The reflections of these regions relative to the origins lead to blue and orange regions for the three- and four-switching control classes on the initial intervals , respectively. The three-switching control classes are separated from each other by parametric curves obtained by Lemma 1. This is due to the fact that nullifying the lateral control interval in the three-switching control class also leads to the two-switching control class (orange and red curves). Zeroing the internal control interval in the ’four-switchings’ results in curves (blue and green) corresponding to the two-switching control classes at different initial controls. The yellow and gray boundary curves were obtained by moving from the four-switching class to the three-switching class by zeroing out the last control interval, with the control at the initial interval coinciding with the initial controls in the regions on either side of the boundary. The white and purple curves correspond to the zeroing of the first control interval in the four-switching class, which means the transition to the three-switching. The simultaneous nullifying of the side interval of the control constancy in the four-switching class corresponds to the blue points where the curves obtained under the conditions or intersect.
Figure 8.
Terminal points of the phase plane of the first oscillator.
Figure 9 shows the dependence of the criterion on the phase coordinates of the first oscillator. Three- and four-switching control classes are highlighted with red and green points. The colors of all curves correspond to the description in Figure 8.
Figure 9.
The dependence of the criterion on the phase coordinates of the first oscillator .
7. Conclusions
Necessary and sufficient extremum conditions are proposed for the time-optimal control problem of a system consisting of two non-synchronous oscillators, and controlled by limited scalar impact. In addition, lemmas and theorems on the continuity of the criterion by the parameter are proposed. These results can be further extended to systems of any number of oscillators. Thus, a unified mathematical apparatus for describing the set of reachability of the first oscillator for systems of many oscillators will be developed in future works. The construction of control switching curves of the first oscillator in the phase plane will allow us to develop an approach to solving the problem of the synthesis of time-optimal control for multiple oscillators.
Author Contributions
Conceptualization, A.G. and P.L.; methodology, A.G.; software, L.B. and P.L.; validation, L.B. and A.G.; formal analysis, L.B. and A.G.; investigation, L.B., A.G. and P.L.; writing—original draft preparation, L.B. and A.G.; writing—review and editing, L.B., A.G. and P.L.; visualization, L.B.; supervision, A.G.; project administration, A.G.; funding acquisition, A.G. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by a grant to support youth scientific schools of ICS RAS “Methods for trajectories optimization of controlled objects”.
Institutional Review Board Statement
Not applicable.
Informed Consent Statement
Not applicable.
Data Availability Statement
Data sharing not applicable.
Conflicts of Interest
The authors declare no conflict of interest.
Appendix A
Proof
(Proof of Lemma 1.).
From dynamic Equation (7) for any terminal position of the first oscillator and of the next equation system is valid
Let us consider separately the last two equations of the system (A1) in the follow- ing form
The first statement of Lemma 1 follows from (A4).
In the next step, the last two equations of the system (A1) are squared, summed, and then transformed. Moreover, it is noticed that and according to lemma condition.
From this follows the required equality. Lemma 1 is proved. □
References
- Mirzaei, A.; Darabi, H. Mutual pulling between two oscillators. IEEE J. Solid-State Circuits 2014, 49, 360–372. [Google Scholar] [CrossRef]
- Salobutina, E. Regimes of more and more frequent switchings in the optimal control problem of oscillations of n oscillators. J. Math. Sci. 2008, 151, 3603–3610. [Google Scholar] [CrossRef]
- Kayumov, O. Time-optimal movement of platform with oscillators. Mech. Solids 2021, 56, 1622–1637. [Google Scholar] [CrossRef]
- Ansel, Q.; Chepelianskii, A.D.; Lages, J. Enhancing quantum exchanges between two oscillators. arXiv 2022, arXiv:2207.11156v1. [Google Scholar]
- Zhao, Y.; Chen, G.H. Two oscillators in a dissipative bath. Phys. A Stat. Mech. Appl. 2003, 317, 13–40. [Google Scholar] [CrossRef]
- Li, J.; Chen, Y.; Zhao, S. Periodic solutions and stability analysis for two-coupled-oscillator structure in optics of chiral molecules. Mathematics 2022, 10, 1908. [Google Scholar] [CrossRef]
- Firippi, E.; Chaves, M. Period-control in a coupled system of two genetic oscillators for synthetic biology. IFAC-PapersOnLine 2019, 52, 70–75. [Google Scholar] [CrossRef]
- Galyaev, A.A. Scalar control of a group of free-running oscillators. Autom. Remote Control 2016, 77, 1511–1523. [Google Scholar] [CrossRef]
- Galyaev, A.A. Impact of a system of material points against an absolutely rigid obstacle: A model for its impulsive action. Autom. Remote Control 2006, 67, 856–867. [Google Scholar] [CrossRef]
- Andresen, B.; Salamon, P.; Hoffmann, K.H.; Tsirlin, A.M. Optimal processes for controllable oscillators. Autom. Remote Control 2018, 79, 2103–2113. [Google Scholar] [CrossRef]
- Fedorov, A.; Ovseevich, A. Asymptotic control theory for a system of linear oscillators. Mosc. Math. J. 2016, 16, 561–598. [Google Scholar] [CrossRef]
- Pesch, H.J.; Plail, M. The Maximum Principle of optimal control: A history of ingenious ideas and missed opportunities. Control Cybern. 2009, 38, 973–995. [Google Scholar]
- Boltyansky, V.G. Mathematical Methods of Optimal Control, 2nd ed.; Nauka: Moscow, Russia, 1969. [Google Scholar]
- Chernousko, F.L.; Akulenko, L.D.; Sokolov, B.N. Oscillation Control; Nauka: Moscow, Russia, 1980. [Google Scholar]
- Berlin, L.M.; Galyaev, A.A. Extremum conditions for constrained scalar control of two nonsynchronous oscillators in the time-optimal control problem. Dokl. Math. 2022, 106, 286–290. [Google Scholar] [CrossRef]
- Sachkov, Y.L. Introduction to Geometric Control. arXiv 2021, arXiv:1903.00211v2. [Google Scholar]
- La Salle, J.P. The Time Optimal Control Problem. In Contributions to the Theory of Nonlinear Oscillations; Princeton University Press: Princeton, NJ, USA, 1960; Volume 5, pp. 1–24. [Google Scholar]
- Berlin, L.M.; Galyaev, A.A.; Lysenko, P.V. About time-optimal control problem for system of two non-synchronous oscillators. In Proceedings of the 16th International Conference on Stability and Oscillations of Nonlinear Control Systems (Pyatnitskiy’s Conference), Moscow, Russian, 1–3 June 2022. [Google Scholar] [CrossRef]
- Zorich, V.A. Mathematical Analysis I, 2nd ed.; Springer: Berlin/Heidelberg, Germany, 2015; p. 616. [Google Scholar]
- Isaacs, R. Differential Games: A Mathematical Theory with Applications to Warfare and Pursuit, Control and Optimization; John Wiley and Sons: New York, NY, USA, 1965. [Google Scholar]
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).