Advanced Model with a Trajectory Tracking Stabilisation System and Feasible Solution of the Optimal Control Problem

Diveev, Askhat; Sofronova, Elena; Konyrbaev, Nurbek; Abdullayev, Oralbek

doi:10.3390/math12203193

Open AccessArticle

Advanced Model with a Trajectory Tracking Stabilisation System and Feasible Solution of the Optimal Control Problem

¹

Federal Research Center “Computer Science and Control” of the Russian Academy of Sciences, Vavilova Str., 44, Build. 2, 119333 Moscow, Russia

²

Institute of Engineering and Technology, Korkyt Ata Kyzylorda University, Aiteke bi Str. 29A, Kyzylorda 120014, Kazakhstan

^*

Author to whom correspondence should be addressed.

Mathematics 2024, 12(20), 3193; https://doi.org/10.3390/math12203193

Submission received: 26 August 2024 / Revised: 19 September 2024 / Accepted: 30 September 2024 / Published: 12 October 2024

(This article belongs to the Special Issue Artificial Intelligence and Mathematical Models in Robotics and Automation)

Download

Browse Figures

Versions Notes

Abstract

In this study, we consider the extended optimal control problem and search for a control function in the class of feasible functions for a real control object. Unlike the classical optimal control problem, the control function should depend on the state, not time. Therefore, the control synthesis problem for the initial-state domain should be solved, instead of the optimal control problem with one initial state. Alternatively, an optimal trajectory motion stabilisation system may be constructed. Both approaches—control and trajectory motion stabilisation system syntheses—cannot be applied to real-time control, as the task is too complex. The minimum threshold of quality criteria is searched for in the space of mathematical expression codes. Among other problems, the search space is difficult to define and the gradient is hard to determine. Therefore, the advanced control object model is used to obtain a feasible control function. The advanced model is firstly obtained before solving the optimal control problem and it already includes a trajectory motion stabilisation system; in particular, this stabilisation system is synthesised in advance at the control system design stage. When the optimal control problem appears, it is solved in real time in the classical statement, and a control function is searched for as a function of time. The advanced control object model also uses the reference model to generate the optimal trajectory. The search for the optimal control function is performed in real time and considers the synthesised stabilisation system of motion along a determined trajectory. Machine learning control via symbolic regression, namely, the network operator method, is used to directly solve the control synthesis problem. An example solution of the optimal control problem, with an advanced model moving in the environment with obstacles for a group of two mobile robots, is presented. The obtained solution is a control function for a reference model that generates a trajectory from a class of trajectories stabilised with the object’s control system.

Keywords:

machine learning control; optimal control; control synthesis; stabilisation system; symbolic regression; mobile robot

MSC:

49M25; 68W50

1. Introduction

This study is devoted to developing numerical methods for solving complex tasks in control theory, namely, the control synthesis problem. To implement the optimal control problem solution for a real control object, solving the control synthesis problem is necessary, which is a highly time-consuming computational task. Previous studies have proposed automatically solving this problem using effective machine learning (ML) methods [1].

Solving the optimal control problem in a classical statement leads to obtaining control as a time function. This control cannot be directly implemented for a real control object because the model of this optimal control problem was formulated and solved, as it is sensitive to the initial state and perturbations. To directly implement the optimal control problem solution in a real control object, a stabilisation system must be built for the control object’s motion along the optimal trajectory, in order to compensate for deviation from the optimal trajectory [2,3,4].

Trajectory tracking is a common motion control problem in which the control object moves with the time-parameterised reference [5,6]. Trajectory tracking is widely applied in autonomous driving [7,8], robotics [9], high-precision agriculture [10,11], and so on. Effective methods for trajectory tracking include model predictive control [12], backstepping [13], PID regulators [14], and neural network PID control [15], among others.

It should be noted that the stabilisation system should provide trajectory tracking in the expanded space of states, taking into account the time; otherwise, the value of the quality criterion obtained from solving the optimal control problem will change.

Building a trajectory motion stabilisation system is not any easier than solving the optimal control problem. It requires searching for a control function that depends on the control object’s state deviation from a given trajectory. For example, if this problem is defined as optimisation (i.e., the minimisation of deviation from a given trajectory), then its solution generally involves finding the structure and parameters of the stabilisation system function.

The search for mathematical expressions occurs in various tasks, such as experimental data approximation, inverse function searches, common differential equation solutions, control object model identification, control synthesis, and so on, and includes determining the mathematical expression in the following form:

g (x, q),

(1)

where

x

is a vector of variables, and

q

is a vector of parameters searched for with the structure of Expression (1).

Machine learning methods to search for mathematical expressions, including symbolic regression, are described in [1,16,17,18]. Symbolic regression methods allow us to determine mathematical expressions, including their parameters, using a computational evolutionary approach. The mathematical expressions are iteratively composed of alphabets of elementary functions, and the evolutionary technique is used to search for the most suitable equation to satisfy the given quality criteria. In our research, symbolic regression was applied to the control synthesis problem solution.

1.1. Relation to the Literature

The classical statement of the optimal control problem has been expanded [19], and additional optimal trajectory conditions have been introduced as the classical statement does not address stabilisation system construction for control object motion along the optimal trajectory, and creating such a stabilisation system or similar solutions is necessary to implement the optimal control problem solution. In the extended statement, obtaining an optimal control and trajectory in the state space is followed by creating a system that stabilises the control object motion along the obtained optimal trajectory. The obtained stabilisation system should ensure the attraction property in the vicinity of the optimal trajectory. However, a quality problem arises when synthesising a motion stabilisation system, as the criterion of reducing the motion error along a given trajectory is usually used.

The main issue in solving the extended optimal control problem is that the stabilisation system for object motion along the optimal trajectory significantly changes the mathematical model of the control object. Initially, the optimal control problem was deemed to be solved for one mathematical model of the control object. In reality, the object moving along the trajectory had another mathematical model that was overlooked at first. Another issue is that the motion stabilisation system cannot be synthesised on the on-board processor of the control object in real time. If the optimal control problem can be solved via the direct method on the on-board processor of the control object within a few seconds, then synthesis of the motion stabilisation system requires much more time due to its computational complexity.

The authors of [20] presented a universal stabilisation system for motion along a given trajectory. Initially, several trajectories were set in the state space, and one universal stabilisation system was sought, which ensured control object motion in the vicinity of all given trajectories. Furthermore, when the optimal control problem is solved, the universal stabilisation system is used. In this study, the optimal control problem is solved in real time with a possible deterioration in the stabilisation quality of the motion along the optimal trajectory, as the obtained optimal trajectory is not guaranteed to belong to the class of trajectories that the universal stabilisation system stabilises. In addition, the second problem remains. The optimal control problem is solved for a mathematical model of the control object that does not include a stabilisation system, and a real control object contains a stabilisation system.

In the work of [21], the problem of stabilising the motion along a given trajectory was considered. The trajectory was given in space in the form of straight-line segments. A reference model of motion was created to calculate the deviation from the trajectory. For this purpose, the motion speed along each segment was calculated and considered constant. The deviation of the object from the reference motion of the point along the trajectory was calculated at each segment. To solve the control synthesis problem, the network operator method was used and a stabilisation system was obtained.

1.2. Contribution

In this study, we searched for a feasible optimal control problem solution. The search space includes a class of feasible optimal trajectories.

An advanced mathematical model of the control object was used to solve the extended optimal control problem. Initially, a universal stabilisation system was built. Furthermore, the stabilisation system was input into the mathematical model of the control object. An object trajectory vector in an extended state space in the right part of the control object model was obtained, instead of a free control vector. The mathematical model of the control object includes a reference model, which contains a free control vector in the right part. As a result, the advanced control object model was obtained. The dimension of the advanced model is twice as large as the original control object model. The advanced mathematical model allowed us to solve the optimal control problem in the classical statement, in an on-board and online manner. The optimal control problem was solved for an object with a mathematical model that includes a stabilisation system for motion along a given trajectory.

The rest of this study is organised as follows: The advanced model and problem statement of the control object are presented in Section 2; a network operator method, as one of the symbolic regression methods, is outlined in Section 3; computational experiments focused on solving the optimal control, control synthesis, and stabilisation system synthesis problems for two-wheeled robots are described in Section 4, followed by a discussion in Section 5, a conclusion in Section 6, and potential avenues for future studies in Section 7.

2. Advanced Control Object Model and Problem Statement

In the classical statement of the optimal control problem, the mathematical model of the control object is given as follows:

\dot{x} = f (x, u),

(2)

where

x

is a state vector,

x \in R^{n}

,

u

is a control vector,

u \in U \subseteq R^{m}

, and

U

is a compact set that, as a rule, determines constraints on the control.

The initial state is

x (0) = x^{0} = {[x_{1}^{0} \dots x_{n}^{0}]}^{T} .

(3)

The terminal state is

x (t_{f}) = x^{f} = {[x_{1}^{f} \dots x_{n}^{f}]}^{T},

(4)

where

t_{f}

is a terminal time,

t_{f} ⩽ t^{+}

, and

t^{+}

is a given limitation.

The quality criterion in an integral form is

J_{0} = \int_{0}^{t_{f}} f_{0} (x, u) d t \to min_{u \in U} .

(5)

We created a universal stabilisation system of motion along a given trajectory to develop an advanced model. For this purpose, we formulated the problem of a universal stabilisation system synthesis. The initial state domain is given as a deviation from the specified initial state as follows:

X_{0} = {x^{0} \pm Δ},

(6)

where

Δ = {[Δ_{1} \dots Δ_{n}]}^{T}

.

A set of diverse control functions is

V^{*} = {v^{*, 1} (t), \dots, v^{*, M} (t)}, 0 ⩽ t ⩽ t^{+},

(7)

where

v^{*, j} (t) \in U, \forall t \in [0; t^{+}], j = 1, \dots, M .

(8)

The diversity of functions should be enough to consider all control object features. Functions in (7) may include simple discontinuities.

If we substitute the control functions in (7) into the right parts of the control object model (2), we can then obtain a set of program trajectories as particular solutions of the ODE system from a given initial state, as shown in the following equation:

X^{*} = {x^{*, 1} (t, x^{0}), \dots, x^{*, M} (t, x^{0})},

(9)

where

x^{*, j} (t, x^{0})

is a particular solution of the ODE system (2) from an initial state (3) with the control function

v^{*, j} (t)

in the right part.

\dot{x} = f (x, v^{*, j} (t)), j \in {1, \dots, M} .

(10)

Then, we searched for control in (2) as a function of state

u = h (x^{*, j} (t) - x) \in U, j \in {1, \dots, M} .

(11)

To minimise the following quality criterion:

J_{1} = \sum_{j = 1}^{M} \sum_{k = 0}^{2^{n} - 1} \int_{0}^{t^{+}} ∥ x^{*, j} (t) - x (t, x^{0, k}) ∥ d t \to min_{h (x^{*, j} (t) - x) \in U},

(12)

where

x^{0, k}

is a particular solution of the ODE system,

\dot{x} = f (x, h (x^{*, j} (t) - x)),

(13)

from the initial state of

x^{0, k} = x^{0} - Δ + 2 {(k)}_{2} ⊙ Δ, k = 0, \dots, 2^{n} - 1,

(14)

where

{(k)}_{2}

is a binary code of the number k of n bit, and ⊙ is a Hadamard product of vectors.

The obtained stabilisation system (11) is inserted into the mathematical model of the control object. Then, the original model (2) is added. As a result, the advanced mathematical model of the control object is obtained:

\begin{matrix} {\dot{x}}^{*} & = & f (x^{*}, u), \\ \dot{x} & = & f (x, h (x^{*} - x)) . \end{matrix}

(15)

The structure of the advanced model (15) (see Figure 1) consists of two subsystems: the reference model, which is usually realised on the on-board processor, and a model of the control object with a trajectory tracking stabilisation system.

The first subsystem is related to the open-loop control, while the second subsystem is related to the closed-loop control. Therefore, the control quality depends on the stabilisation formula

h

previously found for a class of trajectories (9) and the initial state domain (6).

For the advanced model (15), the optimal control problem can be solved in the classical statement. A control function is found as a time function, using the quality criterion of reaching the terminal state and avoiding collisions with obstacles, namely, satisfying the phase constraints in terms of control theory. Phase constraints may be static and dynamic in nature. Static phase constraints are obstacles in the environment, and dynamic phase constraints are possible collisions between interacting objects.

The two-step scheme for designing a control system based on the proposed advanced control object model is shown in Figure 2. In the first step, a universal stabilisation system for trajectory tracking is obtained using a machine-learning symbolic regression method. In the second step, an optimal control problem is solved using an evolutionary algorithm for the advanced model. The obtained solution belongs to the class of feasible control functions.

For the advanced model, it is possible to build a Hamiltonian system of differential equations for conjugate variables and use the maximum principle

H (x^{*}, x, ψ^{*}, ψ) = - f_{0} (x, u) + ψ^{* T} f (x^{*}, u) + ψ^{T} f (x, h (x^{*} - x)),

(16)

where

ψ^{*}

and

ψ

are vectors of conjugate variables as shown in the following equations:

{\dot{ψ}}^{*} = - \frac{\partial H (x^{*}, x, ψ^{*}, ψ)}{\partial x^{*}},

(17)

\dot{ψ} = - \frac{\partial H (x^{*}, x, ψ^{*}, ψ)}{\partial x} .

(18)

3. Machine Learning for Stabilisation System Synthesis

To solve the control synthesis problems (2), (11), (12), and (14) and obtain a stabilisation formula

h

for a class of trajectories (9), machine learning control via symbolic regression was used.

Symbolic regression encodes mathematical expressions as special codes and searches for the solution in the code space. The space of mathematical expression codes is not numerical, and it is rather difficult to determine the search direction as it does not have the metrics to estimate the distance between possible solutions nor a possible solution to evaluate the gradient. The convergence of symbolic regression algorithms is the main problem in this optimisation class. Let us call this class non-numeric. It contains the most difficult optimisation problems. A developed search algorithm should, at least, guarantee more efficient performance than a simple random search.

To assess the distance between two possible solutions in the space of mathematical expression codes, the concept of small variation in the mathematical expression code is defined in [22]. As most codes of mathematical expressions are integer mathematical constructions, vectors, matrices, sets, and so on, a small variation in code means the smallest number of integer changes in a mathematical construction code (i.e., their positions and values). With this small change in code, the correct code of another mathematical expression should be obtained. Any symbolic regression method has its own set of small variations.

Any mathematical expression may have an infinite number of representations if any other expression that can be added to the outlined expression is also equal to zero. However, as a rule, mathematical expressions are written in the most compact form possible, without adding identical expressions equal to zero or multiplying by mathematical expressions with identical unit values.

When searching for a mathematical expression in the code space, it is difficult to state that the found code does not contain insignificant additives. Therefore, it is necessary to limit the length of the searched code. Small variations must possess the property of completeness. This means that any mathematical expression described using symbolic regression code from a set of codes of limited length can be transformed using a finite number of small variations into any other mathematical expression with code belonging to the same set.

\tilde{Ψ} = Ψ \circ w^{1} \circ \dots \circ w^{d},

(19)

where

Ψ

is a code of initial mathematical expression,

w^{i}

is a code of small variation, d is a number of small variations,

\tilde{Ψ}

is a code of a new mathematical expression, and ∘ denotes an operator of the variation application.

We considered an example of encoding a mathematical expression and applying small variations. To code mathematical expressions, we used a graph-based structure of a network operator (for more details, see [18]). Then, we needed to code and vary the following mathematical expression:

y = e^{(- q_{1} x_{1})} cos (q_{2} x_{2} + q_{1})

(20)

where

x_{1}

,

x_{2}

are variables, and

q_{1}

,

q_{2}

are constant parameters.

To code the mathematical expression, an alphabet of unary and binary functions was used.

Unary functions include the following:

F_{1} = {f_{1, 1} (z) = z, f_{1, 2} (z) = exp (z), f_{1, 3} (z) = - z, f_{1, 4} (z) = cos (z)} .

(21)

Binary functions include the following:

F_{2} = {f_{2, 1} (z_{1}, z_{2}) = z_{1} + z_{2}, f_{2, 2} (z_{1}, z_{2}) = z_{1} \cdot z_{2}} .

(22)

The identity function

f_{1, 1} (z)

is obligatory in Equation (21). Binary functions in Equation (22) should be commutative and associative and have a unit element: 0 for addition and 1 for multiplication.

In the PC memory, the network operator is represented as the following integer matrix:

Ψ = [\begin{matrix} 0 & 0 & 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 3 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 & 2 & 0 & 0 & 2 \\ 0 & 0 & 0 & 0 & 0 & 2 & 1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 1 & 4 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 2 \end{matrix}] .

(23)

In the matrix, rows that have zeroes on the main diagonal are connected to source nodes (variables and parameters). Other elements on the main diagonal are numbers from (22). The elements above the main diagonal are numbers from (21). For coding and decoding algorithms, see [18].

Let us apply the principle of small variations and varied expression (20) using two small variations,

w^{1}

and

w^{2}

, as outlined in the following equation:

\tilde{Ψ} = Ψ \circ w^{1} \circ w^{2}, w^{1} = {[2 4 7 1]}^{T}, w^{2} = {[3 2 7 0]}^{T} .

(24)

The first component in the vector of variation is a code of small variation types: 0—alternation of unary function; 1—alternation of binary function; 2—addition of unary function; 3—deletion of unary function. The second and third components are numbers of rows and columns in the matrix. The fourth component is a number of the unary or binary function depending on the type of variation.

Having performed two variations, we obtained a new matrix of the network operator and, thus, a new mathematical expression as follows:

\tilde{Ψ} = [\begin{matrix} 0 & 0 & 0 & 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 3 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 1 & 1 & 0 \\ 0 & 0 & 0 & 0 & 2 & 0 & 0 & 2 \\ 0 & 0 & 0 & 0 & 0 & 2 & 1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 1 & 4 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 2 \end{matrix}]

(25)

y = e^{(- q_{1} x_{1})} cos (q_{2} x_{2} + q_{2}) .

(26)

4. Computational Experiment

A series of computational experiments were carried out.

4.1. Optimal Control Problem

We considered the optimal control problem for two-wheeled differential drive robots.

The mathematical model of control object is given as follows:

\begin{matrix} {\dot{x}}_{1}^{j} & = & 0.5 (u_{1}^{(j)} + u_{2}^{(j)}) cos (x_{3}^{(j)}), \\ {\dot{x}}_{2}^{j} & = & 0.5 (u_{1}^{(j)} + u_{2}^{(j)}) sin (x_{3}^{(j)}), \\ {\dot{x}}_{3}^{j} & = & 0.5 (u_{1}^{(j)} - u_{2}^{(j)}), \end{matrix}

(27)

where j is an index of the robot

j = 1, 2

,

x^{j} = {[x_{1}^{(j)} x_{2}^{(j)} x_{3}^{(j)}]}^{T} \in R^{n}

,

u^{j} = {[u_{1}^{(j)} u_{2}^{(j)}]}^{T} \in U \subseteq R^{m}

, and

U

is a compact set that is determined by constraints on control:

- 10 = u_{i}^{-} ⩽ u_{i}^{(j)} ⩽ u_{i}^{+} = 10, i = 1, 2 .

(28)

The initial states of both robots are given as follows:

x^{1} (0) = x^{0, 1} = {[0 0 0]}^{T}, x^{2} (0) = x^{0, 2} = {[10 10 π]}^{T} .

(29)

The terminal states are given as follows:

x^{1} (t_{f}) = x^{f, 1} = {[10 10 0]}^{T}, x^{2} (t_{f}) = x^{f, 2} = {[0 0 π]}^{T},

(30)

where

t_{f}

is a time to reach the terminal states by each of the robots,

t_{f}

is not given but limited by

t_{f} ⩽ t^{+}

, and

t^{+}

is a given limited time. If two robots reach their terminal states at different times

t_{f, 1} ⩽ t^{+}

,

t_{f, 2} ⩽ t^{+}

, then a maximal time is considered as follows:

t_{f} = max {t_{f, 1}, t_{f, 2}} .

(31)

Static phase constraints are given in the form of circular obstacles as follows:

φ_{i} (x^{j}) = r_{i} - \sqrt{{(x_{1, i} - x_{1}^{(j)})}^{2} + {(x_{2, i} - x_{2}^{(j)})}^{2}} ⩽ 0, i = 1, 2, j = 1, 2,

(32)

where

r_{1} = 2

,

r_{2} = 2

,

x_{1, 1} = 2

,

x_{2, 1} = 5

,

x_{1, 2} = 8

, and

x_{2, 2} = 5

.

Dynamic phase constraints consider collision avoidance conditions as follows:

χ (x^{1}, x^{2}) = r_{0} - \sqrt{{(x_{1}^{(1)} - x_{1}^{(2)})}^{2} + {(x_{2}^{(1)} - x_{2}^{(2)})}^{2}} ⩽ 0,

(33)

where

r_{0} = 1.5

.

It is necessary to find a control function with respect to constraints (28) in order to reach the terminal states and minimise the following quality criterion:

J_{2} = \int_{0}^{t_{f}} 1 d t = t_{f} \to min_{u^{1}, u^{2} \in U} .

(34)

Firstly, the optimal control problem is solved in the classical statement, and the control function is a function of time. For this purpose, the direct approach is used. According to this approach, phase constraints and the accuracy of reaching the terminal states are included in the quality criterion as follows:

\begin{matrix} J_{3} = t_{f} + \sum_{j = 1}^{2} (p_{1} ∥ x^{j, f} - x^{j} (t_{f, j}) ∥ + p_{2} \int_{0}^{t_{f, j}} \sum_{i = 1}^{2} ϑ (φ_{i} (x^{j})) d t) + \\ p_{3} \int_{0}^{t_{f}} ϑ (χ (x^{1}, x^{2})) d t \to min_{u^{1}, u^{2} \in U}, \end{matrix}

(35)

where

p_{1} = 1

,

p_{2} = 3

, and

p_{3} = 3

are penalty coefficients.

t_{f, j} = \{\begin{matrix} t, if t < t^{+} and ∥ x^{j, f} - x^{j} (t) ∥ ⩽ ε \\ t^{+}, otherwise \end{matrix}, j = 1, 2,

(36)

ε = 0.05

, and

t^{+} = 1.8

,

ϑ (α)

is a Heaviside function

ϑ (α) = \{\begin{matrix} 1, if α > 0 \\ 0, otherwise \end{matrix} .

(37)

The control function is found in the form of a piece-wise linear approximation of the time function. The time axis is divided into equal intervals

Δ t

and on the borders of intervals; the values of constant parameters are found for the respective control of both robots. Within the intervals, the parameter values are linked by straight lines. The control function is truncated at the top and bottom according to the specified control constraints in the following:

u_{i}^{(j)} = \{\begin{matrix} u_{i}^{+}, if u_{i}^{+} ⩽ v_{i}^{(j)} (t) \\ u_{i}^{-}, if v_{i}^{(j)} (t) ⩽ u_{i}^{-} \\ v_{i}^{(j)}, otherwise \end{matrix}, i = 1, 2, j = 1, 2,

(38)

where

v_{i}^{(j)} = ({\tilde{q}}_{i + (k - 1) m N}) + ({\tilde{q}}_{i + k m N} - {\tilde{q}}_{i + (k - 1) m N}) \frac{t - k Δ t}{Δ t},

(39)

(k - 1) Δ T ⩽ t < k Δ t

,

k = 1, \dots, L

, m is a number of components in the control vector,

m = 2

, N is a number of robots,

N = 2

,

Δ t

is a time interval,

Δ t = 0.2

, and L is a number of intervals.

L = ⌊\frac{t^{+}}{Δ t}⌋ = ⌊\frac{1.8}{0.2}⌋ = 9 .

(40)

Thus, an optimal control problem solution (27)–(30), (35) includes a vector of constant parameters

\tilde{q} = {[{\tilde{q}}_{1} \dots {\tilde{q}}_{(L + 1) m N}]}^{T} = {[{\tilde{q}}_{1} \dots {\tilde{q}}_{40}]}^{T} .

(41)

To numerically solve the optimal control problem, a hybrid evolutionary algorithm was used [23]. The algorithm includes evolutionary transformations from three evolutionary algorithms: the genetic algorithm, particle swarm optimisation, and the grey wolf optimiser.

The hybrid algorithm determined the following solution:

\begin{matrix} \tilde{q} & = & [16.7976 - 4.8917 16.9660 - 9.2093 2.5560 5.6187 14.6527 8.5727 \\ 15.7636 17.2623 19.9987 16.3321 14.7498 1.2650 - 1.2407 19.8177 \\ 14.9553 19.2749 19.9999 - 6.2499 18.0210 3.1402 15.8863 19.9450 \\ 0.7218 18.0361 16.3448 18.2807 18.2248 16.7853 3.3188 17.0318 \\ {11.5682 10.3246 11.1949 16.7666 - 19.4089 14.1827 - 5.2886 17.9824]}^{T} . \end{matrix}

(42)

Figure 3 shows the optimal trajectories of two robots. The red circles are obstacles or static phase constraints in terms of control theory. As shown in Figure 3, the found solution is almost optimal because both robots avoided collisions, satisfied the phase constraints, and reached the given terminal states. The quality criterion value of (35) was

J_{3} = 1.8003

.

The solution in the form of the control function of time cannot be implemented in real objects since the control system is open-loop. If the initial conditions change, the obtained optimal control will no longer remain optimal even in a small domain.

Figure 4 shows robots trajectories from four randomly disturbed initial states in the range of

Δ_{0} = \pm 0.2

, as shown in the following:

x_{i}^{(j)} (0) = x_{i}^{j, 0} + (2 ξ (t) - 1) Δ_{0}, i = 1, 2, 3, j = 1, 2,

(43)

where

ξ (t)

is a generator of random numbers that returns a random value from 0 to 1 at every call. Small disturbances of the initial states essentially change the trajectories. The robots violate phase constraints and do not reach terminal states. The average value of (35) in a series of 10 experiments was

J_{3} = 6.51

. To deal with this problem, we proposed to solve the optimal control problem as a control synthesis problem.

4.2. Control Synthesis Problem

In the control synthesis problem, mathematical models of two robots should be considered as a model of one control object because a control function of one robot depends on the state vector of the other.

The mathematical model of control object is

\begin{matrix} {\dot{x}}_{1} & = & 0.5 (u_{1} + u_{2}) cos (x_{3}), \\ {\dot{x}}_{2} & = & 0.5 (u_{1} + u_{2}) sin (x_{3}), \\ {\dot{x}}_{3} & = & 0.5 (u_{1} - u_{2}), \\ {\dot{x}}_{4} & = & 0.5 (u_{3} + u_{4}) cos (x_{6}), \\ {\dot{x}}_{5} & = & 0.5 (u_{3} + u_{4}) sin (x_{6}), \\ {\dot{x}}_{6} & = & 0.5 (u_{3} - u_{4}), \end{matrix}

(44)

where

x_{1}

,

x_{2}

, and

x_{3}

are coordinates of the state vector of the first robot;

x_{4}

,

x_{5}

, and

x_{6}

are coordinates of the state vector of the second robot;

u_{1}

and

u_{2}

are components of the control vector of the first robot; and

u_{3}

and

u_{4}

are components of the control vector of the second robot.

The constraints on control are

- 10 = u_{i}^{-} ⩽ u_{i} ⩽ u_{i}^{+} = 10, i = 1, \dots, m = 4 .

(45)

The terminal state is

x (t_{f}) = x^{f} = {[10 10 0 0 0 π]}^{T},

(46)

where

t_{f}

is not given but limited

t_{f} ⩽ t^{+} = 2.2

.

A set of initial states is given in the form of the

Δ

neighbourhood of the original initial state for the optimal control problem

x (0) \in X_{0} = {x^{0} \pm Δ_{0}},

(47)

where

\begin{matrix} x^{0} & = & {[0 0 0 10 10 π]}^{T}, \\ Δ_{0} & = & {[0.2 0.2 π / 18 0.2 0.2 π / 18]}^{T} . \end{matrix}

(48)

For the numerical solution of the synthesis problem, the domain of initial states is replaced by a set of points

{\tilde{X}}_{0} = {x^{0, i} : i = 1, \dots, K},

(49)

where

x^{0, i} = (x^{0} - Δ_{0}) + i ⊙ 2 Δ_{0},

(50)

i

is an integer binary vector of two binary vectors, three bits each, as shown in the following equation:

i = {[{(i - 1)}_{2} ⋮ {(i - 1)}_{2}]}^{T}, i = 1, \dots, 2^{2 n / 2} = 8,

(51)

{(i - 1)}_{2}

is a three bit binary code of the number

i - 1

, and ⊙ is a Hadamard or element-wise product of vectors.

The quality criterion is

\begin{matrix} J_{4} = \sum_{i = 1}^{K} (t_{f, i} + p_{1} ∥ x^{f} - x (t_{f, i}, x^{0, i}) ∥ + p_{2} \int_{0}^{t_{f, i}} ϑ (\tilde{χ} (x (t, x^{0, i}))) d t + \\ p_{3} \int_{0}^{t_{f}} \sum_{j = 1}^{2} (ϑ (α_{j} (x (t, x^{0, i}))) + ϑ (β_{j} (x (t, x^{0, i})))) d t) \to min_{u}, \end{matrix}

(52)

where

x (t, x^{0, i})

is a particular solution of the ODE system (44) from the initial state

x^{0, i}

,

i \in {1, \dots, 8}

, and

p_{1} = 1

,

p_{2} = 3

, and

p_{3} = 3

are penalty coefficients.

t_{f, i} = \{\begin{matrix} t, if t < t^{+} and ∥ x^{f} - x (t, x^{0, i}) ∥ ⩽ ε_{0} \\ t^{+}, otherwise \end{matrix},

(53)

ε_{0} = 0.05

,

t^{+} = 2.4

,

α_{j} (x) = r_{j} - \sqrt{{(x_{1, j} - x_{1})}^{2} + {(x_{2, j} - x_{2})}^{2}},

(54)

β_{j} (x) = r_{j} - \sqrt{{(x_{1, j} - x_{4})}^{2} + {(x_{2, j} - x_{5})}^{2}},

(55)

j = 1, 2

,

r_{1} = 2

,

x_{1, 1} = 2

,

x_{2, 1} = 5

,

r_{2} = 2

,

x_{1, 2} = 8

,

x_{2, 2} = 5

,

\tilde{χ} (x) = r_{0} - \sqrt{{(x_{1} - x_{4})}^{2} + {(x_{2} - x_{5})}^{2}},

(56)

r_{0} = 1.25

.

In the control synthesis problem, it is necessary to find a control function as a function of the state space vector as follows:

u = g (x^{f} - x) .

(57)

If the control function (57) is placed in the right part of the ODE system (44), then the system will have a particular solution from any initial state of (47) that will reach the given terminal state (46) with an optimal value of the quality criterion (52).

The control synthesis problem may be effectively solved using symbolic regression methods. Here, the network operator method was applied.

The network operator method was performed with the following parameters: an NOP matrix size of

32 \times 32

, 12 source nodes for the NOP graph, including 6 nodes for variables and 6 nodes for searched parameters, and 4 sink nodes for the output components of the control vector. The variational genetic algorithm parameters were as follows: the number of possible solutions in the initial population—1024; the number of crossover operations in one generation—128; the number of generations—128; the depth of variation—7; the number of generations between change in the basic solution—20; the number of bits for coding a parameter—16.

Computations were performed on a CPU Intel core i7, 2.8 GHz. The computational time was approx. 10 min. Any change in the obstacle position required recalculation. It could be completed offline on on-board processors but not in real time.

The following solution was obtained using the network operator method as follows:

u_{i} = g_{i} (x^{f} - x^{j}) = \{\begin{matrix} u_{i}^{+}, if u_{i}^{+} ⩽ {\tilde{g}}_{i} (x^{f} - x) \\ u_{i}^{-}, if {\tilde{g}}_{i} (x^{f} - x) ⩽ u_{i}^{-} \\ {\tilde{g}}_{i} (x^{f} - x), otherwise \end{matrix}, i = 1, 2, 3, 4,

(58)

where

{\tilde{g}}_{1} (x^{f} - x) = (F - E) ln (| F |) arctan (C + ρ_{3} (q_{6}) + μ (q_{5})) sgn (C) \sqrt{| C |},

(59)

\begin{matrix} {\tilde{g}}_{2} (x^{f} - x) = H + μ (F) + ρ_{1} (E) + ϑ (D) + ρ_{2} (q_{2} (x_{2}^{f} - x_{2})) + \\ {(q_{1} (x_{1}^{f} - x_{1}))}^{- 1} + ρ_{2} (q_{6}) + q_{1}^{- 1} + μ (x_{5}^{f} - x_{5}) + ρ_{2} (x_{1}^{f} - x_{1}), \end{matrix}

(60)

{\tilde{g}}_{3} (x^{f} - x) = {\tilde{g}}_{1} (x^{f} - x),

(61)

\begin{matrix} {\tilde{g}}_{4} (x^{f} - x) = cos ({\tilde{g}}_{3} (x^{f} - x)) + {\tilde{g}}_{2} (x^{f} - x) + {({\tilde{g}}_{1} (x^{f} - x))}^{- 1} + \\ ρ_{2} (H) + ρ_{2} (q_{4}) + ρ_{2} (x_{5}^{f} - x_{5}), \end{matrix}

(62)

A = tanh (q_{2} (x_{2}^{f} - x_{2})) q_{5} (x_{5}^{f} - x_{5}),

B = q_{6} (x_{6}^{f} - x_{6}) sgn (x_{1}^{f} - x_{1}) \sqrt{| x_{1}^{f} - x_{1} |},

C = exp (A) + q_{3} (x_{3}^{f} - x_{3}) + q_{2} (x_{2}^{f} - x_{2}) + q_{1} (x_{1}^{f} - x_{1}),

D = B + A + \sqrt[3]{q_{3} (x_{3}^{f} - x_{3})} q_{4} (x_{4}^{f} - x_{4}) + exp (x_{4}^{f} - x_{4}),

E = μ (C + ρ_{3} (q_{6}) + μ (q_{5})) D ρ_{2} (q_{6}) sgn (C) tanh (q_{1} (x_{1}^{f} - x_{1})),

F = C + ρ_{3} (q_{6}) + μ (q_{5}) + ρ_{2} (D) + cos (B),

G = E μ (C + ρ_{3} (q_{6})) tanh (q_{2} (x_{2}^{f} - x_{2})),

H = ϑ (F - E) G ρ_{3} (C + ρ_{3} (q_{6})) arctan (D) tanh (C) sgn (E) \sqrt{| E |},

μ (α) = \{\begin{matrix} α, if | α | < 1 \\ sgn (α), otherwise \end{matrix}, ρ_{1} (α) = sgn (α) (exp (| α |) - 1),

ρ_{2} (α) = sgn (α) exp (- | α |), ρ_{3} (α) = sgn (α) ln (| α | + 1),

q_{1} = 1.1709

,

q_{2} = 12.27979

,

q_{3} = 7.32666

,

q_{4} = 4.92456

,

q_{5} = 3.90015

,

q_{6} = 1.73071

.

Figure 5 shows the trajectories of two robots with the control system (58) from some initial states. None of the trajectories violate the phase constraints and they almost reach terminal states. The average value of the quality criterion (52) was

J_{4} = 2.21

.

If we solved the optimal control problem in the classical statement and then synthesised the control system for motion stabilisation along the optimal trajectory, then the control synthesis problem for motion stabilisation took too much time.

4.3. Stabilisation System Synthesis

It was proposed to first solve the synthesis problem of the universal stabilisation system. Universal means that the stabilisation system stabilises the motion along a class of trajectories. Then, the optimal control problem is solved in a class of optimal trajectories that can be stabilised using an already obtained stabilisation system of motion along the trajectory.

We considered the synthesis problem of the universal stabilisation system for motion along the given trajectories. For this purpose, we created one stabilisation system for one robot to work in tandem with the two optimal trajectories, obtained upon solving the optimal control problem and starting from two different initial states.

In the stabilisation synthesis problem, the mathematical model of one robot is considered as follows:

\begin{matrix} {\dot{x}}_{1} & = & 0.5 (u_{1} + u_{2}) cos (x_{3}), \\ {\dot{x}}_{2} & = & 0.5 (u_{1} + u_{2}) sin (x_{3}), \\ {\dot{x}}_{3} & = & 0.5 (u_{1} - u_{2}) . \end{matrix}

(63)

Control constraints are given as follows:

- 10 = u^{-} ⩽ u_{i} ⩽ u^{+} = 10, i = 1, 2 .

(64)

Two sets of initial states are given as follows:

\begin{matrix} X_{0, 1} & = & {x^{0, 1, 1}, \dots, x^{0, 1, K}}, \\ X_{0, 2} & = & {x^{0, 2, 1}, \dots, x^{0, 2, K}}, \end{matrix}

(65)

where

x^{0, j, i} = x^{0, j} - Δ_{0} + {(i - 1)}_{2} ⊙ 2 Δ_{0}, j = 1, 2,

(66)

x^{0, 1}

and

x^{0, 2}

are the initial states of each robot (29),

Δ_{0} = {[0.2 0.2 π / 18]}^{T}

, and

{(i - 1)}_{2}

is a three-bit binary code of

i - 1

number,

i = 1, \dots, 2^{3} = 8

.

Two program trajectories are given as follows:

x^{*, 1} (t), x^{*, 2} (t),

(67)

which are particular solutions of the reference model:

\begin{matrix} {\dot{x}}_{1}^{*} & = & 0.5 (u_{1}^{*} + u_{2}^{*}) cos (x_{3}^{*}), \\ {\dot{x}}_{2}^{*} & = & 0.5 (u_{1}^{*} + u_{2}^{*}) sin (x_{3}^{*}), \\ {\dot{x}}_{3}^{*} & = & 0.5 (u_{1}^{*} - u_{2}^{*}), \end{matrix}

(68)

from the initial states (29) with the program control in the form of a piece-wise linear function

u_{i}^{*, j} = \{\begin{matrix} u^{+}, if u_{i}^{+} ⩽ v_{i}^{*, j} (t) \\ u^{-}, if v_{i}^{*, j} (t) ⩽ u^{-} \\ v_{i}^{*, j}, otherwise \end{matrix}, j = 1, 2, i = 1, 2,

(69)

where

v_{i}^{*, j} = ({\tilde{q}}_{i + (k - 1) m}^{(j)}) + ({\tilde{q}}_{i + k m}^{(j)} - {\tilde{q}}_{i + (k - 1) m}^{(j)}) \frac{t - (k - 1) Δ t}{Δ t}, (k - 1) Δ t ⩽ t < k Δ t,

(70)

k = 1, \dots, L

,

{\tilde{q}}^{j} = {[{\tilde{q}}_{1}^{(j)} \dots {\tilde{q}}_{(k + 1) m}^{(j)}]}^{T}

, j is the trajectory number,

j = 1, 2

,

\begin{matrix} {\tilde{q}}^{1} & = & [16.79762 - 4.89174 2.55602 5.61872 15.76362 \\ 17.26232 14.74983 1.26504 14.95531 19.27486 \\ 18.02104 3.14018 0.72182 18.03608 18.22479 \\ {16.78534 11.56819 10.32455 - 19.40889 14.18273]}^{T}, \\ {\tilde{q}}^{2} & = & [16.96601 - 9.20929 14.65269 8.57269 19.99868 \\ 16.33206 - 1.24072 19.81766 19.99986 - 6.24985 \\ 15.88627 19.94497 16.34476 18.28070 3.31875 \\ {17.03179 11.19494 16.76657 - 5.28858 17.98243]}^{T} . \end{matrix}

(71)

Both program trajectories have the last points in the terminal states of

\begin{matrix} x^{*, 1} (t_{f}) & = & x^{f, 1} = {[10 10 0]}^{T}, \\ x^{*, 2} (t_{f}) & = & x^{f, 2} = {[0 0 π]}^{T} . \end{matrix}

(72)

It necessary to find a control function in the form of

u = h (x^{*, j} - x), j = 1, 2,

(73)

where j is the program trajectory number.

The quality criterion is given as follows:

J_{5} = \sum_{j = 1}^{2} \sum_{k = 1}^{K} (p_{1} ∥ x^{f, j} - x (t_{f, k}, x^{0, k}) ∥ + \int_{0}^{t_{f, k}} ∥ x^{*, j} (t) - x (t, x^{0, k}) ∥ d t) \to min_{h (x^{*, j} - x)} .

(74)

To solve the control synthesis problem, machine learning control via the network operator method was used. The network operator method was performed with the following parameters: an NOP matrix size of

24 \times 24

, six source nodes for the NOP graph, including three nodes for variables and three nodes for searched parameters, and two sink nodes for the output components of the control vector. The variational genetic algorithm parameters were as follows: the number of possible solutions in initial population—512; the number of crossover operations in one generation—128; the number of generations—128; the depth of variation—7; the number of generations between change in the basic solution—20; the number of bits for coding a parameter—16.

The following solution was obtained:

h_{i} (x^{*} - x) = \{\begin{matrix} u^{+}, if u^{+} ⩽ {\tilde{h}}_{i} (x^{*} - x) \\ u^{-}, if {\tilde{h}}_{i} (x^{*} - x) ⩽ u^{-} \\ {\tilde{h}}_{i} (x^{*} - x), otherwise \end{matrix},

(75)

where

\begin{array}{l} {\tilde{h}}_{1} (x^{*} - x) = μ (W) + H + ρ_{2} (exp (G)) + ρ_{3} (D) + ρ_{3} (B) + C^{2} + \\ exp (B) + ϑ (q_{2} (x_{2}^{*} - x_{2}) sgn (x_{1}^{*} - x_{1})) + {(x_{3}^{*} - x_{3})}^{2} - (x_{2}^{*} - x_{2}), \end{array}

(76)

{\tilde{h}}_{2} (x^{*} - x) = {\tilde{h}}_{1} (x^{*} - x) + sgn (W) \sqrt{| W |} + ρ_{2} (exp (G)) - F + ln (| G |) + {(q_{3})}^{- 1},

(77)

A = q_{1} (x_{1}^{*} - x_{1}) sin (q_{1}) {(x_{2}^{*} - x_{2})}^{3},

B = q_{3} (x_{3}^{*} - x_{3}) exp (q_{2}) + q_{2} (x_{2}^{*} - x_{2}) sgn (x_{1}^{*} - x_{1}),

C = A + μ (x_{3}^{*} - x_{3}) + {(x_{2}^{*} - x_{2})}^{3} + μ (x_{1}^{*} - x_{1}),

D = B^{2} + C + sgn (A) + ρ_{2} (q_{3}) + ρ_{1} (x_{3}^{*} - x_{3}) + sgn (x_{2}^{*} - x_{2}) \sqrt{| x_{2}^{*} - x_{2} |},

E = D + {(q_{3})}^{- 1} + ρ_{1} (x_{3}^{*} - x_{3}),

F = B + exp (q_{2} (x_{2}^{*} - x_{2}) sgn (x_{1}^{*} - x_{1})) + ρ_{3} (q_{1}),

G = ρ_{1} (E) + A^{2} + {(q_{2} (x_{2}^{*} - x_{2}) sgn (x_{1}^{*} - x_{1}))}^{- 1} + tanh (x_{3}^{*} - x_{3}),

H = F + cos (F) + ρ_{1} (B) sin (q_{3} (x_{3}^{*} - x_{3}) exp (q_{2})) + arctan (q_{2} (x_{2}^{*} - x_{2}) sgn (x_{1}^{*} - x_{1})) + A - A^{3},

W = H - H^{3} + E^{2} + sgn (B) - B + ln (| q_{3} (x_{3}^{*} - x_{3}) exp (q_{2}) |) + cos (x_{1}^{*} - x_{1}),

q_{1} = 15.9375

,

q_{2} = 1.29028

,

q_{3} = 10.13403

. A network operator matrix is given in Appendix A. Here, the trajectory number is not indicated since it is assumed that the stabilisation system provides the control object motion along any given program trajectory from some class.

Figure 6 and Figure 7 show the perturbed trajectories from the first (Figure 6) and the second (Figure 7) given initial states.

If the obtained motion stabilisation system (75) was inserted into the mathematical models of the control objects (27), and the system was simulated without disturbances to the program trajectories obtained via the control (42) applied to the reference model (68). Then, the quality criterion for the optimal control problem was

J_{3} = 2.088

, instead of

1.801

, which was obtained upon solving the origin optimal control problem. This result was expected, as the optimal control problem was solved for objects without a motion stabilisation system.

4.4. Optimal Control Problem with Advanced Control Object Model

Let us consider the same optimal control problem, but for the advanced mathematical model of the control object. There are two control objects described using the advanced mathematical model, including reference models for program trajectory generation.

\begin{matrix} {\dot{x}}_{1}^{*, j} & = & 0.5 (u_{1}^{(j)} + u_{2}^{(j)}) cos (x_{3}^{*, j}), \\ {\dot{x}}_{2}^{*, j} & = & 0.5 (u_{1}^{(j)} + u_{2}^{(j)}) sin (x_{3}^{*, j}), \\ {\dot{x}}_{3}^{*, j} & = & 0.5 (u_{1}^{(j)} - u_{2}^{(j)}), \\ {\dot{x}}_{1}^{(j)} & = & 0.5 (h_{1} (x^{*, j} - x^{j}) + h_{2} (x^{*, j} - x^{j})) cos (x_{3}^{(j)}), \\ {\dot{x}}_{2}^{(j)} & = & 0.5 (h_{1} (x^{*, j} - x^{j}) + h_{2} (x^{*, j} - x^{j})) sin (x_{3}^{(j)}), \\ {\dot{x}}_{3}^{(j)} & = & 0.5 (h_{1} (x^{*, j} - x^{j}) - h_{2} (x^{*, j} - x^{j})), \end{matrix}

(78)

where

j = 1, 2

.

A hybrid evolutionary algorithm was used. The previously found optimal solution (42) was included in the initial population as one of the possible solutions. This solution is the best in the initial population as other possible solutions are generated randomly, which makes it possible to not look for a completely new solution but to improve the previously found one for a problem with the advanced control object model.

The following solution for the piece-wise linear approximation of the control function (38) was obtained:

\begin{matrix} \tilde{q} & = & [17.1163 - 5.1976 16.9293 - 8.8813 3.5470 6.8936 16.4441 8.8112 \\ 16.9186 16.9069 19.0421 16.3618 15.2623 1.0616 - 0.6918 19.5165 \\ 15.3498 18.0201 19.9420 - 6.1028 17.4561 4.8083 14.9516 20.0000 \\ 0.8972 19.5355 15.9978 14.0845 19.9820 15.8831 2.4157 14.7072 \\ {7.9791 7.6057 16.7390 15.9825 - 18.8485 10.4700 - 17.8128 15.6457]}^{T} . \end{matrix}

(79)

Figure 8 shows the optimal trajectories (79) for robots with the advanced mathematical model with a quality criterion value

J_{3} = 1.937

. Optimal trajectories are marked with blue lines, and obstacles are marked with red circles.

5. Discussion

At present, obtaining the optimal control problem solution for multi-functional objects, such as robots in their functioning process, is highly desirable. This is even the case for an approximate solution that is fast and can be obtained on-board. Modern on-board processors allow us to solve problems in short periods of time, albeit in the classical statement, when the control function is searched for as a time function. It is well known and has been demonstrated in our computational experiments that such solutions obtained for original mathematical models are not feasible in practice as they lead to significant errors when the control object experiences small perturbations. The proposed approach of using the advanced method allows us to obtain optimal control problem solutions from the class of feasible control functions. The quality criterion includes penalties for violating both static and dynamic phase constraints. It should be noted that if the number and size of obstacles change, then the optimal control problem should be solved again for the same advanced model and another number of obstacles, and so on, and new optimal trajectories should be obtained, which will also belong to the class of trajectories that are stabilised via the trajectory motion stabilisation system.

The key factor in creating an advanced model is a set of control functions (7) that generate a class of trajectories (9). If the set (7) is sufficient, then the optimal control problem solution with the advanced model with a quality criterion value of (5) will be close to the optimal solution obtained solely for the reference model.

Creating an advanced model is always possible for tasks where the optimal control problem is stated for the model without uncertainties and stochastics in the right parts.

6. Conclusions

This study proposes using an advanced control object model. Initially, a universal motion stabilisation system along the specified trajectory is built for the object. It is assumed that, in the functioning process, the object will move along the optimal trajectories of a certain class. The resulting control system is then inserted into the control object. In order to quickly solve optimal control problems, an advanced object model is input into the on-board processor of the control object, which includes a mathematical model of the object with a motion stabilisation system along a given trajectory and a reference model that generates an optimal trajectory. While functioning, the evolutionary algorithm solves the optimal control problem in the classical statement in real time for an advanced control object model. The obtained solution is a control function for the reference model that generates a trajectory from a class of trajectories stabilised using the object control system.

7. Future Work

Further research will focus on developing effective methods for building advanced control object models, specifying the class of stabilisation trajectories and possibly the class of criteria for which these trajectories are optimal.

Author Contributions

Conceptualisation, A.D. and E.S.; methodology, A.D.; software, A.D. and E.S.; validation, E.S.; formal analysis, A.D.; investigation, E.S. and N.K.; resources, A.D.; writing—original draft preparation, A.D. and E.S.; writing—review and editing, E.S.; visualisation, N.K. and O.A.; supervision, A.D. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Science Committee of the Ministry of Science and Higher Education of the Republic of Kazakhstan (Grant No. AP14869851).

Data Availability Statement

The solution of the stabilisation system synthesis problem (Section 4.3), a network operator matrix, and the alphabet of used functions are given in Appendix A.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

A network operator matrix for (76) and (77) is as follows:

[\begin{matrix} 0 & 0 & 0 & 0 & 0 & 0 & 1 & 10 & 0 & 0 & 0 & 16 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 23 & 11 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & 0 & 14 & 0 & 14 & 0 & 4 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 3 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & 0 & 0 & 16 & 0 & 18 & 0 & 18 & 0 & 8 & 0 & 0 & 0 & 0 & 2 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 1 & 0 & 0 & 12 & 0 & 0 & 0 & 0 & 0 & 0 & 17 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & 6 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & 0 & 0 & 0 & 0 & 19 & 0 & 5 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 5 \\ 0 & 0 & 0 & 0 & 0 & 0 & 2 & 0 & 0 & 1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 2 & 0 & 0 & 1 & 0 & 0 & 0 & 0 & 0 & 6 & 5 & 0 & 0 & 13 & 0 & 9 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 2 & 0 & 1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 12 & 7 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & 0 & 1 & 0 & 10 & 0 & 0 & 0 & 2 & 0 & 0 & 0 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & 0 & 1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 3 & 6 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & 0 & 1 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 2 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & 2 & 1 & 0 & 0 & 0 & 0 & 0 & 0 & 10 & 17 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & 0 & 1 & 0 & 0 & 0 & 0 & 0 & 0 & 17 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 2 & 0 & 1 & 0 & 0 & 0 & 18 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & 0 & 18 & 0 & 0 & 0 & 2 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 2 & 0 & 1 & 0 & 11 & 0 & 0 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & 0 & 6 & 0 & 0 & 0 & 7 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 2 & 0 & 1 & 0 & 0 & 3 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & 0 & 0 & 19 & 19 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & 23 & 1 & 0 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & 16 & 4 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 & 1 \\ 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 0 & 1 \end{matrix}]

The following functions with one argument are used: 1—z (identity function); 2—

z^{2}

; 3—

- z

; 4—

sgn (z) \sqrt{| z |}

; 5—

z^{- 1}

; 6—

exp (z)

; 7—

ln (| z |)

; 8—

tanh (z)

; 10—

sgn (z)

; 11—

cos (z)

; 12—

sin (z)

; 13—

arctan (z)

; 14—

z^{3}

; 16—

μ (z)

; 17—

ρ_{3} (z)

; 18—

ρ_{1} (z)

; 19—

ρ_{2} (z)

; 23—

z - z^{3}

. Functions with two arguments (on the diagonal): 1—

z_{1} + z_{2}

; 2—

z_{1} z_{2}

.

References

Brunton, S. Machine Learning for Scientific Discovery, with Examples in Fluid Mechanics. J. Fluid Mech. 2022. [Google Scholar] [CrossRef]
Brockett, R.W. Asymptotic Stability and Feedback Stabilization. In Differential Geometric Control Theory; Brockett, R.W., Millman, R.S., Sussmann, H.J., Eds.; Birkhauser: Boston, MA, USA, 1983; pp. 181–191. [Google Scholar]
Posa, M.; Kuindersma, S.; Tedrake, R. Optimization and stabilization of trajectories for constrained dynamical systems. In Proceedings of the 2016 IEEE International Conference on Robotics and Automation (ICRA), Stockholm, Sweden, 16–21 May 2016; pp. 1366–1373. [Google Scholar] [CrossRef]
Kaspirovich, I.; Mukharlyamov, R. Stabilization of Optimal Trajectories of Dynamical Systems. In Proceedings of the IUTAM Symposium on Optimal Guidance and Control for Autonomous Systems 2023, Honolulu, HI, USA, 15–17 March 2023; pp. 237–250. [Google Scholar] [CrossRef]
Laumond, J.-P. (Ed.) Robot Motion Planning and Control; Lectures Notes in Control and Information Sciences; Springer: Berlin/Heidelberg, Germany, 1998; Volume 229, 343p. [Google Scholar] [CrossRef]
Morin, P.; Samson, C. Motion Control of Wheeled Mobile Robots. In Springer Handbook of Robotics; Siciliano, B., Khatib, O., Eds.; Springer: Berlin/Heidelberg, Germany, 2008; pp. 799–826. Available online: https://link.springer.com/referenceworkentry/10.1007/978-3-540-30301-5_35 (accessed on 29 September 2024).
Aguiar, A.P.; Hespanha, J.P. Trajectory-Tracking and Path-Following of Underactuated Autonomous Vehicles with parametric Modeling Uncertainty. IEEE Trans. Autom. Control 2007, 52, 1362–1379. [Google Scholar] [CrossRef]
Dixit, S.; Fallah, S.; Montanaro, U.; Dianati, M.; Stevens, A.; Mccullough, F.; Mouzakitis, A. Trajectory Planning and Tracking for Autonomous Overtaking: State-of-the-Art and Future Prospects. Annu. Rev. Control 2018, 45, 76–86. [Google Scholar] [CrossRef]
Nguyen, A.T.; Xuan-Mung, N.; Hong, S.-K. Quadcopter Adaptive Trajectory Tracking Control: A New Approach via Backstepping Technique. Appl. Sci. 2019, 9, 3873. [Google Scholar] [CrossRef]
Fang, H.; Fan, R.; Thuilot, H.; Martinet, H. Trajectory tracking control of farm vehicles in presence of sliding. Robot. Auton. Syst. 2006, 54, 828–839. [Google Scholar] [CrossRef]
Pouya, K.; Khalil, A.; Bahram, T. A full-state trajectory tracking controller for tractor-trailer wheeled mobile robots. Mech. Mach. Theory 2020, 150, 103872. [Google Scholar] [CrossRef]
Andriën, A.R.P.; Lefeber, E.; Antunes, D.J.; Heemels, W.P.M.H. Model Predictive Control for Quadcopters With Almost Global Trajectory Tracking Guarantees. IEEE Trans. Autom. Control 2024, 69, 5216–5230. [Google Scholar] [CrossRef]
Zhang, Y.P.; Fidan, B.; Ioannou, P.A. Backstepping control of linear time-varying systems with known and unknown parameters. IEEE Trans. Autom. Control 2003, 48, 1908–1925. [Google Scholar] [CrossRef]
Mohamed, M.J.; Abbas, M.Y. Design a Fuzzy PID Controller for Trajectory Tracking of Mobile Robot. Eng. Technol. J. 2018, 36A, 100–110. [Google Scholar] [CrossRef]
Eski, İ.; Kuş, Z. Control of unmanned agricultural vehicles using neural network-based control system. Neural Comput. Appl. 2019, 31, 583–595. [Google Scholar] [CrossRef]
Jordan, M.I.; Mitchell, T.M. Machine learning: Trends, perspectives, and prospects. Science 2015, 349, 255. [Google Scholar] [CrossRef] [PubMed]
Koza, J.R. Genetic Programming: On the Programming of Computers by Means of Natural Selection; MIT Press: Cambridge, MA, USA, 1992; 819p. [Google Scholar]
Diveev, A.I.; Sofronova, E.A. The network operator method for search of the most suitable mathematical equation. In Bio-Inspired Computational Algorithms and Their Applications; InTech: Rijeka, Croatia, 2012; pp. 19–42. [Google Scholar]
Diveev, A. Refinement of Optimal Control Problem for Practical Implementation of Its Solution. Dokl. Math. 2023, 107, 28–36. [Google Scholar] [CrossRef]
Diveev, A.I.; Sofronova, E.A. Universal Stabilisation System for Control Object Motion along the Optimal Trajectory. Mathematics 2023, 11, 3556. [Google Scholar] [CrossRef]
Diveev, A.; Sofronova, E.; Konyrbaev, N. A Stabilisation System Synthesis for Motion along a Preset Trajectory and Its Solution by Symbolic Regression. Mathematics 2024, 12, 706. [Google Scholar] [CrossRef]
Sofronova, E.A.; Diveev, A.I. Universal Approach to Solution of Optimization Problems by Symbolic Regression. Appl. Sci. 2021, 11, 5081. [Google Scholar] [CrossRef]
Diveev, A. Hybrid evolutionary algorithm for optimal control problem. In Intelligent Systems and Applications—IntelliSys 2022; Arai, K., Ed.; Lecture Notes in Networks and Systems; Springer: Cham, Switzerland, 2023; Volume 543, pp. 726–738. [Google Scholar]

Figure 1. Structure of the advanced model.

Figure 2. Design of a control system based on the advanced model.

Figure 3. Optimal trajectories (– solid line for the first robot; - - dashed line for the second robot).

Figure 4. Trajectories from four disturbed initial states (− black solid lines for the first robot; - - black dashed lines for the second robot) and optimal trajectories (− blue solid line for the first robot; - - blue dashed line for the second robot).

Figure 5. Robot trajectories with the control system (58) from four disturbed initial states (− black solid lines for the first robot; - - black dashed lines for the second robot).

Figure 6. Trajectories from eight perturbed points (black lines) of the first initial state for the control system (75) and the optimal trajectory (blue line).

Figure 7. Trajectories from eight perturbed points (black lines) of the second initial state for the control system (75) and the optimal trajectory (blue line).

Figure 8. Optimal trajectories of robots (blue lines) obtained with the advanced mathematical model.

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Diveev, A.; Sofronova, E.; Konyrbaev, N.; Abdullayev, O. Advanced Model with a Trajectory Tracking Stabilisation System and Feasible Solution of the Optimal Control Problem. Mathematics 2024, 12, 3193. https://doi.org/10.3390/math12203193

AMA Style

Diveev A, Sofronova E, Konyrbaev N, Abdullayev O. Advanced Model with a Trajectory Tracking Stabilisation System and Feasible Solution of the Optimal Control Problem. Mathematics. 2024; 12(20):3193. https://doi.org/10.3390/math12203193

Chicago/Turabian Style

Diveev, Askhat, Elena Sofronova, Nurbek Konyrbaev, and Oralbek Abdullayev. 2024. "Advanced Model with a Trajectory Tracking Stabilisation System and Feasible Solution of the Optimal Control Problem" Mathematics 12, no. 20: 3193. https://doi.org/10.3390/math12203193

APA Style

Diveev, A., Sofronova, E., Konyrbaev, N., & Abdullayev, O. (2024). Advanced Model with a Trajectory Tracking Stabilisation System and Feasible Solution of the Optimal Control Problem. Mathematics, 12(20), 3193. https://doi.org/10.3390/math12203193

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Advanced Model with a Trajectory Tracking Stabilisation System and Feasible Solution of the Optimal Control Problem

Abstract

1. Introduction

1.1. Relation to the Literature

1.2. Contribution

2. Advanced Control Object Model and Problem Statement

3. Machine Learning for Stabilisation System Synthesis

4. Computational Experiment

4.1. Optimal Control Problem

4.2. Control Synthesis Problem

4.3. Stabilisation System Synthesis

4.4. Optimal Control Problem with Advanced Control Object Model

5. Discussion

6. Conclusions

7. Future Work

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI