Path Planning in Threat Environment for UUV with Non-Uniform Radiation Pattern.

The problem of optimal trajectory planning of the unmanned underwater vehicle (UUV) is considered and analytically solved. The task is to minimize the risk of detection of the moving object by a static sonar while moving between two given points on a plane. The detection is based on the primary acoustic field radiated by the object with a non-uniform radiation pattern. In the first part of the article, the probability of non-detection is derived. Further, it is used as an optimization criterion. The non-uniform radiation pattern of the object differentiates this work from previous research in the area. The optimal trajectory and velocity law of the moving object are found, as well as the criterion value on it.


Introduction
The extensive development of unmanned mobile vehicles has led to a new area of control problems known as "path planning in the threat environment." The threat environment is defined as a set of agents (they are called conflicting) that the controlled vehicle must not approach while performing its main task. The control goal is to minimize the negative impact of conflicting agents by selecting the route, parameters of its movement and/or operating modes of technical devices. Depending on the nature of the specific task, the objectives of the negative impact can be considered to be the detection of an object, the approach to the conflicting object to the distances from which it is possible to defeat, and so forth. In the problem of evasion the moving object has to remain covert for the detector system [1]. These problems are extremely relevant for modern-day warfare applications. The technical progress allows using of non-trivial onboard algorithms for unmanned vehicles from wide scientific areas such as optimal control and differential games.
In Reference [2], the problem of automated path planning of combat unmanned aerial vehicles (UAVs) in the presence of radar-controlled surface-to-air missiles is considered. This task includes the interaction of three subsystems-the aircraft and its characteristics, the radar and its capabilities, and the missile and its striking properties. The paper proposes an approach to a model consisting of three subsystems united in a single system. In general, the literature on UAV path planning can be divided into two groups. The first group considers the problem under the assumption of isotropic radar capabilities (i.e., the independence of the reflected signal from the UAV's spatial orientation), and the second group assumes the implementation of such dependence in the model. Mathematical criteria for the success of missions performed by moving objects can be the following-the probability of rescue, the motion time on the trajectory, the mathematical expectation of the time interval until the first detection on the path, the path length, and so forth [3][4][5][6][7][8][9]. These criteria

Features of the Reception of Broadband Signals by Antenna Arrays
An antenna array with N H receiving elements is a multi-channel receiving device with output signals x i (t), i = 1, ..., N H , whose vector x(t) is represented in the time and frequency domains by the relations

X(ω) = M(ω)S(ω) + ξ(ω).
(1) The matrix function h(·) characterizes the effects of the propagation of the signal s(t) generated by the source, as well as the effects of possible distortions introduced by the receiving elements. The front of the wave received in the far zone is considered flat, and then the signal s(t) becomes a scalar function of time. The wave front does not reach the receiving elements simultaneously, but with delays τ i = (n · r i )/c, where n is the unit direction vector of the wave front, r i is the coordinate vector of the i-th antenna element and c is the signal propagation speed. In a non-dispersive medium the matrix h(·) becomes a vector with components h i (t) = δ(t − τ i ), that reflect only the time delay.
In the two-dimensional problem, the vectors n, r are located in the same plane and it is assumed that the reception is carried out on a linear equidistant lattice with an inter-element distance d located along the axis Ox, and the angle between n and the axis Oy is equal θ. The signals from various array elements are weighed, generally speaking, with complex weights w i , i = 1, ..., N H and summed. The frequency response of such an antenna array has the form If the array is tuned to the frequency ω 0 of the received signal then the function w i e −j(i−1)ψ depends on the angle θ. Moreover, it determines the normalized radiation pattern of the antenna array G(θ) =10 log(|A(θ)| 2 /N 2 H ) (in terms of power). In a dispersive medium, the signal s(t) in Equation (1) is a vector with a component s i (t) at the input of the i-th element of antenna.
If the signal s i (t) has a carrier frequency f i = f 0 , i = 1, ..., N H , then it is impossible, generally speaking, to tune the array simultaneously to each of these signal frequencies. This fact determines the difference in the methods of receiving narrow-band and wide-band signals on the antenna arrays. During broadband reception, the signal s i (t) received by the i-th antenna element is fed to a transverse filter with amplitude and phase frequency characteristics that are adjustable for it, wherein each channel has its own transverse filter. If the latter is synthesized on the basis of a multi-tap delay line (with a delay ∆ i between adjacent taps), the choice of value ∆ i depends on the frequency sub-band of the signal s i (t) that this filter processes.
Consider some channel of the array that processes a frequency-limited portion [−F, F] of the input signal spectrum. The Fourier transform S( f ) of the output signal of such a filter has a limited support. By Kotelnikov's sampling theorem, a discrete set of points s(i) := s(t i ), defined by the formula s(i) = F −F S( f )e j2π f i∆t d f (∆t is the time interval between adjacent sample points), is enough to restore the function from these points. Namely, just put S( f ) = ∆t ∑ ∞ i=−∞ s(i)e −j2π f i∆t . Similarly, the function s(t) can be reconstructed from a discrete sequence s(i) according to the same theorem. In particular, if t = k∆t, we have s(k∆t) = ∆t s(k)2F = s(k). At intermediate points, the use of an expression of the type sin t/t serves as an interpolation scheme which restores s(t) by values s(i).
For digital computers, the use of the Kotelnikov's theorem is not entirely convenient. It would be desirable to have a slightly different form of the Fourier transform, in which they are also specified discretely, but the limits of summation are finite both in the direct and inverse Fourier transforms. This form is used in the theory of the fast Fourier transform (FFT), where the time sequence s(i), i = 0, 1, ..., N 0 − 1 and its frequency FFT sequence S(k), k = 0, 1, ..., N 0 − 1 are set by formulas The FFT algorithm can be given in a matrix notation [20] where s is a vector representation of the sequence s(i), i = 0, 1, ..., N 0 − 1, S is a vector representation of the sequence S(k), k = 0, 1, ..., N 0 − 1 and W is a unitary matrix with elements From this fact, the equivalence of the results obtained using the time and frequency representations of the data follows. In particular, if s is a random vector of Gaussian time samples, then the vector S in Equation (2) of frequency samples of its Fourier image will also be Gaussian. Recalculation of covariance matrices from one representation to another is simplified due to the unitarity of the transition matrix. In this paper, both representations are used equally. In particular, such an important parameter in the problems of detecting and evaluating signals as the signal-to-noise ratio is easily recalculated.
Finally, the last remark concerns the inclusion of scattering indicatrix (in power) for an object emitting a signal. The indicatrix depends on the course angle of the object. Multiplying the scattering indicatrix and the radiation pattern of the receiving antenna, we obtain a function W(ω, θ, φ) depending on the frequency and two angular variables, which determines the strength of the target at the receiving point. It can be shown that for the case of a spatial (and not just flat) problem, the function W(ω, θ, φ) can be represented by the Chebyshev polynomials with coefficients g mn (ω) depending on the frequency. The coefficients g mn (ω) are downloaded in the form of tables from the database for the entire set of operating frequencies of the monitoring station.
The selection of weighting coefficients of antenna arrays and frequency channel parameters for a broadband signal processing is not considered in the work. As noted in the introduction, the purpose of the work is not to optimize the detection procedure, but to optimize the trajectory of the object in order to evade detection.

Non-Detection Probability of UUV under Passive Surveillance
The covertness of the UUV on the selected trajectory with a given velocity law can be characterized by the probability that during the passage of the whole route it will not be detected at any tact. Such probability is denoted by P nd and will be called the non-detection probability of UUV on the trajectory. Let us denote as T 0 the UUV transit time of the whole trajectory, and the duration of the tact will be ∆T. Then the entire trajectory can be divided into N = T 0 ∆T segments, on each of which a decision is made about the absence of an object and the corresponding probabilities P (j) nd , j = 1, ..., N are calculated. Then, assuming statistical independence of the signals on each segment, one of the simplest variants of the probability measure on the trajectory is defined as the product of the probabilistic measures on the segments of the trajectory In the case of several SSS, the product of expressions of the form Equation (3) over all available SSS is taken [2]. Thus, the criterion in the formalization problem of the stealth concept of UUV is the Formula (3). Now the task is to construct the optimal trajectory and the optimal UUV velocity law, maximizing the UUV non-detection probability P nd .
In passive mode, the signal on the elements of the antenna array is described by the formula where x(t), s(t), ξ(t) are Gaussian input signal, object signal and noise, respectively, and N T = ∆T ∆t -number of discrete time samples. Here δ = 0 in the case of hypothesis H 0 (a signal from UUV is absent) and δ = 1 in the case of alternative H 1 (a signal from UUV is present).
If the bearing (with the angle θ) to the UUV does not coincide with the normal to the antenna line and it is known, then the hydrophone input signal is where d is the distance between antenna elements, c is a speed of sound in water and N H is the number of hydrophones. This signal at the k-th hydrophone is a function of time and after sampling in time it turns into a vector of size N T . The covariance matrix of this vector is equal to the sum of the covariance matrices K s + K of the signal and noise respectively, where, due to the independence of time samples of noise, the matrix becomes diagonal, and the matrix of the signal from the object, due to the assumed stationarity, is a Toeplitz matrix with elements where a pair of indices (l, k) fixes the numbers of hydrophones, and i∆t corresponds to the time shift of the signals during sampling by time in step ∆t = t i+1 − t i . Similarly, ∆τ = d sin θ/c means the inter-element delay in the equidistant linear array when the wave is incident at an angle θ. The time sequence of N T vectors in the number of N H pieces can be represented as a single column vector Y t = (y (1) , y (2) , ..., y (N H ) ) T with N T N H elements. In modern SSS with linear antenna array the incoming signal is often preprocessed in the receiving path using FFT. Consider the detector, which performs FFT of time-sampled signals coming from the hydrophones of the linear antenna. As a result, we get a sample of Gaussian centered complex random vectors x( f ), f = 1, . . . , N F of dimension equal to the number of hydrophones N H in antenna. Further processing of the received signal is assumed not in the time but in the frequency domain, since in the case of broadband reception, it is possible to select the frequency ranges most informative for the received useful signal. Therefore, the FFT is applied to the temporal components of the vector Y t . After the FFT conversion, the observation signal is described as The detector works according to the threshold principle, according to which the observations obtained at one tact are converted into decisive statistics compared with the threshold. If the threshold is exceeded, a decision is made about the presence of the object. A false alarm is when the threshold is exceeded in the absence of an object. Such an event is random, and the threshold is selected from the condition that the probability of false alarm is equal to a given small number α (0 < α < 1).
After pre-processing at each hydrophone (band-pass filtering, time-shift of the input signal, sampling by time of the continuous signal and FFT of them), the input signal to be decided is a set of random vector-columns Y (k) ( f ), k = 1, ..., N F with a dimension equal to the number of hydrophones N H . Moreover, Y (k) = ξ (k) in the case of a hypothesis and Y (k) = S (k) + ξ (k) in the case of an alternative.
Then the likelihood ratio for Gaussian vector Y can be determined as [21] where K ξ is the covariance matrix of a random vector Y of the frequency components of the signal in the case of a hypothesis H 0 , and K σ + K ξ is the covariance matrix of the vector in the case of an alternative H 1 . Assuming noise independence at various hydrophones, covariance matrices can be represented in a block form where K is the covariance matrix of a noise which is the same on each of the frequency channels of the antenna array processor, and K I J s is the Toeplitz covariance matrix of the vector signal s( f ) from an object with elements k , where ∆ f = 1/∆t, ∆u = 1/∆τ and O is the square zero matrix. For simplicity we assume that signal s is stationary. Then the covariance matrix of signal s can be represented as K σ = σ 2 s R s , where R s is the normalized covariance matrix, and σ 2 s is the variance of the signal from the object.

Non-Detection Probability of UUV under Passive Surveillance at a Small Signal/Noise Ratio
The Hermitian form in the likelihood ratio Equation (4) is reduced to which allows in the case of a small signal/noise ratio to significantly simplify this expression, bringing it to the form Then in the far detection zone the likelihood ratio is approximated as a function of the statistics [22] where In this case, the likelihood ratio is a monotonically non-decreasing function of statistics Q and there is a uniformly the most powerful criterion for testing the hypothesis H 0 against H 1 . This criterion is to compare the statistics Q from Equation (5) with the threshold h chosen from the condition that the probability of false alarm equals to a given number α: P(Q > h|H 0 ) = α.
By virtue of the central limit theorem the probability distribution of the statistics in Equation (5) 1 ] in the case of the alternative H 1 . When calculating the UUV probability of detection it is assumed that the bearing on the UUV is known. In this case, the probability of UUV being undetected in the next cycle is obtained as where Φ is a function of the standard normal distribution Φ(x) = 1 √ 2π x −∞ exp(−t 2 /2)dt. Due to Equation (5) the next expressions for distribution of statistics Q moments are valid Taking into account Equations (7) - (9) the Formula (6) is representable in the form In the absence of information about covariances between hydrophones we accept Finally, the UUV non-detection probability on the trajectory is equal (with a small ratio signal/noise) The next lemma is valid.

Lemma 1.
On the i-th cycle of observation non-detection probability of UUV under passive surveillance at a small signal/noise ratio can be approximately formulated as Proof. Choose the non-detection probability of the form Equation (10) and consider it at a small signal/noise ratio Taylor series expansion with respect to a small signal/noise parameter leads Equation (12) to the form which is simplified to an expression P nd The total probability of UUV detection formally corresponds to the probability of occurrence of a detection event at least once in a series of N observations and for independent events (observations) is calculated by the formula P d = 1 − ∏ N i=1 P nd i , or, if you introduce the concept of "risk", where R i = − ln P nd i is the risk for the i-th observation segment. Given Equation (12), for enough small α Formula (13) takes the form Multiplying and dividing the last expression by ∆T (tact duration) and assuming that T 0 = N∆T (UUV time on the route), we obtain The last sum is represented as an integral, which we call the threat functional [4,5] in the problem of UUV route planning for the case of passive sonar This conclusion coincides with the result given in Reference [3] in the case of one sonar for with the non-detection probability in the form where F n is χ 2 -probability distribution with n degrees of freedom, α is a false alarm probability, γattenuation coefficient, r j is the distance from UUV to sonar at j tact, starting from the moment of appearance of UUV on the trajectory, and υ j is a constant speed of UUV at this tact, σ 0 , σ ξ , µ, r 0 , υ 0 are some model parameters. The number of factors in the product is equal to the number N of tacts when moving UUV along the trajectory. To use the Formula (14) for calculations, it is necessary to ensure the smallness of the first term and to estimate the value of the multiplier standing before the sign of the sum in the second term. The of this multiplier on the probability of a false alarm is shown in Figure 1. Moreover, for the path planning task, it is not required to know the exact values of the information processing parameters included in expression Equation (14) because they are not included in Equation (15).

Non-Detection Probability of UUV under Active Surveillance
Consider the case of an active location, when the observation model has the form , ξ( f ) are Gaussian complex input signal, useful signal and noise, respectively, after FFT conversion. The meaning of δ is the same. In this case, the statistics for comparing with the threshold is as follows where K ξ is the covariance matrix of noise vector. Statistics Equation (18) is linear in observations, P(T ≤ h|H 1 ) has a normal distribution and the non-detection probability on the observation cycle after sending the probing signal is written in the form of Equation (8), but here In the absence of information about covariances between hydrophones the probability of non-detection at one sending of the probing signal is equal to The number of factors in the product is equal to the number of cycles during the passage of the entire trajectory of UV. As in Equation (11), the non-detection probability on the trajectory in active mode is the product of all observation cycles on the trajectory It should be noted that in Formulas (11) and (20), the variance of the useful signal is a function not only of the distance r to the object, but also of the attenuation coefficient γ when the acoustic wave propagates in a stratified medium.
Choose the non-detection probability of the form Equation (20) and consider it at a small signal/noise ratio with the aid of Taylor series. Then next formula is valid. Like Formula (15), summing up all the cycles of observation, we obtain the treat functional part of active mode equal to The treat functional Equation (15) or (21) on a segment of the trajectory has as input parameters the data of sonar systems, including their location (coordinates and parameters of detection active or passive mode, orientation of antenna, etc.) and calculated for these values of the ratio signal/noise reference source (its speed and the direction of sonar), the values of latitude and longitude start and end points of the segment, the speed of the UUV.

About Risk Functional
The path planning problem for object moving in a threat environment will be determined as the problem of calculus of variations of the risk functional Equation (15). Due to Equation (16), after omitting the constants the expression under integral depends on instantaneous level of signal S(t), received by the sonar [ This signal itself depends on the receiving diagram of the sensor and the radiation pattern of the object as well as properties of the marine environment where multiplier G(ϕ) is responsible for diagram of sensor antenna, g(β) is a radiation pattern of the moving object, r is the current distance between sensor and evading object, v -absolute instant velocity of the object, γ -signal saturation factor in water. The geometric meaning of angles ψ and ϕ is as follows. Here ψ is the angle of rotation of object's velocity vector and ϕ is the angle of rotation of radius vector as shown in Figure 2. Angle ψ − ϕ = β is the angle in triangle built from radial v r and transversal v ϕ velocities of the object, for example, the angle between velocity of the object and its projection on radius vector as shown in Figure 3.The exponents k and µ characterize the physical field used for detection. Depending on values of k and µ this can be magnetic, thermal, acoustic or electromagnetic fields. Moreover, the risk R is the integral value of this signal, so the criterion of optimization Equation (15) is a function of phase coordinates of the object and qualities of the sonar and the object itself: We consider sonar antenna diagram to be homogeneous, so G(ϕ) ≡ 1, and there is no additional signal attenuation or gain in the environment so γ = 1.
We study the case of k = µ, namely µ = 2 for acoustic field in underwater environment. Non-uniform radiation pattern of the object can be described as The example of radiation pattern is presented in Figure 4.

Mathematical Statement of the Problem
Therefore we can formulate the optimal path planning problem.

Problem 1.
It is required to find such trajectory (r * (t), ϕ * (t)), which minimizes the functional where v is a velocity of the object, r is the distance between the sensor and the object. Boundary conditions are Time T 0 of moving on route from point A to B is fixed. The detection system consists of one static sonar placed in the origin of Cartesian coordinate system with X axis passing through point A. The polar coordinate system is used for solution simplicity.

Solution of the Path Planing Problem
Lemma 2. Substitution of variable ρ = ln r brings functional Equation (24) to the form Proof of lemma 2 is given in the Appendix A. Instead of solving the original Problem 1 according to Lemma 2 it is necessary to solve a two point boundary value variational problem on the minimum of the functional Equation (25).

Problem 2.
It is required to find the trajectory (ρ * (t), ϕ * (t)), which minimizes the functional . (26) with boundary conditions The following lemma holds.
Theorem 1. Suppose that 0 < F 1 < F(η) < F 2 for all η ∈ R 1 is a twice continuously differentiated function of η, where F 1 , F 2 are some constant values, andρ(t),φ(t) exist and are continuous functions of t. Then the extremal trajectory satisfies the following system of equations Proof. The Euler-Lagrange equations for the variational problem Equation (26) are After substituting the functional S into system Equation (28) and due to ∂S Here C 1 , C 2 are some constant values, which are first integrals of system Equation (28) as well as S * . After multiplying first Equation (29) onρ, second -onφ and summing, so on after multiplying first Equation (29) on −ρ, second -onφ and summing, we have system The implementation the condition of theorem F > F 1 > 0 and substitution the S * into second Equation (30) give Dividing the second equation from Equation (31) on the first and substituting η =φ ρ the equation with respect to variable η one can obtain It means that left and right sides of Equation (32) equal to each other at some fixed points η * . From the linearity on C 1 , C 2 system Equation (31) follows its solvability at any right-hand side. Due to fixed value η * and left side of Equation (31) the statement of theorem follows.
The general form of optimal trajectory can be obtained. Lemma 4. Let S(ρ,ρ, ϕ,φ, t) = ρ 2 +φ 2 g (β (η)), g(β) -thrice continuously differentiated function of β, then Hessian matrix H equals Proof. After calculating the first partial derivatives of S onρ,φ, using equality β = arctan φ ρ , one can get Then we can calculate the second partial derivatives of S onρ,φ and obtain Equation (33). Computing the determinant of the Hessian matrix gives Equation (34). Proof. First of all, it is required to check the Legandre conditions. It means that Hessian matrix is positively defined. Indeed second order minor coincides with det H which is strictly positive. Among first order minors H 11 and H 22 at list one of them is positive because both values are fixed on the optimal trajectory and their sum equals Hence both values H 11 and H 22 are strictly positive too. The Jacobin conditions have the form because S ρρ = 0, S ϕρ = 0, S ρφ = 0, S ϕφ = 0. The Equation (36) take the form which coincides with explicit form of Equation (28) and leads to Equation (29). It means that the sufficient conditions of strong minimum of risk functional Equation (25) is fulfilled [23].

Corollary 1.
According to the theorems 1 and 2 the optimal trajectory has the form of logarithmic spiral.
The equation of optimal trajectory of the UUV on the plane is Lemma 5. The functional value Equation (25) on the optimal trajectory Equation (38) equals , and depends on boundary conditions only.
The proof of lemma is given in Appendix A.
The theorems above guarantee that optimal trajectory of the object is logarithmic spiral. However, it is unclear, whether this trajectory must consist of one segment or several and have points of switching along the route. To answer this question let us compare the values of the risk functional in two cases. In the first, object moves from A straight to B, and in second -from A to C and then to B. In first case risk R * AB , according to Lemma 5, equals The risk on the trajectory consisting of two segments is given in the next Lemma.
The minimal value of risk on the two-segment trajectory consequently passing through the points A, C, B during time interval T 0 equals a) in the case of 0 ≤ β m < β * b) in the case of β * < β m ≤ π/2 The proof of Lemma 6 is given in Appendix A.
Due to the fact that real acoustic radiation patterns of UUV have more complex shapes, Lemma 6 provides a simple way to check the type of optimal trajectory. Indeed, if in one of the cases of Lemma 6, R * AC + R * CB < R * AB , then the logarithmic spiral directly connecting the points A, B is not optimal and two-segment trajectory is preferable in a sense of risk functional value.

Discussion of the Results
In previous articles the similar problem without radiation pattern was considered. The risk functional has a form . In Reference [1], it was proven that optimal trajectory and velocity in the case of µ = 2 for underwater environment, considered in the current article, are the same as Equation (38). Generally speaking, this is a non-trivial fact. However, the optimal risk value in Reference [1] equals Comparing this formula with Equation (39), one can see that taking into account heterogeneous radiation pattern of the moving object reduces risk value, as g(β), being normalized for comparison by the meaning max(g(β)), is less than 1. Then the risk value Equation (39) is less than the one in Reference [1].
Matlab procedure has been developed to simulate and validate found analytical solution. Figures 5 and 6 show results of this simulation. In each figure left picture shows the chosen radiation pattern and right picture shows extremal trajectories, obtained using this pattern: path AC 1 B represents case a) of the Lemma 6, AC 2 B -case b) and AB -case c) respectively. As a result numerical and analytical solutions have been found to coincide with each other that supports Lemma 6.
The values of risk functional and time intervals have been calculated. In the first example for case a) T 1 = 8.8566, R * AC 1 = 0.078529, T 2 = 11.1434, R * C 1 B = 0.062401, R * AC 1 B = 0.13909, for case b) T 1 = 11.1434, R * AC 2 = 0.078529, T 2 = 8.8566, R * C 2 B = 0.067223, R * AC 2 B = 0.13909, for case c) R * AB = 0.14739. One can see that R * AC 1 B = R * AC 2 B . Also it is clear that R * AC 1 = R * C 2 B and R * C 1 B = R * due to geometry of placement points C 1 , C 2 on plane, which are defined by appropriate choice of radiation pattern. This is also fair for the second example, where for case a): T 1 = 6.7286, R AC 1 = 0.079755, , that is, two-segment route has a less risk of detection than the one of the simple straight route. Thus this solution is different from results of Reference [1] and presents an advantage in the sense of risk functional minimization. Whereas in Reference [1], due to velocity optimization risk value on the optimal path was reduced by 40% in average, now due to radiation pattern risk has been reduced by factor g(β * ) max(g(β)) up to value min(g(β)) max(g(β)) .
In both examples the maximum primary acoustic radiation of UUV while object moves along the trajectory AB falls in the direction of SSS or, in other words, g(β * ) = max(g(β)). Therefore, under other boundary conditions, the risk functional reduction will be significantly greater. This presents an advantage of new solution in the sense of risk functional minimization. Moreover, in both examples R * AB > R * AC 1 B , that is, two-segment route has a less risk of detection than the one of the simple straight route (risk reduction is by 6% and 8% respectively), that is the result described analytically in Lemma 6. Now let us suppose that non-uniform radiation pattern of the object is described as Coefficients K 1 and K 2 define the character of radiation and K 1 + K 2 = 1. The example of this radiation pattern is presented in Figure 4 for K 1 = 0.25, K 2 = 0.75. The functional value Equation (25) on the optimal trajectory equals and depends on boundary conditions only. In this case according to Lemma 4 det H = 4K 1 (K 1 + K 2 ) = 4K 1 > 0 and Theorem 2 can be reformulated as follows. Proof. Indeed, the variation of functional Equation (25) has a form R(ρ + δρ,ρ +δρ, ϕ + δϕ,φ +δ ϕ, t) − R(ρ,ρ, ϕ,φ, t) = where δρ, δϕ are variations of respective coordinates and smooth enough functions of time. The first and second summands are zeros due to fixed boundary conditions for variations, the third summand equals zero due to fulfilled Euler-Lagrange equations. The rest is positive as an integral of a weighted sum of squares of functions. Hence any variation of optimal trajectory enlarges the value of risk functional.

Conclusions
Above, for simplicity, it was assumed that in the detection process of a UUV in passive mode using a linear antenna, bearing θ on the UUV is known. In fact, the bearing search for the target is carried out by additional optimization of statistics from observations, each of which depends on its direction θ to the target in the range (−π/2 < θ < π/2). This means that the entire range of angles is divided into a finite number of angles (directions) θ m , m = 1, ...M. For each one it is necessary to solve the detection problem, that is, to build observation statistics depending on the bearing θ m and then calculate and compare the corresponding detection probabilities. This procedure significantly increases the number of calculations. Therefore, in the future, an optimization method in the statistics space will be proposed, after which the probability of detection will be calculated only once.
When overcoming a particular network-centric system, the risk functional may consist of a weighted sum of the functions related to different search means. The UUV route planning problem in this case is reduced for example to a two-point variational problem or an optimal control problem in which the control variables enter into the permanent dispersion of the signal.

Conflicts of Interest:
The authors declare no conflict of interest.

Abbreviations
The following abbreviations are used in this manuscript: Therefore the full velocity of the moving object while moving on the optimal trajectory changes according to law Then, since the derivatives of ρ and ϕ are one can easily obtain Equation (39).
Proof of Lemma 6. The values of risks on each segment are calculated as follows where and Now we have to minimize the sum of R * AC + R * CB . Firstly, let us find optimal times T 1 and T 2 : Using the condition Equation (A7), this equation takes a form To find the value T 2 that minimizes this sum, let us differentiate it by variable T 2 : Is is easy to find Thus T 1 equals According to the lemma conditions we have Now it is possible to use β m as a known value One can express ρ C and ϕ C from this system: After substituting Equation (A15) into Equation (A6) variables a and b take a form Now it is possible to rewrite Equation (A9), using Equations (A16), (A12) and (A11) to obtain As ϕ B −ϕ A ρ B −ρ A = tan β * , simplifying the equation above one can get the statement of the lemma.