1. Introduction
The active tracking of weak targets in the off-shore scenario has always been a hot issue of research in the field of underwater acoustics engineering [
1,
2,
3,
4]. Active sonar, which can simultaneously obtain the point measurements of the range and bearing of the target [
5,
6], is widely used for target surveillance and tracking in scenarios such as harbor and naval bases. A recent study [
7] compared the signal-to-reverberation ratio (SRR) of pulsed active sonar (PAS) and continuous active sonar (CAS) and analyzed the impact on detection probability. Under ideal SRR conditions, the PAS maintains a higher probability of detection and ranging accuracy [
8], while the CAS [
9] increases the number of continuous detection opportunities. For a lower SRR, it is difficult for active sonar to distinguish target echo signals and reverberation in both the time domain and frequency domain [
10].
In complex off-shore environments, due to the reflection of the objects, such as dams, ships, buoys, etc., and the multipath propagation, the number of high-energy clutters is numerous. Underwater target tracking methods combined with conventional threshold detection have been widely studied [
8,
10,
11] in this environment. However, for the low SRR conditions, this kind of method faces challenges [
12,
13]. Primarily, the weak target signals may be damaged or discarded after the high detection threshold, which leads to interruption or even missing target trajectories. On the other hand, the low detection threshold may cause plentiful false target echoes within a single measurement frame, which leads to error association with clutters of the target trajectory. Above all, the classic target tracking techniques are not reliable for the weaker target in complex underwater scenarios.
To resolve the problem, some methods that take non-thresholding measurements as input data for weak target tracking have been proposed [
14,
15,
16,
17]. This kind of approach exploits the complete received data to enhance the energy of the target echo through multiple continuous frames. It performs the tracking and detection simultaneously. The particle-filter-based approach [
15,
18] inherits the advantages of the particle filter that is not limited by the nonlinear and non-Gaussian conditions and has been widely applied to underwater signal processing [
19,
20,
21] in recent years. Duan et al. [
20] presented a multipath time delay tracking approach based on particle filtering by introducing the correlation function of the signal-related peaks into the measurement equation. The results indicated that it can track time delays effectively by taking advantage of the continuous evolution relationship of the correlations between multipath arrivals submerged by the background noise. Wei et al. [
21] combined a data fitting and particle filter to recursively estimate the joint multi-target probability density (JMPD) and achieved multi-target bearing tracking.
In recent years, some research has been conducted on active sonar tracking methods based on the Bayesian framework. Saucan et al. [
22] take into account the impulsive nature of active sonar signals in the prior information, proposing a robust tracking algorithm, and effectively tracking the direction of arrival (DOA) of multiple echoes. After that, they [
23] propose a Cardinalized Probability Hypothesis Density (CPHD) filter for tracking multiple distributed targets from impulsive observations. Zhang et al. [
24] designed low-power and high refresh rate active sonar signals and achieved effective tracking of weak targets. It can be concluded that the non-thresholding tracking method is suitable for tracking problems with weak target echoes.
It must be pointed out that the non-thresholding tracking method can be also ineffective when the reverberation appears as large patches. In general, these patches are relatively steady in adjacent frames and are the so-called steady component of reverberation. The rest is the dynamic component. Recent studies have revealed a strong connection of the separation of the steady component and the dynamic component with the matrix decomposition problem [
25,
26,
27]. By taking advantage of the strong reverberation correlation between adjacent detection frames, a reverberation-suppression method based on the low-rank and sparsity decomposition (LRSD) was proposed [
28]. It can effectively reduce the similar steady components of reverberation in multiple frames and has been widely applied in underwater acoustic engineering [
29,
30]. The non-zero entries contained in the sparse matrix after LRSD can be treated as the measurements for the non-thresholding tracking method to track the target in the reverberation that appears as large patches.
However, in a low SRR environment characterized by fluctuations in clutter quantity and distribution, the tracking method utilizing a sparse matrix may experience significant fluctuations in the posterior probability of the target existence. This can lead to interruptions or even the complete absence of the target trajectory. Aiming at designing a stable tracking method in this complex scenario, this work proposes an adaptive PF-based tracking method (APF) for joint reverberation suppression. The weak echo signals are separated from the steady component of reverberation by exploiting the LRSD to the multi-frame joint matrices. Then, the random dynamic reverberation and clutter are filtered out by utilizing the continuous evolution of the target over time in the state space. The prominent feature of this approach is introducing adaptive measurement likelihood ratio (AMLR) into the posterior update step, which solves the problem of unstable tracking due to the strong fluctuation in the number of false measurements per frame. Finally, the method is implemented by particle filtering.
The rest of this paper is organized as follows.
Section 2 introduces the methodology for the APF-tracker, especially providing the details of the measurements, state space model, and adaptive Bayesian filter theory. In
Section 3, the implementation detail and steps of particle filtering for the proposed method are presented. In
Section 4, a simulation study is conducted to evaluate the tracking performance of the proposed method. In
Section 5, the performance of the proposed method is validated through four experimental datasets carried out on the harbor scene. The conclusions are given in
Section 6.
3. Implementation with Particle Filter
In general, the posterior spatial PDF does not have an analytic solution and there-fore needs to be implemented numerically. The PF-based approaches have emerged as a potent tool in underwater signal processing, offering distinct advantages that are not constrained by the nonlinearity and non-Gaussian conditions. It approximates the posterior spatial PDF
by a set of weighted particles
, where
is the
ith particle,
denotes the corresponding normalized weight, and
. Then, the approximation of
can be expressed as
where
represents the Dirac delta function concentrated at
.
Draw a set of particles from which obeys the uniform distribution in measurement space. The algorithm is then implemented in steps as follows and summarized in Algorithm 1.
Predict: The particles are evaluated by the state transition model using Equation (1). This step creates a set of weighted particles
. Draw a set of newborn particles
from target birth density
at frame
k. Then, compute the posterior probability
with Equation (6). The predicted spatial PDF of target is approximated by the persistent and newborn particles, whose weights are given as
A newborn set of particles
is generated from the
at frame
k + 1. Union the particles as the following
where
denotes the total number of particles.
Update: From the observation model in
Section 2.2 and Equation (5), the likelihood
for each particle
and measurement
can be computed by Equation (13). Then, use the present particles to approximate the integration term in Equation (11) by
The posterior probability
is then updated with Equation (8), and each weight
is updated based on Equation (13) as
The particle weights are normalized with
Resample: Obtain a new set of particles with identical weights
by resampling from
We adopt the regularization strategy [
34] in resampling step to reduce the effect of sample impoverishment.
Output: For target tracking problems, set a reporting threshold
to determine whether the tracking is successful at the present frame. When the posterior probability
is higher than
, it is judged that the tracking is successful and outputs
and
. Subsequently, the estimated state of target can be calculated using
Repeating the above steps, the particle approximation of the posterior spatial PDF and the posterior probability for k = 1, 2, …, K is obtained.
Algorithm 1: Flow of Our Tracking Algorithm |
Initialization: , , , ; |
State Predict: |
1. Evolve particles with Equation (1), to obtain . |
2. Draw a set of newborn particles from . |
3. Compute with Equation (6). |
4. Compute the weights of particles with Equations (15) and (16) at k + 1: and . |
5. Draw newborn particles from at k + 1. |
6. Union the set of predict particles with Equation (17). |
Measurement Update: |
7. Compute the likelihood for each particle and measurement with Equation (12). |
8. Compute with Equations (11) and (18). |
9. Update with Equation (8). |
10. Update the weight of particles and normalize weights according to Equations (19) and (20). |
Resampling: |
11. Resample times from to obtain a new set of particles . |
Output: |
12. If , output the quantities , and , repeating the above steps. |
For off-shore short-range active sonar, real-time detection of the surveillance area is required. It is generally required to detect potential targets as soon as possible after each scan, so the computational efficiency of the algorithm needs to be considered. The computational complexity of this approach mainly involves two stages: the LRSD, and the target tracking algorithm. The computational complexity of LRSD based on ADMM [
33] is mainly contributed by singular value decomposition [
30,
35]. For the signal processing in
Section 2.2, the computational complexity of singular value decomposition is
, where
represents the singular value computational complexity. In Equation (A4), the computational complexity for calculating the low-rank matrix
L is
. In Equation (A5), the computational complexity of the sparse matrix
S is expressed as
. The computational complexity of the Lagrange multiplier
Y is
. The computational complexity of the Frobenius norm in iterative processing is
. In the preprocessing step of obtaining the sparse matrix
S, the computational complexity of one iteration processing is
The PF-based method is a dynamic and recursive algorithm whose computational complexity comes from particle state transition, the weights update, and resampling [
36,
37]. Adopting the common notation for computational complexity [
38], the computational complexity of the two tracking methods can be expressed as
where
represents the computational complexity of particle state transition,
N is the number of particles,
is the number of state dimensions,
is the computational complexity of weight update step,
is the number of measurements at
kth frame,
is the computational complexity of the resampling, and
K is the number of tracking frame.
4. Simulation Study
Virtual target tracking simulations were employed with different SRR conditions. By overlaying target echo signals of varying energies onto measured reverberation data, controlled SRR conditions for tracking experiments were simulated. This approach takes advantage of the reverberation background from the experimental data to evaluate the performance in a more realistic environment.
The background datasets of simulations are collected from a 64-elements uniform line array (ULA) in harbor scenes. The dataset consists of 332 continuous frames. These frames are processed using conventional beamforming. Each individual frame is structured as a matrix containing 192 grids of range and 241 grids of bearing. We have artificially added a target echo with a two-dimensional Gaussian energy distribution into each frame following the strategy outlined in [
28]. Let
represent the energy distribution of the target in the bearing-range spatial spectrum. We defined the SRR by
where
represents the bearing-range spatial spectrum energy matrix of the
kth frame and
represents the maximum value among all matrix cells.
According to the definition of Equation (24), the datasets containing target trajectories under different SRR conditions are established. These datasets will be used to validate the performance of the proposed method.
Figure 3 shows the bearing-range spatial-spectral of the same background data frame with SRRs of −5 dB and −15 dB, respectively. It can be observed that the background is filled with large patches of reverberation and clutter. In comparison to
Figure 3a, the target echo is barely observable in
Figure 3b.
According to [
29], setting the trade-off parameter
and the penalty factor parameter
in LRSD yielded favorable results in reverberation suppression.
Figure 4 depicts the reverberation suppression results (i.e.,
) for two consecutive frames with SRRs of −5 dB and −15 dB, respectively. All data have been normalized in terms of energy. The dynamic clutters result in a substantial number of false target echoes. Comparing
Figure 4a,b, it can be observed that LRSD effectively enhances the target echo signals and the fluctuations in the number and distribution of clutter between adjacent frames are not significant when the SRR is high. Conversely, in
Figure 4c,d, when the SRR decreases to −15 dB, the target echoes are faint, and the fluctuations of clutter become significant. It is evident that as the target echo becomes exceedingly weak, the reverberation suppression capability of LRSD decreases accordingly. In such cases, in addition to the steady component of reverberation, the high-energy dynamic components remain and mask the target echoes.
To run the PF-tracker and the APF-tracker, the parameters are set as follows: the specified probability
,
, and
, the number of particles for both trackers is 10,000, the number of newborn particles is 2000, and the reporting threshold
is set to 0.6. The target is moving based on the transition probability of model [
32]
and the state transition matrix for the nearly constant velocity model is
where
denotes identity matrix,
denotes the Kronecker product, and
of 1 s is the sampling period of the sensor. The target state vector is
. The variables
and
represent the target position. The
and
represent the target velocity. The covariance
Q of corresponding process noise
can be set as:
where
represents the intensity of process noise and the value is set to 10.
The measurement Equation, a fundamental component of the nonlinear underwater target tracking model, establishes the mapping between the predicted target state and the measurement space. It can be expressed as follows:
and the measurement noise covariance matrix
. The
of 3.85 m and
of 0.5° denote the standard deviation of the range and bearing measurement, respectively.
For each group of tracking experiments, the target trajectory of 60-frame is added to the continuous reverberation background. In addition to the difference in SRR, the target trajectory is also uncertain for each experiment. The generation of target trajectories is subject to the following constraints: (1) The 60-frame trajectory of the target is always within the sonar detection range. (2) The target state equation follows Equation (25). (3) The initial velocity of the target is within the range of (4 m/s, 6 m/s).
Figure 5 illustrates the true trajectory of the target and the corresponding tracking results by the two methods.
Figure 5a shows the accumulation of all clutter in
for 60 frames overlaying the true target trajectory.
Figure 5b shows the tracking results of the APF-tracker and PF-tracker with SRR of −5 dB, where the value of bearing-range points on trajectories are presented by posterior probabilities. The PF-tracker misses one frame trajectory marked by the white rectangle. The matrixes
shown in
Figure 4a,b corresponds to the previous frame and this missing frame, respectively. Due to significant changes in the number and distribution of clutter from the 13th frame to the 14th frame, the posterior probability of the PF-tracker rapidly decreases. In contrast, the posterior probability of APF-tracker does not show a significant decrease. Moreover, the overall posterior probability of the APF is higher than that of the PF as shown by the brighter intensity of the target trajectory. The tracking results with the SRR of −15 dB are shown in
Figure 5c. The strong fluctuations in clutter occur after applying LRSD for reverberation suppression, as shown in
Figure 4c,d. In this condition, the posterior probability of the PF-tracker exhibits significant fluctuations, leading to the continuous multi-frame missing trajectory (marked by the white rectangle). In contrast, the APF-tracker reliably tracks the target and maintains a higher overall posterior probability.
Figure 6 analyzes the relationship between the posterior probability and the parameters in two tracking experiments. The APF-tracker adaptively updates the false alarm parameter
based on the number of false echoes
as demonstrated in Equation (10). Therefore, the measurement likelihood ratio
adjusts correspondingly, which alleviates the fluctuation of the posterior probabilities
. In contrast, the ratio
of PF-tracker almost remains unchanged resulting in the drop of the
with the increase of
. For example, the ratio
of the APF-tracker decreases at the 14th frame when the clutters
leap sharply, while the ratio
of the PF-tracker has no reaction. Above all, the APF-tracker demonstrates a superior ability of adjustment based on the measurements, leading to more complete tracking trajectories and higher posterior probabilities. Furthermore, as shown in the bottom panels of
Figure 6, the APF-tracker demonstrates superior convergence speed and stability in terms of the parameter
when compared to the PF-tracker.
The performance assessment of the two tracking methods is conducted through Monte Carlo simulations, employing the aforementioned scenario and parameters. The statistic results are compared using box plots, as shown in
Figure 7. The APF-tracker maintains overall higher and more stable posterior probabilities under both SRR conditions, while the PF-tracker experienced a significant performance degradation at SRR of −15 dB. Specifically, the APF-tracker exhibits higher median and mean posterior probabilities by about 6%, indicating superior overall tracking performance compared to the PF-tracker. The interquartile ranges (IQR) of the APF-tracker are narrower than those of the PF-tracker, indicating a more concentrated distribution of posterior probabilities. Therefore, the APF-tracker is expected to be more stable. Furthermore, the APF-tracker exhibited fewer outliers (the red plus signs), also indicating its higher stability. Based on the above analysis, the APF-tracker is more suitable for the low SRR environment with the fluctuation of clutter number and distribution.
The two-sample
t-test [
39] is a robust and unbiased method for conducting statistical hypothesis testing. This test evaluates whether the mean and median of posterior probabilities generated with the APF and PF trackers are statistically equivalent. The null hypothesis posits no difference in the mean and median values of the posterior probabilities between the two trackers. Distinguishing the significant differences between the two data groups under the given significance level of 0.05. The analysis is conducted under two SRR conditions: −5 dB and −15 dB. At SRR = −5 dB, we observed
p-values of 0.0182 for the mean and 0.0212 for the median. Similarly, at SRR = −15 dB, the
p-values for the mean and median are 0.0005 and 0.0011, respectively. These results indicate statistically significant differences in both the mean and median of the posterior probabilities generated by the APF and PF trackers under varying SRR conditions.
Monte Carlo experiments are conducted with SRR ranging from −25 dB to 0 dB. For each tracking experiment, when the number of frames with
accounted for more than 80% of the total frames, the track is denoted as successful. The success rate of tracking
is then defined as the ratio of the successful tracking. The results are presented in
Table 1 with different SRRs. When the SRR is below −25 dB, both tracking methods fail. When the SRR is greater than −2.5 dB, both methods achieve a 100% success rate. As the SRR decreases from −5 dB to −15 dB, the success rate of the APF-tracker surpasses that of the PF-tracker with an increasing difference. Notably, at an SRR of −15 dB, the APF-tracker exhibits a 40% higher success rate compared to the PF-tracker.
Figure 8 illustrates the true trajectory of three targets and the corresponding tracking results using the two methods at SRR of −10 dB. The speed of targets 1 and 2 is about 4 m/s, and the speed of target 3 is about 0.6 m/s.
Figure 8a shows the true trajectories (solid green lines), as well as the tracking results obtained using the APF method (red trajectory) and the PF method (blue trajectory). The starting positions and movement directions of the targets are visually depicted by black arrows.
Figure 8b showcases the posterior probability outcomes for both methods, with
denoting the posterior probability of the
lth target. To enhance the visibility of the trajectory for the slow-moving Target 3, we increase the value of joint frame
during the LRSD processing. Additionally, we adjusted Constraint 3 by modifying the speed range to 0.5 m/s–6 m/s.
The results demonstrate that APF achieves better convergence and trajectory stability than PF for all three targets. Specifically, the PF experiences one frame loss in tracking Target 1 at frame 39, while the APF tracks the trajectory successfully. From frames 48 to 60, the PF suffered continuous multi-frame trajectory loss for Target 2 (marked by rectangle), while the APF was only missing one frame. Moreover, the PF method exhibits more pronounced posterior probability fluctuations for the slow-moving Target 3 compared to APF. These findings highlight the superior performance of APF over PF in terms of convergence speed and trajectory stability for multiple target tracking.
5. Experimental Results and Discussion
In this Section, we evaluate the performance of the proposed method using four experimental datasets of active sonar captured during May 2020 in a shallow water harbor. The active sonar system was deployed at a depth of 5 m below the surface, while the average depth of the surveillance area was approximately 12 m. The active transmission signal utilized was LFM with a modulation band ranging from 50 kHz to 70 kHz. The active transmission signal had a period of 1 s with a duration of 40 ms. The horizontal array consisted of 64 elements with a uniform interval of 0.012 m. The sampling frequency is 200 kHz. The target traveled back and forth at a depth of approximately 5 m. Each frame of received data underwent processing via conventional broadband beamforming (CBBF). The tracker parameters setting for the experimental datasets remained the same as discussed in
Section 4.
Figure 9 presents the results of the two tracking methods on dataset A.
Figure 9a displays the pseudo-color image accumulation of 60 frames after reverberation suppression, where the target trajectory (highlighted by a red rectangle) cannot be distinguished clearly from the cluttered background due to the presence of high-energy clutter. Two suspected target trajectories (highlighted by white ellipses) formed by high-energy clutter are relatively more prominent.
Figure 9b depicts the tracking results of the PF-tracker and APF-tracker, respectively. In the tracking results of the PF-tracker, there are two noticeable trajectory interruptions (highlighted by white rectangles). In contrast, the APF-tracker performs complete tracking of the target trajectory. Besides, the APF-tracker initiates the tracking at a further range of 712 m compared to the 677 m of the PF tracker. It should be noted that the false target trajectories in
Figure 9a are caused by random dynamic clutters and do not have continuous spatial-temporal evolution characteristics. Therefore, both trackers effectively filter them out.
The
curves presented in
Figure 10 display pronounced fluctuations between frames. There are 10 frames with an
value over 200, indicating that the dynamic clutter is still evident after reverberation suppression. Consequently, the performance of the PF-tracker significantly deteriorates at these frames. Specifically, from frame 28 to frame 34, the
of the PF-tracker decreases sharply, which leads to a trajectory missing in the corresponding frames of
Figure 9b. By comparing the tracking results from frame 73 to frame 97, it can be observed that the fluctuation of
has a more significant degradation on the
of the PF-tracker. Furthermore, compared with the APF-tracker, the
of the PF-tracker exceeds the tracking threshold after a delay of nine frames, which indicates its convergence speed is inadequate. In conclusion, the APF-tracker demonstrates stronger applicability in the reverberation environment with strong fluctuations of clutter compared to the PF-tracker.
Figure 11 exhibits the comparison of tracking trajectories by the two methods across four experimental datasets. It is noteworthy that despite conducting the experiments within the same region, the clutter background in each dataset varies due to the time-varying characteristic of the underwater environment. Furthermore, the speed and direction of the target differ across each experiment. The dashed rectangles in the Figure show the locations where trajectory missing occurred with the PF-tracker in each dataset. It is evident that the PF-tracker exhibits insufficient tracking stability in this shallow water environment. As shown in
Figure 11a,b, the APF-tracker demonstrates superior continuous tracking performance when the target motion state is relatively complex. Specifically, when the motion direction of the target changes (marked by circles), the corresponding tracking trajectory of the APF-tracker changes more promptly than that of the PF-tracker. Additionally, due to the faster convergence of the
pk value in the APF-tracker, it can track the target trajectory earlier.
Table 2 presents comprehensive details and the ratio
of successfully tracked frames to the total frames in each dataset. Based on the results provided in
Table 2, the APF-tracker demonstrates an overall tracking performance that exceeds that of the PF-tracker by more than 14% in each dataset. In summary, the APF-tracker exhibits substantial superiority over the PF-tracker when large amounts of false target echoes are caused by high-energy dynamic reverberation.