A Novel iTransformer-Based Approach for AIS Data-Assisted CFAR Detection

Suo, Yongfeng; Yuan, Zhenkai; Cui, Lei; Li, Gaocai; Sun, Mei

doi:10.3390/jmse13081475

Open AccessArticle

A Novel iTransformer-Based Approach for AIS Data-Assisted CFAR Detection

by

Yongfeng Suo

¹,

Zhenkai Yuan

¹,

Lei Cui

^1,*

,

Gaocai Li

¹

and

Mei Sun

²

¹

Navigation College, Jimei University, Xiamen 361021, China

²

School of Computer and Information Engineering, Xiamen University of Technology, Xiamen 361024, China

^*

Author to whom correspondence should be addressed.

J. Mar. Sci. Eng. 2025, 13(8), 1475; https://doi.org/10.3390/jmse13081475

Submission received: 27 June 2025 / Revised: 21 July 2025 / Accepted: 25 July 2025 / Published: 31 July 2025

(This article belongs to the Section Ocean Engineering)

Download

Browse Figures

Versions Notes

Abstract

Detection of small vessels is of great significance for maritime safety assurance, abnormal vessel tracking, illegal fishing supervision, and combating smuggling. However, the radar reflection intensity of small vessels is low, making them difficult to detected with the radar’s constant false-alarm rate (CFAR) algorithm. To enhance the detection capability for small vessels, we propose an improved CFAR scheme. Specifically, we first compared traditional CFAR processing results of radar data with automatic identification system (AIS) data to identify some special targets. These special targets, which possessed AIS information, but remained undetected by radar, enabled an iTransformer model to generate more reasonable CFAR threshold adjustments. iTransformer adaptively lowered the threshold of the areas around these targets until they were detected by radar. This process made it easier to discover the small boats in the surrounding area. Experimental results showed that our method reduces the missed detection rate of small vessels by 73.4% and the false-alarm rate by 60.7% in simulated scenarios, significantly enhancing the CFAR detection capability. Overall, our study provides a new solution for ensuring maritime navigation safety and strengthening illegal supervision, while also offering new technical references for the field of radar detection.

Keywords:

CFAR; AIS; vessel detection; iTransformer

1. Introduction

The accurate detection of small vessels is of paramount importance for maritime security, fisheries management, and collision avoidance [1]. These small vessels are characterized by small radar cross sections, highly random navigation trajectories, and unpredictable appearance patterns. Some even purposefully evade monitoring. These characteristics collectively exacerbate the challenge of supervising such vessels. To address this issue, existing detection methods for small vessels mainly include CFAR detection, infrared thermal imaging, optical camera imaging, and synthetic aperture radar (SAR) [2,3,4]. However, these detection methods are still prone to missed detections when applied to the inspection of small vessels. Among these methods, the CFAR algorithm has been widely used due to its cost-effectiveness, all-weather operability, illumination independence, and long detection range. Thus, CFAR is particularly suitable for long-term and large-area vessels monitoring in fixed sea regions. Improving CFAR detection accuracy for small vessels can effectively enhance maritime safety supervision capacity.

1.1. Related Work

Current detection methodologies for small vessels primarily rely on physical sensing and information fusion technologies. The constant false-alarm rate (CFAR) algorithm dominates radar-based detection due to its all-weather operability and cost-effectiveness, yet it remains susceptible to sea clutter interference in complex environments, leading to missed detections [5]. Synthetic aperture radar (SAR) enables high-resolution vessel contour identification, but suffers from prolonged data update cycles and elevated false-alarm rates in nearshore scenarios [6]. For optical/infrared sensing, optical cameras excel in nearshore surveillance due to high resolution and low cost, though their performance is constrained by weather and lighting conditions [7]. While infrared thermal imaging ensures all-weather detection, its high hardware costs hinder large-scale deployment. Automatic identification system (AIS)-based fusion techniques enhance identification accuracy by cross-correlating signal data with physical detections [8], yet they depend on equipment installation rates and fail to monitor “silent” vessels. In summary, existing methods involve trade-offs among detection efficiency, environmental adaptability, and cost. Overall, CFAR remains the cornerstone for long-term, continuous monitoring of small vessels in fixed maritime areas, balancing practicality and reliability.

The traditional CFAR algorithm estimates the power threshold of surrounding reference cells to determine target presence. Depending on the threshold calculation approach, various CFAR variants have emerged, each tailored to specific sea conditions [9]. The cell averaging (CA) CFAR offers computational simplicity and performs satisfactorily in homogeneous environments. Building on CA CFAR, the smallest-order cell averaging (SOCA) CFAR mitigates strong target interference [10]. It does so by selecting the minimum value from forward/backward training cells. In contrast, the greatest-order cell averaging (GOCA) CFAR divides reference cells into two segments [11]. It uses the maximum value for noise power estimation, which is crucial for clutter-edge scenarios. These improvements have yielded distinct benefits: SOCA has reduced the missed-detection rate, while GOCA has lowered the false-alarm rate. The ordered statistics (OS) CFAR further enhances robustness by sorting reference cells and selecting the median value, though at the cost of higher computational complexity [12]. Overall, these CFAR variants excel in their designated scenarios. However, detection performance deteriorates significantly when applied to mismatched conditions, highlighting their limited adaptability and generalizability in dynamic, complex environments [13,14,15].

To enhance the CFAR algorithm’s ability to dynamically adjust thresholds based on environmental factors and improve small-vessel detection, recent studies have increasingly leveraged deep learning for optimization. In 2019, Chia-Hung Lin et al. proposed DL-CFAR (deep learning-based CFAR), aiming to address the masking effect of CA-CFAR and its variants in multitarget scenarios with lower computational cost [16]. The algorithm uses a deep learning model to learn target structures in distance-Doppler (RD) graphs, removes target patterns to derive pure noise graphs, and thus estimates noise levels more accurately for robust CFAR detection. Subsequently, in 2022, Tzvi Diskin and colleagues introduced CFARnet, a framework that theoretically proved the equivalence between CFAR-constrained Bayesian optimal detectors and the generalized likelihood ratio test (GLRT) [17]. Experiments showed that CFARnet achieved flexible trade-offs between false-alarm rates and detection accuracy. In 2024, Ignacio Roldan et al. developed a data-driven radar detector inspired by computer vision using a 2D CNN backbone [18]. This method employs cross-sensor supervision and unlabeled radar–lidar data to overcome traditional CFAR limitations in complex urban environments with extended targets. These studies demonstrate that deep learning effectively enhances CFAR’s feature extraction and environmental adaptability, improving small-target detection and system robustness. However, current methods still face challenges such as high data dependency, high annotation costs, and limited adaptability to non-uniform environments.

A cutting-edge model for multivariate time-series analysis, iTransformer demonstrates exceptional capabilities in processing multivariate data and conducting robust feature correlation analysis [19]. This implies that it can comprehensively analyze the complex interrelations among various influencing factors in dynamic marine conditions, thereby exhibiting superior adaptability and generalization. This architecture inherits the fundamental modules of the transformer framework, yet it redefines the framework’s design to suit the unique characteristics of time-series data [20]. For multivariate time-series analysis, iTransformer employs a variable-centric strategy. The model captures inter-variable correlations via self-attention mechanisms while leveraging temporal features encoded by the feedforward network to facilitate prediction [21]. This integrated design endows iTransformer with exceptional adaptability to diverse environments, rendering it highly effective in handling non-uniform data.

To enhance the CFAR algorithm’s applicability across complex sea conditions and improve small-vessel detection, we propose an iTransformer-based CFAR improvement that integrates AIS as prior knowledge. Specifically, this method constructs a weakly supervised evaluation index by comparing inputs and outputs of the model to form the loss function, and introduces the Gumbel-softmax algorithm to regulate training dynamics. iTransformer is leveraged to analyze the complex interrelations among diverse sea conditions, radar waveforms, raw CFAR outputs, and AIS data, thereby deriving an adaptively adjusted CFAR threshold. This threshold empowers the CFAR algorithm to achieve more effective small-vessel detection.

1.2. Contributions

To enhance CFAR with iTransformer and integrate AIS as prior knowledge, we conducted the following key work.

To adapt to real-world scenarios, we performed a comparative analysis between the model inputs (radar signals, AIS data, enhanced CA-CFAR detection results, and spatial encoding) with the improved CFAR threshold outputs. This analysis yielded a weakly supervised evaluation index for constructing the loss function. The design enabled the model to prioritize targets with AIS data, but missed by radar, adaptively reducing detection thresholds around these targets to enhance the detection of other small vessels.
To avoid falling into local optima and neglecting less influential factors during training, we employed the Gumbel-softmax annealing algorithm. This algorithm regulated the relaxation degree throughout training, enabling extensive exploration in the early stage and focused optimization in the later stage.

2. Methods

The proposed improved CFAR method based on the iTransformer model is shown in Figure 1. This method mainly consists of three parts: the data part, the simulation scene construction part, and the model training part. In the data processing stage, we applied cubic spline interpolation to AIS data to align it with radar data, ensuring that there were corresponding AIS data every minute. Additionally, we stochastically generated position coordinates for AIS-unannotated vessels, which were incorporated into radar waveform modeling. These AIS-unannotated small vessels served as the reference for final validation of the model’s detection performance. In the simulation scenario construction, we modeled sea clutter, obstacle echoes, and vessel echoes based on radar echo simulation principles, then combined and rasterized these waveforms into radar data. The radar waveforms were processed by the original CFAR, and the CFAR processing results and thresholds obtained were encapsulated with the radar and AIS data as the model input. In the model training phase, we augmented the input data with two-channel spatial position information and employed the iTransformer to capture complex correlations among multiple variables in the input data. The iTransformer model output a two-dimensional adaptive threshold distribution. Then, we designed a weakly supervised evaluation system as the loss function to compare the input and output, enabling the model to continuously iterate in the desired learning direction in the absence of labels. Finally, we used the Gumbel-softmax annealing algorithm to adjust the relaxation degree of the entire learning process of the model.

2.1. Construction of Simulated Scenes

We constructed a simulation scenario to facilitate the easier adjustment of parameters for control experiments and verification of model performance. The complete radar echo simulation process is shown in Figure 2. Specifically, we first introduced actual AIS data as the spatial coordinates of vessels with AIS. Then, based on the Monte Carlo method, we added a certain number of obstacles and vessels without AIS information to the samples at each time point. By modeling the radar echoes of vessels, obstacles, and sea clutter, we simulated real radar echo signals as realistically as possible. In addition, we also obtained the threshold and processing results of CFAR before improvement through CA-CFAR processing, which is conducive to helping the model understand how to improve the threshold distribution. The data generated by the above process were rasterized and input as the dataset of the model. The simulated experimental scenario was structured around the core logic of “data-driven modeling–multimodal signal synthesis–spatiotemporal dynamic alignment”, achieving faithful replication of complex marine environments through in situ measurements and physical modeling integration.

The simulated experimental scenario was structured around the core logic of “data-driven modeling–multimodal signal synthesis–spatiotemporal dynamic alignment”, achieving faithful replication of complex marine environments through the integration of in situ measurements and physical modeling. The complete simulation process of the radar echo is shown in Figure 2 [22,23,24]. Sea clutter signal generation is grounded in the statistical properties of empirical data, utilizing a hybrid Rayleigh-K distribution model: Rayleigh distributions simulate homogeneous clutter in high-sea-state regions, while K distributions are introduced in inshore reef areas to characterize spiky amplitude statistics. The texture component is generated via a gamma distribution (parameters co-regulated by topographic gradients and sea state indices), and the speckle component employs zero-mean complex Gaussian processes to model microscale scattering stochasticity. The superposition of the K-distribution and the Rayleigh distribution is shown in Figure 3 [25]. Obstacle echo modeling relies on the geometric theory of diffraction (GTD), partitioning the near-field diffraction zone, interference zone, and far-field shadow zone. Using the method of moments (MoM), the Doppler shift–incidence angle relationship of obstacle scattering fields at the X-band is calculated to physically model multipath interference, interference fringes, and clutter attenuation effects [26]. Since this paragraph pertains to the construction of the simulation environment, and the important theories involved have little relevance to the main innovations of this paper, they will not be elaborated on in the main text. The relevant theoretical content can be found in Appendix A.

2.2. Early Iterative Ideas and Limitations

In earlier work, we proposed using AIS as prior knowledge to adjust CFAR thresholds. During original CFAR detection, the monitored sea area is gridded into cells. The detection threshold for each cell is calculated by referencing the average radar power in a surrounding region. Consequently, adjacent cells share most reference cells during threshold calculation: if one cell has an excessively high threshold, its neighbors often show similar overestimation. When AIS information exists in a cell, but no radar target is detected, this indicates excessively high thresholds in that cell and its vicinity. Thus, we iteratively reduced the thresholds in this region until radar targets were detected and matched with AIS data, as shown in Figure 4. The adjusted thresholds around the cell then became more reasonable, facilitating the identification of AIS-unannotated small vessels.

However, this approach introduced three key challenges. First, defining the spatial scope for threshold reduction was challenging, i.e., how to mathematically characterize the adjustment region. Second, determining the reduction strategy—fixed value across the region or greater reduction near its center—was non-trivial. Finally, 2D CFAR algorithms required accounting for both distance and direction of reference cells, complicating spatial correlation modeling. In summary, relying on simple iterative models for threshold adjustment is subjective and theoretically unsound. Empirically setting model parameters may yield acceptable performance in specific scenarios, but fail entirely in novel contexts. However, this iterative idea itself can effectively enhance the detection capabilities of small ships. Thus, we employed an iTransformer neural network to learn the complex relationships among historical radar data, AIS data, and CFAR detection results. iTransformer has excellent capabilities in analyzing multiple couplings and perceiving spatial characteristics, and can better utilize the laws of analyzing sea conditions to obtain a more reasonable threshold distribution.

2.3. Model Training

We implemented three key improvements to adapt the iTransformer model for non-AIS-transmitting vessel detection scenarios. The architecture of the improved iTransformer model is shown in Figure 5. First, we incorporated spatial position information to help the model learn the complex connections of multichannel input information in the spatial domain. Second, we designed a weakly supervised composite evaluation system to address the shortage of non-AIS-transmitting vessel samples. We constructed the constraint and parameter terms using the general evaluation criteria for radar data detection. These terms were derived through input–output analysis and comparison, reducing reliance on label data. Finally, we adopted the Gumbel-softmax annealing algorithm to achieve differentiable continuous relaxation. By dynamically adjusting the temperature parameter, the algorithm balanced the model’s exploration and convergence capabilities, effectively addressing the problem of traditional methods being prone to local optima during the initial training phase.

2.3.1. Spatial Position Enhancement Mechanism

We added two channels of spatial position encoding information to the original four-channel input variables before flattening, recording the horizontal and vertical coordinates of each grid cell. This ensured that spatial coordinate information was retained even after flattening The multi-head self-attention mechanism can analyze the spatial information of these two channels as variables and establish cross-modal correlations among radar data, AIS information, and spatial coordinates. The spatial position encoding method is shown in Figure 6.

2.3.2. Parameter Term of Weakly Supervised Evaluation System

We constructed a loss function framework that was independent of strong labels by fusing prior AIS knowledge and radar detection results, enabling adaptive optimization of CFAR thresholds. Focused on core detection scenarios with valid AIS data (AIS = 1), the framework guided the model to learn complex spatiotemporal correlation features without labeled data through differentiated reward–penalty mechanisms and dynamic constraint strategies. The weakly supervised evaluation system uses a loss function with synergistic parametric and constraint terms based on four typical detection scenarios for AIS = 1. The parametric terms quantify model performance by defining four scenarios for AIS-enabled vessel detection logic, as follows.

Correct detection maintenance (case 1): When both pre- and post-improvement CFAR detection results are “1” and AIS = 1, the model stably identifies known vessels. A mild reward weight (0.005) reinforces detection consistency to avoid erroneous elimination of valid targets.
Missed-detection correction (case 2): If the post-improvement detection is “1,” pre-improvement detection is “0,” and AIS = 1, the model corrects the missed detection of weak-signal targets by traditional CFAR. A high reward weight (1.1) prioritizes optimizing such critical scenarios.
Uncorrected missed detection (case 3): When both pre- and post-improvement detections are “0” and AIS = 1, the model fails to detect AIS-enabled vessels. A penalty weight (−1.2) forces the model to enhance sensitivity to low-signal-to-noise-ratio targets.
Erroneous missed detection (case 4): If the post-improvement detection is “0,” pre-improvement detection is “1,” and AIS = 1, the model erroneously eliminates valid detections. The strongest penalty (−1.7) suppresses such severe misjudgments.

The parametric loss is calculated by summing the product of scenario-averaged detection probabilities and weights, defined as:

L_{scene} = \sum_{j = 1}^{B} \sum_{i = 1}^{4} w_{i} \cdot p_{i, j}

(1)

Here,

B

represents the batch size,

w_{i}

corresponds to the weights of the four scenarios, and

p_{i, j}

denotes the detection probability of the j-th sample in the i-th scenario. This design forces the model to prioritize the optimization of the detection stability and missed-detection correction ability for vessels enabled with AIS through the preset reward and punishment logic in the direction we hope for improvement.

2.3.3. Constraint Term of Weakly Supervised Evaluation System

We introduced an exponential constraint term into the system to maintain the target count within a fixed range of

(m, n)

. This approach avoids extreme false alarms or missed detections caused by threshold fluctuations. The range

(m, n)

was determined based on the average number of AIS-containing cells throughout the training process for computational convenience. The constraint term formula is as follows:

f (x) = \{\begin{array}{l} \max (1.0 - k \times {(m - x)}^{1.5}, - 1.0) & if x < m \\ 1.0 & if m \leq x \leq n \\ \max (1.0 - k \times {(x - n)}^{1.5}, - 1.0) & if x > n \end{array}

(2)

The mathematical function graph of the constraint term is shown in Figure 7. When the detection count falls within the range of

(m, n)

, the constraint term considers the target count as expected and gives a reward (value of 1). When the count is out of this range, the constraint term decreases, and the further it deviates, the faster the decrease becomes, until it reaches the threshold value of −1. This mechanism not only guides the model to constrain the number of detection results but also tolerates a certain degree of deviation in target detection.

2.3.4. Constraint Term of Gumbel-Softmax Annealing Algorithm

We introduced the Gumbel-softmax annealing algorithm [26,27,28] to avoid the local optimum of the model caused by the weakly supervised evaluation system. This approach dynamically adjusted the temperature parameter τ to balance the model’s exploration and convergence capabilities. Its mechanism was deeply integrated into the model architecture to form a “temperature–attention–evaluation” collaborative optimization loop. The specific implementation methods were as follows.

During the initial training phase, a high-temperature strategy (

τ

= 1) was adopted to make the decision distribution nearly uniform, forcing the model to widely explore potential correlations between radar echoes and AIS data. In the middle phase, an exponential decay mechanism (

τ

=

τ_{0} \cdot

0.95ᵗ) was introduced to gradually reduce randomness and focus on stable feature correlations. In the later phase, a dynamic response mechanism monitored real-time metrics such as evaluation loss change rate and attention entropy. The formula for local temperature compensation coupling is:

τ_{effective} = τ \cdot σ (w \cdot P + b)

(3)

A temperature regularization term was introduced to constrain the parameter optimization direction and avoided abnormal temperature fluctuations:

L_{reg} = | τ - τ_{optimal} |^{2}

(4)

Here,

τ_{optimal}

represents the theoretical optimal temperature statistically derived from historical data. Minimizing the regularization term ensured that the temperature parameter converged to a reasonable interval. This balance between the model’s exploration and convergence capabilities was achieved by this optimization. In simulated scenarios, the approach also enabled robust threshold adjustment and detection accuracy. The degrees of influence of case 1 and case 2 on the training process during the process from high temperature to low temperature are shown in Figure 8.

3. Experiments

3.1. Study Area and Experimental Setup

We selected a real-world AIS dataset covering the northern South China Sea (17.793918° N–32.495048° N, 106.4104° E–124.888187° E), focusing on vessel movement characteristics in the Taiwan Strait (a critical East Asian shipping lane) and the southeastern East China Sea–northern South China Sea traditional fishing grounds. The research area—a 10 nm² sea zone (115.64428° E–115.65263° E × 25.14915° N–25.1575° N)—was determined via random sampling to feature moderate vessel density. The AIS data spanned 1–17 December 2024.

The experimental dataset comprised 23,651 independent samples, each corresponding to a marine monitoring scenario at a specific time. Each sample’s six-channel input matrix included mean radar power, CFAR detection thresholds and results, AIS vessel status, longitude–latitude encoding, and topographic feature encoding. To ensure the model fully learned complex inter-channel couplings while avoiding overfitting from temporal continuity, a non-temporal random partitioning strategy was adopted. The dataset was divided into a training set (2556 samples), validation set (548 samples), and test set (547 samples): 70%, 15%, and 15%, respectively. Stratified sampling was used during partitioning to maintain consistent ratios of AIS-present and AIS-absent vessels across datasets.

To validate the rationality of the results, we conducted six repeated experiments on the same dataset. Given that all samples were processed out of order during both dataset partitioning and model training, these experiments can be treated as six independent trials. The dataset included 23,651 sample points (collected at one-minute intervals), with each training cycle comprising 32 samples—a total of 740 training cycles.

3.2. Comparison of the CFAR Algorithm Before and After Improvement in a Single Sample

As shown in Figure 9, we selected a sample at 14:00 on 8 December 2024 to visually demonstrate the CFAR threshold changes before and after optimization. Compared with the original CFAR threshold, the optimized one shows more distinct distribution features and larger differences in adjacent thresholds. Specifically, significant threshold variations are observed around predefined vessel positions when compared with preassigned coordinates. Analysis of CFAR detection results indicates that the optimized method has significantly alleviated the missed-detection and false-alarm issues present in the original approach.

3.3. Analysis of the Changing Trends of Parameter Terms

As previously discussed, constructing the loss function requires discussing the parameter terms (i.e., the four cases). Figure 10 analyzes the trends of these parameter terms across six training processes. To facilitate observation amid numerous training steps, we averaged values every ten steps before plotting the line graph. Specifically, while correct detection maintenance (case 1) exhibits significant fluctuations, its overall trend remains stable. Uncorrected missed detection (case 3, representing unimproved missed detections post-optimization) and erroneous missed detection (case 4, indicating aggravated missed detections) both decrease markedly after learning, stabilizing near zero by the 400th training step. For missed-detection correction (case 2, reflecting improved missed detections), the number of successful corrections was low initially, but stabilized at slightly below five per sample as training progressed. This aligns with the experimental design of adding six non-AIS vessels per sample, verifying the model’s effectiveness in reducing missed detections. Notably, case 2 counts decreased moderately during mid-training, likely because the higher penalty coefficient for case 4 prioritized avoiding its occurrence. These patterns were consistent across all six experiments, with parameter term trends remaining uniform. This demonstrates that the model is resilient to temporal shuffling and exhibits robust stability.

3.4. Analysis of the Changing Trends of Constraint Terms

We added the previously mentioned constraint term to maintain the quantity of the test results within a reasonable range. Given that the average number of AIS targets across all samples was 5.874, we set the constraint range (m, n) to (5, 12). As illustrated in Figure 11, the initial number of detected targets frequently exceeds the upper limit of this range. However, as training progresses, the count stabilizes within (5, 12), validating the constraint term’s effectiveness during training.

3.5. Analysis of Model Learning Trends

Averaging the CFAR thresholds of all cells across pre- and post-improvement model outputs reveals the overall CFAR threshold pattern, as depicted in Figure 12. This average difference partially reflects the model’s learning of radar echo power patterns in the sea area. Specifically, compared with the initially preset obstacle power distribution, the improved CFAR algorithm yields threshold distributions that more clearly align with actual obstacle locations. Before the improvement, the central sea area showed higher power values in CFAR threshold calculations due to the inclusion of multiple obstacle regions in the reference cells. This inconsistency with actual obstacle distribution was resolved by the improved CFAR algorithm, further verifying the model’s superiority in learning spatial radar echo distribution patterns. However, a notable spatial deviation exists between obstacles in the lower-right sea area and the corresponding high-power regions of the improved radar threshold. This suggests that the model’s learning of echo patterns may be influenced by extraneous factors or that minor local optima occurred during training—both warranting further investigation.

The statistics of the CFAR target detection results and error rates are shown in Table 1 and Table 2. The improved CFAR reduced the average number of cells that were undetected and missed detections from 3.7152/4.1379 to 1.2461/1.1012. The false-alarm rate dropped from 0.794% to 0.312% and the missed-detection rate from 1.034% to 0.275%, showing 60.7% and 73.4% improvements. These results demonstrate the improved CFAR’s significantly enhanced detection accuracy and reliability.

4. Discussion

The experimental results show that the iTransformer model with AIS data information added can effectively enhance the detection ability of the CFAR algorithm for small vessels. The variations in various parameters in the experiment and the overall trend of the improved CFAR thresholds all conform to our expectations. However, several areas for improvement remain in the experimental process.

4.1. Real-World Validation

This experiment was trained and validated in a simulated scenario to enable low-cost annotation for large-scale training. However, simplified sea clutter modeling in simulations fails to fully reproduce the sea clutter in the real scene, and the experiment omitted minor random factors, introducing uncertainties for real-scenario model deployment. To address these limitations, many of the existing similar studies on simulated scenarios chose to conduct predefined experiments in small-scale real scenarios to validate model generalization and robustness [29,30,31]. The addition of real data can make the parameter settings of the model more reasonable. In future work, we will also deploy experiments in a smaller range of real scenarios, using test boats as label targets., thereby enhancing the credibility and authenticity of the experiment.

4.2. Model Training Speed Optimization

The sea area selected in this experiment is moderate and the divided grids are relatively sparse, which can ensure that the sample size of each training sample is within a reasonable range. However, when it is necessary to monitor a broader sea area or divide denser grids, the amount of sample data will significantly increase and the training speed of the iTransformer model will drop sharply. Aiming at the problem of the training speed of similar deep learning models, the current research mainly realizes the improvement in the training speed at the software level through two methods: parallelization strategy coupled with fragmentation optimization technology [32] and data distillation technology [33]. In future research, we will add model parallelism, data parallelism, and gradient compression functions to accelerate the training speed.

4.3. Enhance the Local Detection Ability

We compared the CFAR variation trend with the energy distribution of predefined obstacles and observed local deviations, misalignments, or diffusion despite overall distribution consistency. This may be attributed to the model prioritizing global features at the expense of local accuracy. Existing studies have shown that pyramid attention mechanisms [34], Fourier spectrum gating [35], and adaptive fragmentation strategies [36] effectively address such issues. In future research, we will focus on adaptive fragmentation strategies, implementing finer-grained segment division for abrupt change regions to balance global–local influence and enhance local precision.

5. Conclusions

This study used an iTransformer-based CFAR threshold optimization method by constructing simulated radar echo scenarios that replicate real sea conditions. The experimental results demonstrated the effectiveness of the proposed model. Using six-channel rasterized data, experiments across 23,651 samples validated the model’s performance. The enhanced iTransformer model improved detection efficacy through three key mechanisms: a spatial encoding enhancement, a weakly supervised composite evaluation system, and a dynamic target number constraint. Experimental results showed that the optimized CFAR reduced the average number of missed-detection cells and false-detection cells significantly. Threshold distribution visualization confirmed that the optimized results more clearly reflected the spatial characteristics of predefined obstacles. These findings validate the method’s effectiveness in learning the complex coupling between sea clutter and target signals. Overall, after training, the model’s ability to inspect small vessels was significantly enhanced while taking into account both adaptability and accuracy.

Author Contributions

Conceptualization, Y.S., Z.Y. and L.C.; methodology, Z.Y. and G.L.; software, Z.Y. and M.S.; validation, Y.S. and Z.Y.; formal analysis, Z.Y. and G.L.; data curation, Z.Y. and G.L.; writing—original draft preparation, Z.Y. and M.S.; writing—review and editing, Z.Y. and L.C.; funding acquisition, L.C. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the National Natural Science Foundation of China (No. 52371369), the Key Projects of National Key R & D Program (No. 2021YFB390150), the Natural Science Project of Fujian Province (No. 2022J01323, 2021J01822, 2020J01660, 20230019), Natural Science Founda-tion of Xiamen, China under Grant (No. 3502Z202473059, 3502Z202573055 and 3502Z202471079), the Chunhui Project Foundation of the Education Department of China under Grant 202200324, the University Natural Science Foundation (ZQ2024110).

Data Availability Statement

The original contributions presented in this study are included in the article. Further inquiries can be directed to the corresponding author(s).

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Appendix A.1. Characteristics of the Rayleigh Distribution

The Rayleigh distribution is a classic statistical model used to describe the amplitude characteristics of scatterers in radar echoes. Its probability density function is:

f (x; σ) = \frac{x}{σ^{2}} \exp (- \frac{x^{2}}{2 σ^{2}})

(A1)

Here,

σ^{2}

represents the scale parameter, which is related to the average power. This distribution assumes that radar echoes consist of a large number of independent and identically distributed scatterers whose contributions cancel each other out, eventually forming a Gaussian distribution, as described by the central limit theorem. However, the Rayleigh distribution is only suitable for simulating sea clutter under low-resolution or low-sea conditions. When the radar resolution is high or the sea conditions are complex, its performance will deviate from actual data.

Appendix A.2. Characteristics of the K Distribution

The K distribution is a non-Rayleigh distribution model widely used in the simulation of high-resolution radar sea clutter. It describes the non-Gaussian characteristics of sea clutter through a composite structure, namely:

p (z) = \int_{0}^{\infty} \frac{1}{\sqrt{π τ}} e^{- \frac{z^{2}}{τ}} \cdot \frac{b^{ν}}{Γ (ν)} τ^{ν - 1} e^{- b τ} d τ

(A2)

Here,

ν

represents the shape parameter and

b

is the scale parameter. The texture component

τ

follows the

Γ

-distribution, describing the modulation effect of large-scale waves. The speckle component follows a Gaussian distribution, representing the randomness of microscale scattering. The shape parameter (

ν

) of the K distribution determines its tail characteristics. The smaller the

ν

, the “sharper” the distribution, which is more suitable for simulating sea clutter under high-sea conditions. In addition, the K distribution can better fit the coherent characteristics of sea clutter, such as the speckle features shown in radar images.

Appendix A.3. Theoretical Support for Superimposing Rayleigh and K Distributions to Simulate Sea Clutter

Combining the Rayleigh distribution with the K distribution can more comprehensively simulate the complex characteristics of sea clutter. For example, the K-Rayleigh distribution improves the fitting accuracy of the model to sea clutter by introducing additional Rayleigh components to simulate non-Bragg scattering (such as isolated sea peaks). This composite model can better capture the multi-scale characteristics of sea clutter, including short-term Rayleigh components and long-term K distribution components. In addition, the physical basis of the K distribution lies in its ability to reflect the aggregation effect of sea surface waves, while the Rayleigh distribution is used to describe small-scale scattering effects. Therefore, combining the two not only improves the flexibility of the model but also enhances its practicality in radar target detection.

Appendix A.4. Zero-Mean Complex Gaussian Process

In radar signal processing, the zero-mean complex Gaussian process is often used to simulate the “speckle” component in sea clutter, and its mathematical form is [8]:

f (t) ~ C N (0, σ^{2})

(A3)

C N (0, σ^{2})

denotes the complex Gaussian distribution with a mean of zero and a variance of

σ^{2}

.

Appendix A.5. Influence of Seabed Topographic Gradient Parameters on Clutter Power

The seabed topographic gradient (

\nabla H

) changes the surface microscale wave structure through wave refraction, and its quantitative relationship with clutter power is expressed as a formula. The quantitative relationship between seabed topographic gradient and clutter power is shown in Figure A1.

P_{clutter} (x, y)

is the clutter power,

P_{0}

is the basic reference power,

\nabla H (x, y)

represents the gradient of the seabed terrain at the spatial coordinate

(x, y)

, reflecting the topographic change characteristics, N is the total number of discrete components,

β_{k}

is the correlation coefficient of the k-th discrete component,

A_{k}

is its amplitude parameter, and

r_{k}

is the distance between this component and the calculation point

(x, y)

. The formula comprehensively considers the influence of terrain and discrete components to quantify the clutter power distribution.

Figure A1. Quantitative relationship between seabed topographic gradient and clutter power.

Appendix A.6. Obstacle Scattering and Multipath Effect

Fixed obstacles such as artificial reefs produce multipath effects through electromagnetic scattering, and the surrounding sea areas can be divided into three characteristic zones [11,12], as follows.

The clutter amplitude in the near-field diffraction zone (

r < 3 D

) follows a Rician distribution and the Doppler spectrum width increases by 20%–30%, which is caused by specular reflection and edge diffraction on the obstacle surface.

The power fluctuation in the interference zone (

3 D < r < 10 D

) is ±10 dB, forming interference fringes with a period that is related to the phase difference of dual-path reflection.

The clutter power in the far-field shadow zone (

r > 10 D

) is reduced by 6–8 dB and the spatial correlation is enhanced (correlation coefficient > 0.7), which is caused by energy attenuation due to obstacle occlusion. Analysis of obstacle influence shows that its modulation on clutter is frequency-dependent.

Appendix A.7. Spatiotemporal Correlation Model

The spatiotemporal correlation function of sea clutter is:

R (Δ x, Δ t) = σ^{2} \exp (- \frac{| Δ x |}{L_{c}} - \frac{| Δ t |}{T_{c}}) \cdot \cos (2 π f_{d} Δ t)

(A4)

The simulation image of the sea clutter spatiotemporal correlation model is shown in Figure A2.

Figure A2. Simulation of sea clutter spatiotemporal correlation model.

The spatial correlation length (

L_{c}

) and time correlation constant (

T_{c}

) are modulated by terrain: in the submarine canyon area,

L c < 30 m

,

T_{c} \approx 2 s

(dominated by strong local scattering); and at the top of the seamount,

L c < 150 m

,

T_{c} \approx 8 s

(significant large-scale wave modulation). The time-varying characteristic analysis shows that within the tidal cycle (12 h), the fluctuation amplitude can reach ±30%, which is negatively correlated with the current velocity (R = −0.78). This characteristic has important guiding significance for the design of coherent integration time of radar systems. The verification of spatial inhomogeneity shows that the spatial correlation length of sea clutter is negatively correlated with the terrain gradient R = −0.65), which means that the clutter in complex terrain areas has more local characteristics and the global statistical assumption of traditional CFAR is invalid.

Appendix A.8. Comprehensive Integration of MoM and GTD for Obstacle Echo Modeling

The method of moments (MoM) serves as a foundational numerical technique for precisely simulating electromagnetic scattering from maritime obstacles such as reefs and vessels. Its core formulation discretizes the electric field integral equation to solve for surface current density distribution, expressed as:

\hat{n} \times E^{i} = \hat{n} \times (j ω μ \int_{S} \bar{G} \cdot J_{s} d S^{'})

(A5)

where Js represents surface current density and

\bar{G}

denotes the free-space Green function. Within our implementation, Rao–Wilton–Glisson (RWG) basis functions discretize ship structural edges while calibrated material parameters (seawater: ϵr = 80 − j60; reefs: ϵr = 15 − j2) ensure physical authenticity. The resulting radar cross section (RCS) data directly feed into radar echo synthesis, enabling accurate replication of obstacle-induced backscatter patterns. This approach maintains a verification error below 0.3 dB when benchmarked against analytical solutions, establishing the physical credibility essential for reliable constant false-alarm rate (CFAR) threshold optimization.

Appendix A.9. GTD-Driven Scattering Mechanisms and System Integration

Complementing MoM, the geometric theory of diffraction (GTD) provides an efficient high-frequency approximation for modeling diffraction and reflection phenomena across large-scale obstacles. Its fundamental formulation is:

E^{d} = E^{i} \cdot D \frac{e^{- j k s}}{\sqrt{s}}

(A6)

This formula incorporates diffraction coefficients D with surface roughness compensation

D_{rough} = D_{UTD} \cdot \exp  [- {(\frac{4 π δ \cos θ_{i}}{λ})}^{2}]

, where δ quantifies RMS roughness. Crucially, GTD partitions scattering effects into three distinct regimes: near-field regions (r < 3D) exhibit Rician-distributed echoes with spectral broadening; interference zones manifest periodic fading (±10 dB fluctuations); and far-field shadow areas demonstrate 6–8 dB power attenuation. These computationally efficient scattering signatures combine with MoM outputs to form the composite obstacle echo

E^{o b s}

, which is subsequently integrated with sea clutter and target returns in the radar simulation pipeline.

References

Corbane, C.; Najman, L.; Pecoul, E.; Demagistri, L.; Petit, M.A. Complete processing chain for vessel detection using optical satellite imagery. Int. J. Remote Sens. 2010, 31, 5837–5854. [Google Scholar] [CrossRef]
Zhang, S.; Wu, R.; Xu, K.; Wang, J.; Sun, W. R-CNN-based vessel detection from high resolution remote sensing imagery. Remote Sens. 2019, 11, 631. [Google Scholar] [CrossRef]
Wang, Y.; Wang, C.; Zhang, H.; Dong, Y.; Wei, S. A SAR dataset of vessel detection for deep learning under complex backgrounds. Remote Sens. 2019, 11, 765. [Google Scholar] [CrossRef]
Mahafza, B.R. Radar Systems Analysis and Design Using MATLAB; Chapman and Hall/CRC: Boca Raton, FL, USA, 2005. [Google Scholar]
Gandhi, P.P.; Kassam, S.A. Analysis of CFAR processors in nonhomogeneous background. IEEE Trans. Aerosp. Electron. Syst. 1988, 24, 427–445. [Google Scholar] [CrossRef]
Geng, X.; Zhao, L.; Shi, L.; Yang, J.; Li, P.; Sun, W. Small-Sized Ship Detection Nearshore Based on Lightweight Active Learning Model with a Small Number of Labeled Data for SAR Imagery. Remote Sens. 2021, 13, 3400. [Google Scholar] [CrossRef]
Swamidoss, I.N.; Al Mansoori, A.; Shajahan, S.; Al Remeithi, H.; Al Marzooqi, A.; Bouamer, T.; Sayadi, S. Image quality assessment of thermal images for Maritime surveillance applications. In Proceedings of the SPIE Future Sensing Technologies 2024, Yokohama, Japan, 28 May 2024; SPIE: Bellingham, WA, USA, 2024; Volume 13083, pp. 270–277. [Google Scholar]
Keerthiga, K.; Narmadha, M. Next-Gen Maritime Security by Leveraging Advanced AIS Analytics for Precision Detection and Monitoring of Boats in Ocean 2024. In Proceedings of the Third International Conference on Smart Technologies and Systems for Next Generation Computing (ICSTSN), Villupuram, India, 18–19 July 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 1–6. [Google Scholar]
Cao, C.; Zhang, J.; Meng, J.; Zhang, X.; Mao, X. Analysis of ship detection performance with full-, compact-and dual-polarimetric SAR. Remote Sens. 2019, 11, 2160. [Google Scholar] [CrossRef]
Zhao, J.; Jiang, R.; Wang, X.; Gao, H. Robust CFAR detection for multiple targets in K-distributed sea clutter based on machine learning. Symmetry 2019, 11, 1482. [Google Scholar] [CrossRef]
Al-dabaa, M.M.; Emran, A.A.; Yahya, A.; El-Mashade, M.; Aboshosha, A. Optimizing Multiple-Target CFAR Detection Efficacy through Advanced Intelligent Clustering Algorithms within K-Distribution Sea Clutter Environments. J. Al-Azhar Univ. Eng. Sect. 2024, 19, 250–269. [Google Scholar] [CrossRef]
Weiss, M. Analysis of some modified cell-averaging CFAR processors in multiple-target situations. IEEE Trans. Aerosp. Electron. Syst. 2007, 20, 102–114. [Google Scholar] [CrossRef]
Rihan, M.Y.; Nossair, Z.B.; Mubarak, R.I. An improved CFAR algorithm for multiple environmental conditions. Signal Image Video Process. 2024, 18, 3383–3393. [Google Scholar] [CrossRef]
Liu, H.; Song, J.; Ren, L.; Sun, S.; Guo, C.; Ding, Z.; Xu, C. CFAR Detection of High Grazing Angle Sea-Clutter Based on KR Distribution. J. Phys. Conf. Ser. 2019, 1169, 012022. [Google Scholar] [CrossRef]
Hansen, V.G.; Sawyers, J.H. Detectability loss due to “greatest of” selection in a cell-averaging CFAR. IEEE Trans. Aerosp. Electron. Syst. 1980, 31, 115–118. [Google Scholar] [CrossRef]
Lin, C.H.; Lin, Y.C.; Bai, Y.; Chung, W.-H.; Lee, T.-S.; Huttunen, H. DL-CFAR: A novel CFAR target detection method based on deep learning. In Proceedings of the 2019 IEEE 90th Vehicular Technology Conference (VTC2019-Fall), Honolulu, HI, USA, 22–25 September 2019; IEEE: Piscataway, NJ, USA, 2019; pp. 1–6. [Google Scholar]
Diskin, T.; Beer, Y.; Okun, U.; Wiesel, A. CFARNet: Deep Learning for Target Detection with constant false alarm rate. Signal Process. 2024, 223, 109543. [Google Scholar] [CrossRef]
Roldan, I.; Palffy, A.; Kooij, J.F.P.; Gavrila, D.M.; Fioranelli, F.; Yarovoy, A. See further than cfar: A data-driven radar detector trained by lidar. In Proceedings of the 2024 IEEE Radar Conference (RadarConf24), Denver, CO, USA, 13 June 2024; IEEE: Piscataway, NJ, USA, 2024; pp. 1–6. [Google Scholar]
Liu, Y.; Hu, T.; Zhang, H.; Wu, H.; Wang, S.; Ma, L.; Long, M. itransformer: Inverted transformers are effective for time series forecasting. arXiv 2023, arXiv:2310.06625. [Google Scholar]
Tian, Z.W.; Qian, R.L. Chinese Water Demand Forecast Based on iTransformer Model; IEEE Access: Piscataway, NJ, USA, 2024. [Google Scholar]
Jia, W.; Guan, S.; Xue, Y. TL-iTransformer: Revolutionizing sea surface temperature prediction through iTransformer and transfer learning. Earth Sci. Inform. 2024, 17, 4847–4857. [Google Scholar] [CrossRef]
Watts, S. Modeling and simulation of coherent sea clutter. IEEE Trans. Aerosp. Electron. Syst. 2012, 48, 3303–3317. [Google Scholar] [CrossRef]
Watts, S. A new method for the simulation of coherent sea clutter. In Proceedings of the 2011 IEEE RadarCon (RADAR), Kansas City, MO, USA, 23–27 May 2011; IEEE: Piscataway, NJ, USA, 2011; pp. 52–57. [Google Scholar]
Arikan, F.; Vural, N. Simulation of sea clutter at various frequency bands. J. Electromagn. Waves Appl. 2005, 19, 529–542. [Google Scholar] [CrossRef]
Zeng, P.; Zhang, Y.; Xia, X.; Zhang, J.; Du, P.; Hua, Z.; Li, S. Research on Sea Clutter Simulation Method Based on Deep Cognition of Characteristic Parameters. Remote Sens. 2024, 16, 4741. [Google Scholar] [CrossRef]
Jang, E.; Gu, S.; Poole, B. Categorical reparameterization with gumbel-softmax. arXiv 2016, arXiv:1611.01144. [Google Scholar]
Shen, J.; Zhen, X.; Worring, M.; Shao, L. Variational multi-task learning with gumbel-softmax priors. Adv. Neural Inf. Process. Syst. 2021, 34, 21031–21042. [Google Scholar]
Li, Y.; Liu, J.; Lin, G.; Hou, Y.; Mou, M.; Zhang, J. Gumbel-softmax-based optimization: A simple general framework for optimization problems on graphs. Comput. Soc. Netw. 2021, 8, 5. [Google Scholar] [CrossRef]
Akhtar, J.; Olsen, K.E. A neural network target detector with partial CA-CFAR supervised training. In Proceedings of the 2018 International Conference on Radar (RADAR), Brisbane, Australia, 27 August 2018; IEEE: Piscataway, NJ, USA, 2018; pp. 1–6. [Google Scholar]
Rohman, B.P.A.; Kurniawan, D.; Miftahushudur, M.T. Switching CA/OS CFAR using neural network for radar target detection in non-homogeneous environment. In Proceedings of the 2015 International Electronics Symposium (IES), Surabaya, Indonesia, 29–30 September 2015; IEEE: Piscataway, NJ, USA, 2015; pp. 280–283. [Google Scholar]
Cheikh, K.; Soltani, F. Application of neural networks to radar signal detection in K-distributed clutter. IEE Proc. Radar Sonar. Navig. 2006, 153, 460–466. [Google Scholar] [CrossRef]
Dehghani, M.; Djolonga, J.; Mustafa, B.; Padlewski, P.; Heek, J.; Gilmer, J.; Steiner, A.P.; Caron, M.; Geirhos, R.; Alabdulmohsin, I.; et al. Scaling vision transformers to 22 billion parameters. In Proceedings of the International Conference on Machine Learning, Vancouver, BC, Canada, 13–19 July 2025; PMLR: New York, NY, USA, 2023; pp. 7480–7512. [Google Scholar]
Radosavovic, I.; Dollár, P.; Girshick, R.; Gkioxari, G.; He, K. Data distillation: Towards omni-supervised learning. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Salt Lake City, UT, USA, 18–23 June 2018; pp. 4119–4128. [Google Scholar]
Hussain, T.; Anwar, A.; Anwar, S.; Petersson, L.; Wook Baik, S. Pyramidal attention for saliency detection. In Proceedings of the 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW), New Orleans, LA, USA, 19–20 June 2022; IEEE: New York, NY, USA, 2022; pp. 2877–2887. [Google Scholar]
Zhou, T.; Ma, Z.; Wen, Q.; Sun, L.; Jin, R. Fedformer: Frequency enhanced decomposed transformer for long-term series forecasting. In Proceedings of the International Conference on Machine Learning, Baltimore, MD, USA, 17–23 July 2022; PMLR: New York, NY, USA, 2022; pp. 27268–27286. [Google Scholar]
Zhang, J.; Guo, L.; Song, L.; Gao, S.; Hao, C.; Li, X. PatchTCN: Patch-Based Transformer Convolutional Network for Times Series Analysis. In Proceedings of the 2024 3rd International Symposium on Computing and Artificial Intelligence, New York, NY, USA, 22–24 November 2024; pp. 1–9. [Google Scholar]

Figure 1. The overall architecture of the improved CFAR method based on the iTransformer model. In the scene simulation part, we generated the distribution information of vessels based on AIS data and generated special targets for model verification. Then, we conducted radar simulation modeling and original CFAR processing to generate various data for training. In the model training part, these input data were encapsulated and handed over to iTransformer for training. The reverse learning process of model training was regulated through the weakly supervised evaluation system and the Gumbel-softmax annealing algorithm.

Figure 2. The radar echo modeling process for simulating real-world sea conditions consists of three components. The yellow area denotes radar simulation modeling of various factors generating radar waveforms in actual sea areas, which are superimposed to form a complete radar echo. The blue area represents rasterization and CA-CFAR processing of the generated complete radar echo, while the gray area signifies the four-channel data generated as the dataset throughout the simulation process.

Figure 3. The simulation of sea clutter was obtained by generating the noise of Rayleigh distribution and the noise of K-distribution and superimposing them.

Figure 4. The threshold is adjusted using a simple iterative method: when AIS data fail to match a radar-detected target, the threshold around the AIS position is continuously iteratively reduced until a matching radar target appears. This process facilitates the detection of missed targets in the surrounding area. In the picture, the orange cells represent the areas that have been set to a lower threshold.

Figure 5. Improved iTransformer model architecture. The blue area represents the input data of the model, including 4-channel data generated by simulating radar echoes in the sea area and 2-channel data encoded as spatial positions. The green area represents the training process of iTransformer. The flesh-colored regions represent the output of the iTransformer process and the reverse learning process. The CFAR threshold information of the improved output channel of the iTransformer model. The threshold information of this output will be compared with the input data to construct the loss function and participate in the reverse learning process of the model.

Figure 6. Spatial position coding refers to adding two-channel information on the basis of the original four-channel data obtained through simulation, storing the horizontal and vertical coordinate information, respectively.

Figure 7. The mathematical function of the constraint term, which indicates that when the number of detected targets is within the specified range (m, n), the constraint term excites the model. However, when it exceeds this range, the value of the constraint term drops rapidly, which in turn punishes the model. The minimum value of the constraint term will not be less than −1.

Figure 8. The degrees of influence of case 1 and case 2 on the training process during the process from high temperature to low temperature. The randomness is relatively large in the high-temperature stage. The difference in influence between case 1 and case 2 on the model is relatively small, and the model exploration degree is high. As the temperature drops, the gap in their influence on the model widens, and the model gradually focuses on the stable laws summarized from multiple training steps.

Figure 9. The sample from 8 December 2024, 14:00. By comparing the radar thresholds obtained before and after the improvement with the rasterized radar power distribution, the detection results before and after the improvement were obtained. Then, by comparing the detection results with the initially preset vessel position information, the situations of correct detection, false detection, and missed detection can be determined.

Figure 10. Trends of parameter terms. Case 1 represents correct detection maintenance, case 2 represents missed-detection correction, and case 3 represents uncorrected missed detection. Case 4 stands for erroneous missed detection. The changing trends of these parameter terms largely reflects the improvement effect of the iTransformer model on the CFAR detection algorithm, which is obtained from the output of the iTransformer and in turn affects the process of reverse learning. The background colors in the picture - red, yellow and blue - respectively represent the high-temperature stage, the moderate-temperature stage and the low-temperature stage.

Figure 11. Trends in the number of detection targets. Based on the average number of cells containing AIS data in the sample, the reasonable range is set at 5–12. During the evaluation, it was considered that the output results beyond this range were less reasonable.

Figure 12. Analysis of the distribution change trend of the CFAR threshold. By calculating the average value of each cell for the threshold distribution before and after the CFAR improvement in the test set, we can to some extent reflect the overall trend of the CFAR threshold distribution before and after the improvement. The difference between the two represents the overall impact of the iTransformer model on the improvement in CFAR.

Table 1. CFAR object detection results (validation set sample size: 3548).

Detection Method	Total Number of Cells	Average Number of Undetected Cells	Average Number of Cells That Missed Detection
CFAR before improvement	400	3.7152	4.1379
Improved CFAR	400	1.2461	1.1012

Table 2. Statistics on false-alarm rate and missed-detection rate.

	False-Alarm Rate	Missed-Detection Rate
CFAR before improvement	0.794%	1.034%
Improved CFAR	0.312%	0.275%
Improvement extent	60.706%	73.404%

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Suo, Y.; Yuan, Z.; Cui, L.; Li, G.; Sun, M. A Novel iTransformer-Based Approach for AIS Data-Assisted CFAR Detection. J. Mar. Sci. Eng. 2025, 13, 1475. https://doi.org/10.3390/jmse13081475

AMA Style

Suo Y, Yuan Z, Cui L, Li G, Sun M. A Novel iTransformer-Based Approach for AIS Data-Assisted CFAR Detection. Journal of Marine Science and Engineering. 2025; 13(8):1475. https://doi.org/10.3390/jmse13081475

Chicago/Turabian Style

Suo, Yongfeng, Zhenkai Yuan, Lei Cui, Gaocai Li, and Mei Sun. 2025. "A Novel iTransformer-Based Approach for AIS Data-Assisted CFAR Detection" Journal of Marine Science and Engineering 13, no. 8: 1475. https://doi.org/10.3390/jmse13081475

APA Style

Suo, Y., Yuan, Z., Cui, L., Li, G., & Sun, M. (2025). A Novel iTransformer-Based Approach for AIS Data-Assisted CFAR Detection. Journal of Marine Science and Engineering, 13(8), 1475. https://doi.org/10.3390/jmse13081475

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Novel iTransformer-Based Approach for AIS Data-Assisted CFAR Detection

Abstract

1. Introduction

1.1. Related Work

1.2. Contributions

2. Methods

2.1. Construction of Simulated Scenes

2.2. Early Iterative Ideas and Limitations

2.3. Model Training

2.3.1. Spatial Position Enhancement Mechanism

2.3.2. Parameter Term of Weakly Supervised Evaluation System

2.3.3. Constraint Term of Weakly Supervised Evaluation System

2.3.4. Constraint Term of Gumbel-Softmax Annealing Algorithm

3. Experiments

3.1. Study Area and Experimental Setup

3.2. Comparison of the CFAR Algorithm Before and After Improvement in a Single Sample

3.3. Analysis of the Changing Trends of Parameter Terms

3.4. Analysis of the Changing Trends of Constraint Terms

3.5. Analysis of Model Learning Trends

4. Discussion

4.1. Real-World Validation

4.2. Model Training Speed Optimization

4.3. Enhance the Local Detection Ability

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Conflicts of Interest

Appendix A

Appendix A.1. Characteristics of the Rayleigh Distribution

Appendix A.2. Characteristics of the K Distribution

Appendix A.3. Theoretical Support for Superimposing Rayleigh and K Distributions to Simulate Sea Clutter

Appendix A.4. Zero-Mean Complex Gaussian Process

Appendix A.5. Influence of Seabed Topographic Gradient Parameters on Clutter Power

Appendix A.6. Obstacle Scattering and Multipath Effect

Appendix A.7. Spatiotemporal Correlation Model

Appendix A.8. Comprehensive Integration of MoM and GTD for Obstacle Echo Modeling

Appendix A.9. GTD-Driven Scattering Mechanisms and System Integration

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI