Next Article in Journal
Concurrent Decadal Trend Transitions of Sea Ice Concentration and Sea Surface pCO2 in the Beaufort Sea
Previous Article in Journal
Strong Longitudinal and Latitudinal Differences of Ionospheric Responses in North American and European Sectors During the 10–11 October 2024 Geomagnetic Storm
Previous Article in Special Issue
Multiscale Attention-Enhanced Complex-Valued Graph U-Net for PolSAR Image Classification
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

SAR Radio Frequency Interference Suppression Based on Kurtosis-Guided Attention Network

1
Aerospace Information Research Institute, Chinese Academy of Sciences, Beijing 100094, China
2
University of Chinese Academy of Sciences, Beijing 100190, China
3
School of Electronic, Electrical and Communication Engineering, University of Chinese Academy of Sciences, Beijing 101408, China
4
Laboratory of Target Cognition and Application Technology (TCAT), Beijing 100190, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2026, 18(2), 255; https://doi.org/10.3390/rs18020255
Submission received: 8 December 2025 / Revised: 10 January 2026 / Accepted: 11 January 2026 / Published: 13 January 2026

Highlights

What are the main findings?
  • A kurtosis-guided attention network (KANet) is proposed, which utilizes temporal kurtosis with an attention mechanism to guide the network to focus on interference-corrupted regions.
  • Proposing a quantitative evaluation framework for phase fidelity to address phase variations in SAR interference suppression.
What are the implication of the main findings?
  • The proposed method addresses the challenge of distinguishing complex RFI from high-energy ground backscatter based solely on signal energy, thereby improving the interference suppression performance.
  • This work validates the method’s high phase fidelity during interference removal via quantitative evaluation, benefiting phase-sensitive SAR applications.

Abstract

Radio-frequency interference (RFI) severely degrades the imaging quality of synthetic aperture radar (SAR), especially when the interference energy is strongly coupled with ground backscatter in both the time and frequency domains. Existing algorithms typically rely on energy contrast or component decomposition in transform domains, which limits their ability to cleanly separate complex RFI from high-power echoes. Exploiting the fact that kurtosis is insensitive to ground clutter and background noise, this paper proposes an interference suppression network based on the temporal kurtosis guidance mechanism. Specifically, a statistical prior vector capturing the non-Gaussian characteristics of RFI is constructed using kurtosis in the time–frequency domain and is integrated into a multi-scale attention mechanism, allowing the network to more effectively concentrate on interfered regions. Meanwhile, a systematic framework is established for the quantitative assessment of phase fidelity in the reconstruction of complex-valued SAR echoes. On this basis, by exploiting the strong generalization capability and high processing efficiency of data-driven models, the proposed network achieves improved RFI separation and enhanced reconstruction accuracy of underlying scene features. Ablation experiments validated that the design of a kurtosis-guided module can reduce the mean square error (MSE) loss by 14.87% compared to the basic model. Furthermore, regarding the phase fidelity, the correlation coefficient between the suppressed signal and the original true signal reached 0.99. Finally, GF-3 satellite data are used to further demonstrate the effectiveness and practicality of the proposed method.

1. Introduction

Synthetic Aperture Radar (SAR) is an active microwave remote sensing system that achieves two-dimensional high-resolution imaging by transmitting wideband signals and utilizing the synthetic aperture principle to overcome physical aperture limitations. Its all-day, all-weather observation capabilities are significantly superior to those of traditional remote sensing techniques, such as optical and infrared sensors [1,2,3,4]. However, during the acquisition of target information, SAR systems are inevitably susceptible to interference from other in-band electromagnetic radiation sources, known as Radio Frequency Interference (RFI). RFI is typically much stronger than the target’s scattering signals and can mask target information, severely degrading imaging quality and the accuracy of information extraction [5]. Figure 1 illustrates an example of RFI-corrupted data from the GF-3 satellite. In this scenario, the RFI covers both land and ocean regions, where the energy of the stronger part of ground echoes is comparable to that of the interference, making it difficult to distinguish them based solely on energy threshold values. Consequently, developing efficient and robust RFI suppression methods that can eliminate interference while maximally preserving the original radar signal information has become an important scientific problem and engineering challenge in the field of SAR research.
Existing interference suppression methods can be broadly categorized into non-parametric, parametric, semi-parametric, and deep learning methods. Non-parametric methods operate by transforming the data to other domains where interference components can be more easily separated, such as notch filtering [6,7,8], adaptive filtering [9,10], and subspace projection filtering [11,12,13]. These methods are computationally simple and easy to implement, but their suppression performance heavily depends on the separability between interference and the radar signal. If their respective features overlap, the filtering process will also remove useful echoes, resulting in an irreversible loss of target’s information [14]. Parametric methods achieve interference suppression by modeling the echo signal [15,16,17]. For instance, to address the complex motion-induced non-stationarity, Zhang et al. [17] constructed an accurate echo model for FDA-MIMO SAR and utilized adaptive compensation to suppress mainlobe deceptive jamming. However, their performance bound is jointly determined by the complexity of the model and the precision of parameter estimation, and their suppression efficacy may degrade significantly in complex interference scenarios. In contrast to the previous two categories, semi-parametric methods leverage the specific attributes of different signal components, employing iterative optimization of a loss function to achieve matrix decomposition. This approach effectively separates interference from the mixed signal while preserving the radar signal. Existing semi-parametric models primarily include sparse low-rank models and weighted matrix decomposition [18,19,20,21,22]. Joy et al. [18] separated the RFI signal matrix from the SAR echo matrix by constructing low-rank models in the range and azimuth directions. Liu et al. [19], jointly using transform-domain characteristics, constructed a sparse model for wideband interference in the time-frequency domain and designed a joint estimation algorithm to simultaneously achieve RFI suppression and signal-of-interest recovery within an optimization framework. Huang et al. [20] employed a matrix decomposition algorithm that uses a weighted strategy to approximate the rank function, as well as an algorithm that uses the rank’s upper bound as a prior constraint, to suppress narrowband RFI in SAR systems. Although these semi-parametric methods can achieve good results, they require dataset-specific hyperparameter tuning and iterative computation. This results in high computational complexity and poor generalization, which is particularly pronounced in large-scale data processing.
With the rapid development of deep learning (DL) technology, researchers are increasingly applying DL methods to the SAR field, which include many neural network-based interference suppression techniques. Neural networks can automatically extract hierarchical features of targets from images and possess strong generalization capabilities: once trained, they can be directly transferred for application in similar scenes. Fan et al. [23] employed a deep convolutional neural network (CNN) for interference detection in the time-frequency domain, subsequently using deep residual networks to extract and reconstruct the time-frequency features of the target signal. Xu et al. [24] utilized a generative adversarial network (GAN) to map input echoes containing wideband interference to their corresponding counterparts without interference, thereby achieving interference suppression. Tao et al. [25] proposed a hybrid method fusing sparse low-rank models with stacked recurrent neural network (RNN) to achieve RFI extraction and suppression. Compared to traditional approaches, DL-based interference suppression methods yield better performance, but they also face several limitations. On one hand, existing suppression networks focus on the high energy or texture of the RFI, but their performance often degrades when ground targets and interference are highly coupled within the same signal pulse, in which case new criteria need to be introduced to distinguish between the two components. On the other hand, most existing evaluations of interference suppression efficacy are primarily concerned with amplitude errors in the resulting signal or image, while ignoring the critical information contained in the signal phase, and a quantitative framework for assessing phase fidelity is still lacking.
To address these issues, this paper proposes an interference suppression Network featuring an Attention mechanism guided by signal kurtosis characteristics (KANet). The core innovation of this method lies in its physics-informed attention design. For RFI suppression tasks, common attention mechanisms compute attention weights solely based on image domain features extracted from the input data, such as texture and energy distributions. However, when RFI and high energy ground echoes exhibit similar characteristics in the image domain, these mechanisms may fail to effectively distinguish between them. In contrast, the proposed kurtosis-guided attention incorporates statistical properties of the signal. Specifically, kurtosis is a fourth-order moment that captures the non-Gaussian nature of RFI. This statistical information at the signal-level is inherently inaccessible to conventional attention mechanisms, allowing the proposed approach to precisely locate RFI-corrupted regions even when RFI and ground echoes appear similar in image domain.
The main contributions of this paper can be concluded as follows:
  • Firstly, considering the non-Gaussian nature of interference, the principle of kurtosis difference between SAR echoes and RFI signals in time-frequency domain is numerically derived, providing a theoretical basis for the proposed method.
  • Secondly, the distribution of RFI in the echo pulse is mapped to a multi-scale kurtosis vector, guiding the network to increase attention to the interfered time-frequency units, achieving adaptive feature weighting.
  • Thirdly, a fidelity evaluation framework based on the phase information of the reconstructed signal is established, evaluating the phase recovery results from three perspectives: phase preservation error, dispersion, and correlation coefficient.
  • Finally, the proposed method is comprehensively validated experimentally based on semi-physical simulation data and real spaceborne SAR data.
Experimental results demonstrate that, compared with existing deep learning-based interference suppression algorithms, the proposed method effectively integrates traditional interference detection metrics with deep neural networks. By incorporating a kurtosis-guided attention mechanism, it can effectively separate long-duration interference from high-energy ground echoes. Furthermore, the method ensures both reconstruction accuracy and phase fidelity of the signal after interference suppression, which clearly highlights its superiority.

2. Theoretical Foundation

2.1. Formulation of SAR Echoes

Assuming the SAR system transmits a linear frequency modulation (LFM) signal, it can be expressed as Equation (1):
S t x ( t ) = rect ( t / T p ) exp [ j 2 π ( f 0 t + k r t 2 / 2 ) ]
In Equation (1), f 0 is the center frequency, k r is the chirp rate, T p is the pulse width, and t is the fast time. For a SAR system, the received echo signal consists of a superposition of the target signal, the interference signal, and the noise signal. The noise signal primarily stems from environment, whereas the interference signal originates from other in-band radiation sources. Typically, the energy of the interference signal is greater than that of the target signal and the noise signal. The interfered echo signal can be modeled as Equation (2):
S r x ( m , n ) = S t ( m , n ) + S n ( m , n ) + S i ( m , n )
where m is the azimuth pulse index, n is the range time unit index, S t is the target signal, S n is the noise signal, and S i is the interference signal.

2.2. Formulation of RFI Signals

The interference signals discussed in this paper refer to unintentional interference, as opposed to intentional jamming aimed at actively disrupting SAR systems. Based on the relative bandwidth of the interference signal with respect to the SAR signal, RFI can be classified as narrowband interference (NBI) and wideband interference (WBI). This section introduces three representative RFI types and provides their mathematical expressions. These formulations are consistent with the widely recognized interference models adopted in the literature [14,21,23], thereby ensuring the physical fidelity of our RFI simulation. These include single-tone interference as an example of NBI, and linear frequency modulation interference and sinusoidal modulation interference as examples of WBI.
The expression for narrowband interference is given by Equation (3):
I N B ( m , n ) = l = 1 L A l ( m ) · exp ( j 2 π f l n + j ϕ l )
where L is the total number of interference signals, A l ( m ) represents the amplitude of each single-tone interference, f l represents the frequency of each interference, and ϕ l is its initial phase.
The expression for linear frequency modulation wideband interference is given by Equation (4):
I W B C M ( m , n ) = l = 1 L B l ( m ) · exp ( j 2 π f l n + j π γ l n 2 )
where L is the total number of interference signals, B l ( m ) represents the amplitude of each interference, f l represents the frequency of each interference, and γ l represents the chirp rate of the interference.
The expression for sinusoidal modulation wideband interference is given by Equation (5):
I W B S M ( m , n ) = l = 1 L C l ( m ) · exp ( j β l sin ( 2 π f l n + ϕ l ) )
where L is the total number of interference signals, C l ( m ) represents the amplitude of each interference, β l represents the modulation coefficient of each interference, f l represents the frequency of each interference, and ϕ l is its initial phase.
Figure 2 illustrates the frequency-domain and time-frequency-domain representations of the three aforementioned interference types. In each subfigure, the plot on the left shows the spectrum, and the plot on the right shows the time-frequency representation.

3. Kurtosis-Guided RFI Suppression Network

This chapter elaborates on the core principles and architectural design of the proposed kurtosis-guided attention-based interference suppression network (KANet). The main innovation of this method lies in the design of a kurtosis prior guidance module. Its effectiveness is based on the significant difference in the statistical distributions of SAR signals and RFI. This module aims to quantify the impulsive characteristics of the signal along the time dimension, thereby generating a statistical metric capable of accurately identifying RFI.
To clearly elucidate the method, the content of this chapter is organized as follows: First, we analyze the theoretical feasibility and practical effectiveness of using kurtosis as an indicator of RFI presence, providing the theoretical justification for the method. Building on this foundation, we then introduce the key technical details, including the overall network framework, the backbone architecture, the specific implementation of the kurtosis prior guidance module, and the data normalization strategy.

3.1. Analysis of the Kurtosis Prior

Specifically, the complex representation of the SAR signal and background noise exhibits quasi-Gaussian statistical characteristics. In contrast, most RFI exhibits non-Gaussian, impulsive, and transient characteristics, manifesting as short-duration, high-energy spikes. To effectively capture this disparity, this study employs instantaneous kurtosis as a key statistical measure. The kurtosis of the signal’s time-frequency spectrogram is calculated using a sliding time window to characterize the likelihood of RFI presence within each time unit. Kurtosis is a fourth-order moment that characterizes the peakedness and tailedness of a probability distribution. It is calculated as follows.
Given an input Short-Time Fourier Transform (STFT) time-frequency matrix Y , as shown in Equation (6), with dimensions F × T , where F is the number of frequency units and T is the number of time windows.
Y = y 1 , 1 y 1 , 2 y 1 , T y 2 , 1 y 2 , 2 y 2 , T y F , 1 y F , 2 y F , T
y t = y 1 , t , y 2 , t , y F , t T
Equation (7) represents the vector composed of signal amplitudes across all frequency units at time index t.
μ t , the mean of the signal within the t-th time window is given by Equation (8):
μ t = 1 F f = 1 F y f , t
The central second-order moment (variance), m 2 , t , is given by Equation (9):
m 2 , t = 1 F f = 1 F ( y f , t μ t ) 2
The central fourth-order moment, m 4 , t , is given by Equation (10):
m 4 , t = 1 F f = 1 F ( y f , t μ t ) 4
The kurtosis, g t , within this time window is given by Equation (11):
g t = m 4 , t ( m 2 , t ) 2 3
Kurtosis is defined as the standardized fourth-order moment minus 3. Since the standardized fourth-order moment of a standard Gaussian distribution is 3, this subtraction (of 3) serves to calibrate the kurtosis of the standard Gaussian distribution to zero.
To theoretically demonstrate why kurtosis can effectively identify RFI, we analyze two distinct cases: the interference-free SAR signal and the interference-corrupted signal. We then calculate the range of kurtosis values for each case.
Firstly, for an interference-free SAR signal, the echo S t ( t ) is traditionally modeled as the coherent superposition of numerous independent scatterers within a resolution cell. According to the Central Limit Theorem (CLT), this leads to a complex Gaussian representation of the echo and Rayleigh distribution for its amplitude. While the Rayleigh model provides an adequate description for homogeneous media, SAR echoes often exhibit pronounced spatial heterogeneity due to complex ground textures and land cover variability, resulting in heavy-tailed statistics that deviate substantially from the Rayleigh distribution. To accurately capture this behavior, we employ the K-distribution, which arises from a multiplicative combination of Rayleigh speckle and Gamma distributed texture [26,27,28]. By parameterizing the scene heterogeneity through its shape parameter, the K-distribution not only recovers the Rayleigh law as a limiting special case corresponding to homogeneous echoes, but also provides an excellent fit to the highly heterogeneous, heavy-tailed echoes observed over complex terrains.
After transforming the signal to the time-frequency domain via the STFT, we consider the signal’s amplitude distribution along the frequency axis within a fixed time window. The STFT is a linear transformation of the underlying random process, so within each time window the set of signal samples can be modeled as samples drawn from a K-distributed amplitude random process. Specifically, the statistical calculation is performed using the time-frequency amplitudes within a single time window. Consider the amplitude vector of the interference-free SAR signal across all F frequency units at the t-th time index. Let this vector be denoted as x = [ x 1 , x 2 , , x F ] T , which corresponds to the magnitude of y t (defined in Equation (7)) in the absence of interference. The elements x i of this vector are modeled as independent realizations of a K-distributed random variable X. Its probability density function (PDF) under the K-distribution model is given by Equation (12):
f X ( x ; ν , α ) = 2 α Γ ( ν + 1 ) x 2 α ν + 1 K ν x α , x > 0 , α > 0 , ν > 0 ,
where ν is the shape parameter controlling the degree of heterogeneity, α is the scale parameter, Γ ( · ) is the Gamma function, and K ν ( · ) denotes the modified Bessel function of the second kind of order ν .
The raw n-th-order moment of the K-distributed x , E [ X n ] , can be computed in closed form as in Equation (13):
E [ X n ] = 2 n α n Γ 0.5 n + 1 Γ 0.5 n + ν + 1 Γ ( ν + 1 ) .
The kurtosis of the K-distributed interference-free signal, g k , is defined analogously to Equation (11), and can be written in terms of raw moments as in Equation (14):
g k = E [ X 4 ] 4 E [ X 3 ] E [ X ] + 6 E [ X 2 ] E [ X ] 2 3 E [ X ] 4 E [ X 2 ] E [ X ] 2 2 3 .
By substituting Equation (13) with n = 1 ,   2 ,   3 ,   4 into Equation (14), we obtain an explicit expression of the kurtosis of signal X. After straightforward algebra, the scale parameter α cancels out and the resulting kurtosis depends only on ν , as given in Equation (15):
g k ( ν ) = 32 ( ν + 1 ) ( ν + 2 ) Γ ( ν + 1 ) 4 12 π Γ ν + 3 2 2 Γ ( ν + 1 ) 2 3 π 2 Γ ν + 3 2 4 4 ( ν + 1 ) Γ ( ν + 1 ) 4 π Γ ν + 3 2 2 Γ ( ν + 1 ) 2 2 3 .
This result shows that the kurtosis g k of the interference-free SAR signal becomes an explicit function of the parameter ν . For homogeneous or mildly heterogeneous scenes (large ν ), g k ( ν ) approaches the Rayleigh limit and remains a small positive value; for more heterogeneous backgrounds, g k ( ν ) increases accordingly but still remains within a relatively small range in practice. For example, using the closed-form expression in Equation (15), we obtain g k ( 0.1 ) 3.64 for a highly heterogeneous scene, whereas for ν = 100 the value drops to g k ( 100 ) 0.25 . Therefore, for an interference-free SAR signal, the calculated kurtosis remains a small value close to zero.
Next, we consider the case where the SAR signal is superimposed with RFI, and calculate the kurtosis of the corrupted signal within a fixed time window. Assume the signal within this time window has F total samples along the frequency axis. The amplitude vector of the interference-free SAR signal, denoted as x = [ x 1 , x 2 , , x F ] T , consists of samples that follow the K-distribution described above. Concurrently, assume an RFI with an intensity far greater than that of the SAR signal corrupts M of the F frequency units. The signal amplitudes at these M interfered frequency samples are defined as a vector A = [ A 1 , A 2 , , A M ] T . For analytical simplicity, we assume the RFI is a single-tone interference affecting only one frequency unit (i.e., M = 1 , thus A simplifies to a scalar A). Therefore, the amplitude vector of the SAR signal superimposed with RFI can be expressed as z = [ x 1 , x 2 , , x i 1 , A , x i + 1 , , x F ] T . We now calculate the kurtosis of the combined signal z .
The raw n-th-order moment about the origin of the mixed signal z , E [ Z n ] , is given by Equation (16):
E [ Z n ] = i = 1 F 1 x i n + A n F = F 1 F E [ X n ] + A n F ,
where E [ X n ] is the raw n-th-order moment about the origin for the K-distribution, calculated as shown in Equation (13). According to the kurtosis calculation in Equation (11), we need to compute the central fourth-order moment and second-order moment of z . The central fourth-order moment is given by Equation (17):
m 4 , z = 1 F i = 1 F ( z i μ z ) 4 = E [ Z 4 ] 4 E [ Z 3 ] E [ Z ] + 6 E [ Z 2 ] E [ Z ] 2 3 E [ Z ] 4
The E [ Z n ] (for n = 1 ,   2 ,   3 ,   4 ) are calculated by substituting Equation (13) into Equation (16). These moments are then substituted into Equation (17) to yield a result expressed in terms of F, A, ν and α , as given by Equation (18):
m 4 , z = F 1 F E [ X 4 ] + A 4 F 4 F 1 F E [ X 3 ] + A 3 F F 1 F E [ X ] + A F + 6 F 1 F E [ X 2 ] + A 2 F F 1 F E [ X ] + A F 2 3 F 1 F E [ X ] + A F 4
where E [ X ] , E [ X 2 ] , E [ X 3 ] , and E [ X 4 ] follow from the K-distribution model in Equation (13).
The central second-order moment of Z is given by Equation (19):
m 2 , z = 1 F i = 1 F ( z i μ z ) 2 = E [ Z 2 ] E [ Z ] 2
Similarly, by substituting E [ Z 2 ] and E [ Z ] (obtained from Equation (16) with E [ X n ] given by Equation (13)), we derive a result expressed in terms of F, A, ν and α , as given by Equation (20):
m 2 , z = F 1 F E [ X 2 ] + A 2 F F 1 F E [ X ] + A F 2
Finally, the kurtosis of signal Z is calculated as shown in Equation (21):
g z ( F , A , ν , α ) = F 1 F E [ X 4 ] + A 4 F 4 F 1 F E [ X 3 ] + A 3 F F 1 F E [ X ] + A F + 6 F 1 F E [ X 2 ] + A 2 F F 1 F E [ X ] + A F 2 3 F 1 F E [ X ] + A F 4 F 1 F E [ X 2 ] + A 2 F F 1 F E [ X ] + A F 2 2 3
where E [ X n ] are again given by the K-distribution moments in Equation (13).
To evaluate Equation (21), we substitute practical data to estimate the kurtosis of the interference-corrupted signal. We set F = 256 and in this case, each time window of the time-frequency signal contains 256 sampling points along the frequency axis. Since g z depends on the background statistics through ( ν , α ) as well as on the interference amplitude A, we consider a representative parameter setting with ν = 3 and α = 1 . Subsequently, let A be k times the mean amplitude of the SAR signal. Based on a comparative analysis of the amplitudes of interference-corrupted signals in real-world SAR data, we set the range of k to be between 3 and 10. Substituting the values of F = 256 , ν = 3 , α = 1 and A = k E [ X ] into Equation (21) demonstrates that the kurtosis of the interference-corrupted signal is significantly increased, reaching a magnitude that is typically more than 5 to 10 times higher than that of the interference-free signal (given in Equation (15)). For instance, with k = 10 , numerical evaluation yields g z 55 , indicating more than an order-of-magnitude increase. Evidently, unlike the interference-free SAR signal, the kurtosis value increases rapidly when a time window is superimposed with interference. It is also worth noting that although the above derivation assumes a single-tone interference ( M = 1 ) for analytical simplicity, the conclusion generalizes to multi-tone RFI ( M > 1 ). Since RFI typically exhibits sparsity in the time-frequency domain ( M F ), the signal retains distinct heavy-tailed characteristics even with increased M. Consequently, the kurtosis remains significantly higher than that of the interference-free background, ensuring the metric’s robustness against diverse RFI forms.
To validate the practical effectiveness of the proposed theory, verification was performed using real-world GF-3 SAR data. Figure 3 presents the analysis across four distinct scenarios, displaying SAR images alongside time-frequency domain spectrograms and their corresponding time-sequenced kurtosis curves. The results demonstrate a precise temporal alignment between kurtosis peaks and RFI presence. In the interference-corrupted cases (Scenarios 1, 2, and 3), the kurtosis values increase significantly, exhibiting a strong response to RFI. In contrast, Scenario 4 confirms the metric’s robustness against complex ground echoes: in the absence of RFI, the kurtosis curve remains stable at a low level despite the presence of strong terrain backscatter. This verifies that the kurtosis metric effectively targets impulsive interference while remaining insensitive to typical Gaussian-like background echoes.
This experiment strongly demonstrates that the kurtosis calculation is insensitive to Gaussian-like signals, such as ground echoes, and responds strongly only to RFI. It can serve as a highly sensitive and robust RFI localization metric. In summary, a kurtosis value close to zero indicates the absence of RFI within that time window; conversely, a large positive kurtosis value clearly indicates RFI contamination. Therefore, this kurtosis value can be used to guide the attention mechanism, directing the network to focus on RFI-affected regions and thereby improving suppression efficacy.

3.2. Overall Framework

The overall workflow of the RFI suppression method in this paper is illustrated in Figure 4. First, the input signal is transformed into the time-frequency domain, and the amplitude information is divided into blocks along the time axis and sent into the network after passing through a min-max normalization strategy. Second, to enhance the network’s RFI suppression performance by leveraging the information of the SAR signal’s physical characteristics, we designed a kurtosis prior guidance module. This module quantifies the non-Gaussianity of the RFI by calculating the signal’s kurtosis in a time window-wise manner, and maps the kurtosis to an attention weight through a three-layer fully connected network. This weight undergoes adaptive pooling to match the feature map dimensions at different network levels, thereby performing feature weighting. Subsequently, guided by this module, the interference suppression network adaptively suppress the RFI. It then reconstructs the interference-free amplitude spectrum, which undergoes a de-normalization process. This spectrum is combined with the phase information to reconstruct the complex time-frequency domain signals. Finally, this representation is transformed back into the time domain via the inverse short-time Fourier transform (ISTFT), completing the RFI suppression process.

3.3. Backbone Network

This section details the backbone network of the proposed method. The core objective of the network is to effectively transform the input time-frequency matrix to achieve RFI suppression and reconstruct the SAR signals. To this end, the backbone must satisfy two key design requirements:
  • It must possess strong multi-scale feature extraction capabilities to capture the complex patterns of diverse RFI types.
  • It must be able to effectively separate RFI features from the features of the useful signal and accurately reconstruct the interference-free signal.
Figure 5 illustrates the overall architecture of the backbone network.
As shown in Figure 5, the proposed backbone networks consists of three parts: an encoder, a bottleneck module, and a decoder. In the encoder, the input time-frequency domain amplitude spectrum first passes through an initial convolutional layer for downsampling and shallow feature extraction. Subsequently, the feature maps pass sequentially through multiple encoding stages composed of residual units to progressively extract deeper, more abstract features. In these stages, the network employs strided convolution to reduce spatial resolution thereby increasing the receptive field, and introduces dilated convolution in the deeper stages to further expand the receptive field without additional spatial downsampling. The bottleneck module is responsible for channel compression and information integration of the deepest features from the encoder, refining semantic information in preparation for the decoding path. In the decoder, the network progressively restores spatial resolution through upsampling operations. After each upsampling step, features from the corresponding encoder level are fused via skip connections, effectively combining shallow spatial details with deep semantic information. Finally, the decoder restores the feature map to its original size and outputs the single-channel, reconstructed amplitude spectrum. The network employs the MSE function as its loss function. To clearly elucidate the specific operations of each module, the network structure is defined by the following equations.
The operation of the initial feature extraction layer is given by Equation (22):
E 1 = E n c o d e r 1 ( S T F T ( X ) )
where S T F T ( X ) is the input signal in the time-frequency domain.
The calculation for the i-th encoding stage is given by Equation (23):
E i = E n c o d e r i ( E i 1 )
where E i is the output of i-th encoding stage, and E n c o d e r i is i-th encoder.
The operation of the bottleneck module is given by Equation (24):
Z = A l i g n ( S q u e e z e ( E 5 ) )
The S q u e e z e convolutional layer performs feature compression, and the A l i g n convolutional layer performs channel alignment.
Equation (25) is the calculation of upsampling and skip connection:
F = C o n c a t ( U p s a m p l e ( Z ) , E 3 )
The decoder output is given by Equation (26):
Y = S i g m o i d ( D e c o d e r ( F ) )
The decoder output is then passed through a Sigmoid activation function to yield the final output.
The schematic diagram of the residual unit in Figure 5 is shown in Figure 6.

3.4. Kurtosis-Guided Attention Module

The core of the proposed method is the time window-wise kurtosis calculation performed on the time-frequency domain signal. This operation transforms the 2D time-frequency matrix into a 1D kurtosis prior vector. Each element in this vector corresponds precisely to a time window, and its value directly reflects the likelihood of RFI presence within that window. This kurtosis prior vector, therefore, provides precise guidance for the attention mechanism, transforming the interference suppression task from an unconstrained search problem into a prior-guided optimization problem. This enhances the network’s suppression efficacy and accuracy.
To convert this guidance signal into attention weights that the network can utilize, we designed an attention weight generation module based on a Multi-Layer Perceptron. This module takes the aforementioned kurtosis prior vector as input and maps it to an attention weight vector. This mapping is achieved through a linear layer, a batch normalization layer, and a Sigmoid activation function. The specific architecture is illustrated in Figure 7.
Within this module, the Linear layer is responsible for feature transformation; the Batch Normalization (BN) layer is used to accelerate convergence and training; and the final Sigmoid activation function ensures the output attention weight vector has values in the [0, 1] range.
In our multi-scale attention implementation, the attention mechanism is flexibly embedded at multiple levels of the encoder. Given a feature map output by an intermediate layer, the dimension of the attention weight vector is first adjusted via adaptive average pooling to match the dimensions of the feature map. Subsequently, the adjusted weight vector k is broadcast across the channel and frequency dimensions and multiplied element-wise with the input feature map, thereby achieving adaptive feature enhancement:
F o u t = F i n B r o a d c a s t ( k )
Through the operation in Equation (27), the network is guided to assign higher weights to feature regions exhibiting kurtosis anomalies, thereby achieving precise RFI suppression. A schematic diagram of this module is shown in Figure 8.

3.5. Data Normalization Strategy

The data normalization strategy employed in this paper is a paired Min-Max normalization, which is applied to the input interference-corrupted data and its corresponding ground truth data. For any given value x in the features, the normalization is calculated as shown in Equation (28):
x = x m i n ( X ) m a x ( X ) m i n ( X )
where m i n ( X ) is the minimum value of the data, m a x ( X ) is the maximum value, and x is the normalized value. Traditional independent normalization method calculates separate scaling factors for the interference-corrupted input data and the ground truth data. Since the presence of interference will change the m a x ( X ) of the data, this approach causes the background signal (which is common to both) to be scaled inconsistently. This leads to the erroneous amplification of the background signal during network inference.
To circumvent this issue, we apply a joint processing approach to each sample pair, which consists of the interfered data and its corresponding ground truth. Specifically, we first identify the maximum and minimum values across both the input and target data within that pair. These shared extrema are then used to normalize both the interfered data and the ground truth data. This method ensures that the input and the target within each training pair share a uniform linear scaling baseline. This allows the network to focus on learning the task of interference suppression, rather than compensating for the scale discrepancies introduced by inconsistent normalization, thereby guaranteeing the accuracy of the final output.

4. Experiments and Analysis

4.1. Dataset and Experimental Setting

The training of neural networks relies on large-scale paired samples, i.e., input signals containing interference and their corresponding interference-free ground truths. However, in real world SAR observation, obtaining clean, interference-free echo data for interference-corrupted data is often infeasible. This constitutes the primary challenge in data acquisition.
To address this issue, this study employed the semi-physical simulation dataset SarTF, constructed and publicly released by Shen et al. [29], for model training and evaluation. This dataset generates paired samples by injecting simulated RFI into interference-free SAR signals. Its background signals are derived from two data types: simluated multi-point target echos and real X-band airborne SAR data. By controlling the intensity of the injected interference, the Signal-to-Interference Ratio (SIR) of all samples was set within the range of −10 dB to −5 dB. Finally, both the interference-corrupted signals and the original signals were transformed into the time-frequency domain to constitute the final paired training data. The SarTF dataset contains a total of 9747 sample pairs and is divided into a training set and a test set at an 80%/20% ratio. Figure 9 shows the time-frequency spectrograms of sample data from the training set, where (a) shows the interference-corrupted data, and (b) shows the corresponding interference-free data.
The raw data underwent a preprocessing pipeline to be converted into network inputs. Specifically, the complex echoes were transformed into the time-frequency domain using the STFT with a window length of 256 and an overlap of 255. Subsequently, a paired Min-Max normalization strategy was applied to map the amplitude values to [ 0 ,   1 ] . The time-frequency matrix was then segmented along the time axis into blocks of size 256 × 256 . For the network output, the matrices were denormalized using the corresponding input scaling parameters and reassembled to reconstruct the complete time-frequency matrix.
During the training of the network proposed in this paper, we established the hyperparameter setting to ensure optimal model performance. The batch size was set to 16, balancing training efficiency with good gradient estimation quality. For optimization, we selected the Adam optimizer for gradient backpropagation and parameter updates, as its adaptive learning rate mechanism effectively handles sparse gradients. A step decay strategy was adopted for the learning rate schedule: the initial learning rate was set to 2 × 10 4 and was halved every 4 training epochs with a decay factor of 0.5. This progressive adjustment facilitates model fine-tuning in the later training stages. Specific to the architecture of the kurtosis prior-guided module, we set key parameters: the kurtosis-guided attention mechanism was applied to the Encoder 2, Encoder 3 and Encoder 4 stages, the attention visualization interval was set to 50 batches, and gradient clipping was enabled to prevent training instability. The experimental results show that after 15 training epochs, the proposed network achieved excellent interference suppression performance, thus validating the effectiveness of the designed hyperparameter configuration.
For the experimental setup, we designed multiple experiments to verify the method’s performance. First, we took interference-free airborne and spaceborne SAR data from several scenarios and corrupted them with complex interference, which was a composite of the three types mentioned in Section 2.2. The proposed method was then used to suppress the interference in this interference-corrupted data. Several representative algorithms from the SAR RFI suppression field were selected for comparison, including time-frequency domain notching [30], as well as other neural networks such as PISNet [29] and FDNet [31]. Second, we performed validation on real-world data, selecting an RFI-corrupted scene from the GF-3 satellite to verify the suppression performance in a real-world scenario. Subsequently, we conducted a phase analysis and phase fidelity assessment of the network’s output using simulated data. The detailed parameters of the data used in the experiments are listed in Table 1.

4.2. Evaluation Metrics

For a comprehensive quantitative analysis of the experimental results, this section details the evaluation metrics employed in this work, including their physical significance and mathematical definitions. Based on their focus of evaluation, these metrics are categorized into two main classes: amplitude and image-domain metrics, and phase fidelity metrics.

4.2.1. Amplitude Reconstruction Metrics

This paper employs Peak Signal-to-Noise Ratio (PSNR), Structural Similarity (SSIM), Image Contrast (IC), and Image Entropy (IE) as the metrics to evaluate interference suppression performance. Suppose the size of the image to be evaluated is N a × N r , where X denotes the interference-free image and X ^ denotes the reconstructed image output by the suppression algorithm.
PSNR is one of the most widely used metrics for assessing image reconstruction quality. It evaluates the degree of distortion by calculating the mean squared error (MSE) between the corresponding pixels of the ground truth image and the processed image. A higher PSNR value indicates that the processed image is closer to the ground truth image at the pixel level and suffers from less distortion. It is expressed in decibels (dB).
First, the MSE between the two images is calculated, as given in Equation (29):
MSE = 1 N a × N r i = 1 N a j = 1 N r [ X ( i , j ) X ^ ( i , j ) ] 2
The PSNR is then calculated from the MSE, as shown in Equation (30), where M A X I is the maximum possible value for the image pixels.
PSNR = 10 · log 10 ( ( M A X I ) 2 MSE )
SSIM is a metric that evaluates the similarity of structural information between images, which aligns well with human visual perception. SSIM provides a comprehensive evaluation of the similarity between two images based on three components: luminance, contrast, and structure. Its value ranges from −1 to 1, where a value closer to 1 indicates a higher degree of structural and perceptual similarity between the two images. The calculation formula of SSIM is shown in Equation (31):
SSIM = [ l ( m , n ) ] α [ c ( m , n ) ] β [ s ( m , n ) ] γ
where l ( m , n ) is the luminance comparison term at pixel ( m , n ) , c ( m , n ) is the contrast comparison term, and s ( m , n ) is the structure comparison term. The parameters α , β , and γ are positive constants.
IC primarily measures the degree of difference between bright and dark regions in an image, as well as its dynamic range. It ranges from 0 to 1, where a value closer to 1 indicates stronger contrast between light and dark areas. In interference suppression scenarios, strong interference typically raises the noise floor in darker regions, thereby reducing the image contrast. An effective suppression algorithm can restore the contrast to a level approaching that of the ground truth image upon interference removal. In this paper, the IC is calculated as follows: a local window is defined around each pixel, and the maximum intensity value I m a x and minimum intensity value I m i n within that window are computed. These values are then used in Equation (32) to calculate the local contrast, C k . k represents the window index, and K is the total number of windows. Finally, the local contrasts from all windows are averaged to yield the final contrast value.
C k = I max I min I max + I min
IC = 1 K k = 1 K C k
IE is used to measure the uncertainty of the image information content or the pixel grayscale distribution. Because RFI injects pseudo-random information into the image, causing the pixel distribution to be chaotic and raising the entropy value, therefore the change in entropy can reflect the effect of interference suppression. Effective suppression should be able to remove the interference information, make the pixel distribution restore order, and thus lower the image entropy. The lower the entropy value, it indicates that the pixel distribution is more concentrated, and the uncertainty is smaller. Its calculation formula is as follows:
IE = k = 0 Q 1 p k · log 2 p k
where Q is the total number of gray levels, and p ( k ) is the probability of occurrence of gray level k in the image.

4.2.2. Phase Fidelity Metrics

To quantitatively evaluate the difference between the phase after interference suppression processing and the ground truth phase, this paper employs Mean Absolute Difference (MAD), Standard Deviation (SD), Root Mean Square Error (RMSE), and Pearson Correlation Coefficient (PCC) as phase fidelity metrics. Assume the ground truth phase is ϕ t r u e , the estimated phase is ϕ e s t , and the total number of data points is N p .
MAD measures the average magnitude of the error between the estimated phase ( ϕ e s t ) and the ground truth phase ( ϕ t r u e ), indicating the degree of systematic offset. It reflects the expected value of the overall error. A smaller MAD value signifies that the phases of the two signals are closer on average. Its calculation is given by Equation (35):
MAD = 1 N p i = 1 N p ϕ e s t ( i ) ϕ t r u e ( i )
SD measures the degree of dispersion within the phase error sequence, which reflects how closely the errors are distributed around their mean. A lower SD value indicates less random fluctuation in the error, signifying higher phase consistency. It is calculated as given in Equation (36):
SD = 1 N p 1 i = 1 N p ( ϕ e s t ( i ) ϕ t r u e ( i ) μ e r r ) 2
where μ e r r is the mean of the phase error sequence.
RMSE reflects both the systematic offset and random fluctuation of the error by calculating its root mean square. Moreover, due to the squaring operation, RMSE is more sensitive to large error values. A smaller RMSE value indicates a better match between the estimated phase and the ground truth. It is calculated as given in Equation (37):
RMSE = 1 N p i = 1 N p ( ϕ e s t ( i ) ϕ t r u e ( i ) ) 2
PCC does not directly measure the absolute magnitude of the error; instead, it evaluates the linear correlation between the two signals. It ranges from −1 to +1. A PCC value closer to +1 signifies a stronger positive correlation between the estimated phase and the ground truth phase, indicating that the signals’ overall structural features and patterns are more similar. Its calculation is given by Equation (38):
PCC = i = 1 N p ( ϕ e s t ( i ) μ e s t ) ( ϕ t r u e ( i ) μ t r u e ) i = 1 N p ( ϕ e s t ( i ) μ e s t ) 2 i = 1 N p ( ϕ t r u e ( i ) μ t r u e ) 2
where μ e s t is the mean of the estimated phase, and μ t r u e is the mean of the ground truth phase.

4.3. Experimental Results

4.3.1. Comparative Results on Simulated Data

The simulation experiment uses three distinct scenes of interference-free SAR data as background, including airborne SAR data (Scene 1 and Scene 2) and one scene of spaceborne SAR data from GF-3 satellite (Scene 3). To simulate a complex interference environment, we introduced simulated composite RFI into these data using a dynamic generation strategy. This composite interference is segmented along the azimuth direction, comprising three sequential modes: (1) NBI only, (2) WBI only, (3) a hybrid of NBI and WBI. The specific interference parameters are detailed in Table 2. It is crucial to note that the values listed in Table 2 represent the sampling ranges rather than fixed combinations. During the dataset generation, the specific parameters for each interference instance were randomly sampled from these continuous ranges. Furthermore, these diverse interference patterns are introduced at varying locations along the azimuth and range dimensions. The injected RFI signals are generated based on the standard mathematical models defined in Equations (3)–(5) in Section 2.2, ensuring that the simulated interference accurately reflects the characteristics of real-world RFI.
The interference-corrupted SAR data was processed using the proposed method and several comparison methods. SAR imaging was subsequently performed on each of the processed outputs. The resulting images are show in Figure 10. In Figure 10, each row represents a distinct experimental scenario. The correspondence between the columns (from left to right) and the respective methods is detailed in the figure caption. Scene 1 features an area traversed by a river; Scene 2 covers a mixed agricultural and built-up area; and Scene 3 depicts a land–sea interface. These regions span diverse and highly heterogeneous scene backscatter types, providing a representative set of challenging scenarios for assessing the robustness and generalization ability of the RFI suppression methods.
For a quantitative analysis of the experimental results, Table 3 lists the detailed performance metrics (PSNR, SSIM, and IC, as described in Section 4.2), which were calculated using the interference-free ground truth SAR images as the reference.
In addition to the final SAR imaging results, the processing results in the time-frequency domain are also presented. Figure 11 shows the time-frequency domain processing results for a single pulse subjected to three different interference types.
A comprehensive analysis of the aforementioned results reveals that while the traditional time-frequency domain notching method can eliminate the main lobe energy of RFI, its approach of directly setting the amplitude to zero also removes the useful signal overlapping with the interference band. This causes significant spectral information loss, and this over-suppression phenomenon inevitably leads to a loss of detail and texture in the final SAR image. In contrast, the the deep learning-based methods demonstrate superior signal fidelity. DL-based methods are able to effectively suppress RFI’s intensity to a level approaching that of the background signal, thereby greatly reducing the impact of RFI on the imaging results.The proposed method, FDNet, and PISNet all effectively preserve most of the target echo information while suppressing interference. However, careful observation reveals that PISNet’s and FDNet’s results contain more residual interference, indicating that its suppression is incomplete. Furthermore, a slight attenuation of ground echoes is observed in the FDNet’s results, suggesting that it inadvertently suppresses portions of the original SAR signal, leading to signal energy loss.
In summary, the proposed method exhibits superior performance. As shown in Table 3, our method achieves the best metrics (PSNR and SSIM) across all three experimental scenarios. This indicates that the imaging results from the proposed method are the closest to the ground truth image at both the pixel and structural levels. Furthermore, regarding the IC metric, the analysis shows that the introduction of RFI causes a significant drop in image contrast compared to the ground truth, and suppression processing restores the contrast to some extent. The IC metric of the proposed method is the best among these methods, reaching a level closest to that of the ground truth image. These comparative results preliminarily verify the superior comprehensive suppression performance of our method.

4.3.2. Validation Results on Measured Data

To validate the performance of the proposed method in real-world scenarios, we applied it to an interference-corrupted SAR data acquired over Sanya by the GF-3 satellite. Figure 12 shows the suppression result of the proposed method for a representative pulse from the data:
In Figure 12a, the top plot shows the time-frequency domain spectrogram of the original data, while the bottom plot shows the spectrogram of the data after interference suppression. A comparison of the plots before and after processing reveals that the WBI in the −20 to −10 MHz band has been removed, while the surrounding SAR signal maintains good continuity. For a more precise evaluation, we compared the frequency spectrum of the signal before and after processing, as shown in Figure 12b. The results clearly show that the frequency band where the interference is concentrated has been effectively suppressed, while the spectral morphology of the remaining bands remains largely consistent with the original signal.
Next, we demonstrate the performance of the proposed method on this real-world data through its imaging results. Figure 13 illustrates the performance of methods when processing a full real-world scene. As shown in Figure 13a, this scene is corrupted by large-area RFI, which manifest as bright stripes running through the image and severely affects its interpretability.
For a detailed evaluation, we selected a local region rich in land cover features and significantly affected by interference for a magnified analysis (Figure 13b). In this region, the strong RFI stripes mask most of the land cover textures. Subsequently, the original echo data from this region was processed for interference suppression and re-imaged using the proposed method and the comparison methods. Figure 13c–f respectively show the imaging results after processing by time-frequency domain notching, PISNet, FDNet, and the proposed method.
A visual comparison of the imaging results in Figure 13 reveals that, in the real-world data scenario, the time-frequency domain notching performs poorly, whose result losses lots of land cover information, rendering the imaging result uninterpretable. In contrast, the DL-based methods effectively suppress the interference stripes: the uniformity of the background region is restored, and the overall visual quality is significantly improved, thereby revealing the land cover structures that were originally masked by interference. A further comparison of the three DL-based methods shows that the proposed method provides the best preservation of original details while effectively filtering the interference. Its resulting land cover features are the closest to the ground truth, demonstrating superior fidelity.
Subsequently, two No-Reference metrics (as mentioned in Section 4.2), Image Contrast (IC) and Image Entropy (IE) were used to quantitatively analyze the imaging results. The results are presented in Table 4:
Table 4 provides the quantitative evaluation results based on real-world data. Regarding the IC, the original interference-corrupted image (Figure 13b) exhibits low IC because the interference raises the noise floor in darker regions. After suppression, the image contrast is improved, among which the proposed method achieves the highest IC value, indicating that its restored image has the clearest luminance definition. In terms of the IE, RFI injects pseudo-random information into the image, disrupting the pixel distribution and raising the entropy value, and the image entropy is reduced after suppression. The drastic entropy reduction from the time-frequency domain notching method is attributed to the excessive removal of land cover information and is therefore not indicative of good performance. Among the three DL-based methods, the image processed by the proposed method achieves the best entropy reduction rate, indicating that it removed the most redundant interference information and that the restored image structure is the most well-structured.
The results from the real-world data experiment further verify the superior comprehensive performance of the proposed method in suppressing interference in real-world, complex scenarios.

4.3.3. Ablation Experiment

To verify the effectiveness of the kurtosis-guided attention mechanism, we conducted an ablation experiment on the kurtosis-guided attention module. This experiment compares the baseline network (without the module) with the proposed method (which includes the module). Both networks were trained on the same training set. The results are shown in Figure 14, and the specific values are presented in Table 5.
The quantitative results of the ablation study (Table 5) clearly confirm the effectiveness of the kurtosis-guided attention module. The network with this module outperformed the baseline network on three key metrics: Mean Squared Error (MSE), PSNR, and SSIM, achieving a PSNR improvement of over 1 dB and a 14.87% reduction in MSE. This result demonstrates that leveraging the kurtosis prior to guide the network’s attention is an effective design. It significantly enhances the model’s ability to distinguish between interference and the useful signal, thereby improving the final suppression performance and reconstruction quality.

4.3.4. Evaluation of Phase Fidelity

Phase information fidelity is critical for SAR applications such as interferometric measurements. This section investigates the proposed method’s ability to preserve the original signal phase while concurrently suppressing RFI. Furthermore, an evaluation framework is introduced for the quantitative analysis of phase differences before and after RFI suppression. This study utilizes semi-physical simulation data based on real airborne SAR echoes, into which simulated RFI is injected following the methodology described in Section 4. Specifically, we compare the phases of three distinct signals: the interference-free ground truth, the interference-corrupted signal, and the signal recovered by the proposed suppression method. Figure 15 illustrates the time-frequency spectrograms and the unwrapped phase of these three signals. Figure 15a presents the time-frequency spectrograms for the three signals, which are, from top to bottom, the ground truth signal, the interfered signal, and the interference-suppressed signal. Figure 15b compares the corresponding unwrapped phases, where the true phase is marked by the blue line, the phase of the interfered signal by the red line, and the phase of the suppressed signal by the green line.
The comparison result presented in Figure 15 clearly reveals that the presence of RFI induces a substantial deviation between the phase of the interfered signal and the ground truth phase. However, after processing by the proposed method, the signal phase is significantly restored, with both its overall structure and detailed features demonstrating a high degree of consistency with the ground truth phase.
The quantitative evaluation results are presented in Table 6, where the units of MAD, SD, and RMSE are radians. The data indicate that the phase error metrics of the processed signal were significantly reduced compared with those of the interfered signal. For instance, under the WBI+NBI scenario, the MAD decreased from 821.557 to 17.243, corresponding to a 97.9% reduction. Based on both qualitative visual comparisons and quantitative analyses, the experiment demonstrates that the proposed method can effectively suppress RFI while preserving the original phase characteristics of the SAR signal effectively, which is crucial for subsequent data processing and practical applications.

5. Discussion

The experimental results presented in Section 4 demonstrate that the proposed method achieves superior interference suppression performance compared to competing methods. This improvement is largely attributed to the introduction of the kurtosis-guided attention mechanism, which effectively distinguishes RFI from SAR echoes based on their statistical discrepancies. In this section, we provide a comprehensive analysis of the proposed method, covering statistical modeling of SAR echoes, sensitivity analysis of STFT parameters for kurtosis calculation, computational complexity, and performance under low SIR conditions.

5.1. Statistical Modeling and Applicability Analysis of K-Distribution

An important theoretical premise of this work is the modeling of SAR echoes using the K-distribution. While the Rayleigh distribution is classically used to model the amplitude of SAR echoes in homogeneous regions based on the Central Limit Theorem, it is theoretically insufficient for capturing the heavy-tailed characteristics observed in heterogeneous scenes such as urban areas or land-sea interfaces [27].
To verify the applicability of the K-distribution to real SAR data, we selected a representative heterogeneous region from the experimental dataset and performed a fitting analysis on the amplitude distribution of the SAR echoes, employing both the K-distribution and the Rayleigh distribution for a comparative evaluation. The parameters for the K-distribution (shape parameter ν and scale parameter α ) were estimated using a two-step approach: initial values were calculated via the Method of Moments (MoM) [32] and subsequently refined via nonlinear least-squares fitting. The Rayleigh distribution parameter ( σ ) was estimated using the Maximum Likelihood Estimator (MLE).
As shown in Figure 16a, the selected region covers a complex transition area containing both land and water features. The corresponding amplitude distribution fitting results are presented in Figure 16b. The empirical histogram exhibits a distinct heavy tail, a characteristic manifestation of heterogeneous scattering. The standard Rayleigh fit fails to capture this behavior, significantly underestimating the probability of high-amplitude echoes. In contrast, the K-distribution fit aligns closely with the empirical data. The estimated shape parameter ν = 0.10 is close to zero, quantitatively corroborating the strong heterogeneity of the scene. This experiment strongly supports the physical validity of using the K-distribution to characterize the SAR echoes in complex scenarios.
Furthermore, we discuss the applicability of the model across varying scenarios. We acknowledge that in real-world SAR data, the statistical characteristics of SAR signals may vary with factors such as frequency bands, resolutions, and scene types. Consequently, the empirical distribution parameters will fluctuate, and in some complex scenarios, the distribution may exhibit deviations from the ideal K-distribution model. However, the robustness of our method does not rely on a mathematically perfect distributional fit. The proposed method leverages the relative disparity in statistical properties (specifically kurtosis) between RFI and background echoes. As evidenced by the calculation results in Section 3.1, interference-free SAR echoes consistently exhibit low kurtosis values, regardless of whether the scene is highly heterogeneous (well-modeled by the K-distribution with a small shape parameter ν ) or homogeneous (approaching a Rayleigh distribution with a larger ν ). Conversely, the high-power and non-stationary nature of RFI signals causes a sharp rise in kurtosis, leading to values that are significantly higher than those of the background echoes. This pronounced statistical divergence ensures that the kurtosis-guided attention mechanism serves as a robust and broadly applicable indicator for RFI localization, maintaining its effectiveness even as SAR echo model varies under different operating conditions.

5.2. Sensitivity Analysis of STFT Parameters for Kurtosis Calculation

Section 3.1 detailed the temporal kurtosis calculation employed in our proposed method. Specifically, the raw data is transformed into the time-frequency domain via STFT, and the kurtosis is computed column-wise (i.e., for each time instance) on the resulting amplitude matrix. Since the STFT parameters determine the time-frequency resolution and the number of samples for statistical calculation, they inevitably influence the kurtosis calculation. For the STFT configuration in this work, we set the window length to 256 and the overlap ratio to 255/256. This subsection experimentally investigates the impact of these STFT settings on the kurtosis calculation, providing an engineering justification for our selection.
To evaluate the impact of STFT parameters, we selected a representative RFI-corrupted SAR signal and computed its kurtosis using various STFT configurations. Figure 17a illustrates the kurtosis curves for different window lengths with fixed overlap. Changing the window length from 256 to 128 is observed to result in a global decrease in the kurtosis values. This reduction reduces the statistical contrast between RFI and background echoes, which is detrimental to interference detection. Increasing the window length to 512 significantly enhances the kurtosis in strong interference regions (red dashed box), but suppresses the values in regions with weaker interference (blue dashed box), potentially compromising the detection of subtle RFI signals. Consequently, a window length of 256 was selected as the optimal setting to balance sensitivity and localization precision. Figure 17b illustrates the kurtosis curves for varying overlap settings under a constant window length. The results indicate that the kurtosis is relatively insensitive to the overlap parameter. Therefore, we selected a high overlap ratio primarily to ensure the temporal continuity of the generated STFT spectrograms.

5.3. Computational Complexity Analysis

In this section, we conducted an analysis of computational complexity. We compared the proposed KANet with comparison methods in terms of model size (Parameters), computational cost (FLOPs), processing latency, and training efficiency. The evaluation metrics of processing latency are defined as follows: “Inference Time” denotes the network forward-pass latency for a single 256 × 256 time-frequency patch, while “Full Pipeline” measures the total processing duration for a single SAR signal pulse, encompassing the STFT, patch-wise inference, and ISTFT reconstruction. For training efficiency, we recorded the total training time required to complete 15 epochs and the peak GPU memory consumption during training with a batch size of 16. All experiments were performed on a workstation equipped with an NVIDIA GeForce RTX 4070 GPU (NVIDIA, Santa Clara, CA, USA). The quantitative comparison results are summarized in Table 7.
As indicated in Table 7, the traditional Notch Filter achieves the fastest processing speed due to its non-parametric nature. However, as demonstrated in Section 4, this efficiency comes at the cost of severe signal loss and image degradation, making it unsuitable for high-precision tasks. Among the DL-based methods, the proposed KANet demonstrates a competitive balance between performance and efficiency.
For processing latency, although the introduction of the kurtosis-guided attention module slightly increases the FLOPs, KANet achieves a single-patch inference time of 7.93 ms. Notably, despite FDNet having lower FLOPs, it exhibits higher latency than KANet, which may be attributed to its dual-branch architecture. Furthermore, the Full Pipeline times across all DL-based methods are comparable, suggesting that total runtime is dominated by the fixed overhead of pre-processing and post-processing stages.
Regarding training efficiency, the proposed KANet requires 2418 s for complete training, with a peak GPU consumption of 2116 MB. Compared to PISNet, the additional overhead introduced by the kurtosis-guided attention module remains modest, while FDNet demands more resources due to its design. These results confirm that the proposed KANet achieves superior suppression performance without imposing significant computational burden.

5.4. Performance Under Low SIR Conditions

To evaluate the limiting suppression capability of the proposed method, we conducted additional experiments under varying SIR conditions ranging from 0 dB to −20 dB. The experimental setup follows the description in Section 4 with composite RFI injected into the airborne SAR data. To ensure consistency, the interference spatial distribution and spectral characteristics were kept constant across all SIR levels, with only the interference amplitude scaled to achieve the target SIR values.
Figure 18 presents the imaging results for each SIR level in this experiment. The leftmost panel shows the ground truth SAR image without interference, serving as a reference for comparison. The remaining columns correspond to decreasing SIR values of 0 dB, −5 dB, −10 dB, −15 dB, and −20 dB, respectively. The top row depicts the SAR images contaminated by simulated composite RFI, where the interference manifests as distinct bright rectangular regions that become increasingly dominant as SIR decreases. The bottom row presents the images recovered by the proposed suppression method.
Table 8 presents the PSNR and SSIM metrics comparing the interference-corrupted images and the suppressed images against the ground truth at different SIR levels.
As shown in Table 8, the proposed method demonstrates consistent suppression performance across all tested SIR conditions. At 0 dB SIR, the method achieves a PSNR improvement of 12.41 dB and elevates the SSIM from 0.6762 to 0.9426, indicating excellent structural preservation. As the SIR decreases to −10 dB, the method still maintains substantial improvement with PSNR increasing by 11.00 dB and SSIM improving from 0.2729 to 0.7753. Under extreme conditions at −15 dB and −20 dB SIR, the suppression performance gradually degrades but remains effective. As visible in Figure 18, faint residual interference appear in the suppressed images at these extremely low SIR levels. At −20 dB SIR, the proposed method still achieves a PSNR gain of 9.23 dB, successfully recovering the main structural features of the SAR scene.
These results validate that the proposed method enables effective RFI suppression across a broad dynamic range of interference intensities. The method maintains excellent visual quality down to approximately −10 dB SIR, beyond which residual interference become visible but the overall scene structure remains well preserved. This experiment demonstrates the robustness and practical applicability of the proposed method operating in environments of severe RFI.

6. Conclusions

Addressing the challenge of RFI suppression in SAR systems, this paper proposes a deep learning network featuring an attention mechanism guided by signal statistics. The method calculates the kurtosis of time-frequency signals in a window-wise manner, which quantifies the non-Gaussianity and impulsive nature of RFI, then incorporated the kurtosis into a multi-scale attention to guide the network in suppressing interference. Furthermore, unlike most existing methods that predominantly focus on amplitude information, this study systematically analyzes the phase fidelity of the proposed method. We established a quantitative evaluation framework and verified, via simulation, that the method effectively filters interference without significantly compromising phase information.
However, as with other DL-based RFI suppression methods, the proposed method has certain limitations. Under extremely low SIR conditions, residual interference may remain in the suppressed results, as demonstrated in Section 5.4. Additionally, the generalization capability depends on the diversity of training data, and performance may be affected when the interference characteristics in practical applications differ significantly from those included in the training dataset.
Future research will focus on the following aspects: First, we aim to further refine the network architecture to enhance its accuracy and reliability under challenging conditions, such as low-SIR scenarios. Second, we plan to expand the training dataset to include more diverse interference, improving the model’s generalization capability. Third, we plan to conduct extended experiments on phase fidelity. Building on the current real-world data analysis, we will investigate the method’s impact on phase-sensitive SAR applications. Since these applications are inherently sensitive to phase errors, their performance can serve as an indirect yet practical metric for validating phase fidelity on measured SAR data.

Author Contributions

Conceptualization, J.W. (Jiajun Wu), J.S. and B.H.; methodology, J.W. (Jiajun Wu) and J.S.; software, J.W. (Jiajun Wu); validation, J.W. (Jiajun Wu), J.S. and D.Y.; formal analysis, J.W. (Jiajun Wu), J.S. and B.H.; investigation, J.W. (Jiajun Wu); resources, J.W. (Jiajun Wu) and D.Y.; data curation, J.W. (Jiajun Wu) and J.W. (Jiaxin Wan); writing—original draft preparation, J.W. (Jiajun Wu), B.H. and J.S.; writing—review and editing, J.W. (Jiajun Wu), B.H., J.S., D.Y. and J.W. (Jiaxin Wan); visualization, J.W. (Jiajun Wu); supervision, B.H. and J.S.; project administration, B.H.; funding acquisition, B.H. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by National Natural Science Foundation of China (no. 62131019).

Data Availability Statement

The data presented in this study are available on request from the corresponding author due to privacy restrictions.

Acknowledgments

The authors would like to thank the reviewers and the editor for the constructive comments.

Conflicts of Interest

The authors declare no conflicts of interest.

References

  1. Berens, P. Introduction to Synthetic Aperture Radar (SAR). In Advanced Radar Signal and Data Processing; RTO-EN-SET-086; RTO: Neuilly-sur-Seine, France, 2006; pp. 3-1–3-28. [Google Scholar]
  2. Wiesbeck, W.; Sit, L. Radar 2020: The Future of Radar Systems. In Proceedings of the 2014 International Radar Conference, Lille, France, 13–17 October 2014; pp. 1–6. [Google Scholar] [CrossRef]
  3. Chojka, A.; Artiemjew, P.; Rapiński, J. RFI Artefacts Detection in Sentinel-1 Level-1 SLC Data Based On Image Processing Techniques. Sensors 2020, 20, 2919. [Google Scholar] [CrossRef] [PubMed]
  4. Moreira, A.; Prats-Iraola, P.; Younis, M.; Krieger, G.; Hajnsek, I.; Papathanassiou, K.P. A Tutorial on Synthetic Aperture Radar. IEEE Geosci. Remote Sens. Mag. 2013, 1, 6–43. [Google Scholar] [CrossRef]
  5. Feng, Y.; Han, B.; Wang, X.; Shen, J.; Guan, X.; Ding, H. Self-Supervised Transformers for Unsupervised SAR Complex Interference Detection Using Canny Edge Detector. Remote Sens. 2024, 16, 306. [Google Scholar] [CrossRef]
  6. Guarnieri, A.M. Removing RF Interferences from P- Airplane SAR Data. In Proceedings of the IGARSS ’96. 1996 International Geoscience and Remote Sensing Symposium, Lincoln, NE, USA, 31 May 1996. [Google Scholar]
  7. Reigber, A.; Ferro-Famil, L. Interference Suppression in Synthesized SAR Images. IEEE Geosci. Remote Sens. Lett. 2005, 2, 45–49. [Google Scholar] [CrossRef]
  8. Zhao, J.; Yue, Y.; Zhang, X.; Shi, Z.; Fang, K.; Liao, G. A Joint Framework of Wavelet Filtering and Fast GSVT-LRSD Algorithm for SAR Narrowband Pulsed RFI Suppression. IEEE Trans. Geosci. Remote Sens. 2025, 63, 1–16. [Google Scholar] [CrossRef]
  9. Lord, R.; Inggs, M. Efficient RFI Suppression in SAR Using a LMS Adaptive Filter with Sidelobe Suppression Integrated with the Range-Doppler Algorithm. In Proceedings of the IEEE 1999 International Geoscience and Remote Sensing Symposium. IGARSS’99 (Cat. No.99CH36293), Hamburg, Germany, 28 June–2 July 1999; Volume 1, pp. 574–576. [Google Scholar] [CrossRef]
  10. Li, J.; Ergesh, T.; Chen, M.; Duan, X. RFI Mitigation Algorithm Based on Adaptive Filter. In Proceedings of the Third International Conference on Electrical, Electronics, and Information Engineering (EEIE 2024), Wuhan, China, 14–16 November 2024; Malik, H., Ed.; SPIE: Bellingham, WA, USA, 2025; p. 51. [Google Scholar] [CrossRef]
  11. Zhou, F.; Wu, R.; Xing, M.; Bao, Z. Eigensubspace-Based Filtering With Application in Narrow-Band Interference Suppression for SAR. IEEE Geosci. Remote Sens. Lett. 2007, 4, 75–79. [Google Scholar] [CrossRef]
  12. Tao, M.; Zhou, F.; Liu, J.; Liu, Y.; Zhang, Z.; Bao, Z. Narrow-Band Interference Mitigation for SAR Using Independent Subspace Analysis. IEEE Trans. Geosci. Remote Sens. 2014, 52, 5289–5301. [Google Scholar] [CrossRef]
  13. Wang, Z.; Wang, L.; Wang, K. Joint RFI and Clutter Suppression in a Low-Altitude Surveillance Radar. In Proceedings of the 2025 26th International Radar Symposium (IRS), Hamburg, Germany, 21–23 May 2025; pp. 1–6. [Google Scholar] [CrossRef]
  14. Tao, M.; Su, J.; Huang, Y.; Wang, L. Mitigation of Radio Frequency Interference in Synthetic Aperture Radar Data: Current Status and Future Trends. Remote Sens. 2019, 11, 2438. [Google Scholar] [CrossRef]
  15. Liu, Z.; Liao, G.; Yang, Z. Time Variant RFI Suppression for SAR Using Iterative Adaptive Approach. IEEE Geosci. Remote Sens. Lett. 2013, 10, 1424–1428. [Google Scholar] [CrossRef]
  16. Huang, Y.; Liao, G.; Xu, J.; Li, J. Narrowband RFI Suppression for SAR System via Efficient Parameter-Free Decomposition Algorithm. IEEE Trans. Geosci. Remote Sens. 2018, 56, 3311–3322. [Google Scholar] [CrossRef]
  17. Zhang, H.; Jin, G.; Zhang, H.; Wang, Y.; Cheng, Y.; Guo, Z.; Ye, S.; Zhu, D. A Fast Repeater Mainlobe Deceptive Jamming Suppression Method for FDA-MIMO SAR Under Complex Motion Condition. IEEE Trans. Aerosp. Electron. Syst. 2025; in press. [Google Scholar] [CrossRef]
  18. Joy, S.; Nguyen, L.H.; Tran, T.D. Radio Frequency Interference Suppression in Ultra-Wideband Synthetic Aperture Radar Using Range-Azimuth Sparse and Low-Rank Model. In Proceedings of the 2016 IEEE Radar Conference (RadarConf), Philadelphia, PA, USA, 2–6 May 2016; pp. 1–4. [Google Scholar] [CrossRef]
  19. Liu, H.; Li, D.; Zhou, Y.; Truong, T.K. Joint Wideband Interference Suppression and SAR Signal Recovery Based on Sparse Representations. IEEE Geosci. Remote Sens. Lett. 2017, 14, 1542–1546. [Google Scholar] [CrossRef]
  20. Huang, Y.; Liao, G.; Zhang, Z.; Xiang, Y.; Li, J.; Nehorai, A. Fast Narrowband RFI Suppression Algorithms for SAR Systems via Matrix-Factorization Techniques. IEEE Trans. Geosci. Remote Sens. 2019, 57, 250–262. [Google Scholar] [CrossRef]
  21. Li, J.; Cai, Y.; Zhang, Y.; Liang, D.; Nan, Y.; Li, B.; Liu, K.; Wang, R. RFI Suppression Scheme for Complicated Low-Rank Violation Cases. IEEE Trans. Geosci. Remote Sens. 2024, 62, 1–17. [Google Scholar] [CrossRef]
  22. Guo, Q.; Tian, Y.; Qi, L.; Wang, Y.; Li, D.; Kaliuzhnyi, M. A SAR Multiple RFI Suppression Method via Frobenius Norm and Iterative Matrix Decomposition. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2024, 17, 3927–3939. [Google Scholar] [CrossRef]
  23. Fan, W.; Zhou, F.; Tao, M.; Bai, X.; Rong, P.; Yang, S.; Tian, T. Interference Mitigation for Synthetic Aperture Radar Based on Deep Residual Network. Remote Sens. 2019, 11, 1654. [Google Scholar] [CrossRef]
  24. Xu, X.; Fan, W.; Wang, S.; Zhou, F. WBIM-GAN: A Generative Adversarial Network Based Wideband Interference Mitigation Model for Synthetic Aperture Radar. Remote Sens. 2024, 16, 910. [Google Scholar] [CrossRef]
  25. Tao, M.; Li, J.; Su, J.; Wang, L. Characterization and Removal of RFI Artifacts in Radar Data via Model-Constrained Deep Learning Approach. Remote Sens. 2022, 14, 1578. [Google Scholar] [CrossRef]
  26. Gao, G. Statistical Modeling of SAR Images: A Survey. Sensors 2010, 10, 775–795. [Google Scholar] [CrossRef]
  27. Jao, J. Amplitude Distribution of Composite Terrain Radar Clutter and the κ-Distribution. IEEE Trans. Antennas Propag. 1984, 32, 1049–1062. [Google Scholar] [CrossRef]
  28. Lee, J.-S.; Hoppel, K.; Mango, S.; Miller, A. Intensity and Phase Statistics of Multilook Polarimetric and Interferometric SAR Imagery. IEEE Trans. Geosci. Remote Sens. 1994, 32, 1017–1028. [Google Scholar] [CrossRef]
  29. Shen, J.; Han, B.; Pan, Z.; Li, G.; Hu, Y.; Ding, C. Learning Time–Frequency Information With Prior for SAR Radio Frequency Interference Suppression. IEEE Trans. Geosci. Remote Sens. 2022, 60, 1–16. [Google Scholar] [CrossRef]
  30. Yang, H.; Tao, M.; Chen, S.; Xi, F.; Liu, Z. On the Mutual Interference Between Spaceborne SARs: Modeling, Characterization, and Mitigation. IEEE Trans. Geosci. Remote Sens. 2021, 59, 8470–8485. [Google Scholar] [CrossRef]
  31. Fang, F.; Li, H.; Meng, W.; Dai, D.; Xing, S. Synthetic-Aperture Radar Radio-Frequency Interference Suppression Based on Regularized Optimization Feature Decomposition Network. Remote Sens. 2024, 16, 2540. [Google Scholar] [CrossRef]
  32. Iskander, D.; Zoubir, A.; Boashash, B. A Method for Estimating the Parameters of the K Distribution. IEEE Trans. Signal Process. 1999, 47, 1147–1151. [Google Scholar] [CrossRef]
Figure 1. An RFI-corrupted SAR image from GF-3 satellite, and the RFI spans both land and ocean regions. (a) Full-scene SAR image. (b) Enlarged view of the RFI-corrupted region. (c) Time-frequency analysis of RFI-corrupted signal.
Figure 1. An RFI-corrupted SAR image from GF-3 satellite, and the RFI spans both land and ocean regions. (a) Full-scene SAR image. (b) Enlarged view of the RFI-corrupted region. (c) Time-frequency analysis of RFI-corrupted signal.
Remotesensing 18 00255 g001
Figure 2. Frequency-domain and time-frequency-domain representations of three common interference types. (a) Narrowband interference. (b) Linear frequency modulation wideband interference. (c) Sinusoidal modulation wideband interference.
Figure 2. Frequency-domain and time-frequency-domain representations of three common interference types. (a) Narrowband interference. (b) Linear frequency modulation wideband interference. (c) Sinusoidal modulation wideband interference.
Remotesensing 18 00255 g002
Figure 3. Time-sequenced kurtosis curves of interference-corrupted pulses across multiple scenarios. Each row represents a distinct scenario. The red dashed line in the SAR images indicate the azimuthal positions of the pulse analyzed in the corresponding plots.
Figure 3. Time-sequenced kurtosis curves of interference-corrupted pulses across multiple scenarios. Each row represents a distinct scenario. The red dashed line in the SAR images indicate the azimuthal positions of the pulse analyzed in the corresponding plots.
Remotesensing 18 00255 g003
Figure 4. Overall framework of the proposed method.
Figure 4. Overall framework of the proposed method.
Remotesensing 18 00255 g004
Figure 5. Schematic Diagram of the Backbone Network.
Figure 5. Schematic Diagram of the Backbone Network.
Remotesensing 18 00255 g005
Figure 6. Schematic Diagram of the Residual Unit.
Figure 6. Schematic Diagram of the Residual Unit.
Remotesensing 18 00255 g006
Figure 7. Schematic Diagram of the Kurtosis-Guided Attention Weight Generation Module.
Figure 7. Schematic Diagram of the Kurtosis-Guided Attention Weight Generation Module.
Remotesensing 18 00255 g007
Figure 8. Schematic Diagram of Multi-scale Attention Application.
Figure 8. Schematic Diagram of Multi-scale Attention Application.
Remotesensing 18 00255 g008
Figure 9. Time-frequency spectrograms of sample data from the training set. The horizontal and vertical axes denote time and frequency, respectively. The indices (1)–(6) represent distinct data samples, indicating a one-to-one correspondence between the corrupted inputs in (a) and their respective ground truth references in (b). (a) Time-frequency spectrograms of interference-corrupted data. (b) Corresponding interference-free ground truth spectrograms.
Figure 9. Time-frequency spectrograms of sample data from the training set. The horizontal and vertical axes denote time and frequency, respectively. The indices (1)–(6) represent distinct data samples, indicating a one-to-one correspondence between the corrupted inputs in (a) and their respective ground truth references in (b). (a) Time-frequency spectrograms of interference-corrupted data. (b) Corresponding interference-free ground truth spectrograms.
Remotesensing 18 00255 g009
Figure 10. Imaging results of the comparative experiment. (a) Ground truth image. (b) Interference-corrupted image. (c) Result of time-frequency domain notching. (d) Result of PISNet. (e) Result of FDNet. (f) Result of the proposed method.
Figure 10. Imaging results of the comparative experiment. (a) Ground truth image. (b) Interference-corrupted image. (c) Result of time-frequency domain notching. (d) Result of PISNet. (e) Result of FDNet. (f) Result of the proposed method.
Remotesensing 18 00255 g010
Figure 11. Time-frequency domain suppression results of the comparative experiment. (a) Interference-corrupted signal. (b) Result of time-frequency domain notching. (c) Result of PISNet. (d) Result of FDNet. (e) Result of the proposed method.
Figure 11. Time-frequency domain suppression results of the comparative experiment. (a) Interference-corrupted signal. (b) Result of time-frequency domain notching. (c) Result of PISNet. (d) Result of FDNet. (e) Result of the proposed method.
Remotesensing 18 00255 g011
Figure 12. Validation of interference suppression performance on a single pulse from real-world GF-3 SAR data. (a) Time-frequency representation before and after suppression. (b) Frequency spectrum before and after suppression.
Figure 12. Validation of interference suppression performance on a single pulse from real-world GF-3 SAR data. (a) Time-frequency representation before and after suppression. (b) Frequency spectrum before and after suppression.
Remotesensing 18 00255 g012
Figure 13. Comparison of Interference Suppression Performance and Details on Real-world GF-3 Data. (a) Full-Scene Imaging Result. (b) Cropped view of a severely interference-corrupted scene in (a). (c) Result of time-frequency domain notching. (d) Result of PISNet. (e) Result of FDNet. (f) Result of the proposed method.
Figure 13. Comparison of Interference Suppression Performance and Details on Real-world GF-3 Data. (a) Full-Scene Imaging Result. (b) Cropped view of a severely interference-corrupted scene in (a). (c) Result of time-frequency domain notching. (d) Result of PISNet. (e) Result of FDNet. (f) Result of the proposed method.
Remotesensing 18 00255 g013
Figure 14. Results of the ablation study. The plots show the Loss, PSNR, and SSIM from left to right.
Figure 14. Results of the ablation study. The plots show the Loss, PSNR, and SSIM from left to right.
Remotesensing 18 00255 g014
Figure 15. The result of the experiment of phase fidelity. (a) Time-frequency domain spectrograms. (b) Comparison of unwrapped phases. The blue, red, and green lines represent the ground truth, the interfered signal, and the signal suppressed by the proposed method, respectively.
Figure 15. The result of the experiment of phase fidelity. (a) Time-frequency domain spectrograms. (b) Comparison of unwrapped phases. The blue, red, and green lines represent the ground truth, the interfered signal, and the signal suppressed by the proposed method, respectively.
Remotesensing 18 00255 g015
Figure 16. Statistical characteristic verification in a heterogeneous scene. (a) The SAR image used for validation, where the red box indicates the selected representative heterogeneous region for statistical analysis. (b) Comparison of amplitude distribution fitting between Rayleigh and K-distribution for the echoes within the selected region. The K-distribution (red solid line) significantly outperforms the Rayleigh distribution (blue dashed line) in characterizing the heavy-tailed distribution of the echoes.
Figure 16. Statistical characteristic verification in a heterogeneous scene. (a) The SAR image used for validation, where the red box indicates the selected representative heterogeneous region for statistical analysis. (b) Comparison of amplitude distribution fitting between Rayleigh and K-distribution for the echoes within the selected region. The K-distribution (red solid line) significantly outperforms the Rayleigh distribution (blue dashed line) in characterizing the heavy-tailed distribution of the echoes.
Remotesensing 18 00255 g016
Figure 17. Sensitivity analysis of kurtosis calculation with respect to STFT parameters. (a) Kurtosis curves under different window lengths (L) with a fixed overlap ratio (255/256). (b) Kurtosis curves under different overlap lengths with a fixed window length ( L = 256 ).
Figure 17. Sensitivity analysis of kurtosis calculation with respect to STFT parameters. (a) Kurtosis curves under different window lengths (L) with a fixed overlap ratio (255/256). (b) Kurtosis curves under different overlap lengths with a fixed window length ( L = 256 ).
Remotesensing 18 00255 g017
Figure 18. SAR imaging results under different SIR conditions. Left: ground truth image without interference. Right: columns represent SIR levels of 0 dB, −5 dB, −10 dB, −15 dB, and −20 dB from left to right; top row shows interference-corrupted images, bottom row shows suppressed images using the proposed method.
Figure 18. SAR imaging results under different SIR conditions. Left: ground truth image without interference. Right: columns represent SIR levels of 0 dB, −5 dB, −10 dB, −15 dB, and −20 dB from left to right; top row shows interference-corrupted images, bottom row shows suppressed images using the proposed method.
Remotesensing 18 00255 g018
Table 1. Detailed Sensor Parameters and Scene Descriptions of the Experimental Data.
Table 1. Detailed Sensor Parameters and Scene Descriptions of the Experimental Data.
ParameterAirborne SARGF-3 Satellite
BandX-bandC-band
Range Sampling Rate548.571 MHz66.667 MHz
PRF533.330 Hz1414.333 Hz
Bandwidth480 MHz60 MHz
Imaging ModeStripmapStripmap
Scene TypesAgricultureCoast, Ocean, Mountain
Number of Scenes23
Table 2. Simulated Interference Parameters for Different Scenarios.
Table 2. Simulated Interference Parameters for Different Scenarios.
Interference TypeParameterValue (Scene 1)Value (Scene 2)Value (Scene 3)
NBIBandwidth<10 MHz<10 MHz<10 MHz
Frequency4.5–7 GHz6–8 GHz3–6.5 GHz
Count322
LFM WBIBandwidth100–300 MHz130–200 MHz80–270 MHz
Frequency3–10 GHz6.2–8.3 GHz4.5–9 GHz
Chirp Rate 0.26 × 10 14 1.1 × 10 14 0.3 × 10 14 0.8 × 10 14 0.38 × 10 14 0.6 × 10 14
Count322
Sinusoidal Modulation WBIBandwidth25–100 MHz50 MHz30–60 MHz
Frequency0.2–1.1 MHz0.77 MHz0.4–0.8 MHz
Modulation Coefficient3–152515–30
Initial Phase 0 π π / 4 π / 4 3 π / 4
Count212
Table 3. Metric Results for the Comparative Experiment (shown in Figure 10). The best results are highlighted in bold. For the IC metric, values in parentheses indicate the relative error rate with respect to the Ground Truth (lower is better).
Table 3. Metric Results for the Comparative Experiment (shown in Figure 10). The best results are highlighted in bold. For the IC metric, values in parentheses indicate the relative error rate with respect to the Ground Truth (lower is better).
SceneMethodPSNR (dB) ↑SSIM ↑IC (Error %) ↓
Scene 1Ground Truth0.9063 (Ref)
Interfered Image16.340.47390.7846 (13.43%)
Notch22.740.79240.9034 (0.32%)
PISNet26.390.90060.9044 (0.21%)
FDNet27.140.90410.9048 (0.17%)
Proposed Method27.920.92050.9049 (0.15%)
Scene 2Ground Truth0.9068 (Ref)
Interfered Image16.110.70620.8597 (5.19%)
Notch17.440.40420.8967 (1.11%)
PISNet28.430.93770.9045 (0.25%)
FDNet28.540.93800.9049 (0.21%)
Proposed Method29.280.94710.9045 (0.25%)
Scene 3Ground Truth0.2925 (Ref)
Interfered Image13.270.20790.2092 (28.48%)
Notch15.200.30470.3206 (9.61%)
PISNet23.880.75980.2659 (9.09%)
FDNet24.320.78240.2701 (7.66%)
Proposed Method24.340.82490.2896 (0.99%)
Note: The symbols ↑ and ↓ indicate that higher values are better and lower values are better, respectively.
Table 4. Metric Results for the Real-world Data Experiment (shown in Figure 13). The best valid results are highlighted in bold.
Table 4. Metric Results for the Real-world Data Experiment (shown in Figure 13). The best valid results are highlighted in bold.
MethodIC ↑IE ↓
Interfered Image0.25437.6627
Time-Frequency Notching0.23196.2006
PISNet0.26567.2297
FDNet0.26707.3432
Proposed Method0.27327.1838
Note: The symbols ↑ and ↓ indicate that higher values are better and lower values are better, respectively.
Table 5. Ablation Study Results for the Kurtosis-Guided Attention Module. The proposed method (with the module) is compared against the baseline. The best performance metrics are highlighted in bold.
Table 5. Ablation Study Results for the Kurtosis-Guided Attention Module. The proposed method (with the module) is compared against the baseline. The best performance metrics are highlighted in bold.
MethodMSE Loss ↓PSNR (dB) ↑SSIM ↑
Baseline Network0.00085431.25680.9640
Proposed Method0.00072732.51520.9736
Improvement Rate14.87%4.03%1.00%
Note: The symbols ↑ and ↓ indicate that higher values are better and lower values are better, respectively. Values in italics represent the calculated percentage improvement of the proposed method compared to the baseline.
Table 6. Phase Fidelity Evaluation Results for Different Interference Scenarios. The units for MAD, SD, and RMSE are radians. The best results are highlighted in bold.
Table 6. Phase Fidelity Evaluation Results for Different Interference Scenarios. The units for MAD, SD, and RMSE are radians. The best results are highlighted in bold.
ScenarioSignal StateMAD ↓SD ↓RMSE ↓PCC ↑
NBIInterfered2659.5711502.4963054.6280.905
Processed70.08036.66476.3930.980
WBIInterfered689.955702.168860.066−0.684
Processed28.93316.54233.3090.994
NBI+WBIInterfered821.557408.719917.6090.429
Processed17.24320.29020.4080.990
Note: The symbols ↑ and ↓ indicate that higher values are better and lower values are better, respectively.
Table 7. Computational Complexity Comparison of Different Methods.
Table 7. Computational Complexity Comparison of Different Methods.
MethodParamsFLOPsInference (ms)Full Pipeline (ms)Training Time (s)GPU Mem (MB)
KANet (Proposed)14.50 M45.57 G7.931886.002418.102116
PISNet15.17 M44.32 G7.341816.642031.812008
FDNet13.17 M32.69 G8.631847.832551.722821
Notch Filter497.05
Table 8. Performance of the Proposed Method Under Different SIR Conditions.
Table 8. Performance of the Proposed Method Under Different SIR Conditions.
SIR (dB)Interfered ImageSuppressed Image
PSNR (dB)SSIMPSNR (dB)SSIM
015.870.676228.280.9426
−513.060.478125.800.8523
−1011.170.272922.170.7753
−1510.020.133820.130.6957
−209.380.065818.610.6002
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Wu, J.; Shen, J.; Han, B.; Yin, D.; Wan, J. SAR Radio Frequency Interference Suppression Based on Kurtosis-Guided Attention Network. Remote Sens. 2026, 18, 255. https://doi.org/10.3390/rs18020255

AMA Style

Wu J, Shen J, Han B, Yin D, Wan J. SAR Radio Frequency Interference Suppression Based on Kurtosis-Guided Attention Network. Remote Sensing. 2026; 18(2):255. https://doi.org/10.3390/rs18020255

Chicago/Turabian Style

Wu, Jiajun, Jiayuan Shen, Bing Han, Di Yin, and Jiaxin Wan. 2026. "SAR Radio Frequency Interference Suppression Based on Kurtosis-Guided Attention Network" Remote Sensing 18, no. 2: 255. https://doi.org/10.3390/rs18020255

APA Style

Wu, J., Shen, J., Han, B., Yin, D., & Wan, J. (2026). SAR Radio Frequency Interference Suppression Based on Kurtosis-Guided Attention Network. Remote Sensing, 18(2), 255. https://doi.org/10.3390/rs18020255

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop