Next Article in Journal
Correction: Kim et al. Multi-Threaded Sound Propagation Algorithm to Improve Performance on Mobile Devices. Sensors 2023, 23, 973
Previous Article in Journal
SAC-Optimized Fuzzy Variable Admittance Control for Lead-Through Teaching of Collaborative Robots
Previous Article in Special Issue
Contactless Cardiac Health Monitoring with Millimeter-Wave Radar Based on PMG-SATNet
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Multi-Scale Edge-Preserving Decomposition and Fusion Framework for Multi-Polarization Passive Millimeter-Wave Imaging

1
School of Electronic Information and Communications, Huazhong University of Science and Technology, Wuhan 430074, China
2
National Key Laboratory of Science and Technology on Multispectral Information Processing, Huazhong University of Science and Technology, Wuhan 430074, China
*
Author to whom correspondence should be addressed.
Sensors 2026, 26(11), 3577; https://doi.org/10.3390/s26113577
Submission received: 6 May 2026 / Revised: 1 June 2026 / Accepted: 2 June 2026 / Published: 4 June 2026
(This article belongs to the Special Issue Advanced Non-Invasive Sensors: Methods and Applications—2nd Edition)

Highlights

What are the main findings?
  • A hierarchical GWACF decomposition and layer-specific fusion strategy significantly enhance concealed object contrast in multi-polarization PMMW images while preserving edge integrity.
  • Experimental results confirm superior detection accuracy and robustness over mainstream methods, demonstrating reliable concealed object imaging on the human body.
What are the implications of the main findings?
  • The significant boost in concealed-object contrast and edge fidelity reduces both false alarms and missed detections, establishing multi-polarization PMMW imaging as a robust, privacy-safe solution for operational security screening.
  • The proposed hierarchical decomposition and fusion framework provides a robust, generalizable strategy for fully exploiting multi-polarization information, establishing a new reference for processing polarimetric passive imagery in security and remote sensing applications.

Abstract

Passive millimeter-wave (PMMW) imaging technology has become a highly promising technology that can protect privacy in human body security inspections. However, most existing methods rely on single-pixel and single-polarization processing mechanisms, which often lead to discrete false-alarm pixels or missed detections in practical applications. Although multi-polarization information can provide richer distinguishing features, the current methods typically depend on limited Stokes parameters or artificially designed polarization features, lacking a systematic framework to fully exploit the intrinsic potential of multi-polarization information. In this paper, we propose a novel multi-scale edge-preserving decomposition model, termed Gaussian and weighted average curvature filtering (GWACF), to hierarchically decompose a multi-polarization PMMW image into three structural layers: base structural (BS) layer, coarse structural (CS) layer, and fine structural (FS) layer. Furthermore, we also propose a fusion strategy in which a gradient-domain pulse-coupled neural network (PCNN) is employed to fuse the texture-rich CS and FS layers, while the energy attribute fusion method is applied to the BS layer where primary structure and background information play a dominant role. This method effectively leverages complementary polarimetric information without introducing artifacts or compromising edge sharpness. Experimental results demonstrate that the proposed method effectively enhances the brightness temperature (BT) contrast of concealed objects. Compared with existing mainstream methods, it exhibits notable advantages in both detection accuracy and robustness.

1. Introduction

Against the backdrop of an increasingly complex global security landscape and the public’s growing demand for safe travel, there is a pressing need for efficient, reliable, and privacy-respecting personal security screening technologies, which has become a key focus for both academia and industry [1,2,3]. Conventional metal detectors and handheld scanners are limited to identifying metallic objects, while being ineffective against non-metallic threats such as ceramic knives, plastic explosives, and liquid hazardous materials. Although active millimeter-wave imaging can penetrate clothing to reveal a human body outline [4,5], its use of transmitted electromagnetic waves has raised public concerns regarding potential health effects and personal privacy. Within this context, passive millimeter-wave (PMMW) imaging technology has gained prominence due to its distinct advantages. As a passive detection system, it only receives millimeter-wave signals radiated by the human body and concealed objects themselves, without the need for any form of electromagnetic wave emission [6,7]. Thus, it completely eliminates health risks and privacy violations at the physical level and is hailed as one of the ideal solutions for the next generation of human body security checks.
However, the practical application of PMMW imaging technology still faces a fundamental bottleneck: the low radiation brightness temperature (BT) contrast between the target and the background. Under indoor ambient temperature conditions, many non-metallic contraband objects exhibit radiation characteristics highly similar to those of human skin, resulting in extremely weak BT differences from the background in the final images, along with a low signal-to-noise ratio (SNR), which makes reliable detection and identification challenging [7,8]. Moreover, factors such as the material and thickness of clothing, fluctuations in environmental temperature, and the complex curvature of the human body further exacerbate image degradation [9,10] significantly limiting the detection performance and practical utility of traditional single-intensity imaging methods. To break through this bottleneck, researchers have turned their attention to another fundamental physical property of electromagnetic waves—polarization. Polarization information reveals the directional and structural characteristics of a target during the process of radiating or scattering electromagnetic waves, offering key insights that intensity images alone cannot provide [11]. Therefore, through polarimetric measurements, it becomes possible to extract material and structural “fingerprints” of concealed objects, enabling them to be more clearly distinguished from human skin amidst complex backgrounds.
Current techniques primarily rely on single-polarization images and single-pixel analysis modes. The information provided by a single polarization method is relatively limited, whereas multi-polarization techniques enable the acquisition of richer information [12,13]. Numerous studies have demonstrated the potential of multi-polarization sensing and processing across various applications, such as multi-polarization image enhancement [14,15], target segmentation [16,17], reflection interference suppression [18,19], material type identification [17,20], and wind direction/speed measurement [21,22]. Meanwhile, single-pixel processing methods often result in numerous discrete false positives or missed detections, thereby adversely affecting target detection and segmentation tasks. Consequently, many researchers advocate the adoption of multi-polarization imaging techniques to expand the dimensionality of BT data, enabling the extraction of richer scene information and enhanced detection performance. Previous studies have indicated that in-depth exploration and utilization of polarization information can significantly improve the detection capability of PMMW imaging [6,23]. To this end, parameters such as the degree of linear polarization (DoLP) [23], passive degree of polarization (PDoP) [11], linear polarization ratio (LPR) [16], and angle of polarization (AoP) [24] have been proposed as quantitative features for target detection in diverse scenarios, serving to measure the extent of target polarization. Such methods predominantly rely on polarization degree information for image segmentation to achieve target recognition. However, the polarization state is influenced not only by the target’s material composition but also by variations in target geometry and incidence angles, both of which can alter the observed polarimetric characteristics. These factors may lead to missed detections or false positives in polarization-based detection methods under specific conditions. Furthermore, the effective fusion of multi-dimensional polarization features with radiative intensity, spatial texture, and other relevant attributes is essential for constructing a robust detection system with low false alarm rates, representing a crucial step toward transitioning this technology from laboratory research to practical application.
To address the aforementioned challenges, this paper innovatively proposes a multi-polarization image fusion framework that combines multi-scale edge-preserving decomposition based on Gaussian and weighted average curvature filtering (GWACF) with a gradient-domain pulse-coupled neural network (PCNN). Unlike existing methods that mainly rely on handcrafted features such as Stokes parameters, simple polarization difference, or polarization ratio, our method, for the first time, cascades a weighted average curvature filter (WACF) with Gaussian filtering (GF) to hierarchically decompose an image into a base structural layer (BS), a coarse structural layer (CS), and a fine structural layer (FS). The uniqueness of this decomposition lies in the fact that WACF can simultaneously preserve edges and suppress noise, while the GF progressively strips away structural information at different scales, thus enabling each layer to carry complementary polarization features. Regarding the fusion strategy, we design distinct fusion rules for different layers: for the texture-rich CS and FS layers, a dual-channel PCNN modulated by multi-scale morphological gradient (MSMG) is adopted; for the BS layer that carries the dominant background, an energy-attribute weighted fusion scheme is used. This overall “decomposition–layering–adaptive fusion” pipeline has not been reported in the field of passive millimeter-wave (PMMW) multi-polarization image processing. In comparison, although the sub-region polarization fusion method [25] also exploits multi-polarization information, its fusion is primarily performed on sub-blocks of Stokes parameter images, lacking multi-scale separation of image structures. The Fisher vector polarization method [7] focuses on enhancing region classification through Fisher encoding, which essentially remains a feature-level post-processing rather than hierarchical fusion prior to image reconstruction. Deep learning-based methods [26] primarily rely on convolutional neural networks or Transformers for end-to-end object detection, and they have indeed achieved remarkable progress in performance. However, these methods typically require large amounts of annotated data and suffer from limited interpretability. In contrast, our method is a physics-driven image fusion method that does not rely on training data, thus offering better generalization and interpretability. Through edge-preserving decomposition and layer-wise differentiated fusion, it is physically more consistent with the millimeter-wave polarization scattering mechanism. The main contributions of this paper are:
(1)
This paper designs a multi-scale edge-preserving decomposition model, which integrates Gaussian filtering with weighted average curvature filtering, termed GWACF. The model is designed to hierarchically represent multi-polarization PMMW images by decomposing them into three structural layers: the FS layer, the BS layer, and the CS layer. The FS layer and the CS layer jointly preserve the rich texture and specific structural information of the image, while the BS layer serves as the bottom-level expression of the image, capturing core elements such as primary structure and background content.
(2)
In both the FS and CS layers, since they preserve the texture details and partial structural features of the image, the gradient-domain PCNN method performs well in fusing fine-textured images and is therefore employed for the fusion of these two layers. As for the BS layer, which contains the primary background information and overall structural contours of the image, the fusion effect at this level directly determines the overall quality of the final image. Given that the energy attribute (EA) fusion method performs exceptionally well when dealing with image with asymmetric information distribution, and is particularly suitable for coarse approximate image layers. Consequently, it is applied to the fusion of the BS layer.
(3)
The proposed method was validated through multi-polarization PMMW imaging detection experiments. Experimental results demonstrate that the fusion method produces high-quality and robust imaging of concealed objects on the human body. Qualitative and quantitative comparisons with several competitive baseline methods further highlight its performance advantages.
The remainder of this paper is organized as follows. Section 2 presents the necessary preliminaries. The proposed fusion method is described in Section 3. Experimental results and performance analysis are provided in Section 4 to validate the effectiveness of our method. Finally, conclusions are given in Section 5.

2. Preliminaries

2.1. Analysis of the PMMW Imaging Model

The PMMW imaging security system detects concealed dangerous objects under a person’s clothing by measuring the effective radiation temperature distribution within the target scene. In practical experimental tests, the PMMW radiation temperature data received by the imaging system is the combined effect of the testing environment, the human body, and the target under examination.
As shown in Figure 1a, the radiation model for the case where no concealed objects are carried on the human body can be derived as follows:
T E c = e c T c + r c T a + t c T E b c
T E b c = e b T b + r b T E c + t b T a
where TEc denotes the effective radiation temperature in front of the clothing, Tc is the absolute temperature of the clothing, and Ta represents the absolute temperature of free space; ec, rc, and tc represent the emissivity, reflectivity, and transmissivity of the clothing, respectively. TEbc denotes the effective radiation temperature between the skin and the clothing, while Tb is the absolute temperature of the human body; eb, rb, and tb represent the emissivity, reflectivity, and transmissivity of the human body, respectively.
By substituting Equation (2) into Equation (1), the effective radiation temperature obtained by the PMMW imaging system for the front clothing area of the human body can be expressed by the following expression:
T E c = e c T c + r c T a + t c ( e b T b + t b T a ) 1 t c r b
In the scenario of a concealed object carried on the human body, as illustrated in Figure 1b, the radiation model is revised to the following:
T E c = e c T c + r c T a + t c T E o b
T E o b = e o b T o b + r o b T E c + t o b T E b c
T E b c = e b T b + r b T E o b + t b T a
where TEob is the effective radiation temperature in front of the concealed object. Eob, rob, and tob represent the emissivity, reflectivity, and transmissivity of the concealed object, respectively. Tob denotes the absolute temperature of the concealed object.
Furthermore, substituting Equations (5) and (6) into Equation (4), Equation (3) can be rewritten as:
T E c = e c T c + r c T a 1 T o b r b + t c e o b T o b + t o b e b T b + t b T a 1 + t c r o b t o b r b

2.2. Pulse-Coupled Neural Network

The pulse-coupled neural network (PCNN), as a third-generation artificial neural network, offers the key advantage of requiring no parameter training. Decades of development have demonstrated its significant potential in various fields, including image denoising [27,28], image segmentation [29,30], target detection [31,32], and image fusion [33,34].
A PCNN neuron (Figure 2) consists of three components: the receptive, modulation, and pulse generation fields, responsible for signal reception, internal state computation, and output, respectively. Its internal state computation is divided into two parts: connection and feedback inputs. The neuron communicates with its neighboring neurons via synaptic weights M and W, while receiving external input stimuli I. Each neuron retains its previous state, which decays over time.
F i j ( n ) = e α F F i j ( n 1 ) + I i j + V F k l M i j k l Y k l ( n 1 )
L i j ( n ) = e α L L i j ( n 1 ) + V L k l W i j k l Y k l ( n 1 )
where Fij denotes the feedforward component of the neuron embedded at position (i, j) in the two-dimensional neural array, n represents the iteration number of the neuron, and Ykl corresponds to the neuronal output at time (n − 1). Lij refers to the corresponding connection input. All three components possess memory capabilities, with their state values decaying over time. The parameters αF and αL are the temporal decay constants for the feeding and connection inputs, respectively, governing their decay rates. The terms VF and VL denote the amplification coefficients of the connection weights for the feeding and connection input. These coefficients help to prevent the saturation of neuronal output, which is usually normalied to constants.
By coupling Fij and Lij, the internal activity Uij of the neuron is constituted, where β denotes the coupling strength. The mathematical expression for the internal activity is given by:
U i j ( n ) = F i j ( n ) ( 1 + β L i j ( n ) )
The output neuron Yij is generated by comparing the internal activity Uij of the neuron with its dynamic threshold θij.
Y i j ( n ) = 1 ,   U i j ( n ) > θ i j ( n ) 0 , otherwise
The dynamic threshold θij is a variable parameter, as it sharply increases when the neuron fires (Yij > θij). It then gradually decays back until the neuron fires again. This process is mathematically described by Equation (12), as shown below:
θ i j ( n ) = e α θ θ i j ( n 1 ) + V θ Y i j ( n )
where αθ is the time constant of decay for the dynamic threshold θij, and Vθ denotes the amplification factor associated with it.

3. Methodology

This section systematically presents the proposed multi-polarization PMMW image fusion method, whose overall workflow is depicted in Figure 3. The framework sequentially comprises the following five core modules: GWACF-based multi-scale decomposition, fusion of fine structural (FS) layer, fusion of coarse structural (CS) layer, fusion of base structural (BS) layer, and final image reconstruction.
The process begins by acquiring observational data at four linear polarization angles (0°, 45°, 90°, and 135°), which are recorded as TB0, TB45, TB90, and TB135, respectively. These polarization measurements are then linearly averaged to yield two components, TA and TB, defined as follows.
T A = T B 0 + T B 90 2
T B = T B 45 + T B 135 2
Subsequently, the GWACF decomposition algorithm is employed to decompose the TA and TB images into three distinct layers: the FS layer, CS layer, and BS layer. The corresponding BS layers are coalesced using an energy property-based fusion strategy. In contrast, the fusion of the FS and CS layers is achieved using a gradient-domain PCNN approach. The final fused image is reconstructed by integrating these combined layers.
From the perspective of physical rationality, the pairing strategy adopted in this paper—(TB0 and TB90) as one group, and (TB45 and TB135) as the other—exhibits clear completeness in the polarization basis. Each group forms an orthogonal polarization basis, enabling comprehensive characterization of any linear polarization state. These two groups are sensitive to the structural and dielectric properties of the target along the horizontal/vertical and diagonal directions, respectively, thereby providing complementary polarization signatures. Moreover, this pairing effectively suppresses the strong dependence of a single polarization channel on target orientation, significantly enhancing the robustness of concealed target detection while reducing redundancy and noise interference under the premise of preserving complementary polarization information.
As for the direct fusion of all four polarization channels, although theoretically feasible, it suffers from notable limitations. First, there exists informational redundancy among the four polarization angles; directly fusing them introduces substantial repetitive information into the fusion network, which may lead to feature competition and unstable fusion decisions. Second, simultaneously feeding all four polarization channels into the fusion model greatly increases computational complexity and may introduce nonlinear inter-channel interference, resulting in artifacts or edge blurring in the fused image.
In contrast, the proposed method first constructs two complementary components, TA and TB, which physically mitigate signal fluctuations caused by target orientation while preserving polarization anisotropy information. This provides a more robust input for subsequent edge-preserving decomposition and hierarchical fusion. Therefore, direct fusion of all four channels is not considered in this paper.

3.1. GWACF Multi-Scale Decomposition

For a 2D image T, its weighted average curvature filtering (WACF) [35] can be expressed as:
G w T = k T 2 G T = T 2 T T 2
where ∇· and ∇ denote the divergence operator and the gradient operator, respectively.
In the case of k = 2, the expression of Equation (15) simplifies to:
G w T = Δ T T x 2 T x x + 2 T x T y T x y + T y 2 T y y T x 2 + T y 2
where Δ is the isotropic Laplacian operator. Tx and Ty are the first-order partial derivatives along the x and y directions, respectively. Txx, Txy and Tyy represent the corresponding second-order partial derivatives, respectively.
Within a 3 × 3 window, the investigation considers eight possible normalized half-window directions, with corresponding kernels generated for all eight cases:
f 1 = 0 0 0 1 / 6 1 1 / 6 1 / 6 1 / 3 1 / 6 ,   f 2 = 0 1 / 6 1 / 6 0 1 1 / 3 0 1 / 6 1 / 6 , f 3 = 1 / 6 1 / 3 1 / 6 1 / 6 1 1 / 6 0 0 0 ,   f 4 = 1 / 6 1 / 6 0 1 / 3 1 0 1 / 6 1 / 6 0 , f 5 = 1 / 12 0 0 1 / 3 1 0 1 / 6 1 / 3 1 / 12 ,   f 6 = 0 0 1 / 12 0 1 1 / 3 1 / 12 1 / 3 1 / 6 , f 7 = 1 / 12 1 / 3 1 / 6 0 1 1 / 3 0 0 1 / 12 ,   f 8 = 1 / 6 1 / 3 1 / 12 1 / 3 1 0 1 / 12 0 0 .
Furthermore, the eight distance values ri can be computed using the eight kernels, respectively.
r i = f i T ,   i = 1 , 2 ,   , 8
where * denotes the convolution operation. The discrete form of Equation (16) can be expressed as:
G w r h ,   h = argmin r i ;   i = 1 , 2 ,   , 8
For the purpose of analysis, the WACF procedure is represented as:
T o u t = WACF T i n
where Tout and Tin represent the filtered output image and the original input image before filtering, respectively. WACF(·) denotes the WACF operation.
Based on this, the paper proposed an image decomposition method that integrates Gaussian filtering with weighted average curvature filtering, namely the GWACF method, whose overall workflow is illustrated in Figure 4. Tin refers to the input image, T G F m (m = 1, 2, 3) denotes the resultant image after the m-th Gaussian filtering operation, and T W m is the result after performing the WACF operation. The decomposed layers T FSm , T BS and T CSm are then given by the following equations:
T F S m = T i n T W m , m = 1 T G F m 1 T W m ,   m = 2 , 3
T B S = T G F 3
T C S m = T W m T G F m
where T G F m and T W m can be respectively calculated as follows:
T G F m = G F T i n , m = 1 G F T G F m 1 ,   m = 2 , 3
T W m = W A C F T i n , m = 1 W A C F T G F m 1 , m = 2 , 3
where the symbol GF(·) denotes the Gaussian filtering operator.
After decomposition via the GWACF operator, Tin can be composed of three distinct hierarchical levels.
T in = m = 1 3 T F S m + T C S m + T B S
Therefore, TA and TB in Figure 3 can be decomposed into T A F S m , T B F S m , T A B S , T B B S , T A C S m and T B C S m through Equations (21) to (23), respectively.

3.2. Fusion Strategy for Fine and Coarse Structural Layers

The multi-scale morphological gradient (MSMG) is an edge extraction method that integrates multi-scale strategies with mathematical morphological operations. By performing morphological operations at different structural element scales and fusing the results, it effectively enhances and captures image contours and detailed features with improved accuracy. The specific steps are as follows:
(1)
The multi-scale structural elements are denoted as:
M S v = M S 1 M S 1 M S 1 v ,   v = 1 , 2 , , M
where MS1 denotes a basic structural unit, using a 3 × 3 matrix structuring element as the basic unit, i.e., MS1 = [1, 1, 1; 1, 1, 1; 1, 1, 1]. v represents the scale factor, and ⨁ indicates the morphological dilation operation.
(2)
For an image T, its gradient feature Gv is characterized by the morphological gradient operator as follows:
G v T = T M S v T Θ M S v
where Θ denotes the morphological erosion operation.
(3)
The output value ρ of MSMG can be computed as the weighted sum of the gradients across different scales:
ρ = v = 1 N w v G v T
where wv denotes the gradient weight at the v-th scale and can be expressed as:
w v = 1 2 v + 1
Both the CS and FS layers carry the texture properties and partial structural characteristics of polarized images. The gradient-domain PCNN fusion strategy is well-suited for fusing images containing small-scale texture features, and is therefore adopted in this paper. Specifically, the CS and FS layers obtained by applying MSMG calculations to TA and TB are utilized as the connection strengths input in the PCNN, thereby constituting an MSMG-modulated dual-channel PCNN model, as shown in Figure 5. Its mathematical expression is given as follows.
According to the introduction in reference [36], under the premise of preserving the biological characteristics of the original model, the receptive field of the PCNN model can be simplified as:
F i j ( 1 ) ( n ) = I i j ( 1 ) ( n )
F i j ( 2 ) ( n ) = I i j ( 2 ) ( n )
L i j ( n ) = V L k l W i j k l Y k l ( n 1 )
where I i j ( 1 ) and I i j ( 2 ) denote the input stimuli received by channel 1 and channel 2, respectively, with their magnitudes corresponding to the pixel values at location (i, j) in the two input images. Lij represents the connection parameter, and the connection weight amplification factor VL is set to 1. Wijkl denotes the connection weight matrix between neurons. When each neuron is positioned at the center of a 3 × 3 (or 5 × 5) weight matrix, its adjacent pixels correspond to the neurons within this matrix. The connection weights between neurons are closely related to their spatial distance.
Therefore, we define the connection weight as the inverse square of the Euclidean distance between connected neurons—specifically, the connection weight between neuron ij and neuron kl is given by:
W ijkl = 0 , if   i = k = j = l 1 i k 2 + j l 2 , otherwise
The modulation field is defined as:
U ij ( n ) = Max U i j ( 1 ) ( n ) ,   U i j ( 2 ) ( n )
where Uij denotes the internal activity of the dual-channel output, while U i j ( 1 ) and U i j ( 2 ) correspond to the internal activities of channel 1 and channel 2, respectively, which can be expressed as:
U i j ( 1 ) ( n ) = F i j ( 1 ) ( n ) 1 + β 1 L i j ( n )
U i j ( 2 ) ( n ) = F i j ( 2 ) ( n ) ( 1 + β 2 L i j ( n ) )
Here, β1 and β2 represent the connection strengths of channel 1 and channel 2, respectively. As for the pulse generator field of this network, it can be represented by Equations (11) and (12).
Despite the notable advances PCNN has achieved in image fusion techniques, this method still exhibits a critical limitation: each pixel in its network architecture corresponds to an independent neuron. If PCNN is directly applied for fusion, pixels representing the same content within an image block may be activated asynchronously, leading to biased fusion decisions. Consequently, the final fused image may exhibit undesirable pixel-level mutations and block artifacts.
As mentioned above, the MSMG exhibits excellent capability in extracting image edge features. Therefore, it is employed as a pre-modulation processing unit for PCNN to effectively enhance spatial correlations across different layers. In this paper, the image results processed by MSMG are used as the connection strength input of the network, and the detailed mathematical representation is given as follows:
β 1 = ρ 1
β 2 = ρ 2
where ρ1 and ρ2 are defined as the outputs of the two input images under the MSMG operator, which can be derived from Equation (29).
Then, the fused FS and CS layers are denoted as:
T F F S m ( i , j ) = T A F S m ( i , j ) , if   U i j ( 1 ) U i j ( 2 ) T B F S m ( i , j ) , otherwise
T F C S m ( i , j ) = T A C S m ( i , j ) , if   U i j ( 1 ) U i j ( 2 ) T B C S m ( i , j ) , otherwise
where U i j ( 1 ) and U i j ( 2 ) can be obtained from Equations (36) and (37).

3.3. Fusion Strategy for Base Structural Layer

The BS contains the primary information of the polarized image (such as the main texture and the background). Given the information asymmetry inherent in polarization images, an energy-attribute fusion strategy is employed at this level to effectively balance disparities and achieve information complementarity and enhancement. The implementation of this strategy involves three main steps:
(1)
Calculate the eigenvalues HA and HB of the BS layer, which are respectively expressed as:
H A = X ¯ A + X ˜ A
H B = X ¯ B + X ˜ B
where HA and HB represent the feature values of the BS layer. X ¯ A and X ¯ B represent the mean values of T A B S and T B B S , respectively, while X ˜ A and X ˜ B denote the median values of T A B S and T B B S , respectively.
(2)
Calculate the energy attribute functions EA and EB, which are respectively given by:
E A ( i , j ) = e α G T A B S ( i , j ) H A
E B ( i , j ) = e α G T B B S ( i , j ) H B
where EA and EB represent the energy attribute functions of the BS layer. αG denotes the gain coefficient, we set αG = 4 in this paper.
(3)
Obtain the final fused result T F B S for the BS layer by weighted averaging:
T F B S ( i , j ) = E A ( i , j ) T A B S ( i , j ) + E B ( i , j ) T B B S ( i , j ) E A ( i , j ) + E B ( i , j )

4. Validation Experiments

Two volunteers were recruited for the experiment, as illustrated in Figure 6. One volunteer participated in Scenario 1, while the other volunteer performed Scenarios 2 and 3. In each scenario, five types of concealed objects were randomly placed on the volunteer’s chest, abdomen, and thigh pockets, with all objects positioned inside the clothing and close to skin. Imaging was conducted using a W-band focal plane scanning system [37], operating within the 70–110 GHz frequency range (bandwidth: 40 GHz). The radiometer channels featured a noise figure better than 3.5 dB, with an integration time of 280 μs, achieving a radiometric sensitivity better than 0.5 K.
A high-density polyethylene dielectric lens with a diameter of 460 mm was employed to focus the millimeter waves, and the observation distance was 2.5 m. Imaging under different linear polarizations was realized by rotating the radiometer and feed antenna. Polarization channel calibration was performed using the two-point calibration method: a blackbody absorbing material (emissivity: 0.999) was placed in front of the system, with its physical temperature varying between 25 °C and 60 °C, while a cold temperature reference was obtained by immersing the material in liquid nitrogen (approximately 77 K). Calibration was conducted prior to each polarization angle acquisition to compensate for system drift. To assess stability, each volunteer/scenario was independently measured three times. All experiments were carried out indoors at an ambient temperature of approximately 25 °C, and linear polarization images at 0°, 45°, 90°, and 135° were acquired using the W-band PMMW system.

4.1. Concealed Contraband Detection Imaging Results

In this paper, we adopt an MSMG-PCNN model to fuse the FS and CS layers. For the PCNN, the decay time constant αθ is set to 3, and the amplification factors VL and Vθ are 1 and 20, respectively. The number of iterations is fixed at 100, with the stopping rule defined as reaching the maximum iteration count. A 3 × 3 connection kernel is used as the neighborhood size. For the morphological gradient, the scale factor is v = 3, and the basic structural element MS1 is a 3 × 3 all-ones structuring element. The GF is applied with σ = 20 and a kernel size of 5 × 5. In the WACF decomposition, the number of iterations per scale is set to 2. For the EA fusion, the gain coefficient is αG = 4.
As shown in Figure 6, the results of three sets of concealed contraband detection imaging experiments are presented, corresponding to three different scenarios from top to bottom in sequence. Figure 6a shows the detection imaging results for Scenario 1, where five concealed objects are annotated with color boxes, including metal pliers (#N1), a utility knife (#N2), an alcohol bottle (#N3), a mobile phone (#N4), and a charging case (#N5). Figure 6b displays the image processing result for Scenario 2, containing a water bottle (#N1), a ceramic knife (#N2), a handgun (#N3), an alcohol bottle (#N4), and a utility knife (#N5). Figure 6c is the imaging result for Scenario 3, which includes a water bottle (#N1), an alcohol bottle (#N2), a handgun (#N3), a glue (#N4), and a ceramic knife (#N5).
As can be observed from the imaging results, all four linear polarization modes—horizontal (TB0), 45° (TB45), vertical (TB90), and 135° (TB135) linear polarization—struggle to effectively distinguish concealed contraband from the human body background, thereby significantly reducing the detection probability. This reveals an inherent limitation of linear polarization in PMMW security checks: its detection performance is highly dependent on the angle between the direction of the target and the polarization direction. For instance, a knife exhibits the strongest signal when aligned parallel to the polarization direction, while the response weakens considerably under orthogonal orientation, easily leading to missed detections. Additionally, non-metallic or structurally complex concealed contraband is difficult to identify due to its weak polarimetric signature. Moreover, the polarization wave scattering caused by the curvature and contour of the human body will also introduce interference and increase the difficulty of image interpretation.
To improve the detection capability of concealed objects, we further investigated multipolarization fusion methods. The results processed by seven fusion methods—polarization summation average (PSA) [25], principal component analysis (PCA) [38], discrete cosine transform (DCT) [39], laplacian pyramid fusion (LPF) [40], subregion fusion (SF) [25], and two advanced deep learning fusion models, FDFuse [41] and LSRNet [42]—along with the proposed fusion method are presented in Figure 6. It can be seen that the proposed fusion strategy effectively suppresses image noise while enhancing the intensity contrast between the concealed object and the human background, and presenting a more complete object shape and clearer contour features.

4.2. Performance Analysis

As can be observed from the detection imaging results in Figure 6, the proposed method effectively reconstructs the shape and contour features of concealed contraband. To further quantitatively evaluate its reconstruction performance, we introduce entropy [43], blind/referenceless image spatial quality evaluator (BRISQUE) [44], natural image quality evaluator (NIQE) [45], perception-based image quality evaluator (PIQE) [46], and signal-to-noise ratio (SNR) [25] for comprehensive analysis.
In PMMW imaging, the acquired images inherently suffer from SNR and pronounced noise interference. Under such conditions, high entropy values often originate not from meaningful target information but from background noise and random fluctuations. Pursuing high entropy alone may instead indicate insufficient noise suppression, which is detrimental to subsequent detection and recognition tasks. The core of the proposed method lies in enhancing the brightness temperature contrast of concealed targets and emphasizing target contours and structural information, rather than preserving all textural details including noise. Consequently, in the fused image, fluctuations in background regions are effectively suppressed, target information is strengthened, and the pixel distribution becomes more concentrated. The moderate reduction in entropy therefore reflects the effectiveness of the method, rather than signifying information loss. In fact, in PMMW tasks, lower entropy generally corresponds to a clearer background and more salient targets. Hence, entropy should not be interpreted in isolation as “information richness”. Instead, it must be jointly interpreted with task-relevant metrics such as SNR and detection accuracy.
On the other hand, although no-reference image quality assessment metrics such as BRISQUE, NIQE, and PIQE were originally designed for natural images, their core function is to measure structural integrity, perceptual quality, and the degree of distortion, and they are not strictly confined to natural scenes. These metrics are built upon spatial-domain statistical features, patch-based statistical models, or perceptual features, and they exhibit high sensitivity to blur, noise, block artifacts, and structural distortions—precisely the critical factors that need to be evaluated in PMMW image fusion. In PMMW imaging, quality assessment focuses on edge preservation, structural clarity, and noise suppression, which align closely with the statistical characteristics captured by the aforementioned no-reference metrics. Therefore, employing these metrics to evaluate the fusion performance of the proposed method is both reasonable and valid.
For the three experimental scenarios depicted in Figure 6, the computed results of each evaluation metric (entropy, BRISQUE, NIQE, PIQE) are summarized in Table 1. The data demonstrate that the proposed fusion strategy outperforms all comparative methods across all metrics, achieving the best PMMW imaging reconstruction performance. Furthermore, as evidenced by the SNR value for each concealed contraband in Table 2, the proposed method achieves the highest SNR value. This significantly enhances the contrast between concealed contraband and the surrounding background, thereby effectively improving target detectability.

4.3. Performance on ROC Curves

To comprehensively assess the detection performance, this paper adopts the ROC curve proposed in [47] as an analytical tool. This curve clearly illustrates the trade-off between detection sensitivity and specificity by plotting the relationship between the true positive rate (TPR) and false positive rate (FPR) across varying thresholds. The area under the curve (AUC) serves as a measure of the overall performance, where TPR reflects the detection sensitivity and FPR is related to its specificity. Their respective formulas are expressed as follows:
TPR = TP TP + FN
FPR = FP FP + TN
where TP (true positive) denotes pixels that truly belong to the target object and are correctly identified as such. TN (true negative) refers to pixels that truly belong to the background and are correctly classified as background. FP (false positive) represents pixels that truly belong to the background but are incorrectly identified as the target object. FN (false negative) indicates pixels that truly belong to the target object but are incorrectly identified as background.
Regarding the generation of ROC curves, we need to further clarify the following two points. First, we provide a detailed description of the threshold setting protocol: after normalizing the pixel values of the fused images to the range [0, 1], we traverse all possible thresholds at a step size of 0.01, and calculate the TPR and FPR at each step, thus generating a complete ROC curve. Second, regarding the generation of ground-truth masks, three researchers independently performed manual annotation based on the actual positions and contours of concealed objects in the original PMMW images. The final binary masks were determined by majority voting, while background regions were selected from typical areas free of interference around the targets. All methods under comparison (including the proposed method and the seven competing methods) strictly use the identical masks for pixel-wise evaluation, ensuring a consistent and fair basis for computing TP, FP, TN, and FN.
As illustrated in Figure 7, the ROC curves provide a visual representation of the detection outcomes for each concealed object across the three experimental scenarios. The experimental results demonstrate that the ROC curve of the proposed method is consistently closest to the upper-left corner in all detection tasks and achieves the largest AUC values. This finding indicates that the proposed method maintains a significant advantage in detecting concealed objects, exhibiting superior discriminative ability and higher detection accuracy. In other words, the proposed method attains the largest AUC for concealed contraband in all cases, further confirming its excellent detection performance.

4.4. Ablation Experiments

To further validate the contribution of each key component in the proposed framework, we conducted a series of ablation experiments on the first experimental scenario. Specifically, we evaluated the following variants: (1) Decomposition architecture: replacing GWACF with Gaussian filtering alone or WACF alone; (2) Texture layer fusion: removing the MSMG modulation from PCNN; (3) Background layer fusion: replacing EA fusion with simple weighted averaging; (4) Decomposition layers: using one-layer (BS only) or two-layer (BS + FS) decomposition; and (5) Input construction: using the original four-channel polarization images instead of TA/TB. The quantitative results are shown in Table 3, Table 4, Table 5, Table 6 and Table 7. The complete model consistently outperformed all variants across all evaluation metrics, confirming the necessity and effectiveness of each proposed module. In particular, GWACF achieved the best edge-preserving decomposition, MSMG-PCNN enhanced texture fusion quality, EA fusion maintained structural integrity, and the three-level decomposition provided the optimal multi-scale representation.

4.5. Time Complexity Analysis

The proposed method mainly consists of three parts: GWACF multi-scale decomposition, MSMG-PCNN fusion (applied to the FS and CS layers), and energy attribute fusion (applied to the BS layer). Let the size of a single input image be H × W. The number of GWACF decomposition scales is set to S = 3, and during the WACF decomposition process, the number of iterations at each scale is set to L. The number of MSMG scales is denoted as R. The number of PCNN iterations is N, and the size of the neighborhood window is K × K.
(1)
GWACF decomposition: Each layer consists of GF operation (complexity O(HW)) and WACF operation. WACF involves convolution with eight directional kernels, and performing L iterations for each convolution yields a complexity of O(K2LHW). Therefore, the complexity of a single layer is O(8K2LHW). Consequently, the total complexity of the three-layer GWACF decomposition is O(3·(HW + 8K2LHW)), which can be further expressed as O(HW + K2LHW).
(2)
MSMG-PCNN fusion: This is used for the FS and CS layers (a total of 2S = 6 layers). For each scale v in MSMG, morphological dilation and erosion operations are performed on the image. The morphological operation for each pixel requires traversing all pixels within the window, so the complexity for a single scale is O(HWv2). Summing over all scales yields O(HWR3). In the PCNN, each iteration involves computing the neighborhood connection weights (complexity O(K2HW)) and updating the internal activity (complexity O(HW)). With a total of N iterations, the complexity for a single layer is O(NK2HW). Consequently, the overall complexity of MSMG-PCNN fusion is O(6·(HWR3 + NK2HW)), which can be further simplified to O(HWR3 + NK2HW).
(3)
Energy attribute fusion (BS layer): This involves only pixel-wise mean and exponential operations, yielding a complexity of O(HW).
In summary, the overall complexity of the proposed method is:
O(HW·(R3 + NK2 + K2L))
where R = K = 3, L = 2, and N is set to 100, so the overall complexity is linearly related to the size of the image.
To verify the practical efficiency, we measured the average running time of each algorithm on the same hardware platform (a computer equipped with an i7-12700H CPU and a GeForce RTX 3060 GPU). The experimental results are presented in Table 8. For images with a resolution of 175 × 360, the average processing time of the proposed method is approximately 0.69 s. Although this time is higher than those of the other fusion methods, considering the significant improvement in detection accuracy and edge preservation achieved by our method, this computational cost remains within an acceptable range for practical security inspection scenarios. It is anticipated that the use of higher-performance processors, combined with further exploration of acceleration algorithms, could potentially enable real-time imaging at video rates in the future.

5. Conclusions

This paper addresses the insufficient application of multi-polarization technology in PMMW security imaging by proposing a multi-scale edge-preserving image decomposition framework integrated with a gradient-domain PCNN fusion strategy. The method employs a GWACF model to decompose images into three layers—FS, BS, and CS layers—which are subsequently fused using gradient-domain PCNN and energy-attribute methods. This enables effective complementary of multi-polarization information while preserving essential texture and edge details. Experimental results demonstrate that the proposed fusion method outperforms existing mainstream methods across multiple evaluation metrics, significantly enhancing target contour and structural information while providing superior image quality for subsequent detection and segmentation tasks. This study has verified the potential of the multi-polarization PMMW technology in the detection of concealed objects in the human body, and it is applicable to scenarios such as non-intrusive security checks at smart transportation hubs, covert security protection in public places, and non-destructive flaw detection in industries.
In terms of future research works, the following two aspects will be focused on: firstly, further optimization of the fusion algorithm to improve computational efficiency and achieve real-time imaging capability at video rate, and secondly, exploration of integration with deep learning-based fusion methods to enhance the extraction and characterization of multi-polarization target features.

Author Contributions

Conceptualization, X.C. and F.H.; data curation, X.C.; formal analysis, X.C. and D.Z.; funding acquisition, D.Z. and F.H.; methodology, X.C. and F.H.; project administration, X.C.; resources, X.C. and F.H.; supervision, F.H., J.S., B.F. and J.T.; validation, X.C. and F.H.; writing—original draft, X.C.; writing—review and editing, X.C., F.H., J.S., B.F. and J.T. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the National Natural Science Foundation of China under grant 62271219 and in part by the Fundamental Research Funds for the Central Universities (Huazhong University of Science and Technology).

Data Availability Statement

All data relevant to this study are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

List of Acronyms

The following abbreviations are used in this manuscript:
AoPAngle of polarization
AUCArea under the curve
BRISQUEBlind/referenceless image spatial quality evaluator
BSBase structural
BTBrightness temperature
CSCoarse structural
DCTDiscrete cosine transform
DoLPDegree of linear polarization
EAEnergy attribute
FNFalse negative
FPFalse positive
FPRFalse positive rate
FSFine structural
GWACFGaussian and weighted average curvature filtering
LPFLaplacian pyramid fusion
LPRLinear polarization ratio
MSMGMulti-scale morphological gradient
NIQENatural image quality evaluator
PCAPrincipal component analysis
PCNNPulse-coupled neural network
PDoPPassive degree of polarization
PIQEPerception based image quality evaluator
PMMWPassive millimeter-wave
PSAPolarization summation average
SFSubregion fusion
SNRSignal-to-noise ratio
TNTrue negative
TPTrue positive
TPRTrue positive rate
WACFWeighted average curvature filtering

References

  1. Zhuravlev, A.; Razevig, V.; Rogozin, A.; Chizh, M. Microwave imaging of concealed objects with linear antenna array and optical tracking of the target for high-performance security screening systems. IEEE Trans. Microw. Theory Tech. 2023, 71, 1326–1336. [Google Scholar] [CrossRef]
  2. Ji, L.; Mou, Y. Research on the feasibility of application of millimeter-wave security screening equipment in civil aviation. In Proceedings of the 2021 IEEE 3rd International Conference on Civil Aviation Safety and Information Technology (ICCASIT), Changsha, China, 20–22 October 2021; pp. 71–75. [Google Scholar]
  3. Wang, Y.; Cheng, Y.; Liu, B.; Xiong, H.; Zhang, L.; Wang, Y.; Qiu, J. Polarization-guided 3-D reconstruction for occluded faces using passive millimeter-wave single-direction imaging. IEEE Trans. Microw. Theory Tech. 2025, 73, 10744–10757. [Google Scholar] [CrossRef]
  4. Liu, C.; Yang, M.; Sun, X. Towards robust human millimeter wave imaging inspection system in real time with deep learning. Prog. Electromagn. Res. 2018, 161, 87–100. [Google Scholar] [CrossRef]
  5. Wang, X.; Guo, S.; Li, J.; Zhao, Y.; Liu, Z.; Jiao, C.; Mao, S. Self-paced feature attention fusion network for concealed object detection in millimeter-wave image. IEEE Trans. Circuits Syst. Video Technol. 2022, 32, 224–239. [Google Scholar] [CrossRef]
  6. Wang, Y.; Cheng, Y.; Tang, P.; Ma, H.; Zhang, L.; Wang, Y.; Qiu, J. Pathway detection using fusion polarization features in passive millimeter-wave imaging. IEEE Trans. Circuits Syst. Video Technol. 2025, 36, 2685–2696. [Google Scholar] [CrossRef]
  7. Cheng, Y.; Tian, X.; Zhu, D.; Wu, L.; Zhang, L.; Qi, J.; Qiu, J. Regional-based object detection using polarization and Fisher vectors in passive millimeter-wave imaging. IEEE Trans. Microw. Theory Tech. 2023, 71, 2702–2713. [Google Scholar] [CrossRef]
  8. Jiang, Z.; Cheng, Y.; Meng, Z.; Wang, N.; Qiu, J. Physics-based feature fusion of passive and active millimeter-wave imaging under noise illumination for handheld concealed object detection. IEEE Trans. Microw. Theory Tech. 2025, 73, 9402–9414. [Google Scholar] [CrossRef]
  9. Yang, H.; Yang, Z.; Hu, A.; Liu, C.; Cui, T.; Miao, J. Unifying convolution and transformer for efficient concealed object detection in passive millimeter-wave images. IEEE Trans. Circuits Syst. Video Technol. 2023, 33, 3872–3887. [Google Scholar] [CrossRef]
  10. Shen, X.; Dietlein, C.; Grossman, E.; Popovic, Z.; Meyer, F. Detection and segmentation of concealed objects in terahertz images. IEEE Trans. Image Process. 2008, 17, 2465–2475. [Google Scholar] [CrossRef]
  11. Su, J.; Tian, Y.; Hu, F.; Cheng, Y.; Hu, Y. Material clustering using passive millimeter-wave polarimetric imagery. IEEE Photon. J. 2019, 11, 5500109. [Google Scholar] [CrossRef]
  12. Cheng, Y.; Wu, H.; Ren, X.; Wang, N.; Qi, J.; Qiu, J. Object segmentation using polarization random feature in passive millimeter-wave imaging. IEEE Trans. Geosci. Remote Sens. 2024, 62, 5302112. [Google Scholar] [CrossRef]
  13. Wang, Y.; Yu, A.; Cheng, Y.; Qi, J. Matrix diffractive deep neural networks merging polarization into meta-devices. Laser Photon. Rev. 2024, 18, 2300903. [Google Scholar] [CrossRef]
  14. Kim, W.; Moon, N.; Kim, H.; Kim, Y. Linear polarization sum imaging in passive millimeter-wave imaging system for target recognition. Prog. Electromagn. Res. 2013, 136, 175–193. [Google Scholar] [CrossRef]
  15. Cheng, Y.; Wang, Y.; Niu, Y.; Zhao, Z. Concealed object enhancement using multi-polarization information for passive millimeter and terahertz wave security screening. Opt. Exp. 2020, 28, 6350–6366. [Google Scholar] [CrossRef]
  16. Cheng, Y.; Wang, Y.; Niu, Y.; Rutt, H.; Zhao, Z. Physically based object contour edge display using adjustable linear polarization ratio for passive millimeter-wave security imaging. IEEE Trans. Geosci. Remote Sens. 2021, 59, 3177–3191. [Google Scholar] [CrossRef]
  17. Wu, L.; Zhu, J.; Peng, S.; Xiao, Z.; Wang, Y. Post-processing techniques for polarimetric passive millimeter wave imagery. Appl. Comput. Electromagn. Soc. J. 2018, 33, 512–518. [Google Scholar]
  18. Li, N.; Zhao, Y.; Pan, Q.; Kong, S. Removal of reflections in LWIR image with polarization characteristics. Opt. Exp. 2018, 26, 16488–16504. [Google Scholar] [CrossRef] [PubMed]
  19. Wang, C.; Miyazaki, D. PolAttNet: A deep learning framework for polarized image reflection removal with multi-attention mechanisms. IEEE Access 2025, 13, 30762–30778. [Google Scholar] [CrossRef]
  20. Wilson, J.; Schuetz, C.A.; Dillon, T.E.; Eng, D.L.K.; Kozacik, S.; Prather, D.W. Display of polarization information for passive millimeter-wave imagery. Opt. Eng. 2012, 51, 091607. [Google Scholar] [CrossRef]
  21. Yueh, S.; Wilson, W.; Li, F.; Nghiem, S.; Ricketts, W. Polarimetric brightness temperatures of sea surfaces measured with aircraft K- and Ka-band radiometers. IEEE Trans. Geosci. Remote Sens. 1997, 35, 1177–1187. [Google Scholar] [CrossRef]
  22. Yueh, S. Directional signals in windsat observations of hurricane ocean winds. IEEE Trans. Geosci. Remote Sens. 2008, 46, 130–136. [Google Scholar] [CrossRef]
  23. Tang, F.; Gui, L.; Liu, J.; Chen, K.; Lang, L.; Cheng, Y. Metal target detection method using passive millimeter-wave polarimetric imagery. Opt. Exp. 2020, 28, 13336–13351. [Google Scholar] [CrossRef]
  24. Garcia, N.; Erausquin, I.; Edmiston, C.; Gruev, V. Surface normal reconstruction using circularly polarized light. Opt. Exp. 2015, 23, 14391–14406. [Google Scholar] [CrossRef]
  25. Cheng, Y.; Zhang, L.; Guo, D.; Wang, N.; Qi, J.; Qiu, J. Subregional polarization fusion via Stokes parameters in passive millimeter-wave imaging. IEEE Trans. Ind. Inform. 2024, 20, 8585–8595. [Google Scholar] [CrossRef]
  26. Yang, H.; Zhang, D.; Hu, A.; Liu, C.; Cui, T.J.; Miao, J. Transformer-based anchor-free detection of concealed objects in passive millimeter wave images. IEEE Trans. Instrum. Meas. 2022, 71, 5012216. [Google Scholar] [CrossRef]
  27. Shen, C.; Wang, D.; Tang, S.; Cao, H.; Liu, J. Hybrid image noise reduction algorithm based on genetic ant colony and PCNN. Vis. Comput. 2017, 33, 1373–1384. [Google Scholar] [CrossRef]
  28. Deng, X.; Ma, Y.; Dong, M. A new adaptive filtering method for removing salt and pepper noise based on multilayered PCNN. Pattern Recognit. Lett. 2016, 79, 8–17. [Google Scholar] [CrossRef]
  29. Helmy, A.; El-Taweel, G. Image segmentation scheme based on SOM–PCNN in frequency domain. Appl. Soft Comput. 2016, 40, 405–415. [Google Scholar] [CrossRef]
  30. Wei, S.; Hong, Q.; Hou, M. Automatic image segmentation based on PCNN with adaptive threshold time constant. Neurocomputing 2011, 74, 1485–1491. [Google Scholar] [CrossRef]
  31. Ranganath, H.; Kuntimad, G. Object detection using pulse coupled neural networks. IEEE Trans. Neural Netw. 1999, 10, 615–620. [Google Scholar] [CrossRef]
  32. Zhou, T.; Si, J.; Wang, L.; Xu, C.; Yu, X. Automatic detection of underwater small targets using forward-looking sonar images. IEEE Trans. Geosci. Remote Sens. 2022, 60, 4207912. [Google Scholar] [CrossRef]
  33. Nie, R.; Cao, J.; Zhou, D.; Qian, W. Multi-source information exchange encoding with PCNN for medical image fusion. IEEE Trans. Circuits Syst. Video Technol. 2021, 31, 986–1000. [Google Scholar] [CrossRef]
  34. Panigrahy, C.; Seal, A.; Mahato, N. MRI and SPECT image fusion using a weighted parameter adaptive dual channel PCNN. IEEE Signal Process. Lett. 2020, 27, 690–694. [Google Scholar] [CrossRef]
  35. Tan, W.; Thitøn, W.; Xiang, P.; Zhou, H. Multi-modal brain image fusion based on multi-level edge-preserving filtering. Biomed. Signal Process. Control 2021, 64, 102280. [Google Scholar] [CrossRef]
  36. Deng, X.; Yan, C.; Ma, Y. PCNN mechanism and its parameter settings. IEEE Trans. Neural Netw. Learn. Syst. 2019, 31, 488–501. [Google Scholar] [CrossRef]
  37. Cheng, Y.; Qiao, L.; Zhu, D.; Wang, Y.; Zhao, Z. Passive polarimetric imaging of millimeter and terahertz waves for personnel security screening. Opt. Lett. 2021, 46, 1233–1236. [Google Scholar] [CrossRef] [PubMed]
  38. Batur, E.; Maktav, D. Assessment of surface water quality by using satellite images fusion based on PCA method in the Lake Gala, Turkey. IEEE Trans. Geosci. Remote Sens. 2019, 57, 2983–2989. [Google Scholar] [CrossRef]
  39. Cao, L.; Jin, L.; Tao, H.; Li, G.; Zhuang, Z.; Zhang, Y. Multi-focus image fusion based on spatial frequency in discrete cosine transform domain. IEEE Signal Process. Lett. 2015, 22, 220–224. [Google Scholar] [CrossRef]
  40. Yao, J.; Zhao, Y.; Bu, Y.; Kong, S.; Chan, J. Laplacian pyramid fusion network with hierarchical guidance for infrared and visible image fusion. IEEE Trans. Circuits Syst. Video Technol. 2023, 33, 4630–4644. [Google Scholar] [CrossRef]
  41. Cheng, M.; Huang, H.; Liu, X.; Mo, H.; Wu, S.; Zhao, X. FDFuse: Infrared and visible image fusion based on feature decomposition. IEEE Trans. Instrum. Meas. 2025, 74, 5021413. [Google Scholar] [CrossRef]
  42. Yang, B.; Hu, Y.; Liu, L.; Liu, Y.; Li, J. LSRNet: A novel interpretable low-rank sparse representation guided fusion network for polarization and intensity images. IEEE Trans. Image Process. 2026, 35, 4961–4974. [Google Scholar] [CrossRef]
  43. Xu, Y.; Zhu, D.; Hu, F.; Fang, B.; Fu, P. Target imaging using compressed sampling in synthetic aperture interferometric radiometer. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5301515. [Google Scholar] [CrossRef]
  44. Mittal, A.; Moorthy, A.; Bovik, A. No-reference image quality assessment in the spatial domain. IEEE Trans. Image Process. 2012, 21, 4695–4708. [Google Scholar] [CrossRef] [PubMed]
  45. Zhang, Z.; Sun, W.; Zhou, Y.; Wu, H.; Li, C.; Min, X.; Liu, X.; Zhai, G.; Lin, W. Advancing zero-shot digital human quality assessment through text-prompted evaluation. IEEE Trans. Image Process. 2025, 34, 3503–3517. [Google Scholar] [CrossRef] [PubMed]
  46. Venkatanath, N.; Praneeth, D.; Sumohana, S.; Swarup, S. Blind image quality evaluation using perception based features. In Proceedings of the 21st 2015 Twenty First National Conference on Communications (NCC), Mumbai, India, 27 February–1 March 2015; pp. 1–6. [Google Scholar]
  47. Zhang, J.; Xu, C.; Gao, Z.; Rodrigues, J.; de Albuquerque, V. Industrial pervasive edge computing-based intelligence IoT for surveillance saliency detection. IEEE Trans. Ind. Inform. 2021, 17, 5012–5020. [Google Scholar] [CrossRef]
Figure 1. Schematic of the human body under radiation measurement. (a) The situation in which there are no concealed objects; (b) The situation in which objects are hidden beneath clothing.
Figure 1. Schematic of the human body under radiation measurement. (a) The situation in which there are no concealed objects; (b) The situation in which objects are hidden beneath clothing.
Sensors 26 03577 g001
Figure 2. Schematic diagram of the PCNN neuron model.
Figure 2. Schematic diagram of the PCNN neuron model.
Sensors 26 03577 g002
Figure 3. The framework of the proposed fusion strategy.
Figure 3. The framework of the proposed fusion strategy.
Sensors 26 03577 g003
Figure 4. Flowchart of the GWACF decomposition method.
Figure 4. Flowchart of the GWACF decomposition method.
Sensors 26 03577 g004
Figure 5. Gradient domain PCNN model with MSMG modulation.
Figure 5. Gradient domain PCNN model with MSMG modulation.
Sensors 26 03577 g005
Figure 6. Imaging results for concealed object detection in the human body. (a) Detection imaging results for experimental scenario 1, with concealed objects including a metal pliers (#N1), a utility knife (#N2), an alcohol bottle (#N3), a mobile phone (#N4), and a charging case (#N5). (b) Detection imaging results for experimental scenario 2, with concealed objects including a water bottle (#N1), a ceramic knife (#N2), a handgun (#N3), an alcohol bottle (#N4), and a utility knife (#N5). (c) Detection imaging results for experimental scenario 3, with concealed objects including a water bottle (#N1), an alcohol bottle (#N2), a handgun (#N3), a glue (#N4), and a ceramic knife (#N5).
Figure 6. Imaging results for concealed object detection in the human body. (a) Detection imaging results for experimental scenario 1, with concealed objects including a metal pliers (#N1), a utility knife (#N2), an alcohol bottle (#N3), a mobile phone (#N4), and a charging case (#N5). (b) Detection imaging results for experimental scenario 2, with concealed objects including a water bottle (#N1), a ceramic knife (#N2), a handgun (#N3), an alcohol bottle (#N4), and a utility knife (#N5). (c) Detection imaging results for experimental scenario 3, with concealed objects including a water bottle (#N1), an alcohol bottle (#N2), a handgun (#N3), a glue (#N4), and a ceramic knife (#N5).
Sensors 26 03577 g006
Figure 7. Comparison of ROC curves for multiple methods in each concealed object. (ae) Concealed object #N1~#N5 of experimental scenario 1 results. (fj) Concealed object #N1~#N5 of experimental scenario 2 results. (ko) Concealed object #N1~#N5 of experimental scenario 3 results.
Figure 7. Comparison of ROC curves for multiple methods in each concealed object. (ae) Concealed object #N1~#N5 of experimental scenario 1 results. (fj) Concealed object #N1~#N5 of experimental scenario 2 results. (ko) Concealed object #N1~#N5 of experimental scenario 3 results.
Sensors 26 03577 g007
Table 1. Computed results of evaluation metrics for various processing methods. The bold values indicate the best performance.
Table 1. Computed results of evaluation metrics for various processing methods. The bold values indicate the best performance.
ScenarioMethodEntropyBRISQUENIQEPIQE
Scenario 1TB06.9140.5812.8754.41
TB456.8939.2513.8856.50
TB906.8638.4315.4055.47
TB1356.8338.0713.9055.06
TPSA6.7533.9411.5342.80
TPCA6.6934.0610.0842.03
TDCT6.5435.7310.2242.66
TLPF6.7134.4610.1441.46
TSF6.8238.2913.9153.40
TFDFuse6.2134.5810.3142.65
TLSRNet6.0333.3110.1840.46
TProposed5.9129.5010.0332.15
Scenario 2TB06.8641.0912.9255.19
TB456.8541.1011.3156.74
TB906.7440.0512.2954.38
TB1356.8741.0211.8555.09
TPSA6.6035.9110.4545.45
TPCA6.1142.4914.7963.39
TDCT6.4936.1710.4044.13
TLPF6.3833.4410.4543.19
TSF6.8440.2511.9656.57
TFDFuse5.4633.488.8743.29
TLSRNet5.4031.638.0641.47
TProposed5.2521.516.0123.09
Scenario 3TB06.7940.9612.4355.79
TB456.8541.0811.7756.56
TB906.8341.3612.6757.74
TB1356.8240.9812.6255.57
TPSA6.6136.4810.8245.40
TPCA6.6336.5910.7045.61
TDCT6.5040.6310.5644.66
TLPF6.6035.9510.1345.40
TSF6.5926.017.3728.05
TFDFuse6.2135.665.1328.43
TLSRNet6.1732.194.7925.19
TProposed6.0318.774.2418.44
Table 2. SNR (Unit: dB) for each concealed object in the experimental results. The bold values indicate the best performance.
Table 2. SNR (Unit: dB) for each concealed object in the experimental results. The bold values indicate the best performance.
ScenarioNumberConcealed ObjectTB0TB45TB90TB135TPSATPCATDCTTLPFTSFTFDFuseTLSRNetTProposed
Scenario 1#N1Metal pliers6.897.155.726.017.677.847.697.987.188.088.199.39
#N2Utility knife10.0110.718.4510.4311.7911.7811.6612.1611.5112.1512.3313.56
#N3Alcohol bottle11.4011.3510.1910.8111.7612.0812.0612.2711.5412.7112.9413.61
#N4Mobile phone9.139.467.1610.0510.4210.4910.4610.6510.2810.8611.0211.35
#N5Charging case10.7210.428.729.5811.3912.4612.4112.5511.6712.0412.2613.03
Scenario 2#N1Water bottle9.026.477.407.2211.2911.2711.3111.416.8811.8912.0112.92
#N2Ceramic knife6.291.712.793.987.517.557.527.536.478.068.379.29
#N3Handgun9.666.298.319.2310.9110.8810.9211.167.9511.3711.5311.72
#N4Alcohol bottle7.083.351.847.576.156.226.166.176.727.297.519.48
#N5Utility knife6.195.633.735.826.816.866.846.876.237.938.268.95
Scenario 3#N1Water bottle10.216.541.0811.9612.4112.3912.4512.613.6613.6213.9014.77
#N2Alcohol bottle9.7511.248.7110.5712.1312.1512.1812.7712.5513.1513.4915.73
#N3Handgun10.1710.497.0210.0311.2911.3111.0511.3410.9911.9212.1713.24
#N4Glue5.352.812.053.654.644.634.835.035.776.857.088.31
#N5Ceramic knife2.151.320.861.732.682.712.763.543.894.574.656.87
Table 3. SNR (Unit: dB) for each concealed object in the decomposition architecture ablation experiment results. The bold values indicate the best performance.
Table 3. SNR (Unit: dB) for each concealed object in the decomposition architecture ablation experiment results. The bold values indicate the best performance.
Metal PliersUtility KnifeAlcohol BottleMobile PhoneCharging Case
GF only6.8610.989.458.1210.62
WACF only7.3111.6711.389.7111.70
GWACF (complete)9.3913.5613.6111.3513.03
Table 4. SNR (Unit: dB) for each concealed object in the texture layer fusion ablation experiment results. The bold values indicate the best performance.
Table 4. SNR (Unit: dB) for each concealed object in the texture layer fusion ablation experiment results. The bold values indicate the best performance.
Metal PliersUtility KnifeAlcohol BottleMobile PhoneCharging Case
PCNN (without MSMG)8.2311.9212.7810.1611.57
PCNN + MSMG (complete)9.3913.5613.6111.3513.03
Table 5. SNR (Unit: dB) for each concealed object in the background layer fusion ablation experiment results. The bold values indicate the best performance.
Table 5. SNR (Unit: dB) for each concealed object in the background layer fusion ablation experiment results. The bold values indicate the best performance.
Metal PliersUtility KnifeAlcohol BottleMobile PhoneCharging Case
Simple weighted average8.7712.8112.9310.8212.35
EA fusion (complete)9.3913.5613.6111.3513.03
Table 6. SNR (Unit: dB) for each concealed object in the different decomposition layers ablation experiment results. The bold values indicate the best performance.
Table 6. SNR (Unit: dB) for each concealed object in the different decomposition layers ablation experiment results. The bold values indicate the best performance.
Metal PliersUtility KnifeAlcohol BottleMobile PhoneCharging Case
BS only5.418.858.086.937.74
BS+FS7.5310.7510.668.519.18
BS + FS + CS (complete)9.3913.5613.6111.3513.03
Table 7. SNR (Unit: dB) for each concealed object in the input construction method ablation experiment results. The bold values indicate the best performance.
Table 7. SNR (Unit: dB) for each concealed object in the input construction method ablation experiment results. The bold values indicate the best performance.
Metal PliersUtility KnifeAlcohol BottleMobile PhoneCharging Case
TB0, TB45, TB90, TB135 direct fusion8.8812.4912.7310.1711.36
TA/TB construction (complete)9.3913.5613.6111.3513.03
Table 8. Average execution time of various methods (Unit: s).
Table 8. Average execution time of various methods (Unit: s).
MethodAverage Execution Time
TPSA0.0085
TPCA0.2686
TDCT0.0153
TLPF0.3631
TSF0.3327
TFDFuse0.0968
TLSRNet0.1501
TProposed0.6873
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Chen, X.; Hu, F.; Zhu, D.; Su, J.; Fang, B.; Tao, J. A Multi-Scale Edge-Preserving Decomposition and Fusion Framework for Multi-Polarization Passive Millimeter-Wave Imaging. Sensors 2026, 26, 3577. https://doi.org/10.3390/s26113577

AMA Style

Chen X, Hu F, Zhu D, Su J, Fang B, Tao J. A Multi-Scale Edge-Preserving Decomposition and Fusion Framework for Multi-Polarization Passive Millimeter-Wave Imaging. Sensors. 2026; 26(11):3577. https://doi.org/10.3390/s26113577

Chicago/Turabian Style

Chen, Xinpeng, Fei Hu, Dong Zhu, Jinlong Su, Bo Fang, and Jingyu Tao. 2026. "A Multi-Scale Edge-Preserving Decomposition and Fusion Framework for Multi-Polarization Passive Millimeter-Wave Imaging" Sensors 26, no. 11: 3577. https://doi.org/10.3390/s26113577

APA Style

Chen, X., Hu, F., Zhu, D., Su, J., Fang, B., & Tao, J. (2026). A Multi-Scale Edge-Preserving Decomposition and Fusion Framework for Multi-Polarization Passive Millimeter-Wave Imaging. Sensors, 26(11), 3577. https://doi.org/10.3390/s26113577

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop