Microwave Reconstruction Method Based on Information Metamaterials and End-to-End Deep Learning

Shi, Hongyin; Song, Jiale; Guo, Jianwen

doi:10.3390/electronics14091731

Open AccessArticle

Microwave Reconstruction Method Based on Information Metamaterials and End-to-End Deep Learning

by

Hongyin Shi

^1,2,*

,

Jiale Song

^1,2 and

Jianwen Guo

³

¹

Intelligence Science and Technology, Beijing University of Civil Engineering and Architecture, Beijing 102616, China

²

Beijing Key Laboratory of Super Intelligent Technology for Urban Architecture, Beijing University of Civil Engineering and Architecture, Beijing 102616, China

³

School of Electrical Engineering & Automation, Henan Institute of Technology, Xinxiang 453003, China

^*

Author to whom correspondence should be addressed.

Electronics 2025, 14(9), 1731; https://doi.org/10.3390/electronics14091731

Submission received: 10 March 2025 / Revised: 20 April 2025 / Accepted: 22 April 2025 / Published: 24 April 2025

Download

Browse Figures

Versions Notes

Abstract

:

Microwave computational imaging (MCI) based on coded apertures does not rely on relative motion between the radar platform and the target, enabling forward-looking imaging. The performance of MCI depends on the computational methods and modulation of the coded aperture, particularly its design. However, current research methods treat the optimization of the coded aperture and computational imaging processing as independent tasks, with no unified framework to link these two aspects, limiting the potential for improving system performance. This paper proposes a novel deep learning-based MCI framework that jointly optimizes the coded aperture and image reconstruction process. Unlike traditional methods that decouple these two stages, our approach trains the sensing and reconstruction networks in an end-to-end fashion. The key novelty lies in constructing an end-to-end imaging network based on a convolutional neural network (CNN) where the coded aperture is modeled as a convolutional layer within the network. Physical constraints on the coded aperture are enforced by adding regularizers to the loss function. Simulation experiments demonstrate that under low signal-to-noise ratio (SNR) and low compression ratio conditions, the proposed method improves peak signal-to-noise ratio (PSNR) by 5 dB to 8 dB, enhances SSIM by 10% to 15%, and reduces relative imaging error by 0.5% to 1%.

Keywords:

microwave computational imaging; deep learning; information metamaterials; encoding aperture optimization

1. Introduction

Microwave imaging technology is based on the study of the propagation and scattering of microwaves in various complex media. By inverting the geometric shape, spatial position, electromagnetic characteristics, and other parameters of the target from its scattering field, the non-destructive detection of the target can be achieved [1]. It can perform both geometric and physical imaging of the target simultaneously with high resolution. Therefore, microwave imaging technology has important applications in target remote sensing [2], safety detection [3], biomedical imaging [4], and other fields [5]. With the continuous progress of computing speed and signal processing technology, microwave imaging technology is gradually developing towards computational imaging by solving a large number of linear equations to invert the target scene [6]. Computational imaging utilizes a universal mathematical imaging equation to obtain scene information through different measurement modes. Each measurement value corresponds to a measurement mode, and by combining the prior information from both, the real scene can be reconstructed. The effectiveness of computational imaging largely depends on the design of measurement modes, with the key being the design of multiple orthogonal measurement modes [7,8,9].

In recent years, the rapid development of metamaterials has provided new avenues for microwave computational imaging. Information metamaterials are composite materials with artificially designed structures that exhibit extraordinary physical properties not found in natural materials. They enable the precise control and modulation of electromagnetic waves by adjusting their structure and parameters [10].

Microwave computational imaging based on information metamaterials combines metamaterials with computational imaging technology to achieve high-resolution imaging [11]. The design of the encoding aperture is a key factor affecting imaging quality [12,13,14,15]. Different encoding apertures lead to different measurement modes, with a lower correlation between apertures resulting in better imaging performance. Hunt et al. [12] proposed a metamaterial aperture for microwave computed imaging. By utilizing the frequency domain dispersion characteristics of metamaterials and treating each frequency point as a measurement mode across a wide frequency band, microwave computational imaging can be achieved. To further improve the measurement mode, Sleasman et al. [16,17] proposed active loading by adding PIN diodes, varactor diodes, etc., to the dispersion metamaterial structure. By adjusting the different states of the diodes, non-correlated measurement modes can be extended in the time domain dimension, thereby improving imaging quality.

The concept of digital aperture coding antenna was first proposed by Cui [18] et al. in 2014, which constitutes the basic unit of digital coding metamaterials that can be individually controlled. It can form a real-time and controllable array encoding antenna, providing rich observation modes for detection and imaging. Compared to the newly proposed digital metamaterials, reflective frequency selective metamaterials can provide low degrees of freedom in the radiation field and contain less scene information in the received scene echoes. The use of transmissive programmable metamaterials in [19] results in high energy radiation efficiency at each frequency point and a low correlation receiving pattern at sampling frequency points within a frequency band, which is beneficial for improving measurement distance and reducing noise reception, achieving microwave imaging at a single frequency of 9.2 GHz.

At the decoding end, in order to ensure the quality of the recovered signal and have high performance requirements for the reconstruction algorithm, early work mainly considered that the image is sparse in the transformation domain, such as discrete cosine transform [20] or discrete wavelet transform [21]. In addition to the sparsity of the transformation domain, the sparsity of the spatial domain has also been widely applied in image perceptual reconstruction. The most prominent method among them is total variational regularization [22,23]. The model proposed by Zhang et al. [24] simultaneously utilizes the local sparsity and non-local self-similarity of images, achieving good reconstruction performance. However, these traditional algorithms suffer from high computational complexity and poor reconstruction quality [25], and are not suitable for practical applications.

The existing aperture coding perception system heavily relies on an iterative optimization algorithm (i.e., sparse enhancement optimization algorithm) that is computationally expensive, which seriously restricts the demand for real-time perception. In recent years, the integration of deep learning and microwave computational imaging has become a powerful approach to enhance various imaging performance metrics. Methods proposed in studies, such as [26,27,28,29], typically employ deep learning solely for image reconstruction, utilizing random or fixed coding apertures. These approaches do not optimize the encoding process, leading to limitations in imaging performance under low signal-to-noise ratios or high compression scenarios. Some research focuses on physical-layer learning-based metasurface optimization [30,31], attempting to design information metamaterials through learning-driven strategies. However, these methods often rely on principal component analysis (PCA) or non-end-to-end feature extraction techniques, making unified modeling and training challenging.

The performance of microwave computational imaging systems based on information metamaterials depends on two components: microwave measurement and digital post-processing [32,33,34,35]. In the coding mode design of metamaterial antennas, random coding generates a relatively uniform radiation field, leading to a high correlation in the observation matrix, which hampers image restoration. In digital post-processing, iterative optimization algorithms incur high computational costs. Traditional imaging methods based on sparse reconstruction and optimization suffer from high computational complexity, slow convergence, and poor robustness under low signal-to-noise ratio (SNR) or high compression scenarios. To address these challenges, we propose a novel deep learning-based MCI framework that jointly optimizes the encoding aperture and reconstruction process in an end-to-end fashion. The network consists of two parts: a compression measurement subnetwork and a reconstruction subnetwork. It directly learns the end-to-end mapping between the target scene and the reconstructed image, combining the design of encoded aperture patterns with computational imaging methods. It leverages deep learning to connect and optimize the design of the encoding aperture and target the reconstruction algorithm within a unified framework.

2. Modeling of MCI Systems Based on Transmitting Information Metamaterials

The microwave computational imaging (MCI) system based on transmissive information metamaterials is shown in Figure 1. The MCI system consists of a transmitter and receiver, processor terminal, and a coding aperture antenna at the receiving end that works as phase modulation. The signal reflected from the target can be received by a coded aperture antenna. The 1-bit digitally controlled coded aperture antenna is employed for the quick modulation of the received signal phase. Then, the echo signal is sampled and sent to the processor to reconstruct the target scatterers. The scattered electromagnetic waves are phase modulated by the transmissive aperture coding antenna at the receiving end and finally received by the receiving antenna. The signal processing terminal reconstructs the target through computational imaging.

In the two-dimensional MCI imaging model, we assume that the imaging plane is divided into

W

grid units, and the strong scatterers of the target are at the center of the grid cells. The number of encoded array elements for information metamaterial antennas is

M

. Assuming that the transmitting signal

s_{t} (t)

reaches the target at time

t_{n}

, the reflected signal at the w-th imaging grid is expressed as follows:

s (t_{n}, w) = s_{t} (t_{n} - d_{w} / c) .

(1)

where

d_{w}

denotes the distance from the transmitting antenna to the w-th grid. Then, the reflected signal arriving at the m-th coded-aperture antenna is the sum of the reflected signals from all the imaging plane grids:

\begin{matrix} s (t_{n}, m) = \sum_{w = 1}^{W} A_{m} (t_{n}) s (t_{n} - \frac{d_{m, w}}{c}, w) \exp (j φ_{m} (t_{n})) β_{w} \\ = A_{m} (t_{n}) \sum_{w = 1}^{W} s (t_{n} - \frac{d_{w} + d_{m, w}}{c}) \exp (j φ_{m} (t_{n})) β_{w} . \end{matrix}

(2)

where

A_{m} (t_{n})

denotes the amplitude modulation factor of the m-th coding unit at the moment

t_{n}

,

d_{m, w}

represents the distance between the m-th coding array element and the w-th imaging grid, and

φ_{m} (t_{n})

is the phase modulation factor of the m-th coding unit,

β_{w}

which is expressed as the scattering coefficient of the w-th imaging grid. The echo signal received by the receiving antenna located behind the information metamaterial is a superposition of

M

modulated signals:

\begin{matrix} S r (t_{n}) = \sum_{m = 1}^{M} \sum_{w = 1}^{W} A_{m} (t_{n}) s (t_{n} - \frac{d_{m}}{c}, m) \exp (j φ_{m} (t_{n})) β_{w} \\ = A_{m} (t_{n}) \sum_{w = 1}^{W} s_{t} (t_{n} - \frac{d_{w} + d_{m, w} + d_{m}}{c}) \exp (j φ_{m} (t_{n})) β_{w} . \end{matrix}

(3)

where

d_{m}

represents the distance between the m-th metamaterial coding array element and the receiving antenna, and in order to express it in the form of the multiplication of the reference signal with the scattering coefficient, the Equation (3) is rewritten as follows:

S r (t_{n}) = \sum_{w = 1}^{W} S (t_{n}, w) \cdot β_{w} .

(4)

where

S (t_{n}, w)

is the reference signal corresponding to the w-th grid cell in the imaging region.

S (t_{n}, w) = \sum_{m = 1}^{M} A_{m} (t_{n}) s_{t r} (t_{n} - \frac{d_{w} + d_{m}}{c}) \exp (j φ_{m} (t_{n})) .

(5)

At the signal transmitter end, a single signal is emitted, and the entire process is divided into

N

time segments. The echo signals received at the receiver end at different time intervals are represented as the following echo signal matrix:

[\begin{matrix} S r (t_{1}) \\ S r (t_{2}) \\ ⋮ \\ S r (t_{n}) \\ ⋮ \\ S r (t_{N}) \end{matrix}] = [\begin{matrix} S (t_{1}, 1) & S (t_{1}, 2) & \dots & S (t_{1}, w) & \dots & S (t_{1}, W) \\ S (t_{2}, 1) & S (t_{2}, 2) & \dots & S (t_{2}, w) & \dots & S (t_{2}, W) \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ S (t_{n}, 1) & S (t_{n}, 2) & \dots & S (t_{n}, w) & \dots & S (t_{n}, W) \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ S (t_{N}, 1) & S (t_{N}, 2) & \dots & S (t_{N}, w) & \dots & S (t_{N}, W) \end{matrix}] \cdot [\begin{matrix} β_{w} \\ β_{w} \\ ⋮ \\ β_{w} \\ ⋮ \\ β_{w} \end{matrix}] + [\begin{matrix} ω_{1} \\ ω_{2} \\ ⋮ \\ ω_{n} \\ ⋮ \\ ω_{N} \end{matrix}] .

(6)

where

ω_{N}

denotes the measurement noise at the nth coded measurement receive echo reception, and

N

is the number of coded measurement patterns. The form of the matrix equation expressed in Equation (6) can also be written in the form of a vector matrix:

S r = S \cdot β + ω .

(7)

Equation (7) is the sensing model of the system. Where

S r \in C^{N \times 1}

is the vector of echo signals throughout the

N

measurements.

S \in C^{N \times W}

represents the reference signal matrix, also known as the sensing matrix, whose correlation is determined by the coding mode of the coded aperture.

β \in C^{W \times 1}

is the vector of scattering coefficients corresponding to the center of the grid cell of the imaging region, and

ω \in C^{W \times 1}

denotes the noise vector.

The compression ratio is the ratio of the number of measured codes to the number of image grids, as follows:

γ = N / W .

(8)

From the imaging modeling process, it can be seen that the solution of the scattering coefficients mainly relies on the row-to-row and column-to-column correlations of the reference signal matrix. The stronger the modulation capability of the coded antenna, the stronger the non-correlation of S, and the more accurate the solution of the target scattering coefficient.

The sparsity criterion of the target is measured by the relationship between the L1 and L2 norms of a given vector [34].

S p a r s n e s s (β) = \sqrt{N} - \frac{\sum |β_{i}|}{\sqrt{\sum | β_{i} |^{2}}} / \sqrt{N} - 1 .

(9)

where

β

denotes the scattering coefficient of the imaging target, N is the pixel of the imaging target, and the influence of image sparsity on the reconstructed image is very important. A high sparsity image means that there are many zero pixels in the image, and in image reconstruction, a high sparsity image can often be better reconstructed. This is because images with high sparsity have fewer non-zero pixels, which can be reduced in storage space and computational complexity through compression and sparse representation methods.

3. Method

In order to improve the quality of target reconstruction, Existing methods mainly focus on two aspects [36,37]: the design of encoding modes in the measurement phase and the design of computational algorithms in the reconstruction phase, which are also referred to as microwave encoders and computational decoder design.

In previous studies, when employing 1-bit information metamaterial antennas, the random matrix with equal 0/1 probabilities (p = 0.5) demonstrated superior robustness in most scenarios. Consequently, the majority of coding pattern selections were either directly based on random matrices for aperture coding imaging or involved the optimization of coding matrices.

In the reconstruction stage, most methods are based on optimization problems, utilizing compressed sensing theory and prior information to adjust the loss function through regularization, such as Tikhonov regularization [38], Orthogonal Matching Pursuit (OMP) [39], Total Variation Augmented Lagrangian Alternating Direction Algorithm [40], and CNN [41]. However, existing methods often overlook the link between these two phases, typically considering them separately, which limits the improvement of imaging quality.

In the field of optics, data-driven deep learning methods leverage an increasing number of datasets to learn non-linear transformations, mapping projected encoded measurements to the desired output. This enables direct target reconstruction from projected encoded measurements by easily altering the deep neural network architecture and loss function. The use of a large number of data-driven methods to design encoding apertures means that the design of microwave encoders and computational decoders can be synchronized, achieving the joint optimization of both.

The microwave computational imaging system consists of two stages: the measurement stage and the reconstruction stage. In the proposed end-to-end framework, these two stages are integrated into a unified network, as illustrated in Figure 2.

In MCI based on information metamaterials, the metamaterial antenna updates the encoding and modulates the echo phase each time the target is measured. Although the reference matrix

S

is determined by the encoding mode of the metamaterial antenna, inferring the encoding mode from the known reference matrix

S

is difficult. Therefore, it is not feasible to directly parameterize the reference matrix into convolutional layers to simulate the measurement process of the system. In order to directly represent the encoding mode of metamaterial antennas as convolutional layers, the reference matrix is rewritten as follows:

S = D B .

(10)

The dimensions of each matrix in the equation are

D \in Z^{N \times M}

,

B \in C^{M \times W}

. The composition of matrix D is as follows:

D = [\begin{matrix} \exp (j φ_{1} (t_{1})) & \dots & \exp (j φ_{m} (t_{1})) & \dots & \exp (j φ_{M} (t_{1})) \\ \exp (j φ_{1} (t_{2})) & \dots & \exp (j φ_{m} (t_{2})) & \dots & \exp (j φ_{M} (t_{2})) \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ \exp (j φ_{1} (t_{n})) & \dots & \exp (j φ_{m} (t_{n})) & \dots & \exp (j φ_{M} (t_{n})) \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ \exp (j φ_{1} (t_{N})) & \dots & \exp (j φ_{m} (t_{N})) & \dots & \exp (j φ_{M} (t_{N})) \end{matrix}] .

(11)

Each of its rows represents the coding pattern of the informative metamaterial antenna in a particular measurement, and the elements of the matrix D consist of 1 or −1 since the modulation phase of the metamaterial antenna array elements is 0 or π. The elements of the coding matrix D are completely determined by the coding pattern of the metamaterial antenna, and thus, the matrix is referred to as the coding pattern matrix.

The construction method of matrix B is as follows:

B = [\begin{matrix} S^{'} (t_{1}, 1, 1) & S^{'} (t_{2}, 1, 2) & \dots & S^{'} (t_{n}, 1, w) & \dots & S^{'} (t_{N}, 1, W) \\ S^{'} (t_{1}, 2, 1) & S^{'} (t_{2}, 2, 2) & \dots & S (t_{n}, 2, w) & \dots & S^{'} (t_{N}, 2, W) \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ S^{'} (t_{1}, m, 1) & S^{'} (t_{2}, m, 2) & \dots & S^{'} (t_{n}, m, w) & \dots & S^{'} (t_{N}, m, W) \\ ⋮ & ⋮ & ⋮ & ⋮ & ⋮ & ⋮ \\ S^{'} (t_{1}, M, 1) & S^{'} (t_{2}, M, 2) & \dots & S (t_{n}, M, w) & \dots & S (t_{N}, M, W) \end{matrix}],

(12)

S (t_{n}, w) = \sum_{m = 1}^{M} S' (t_{n}, m, w) \cdot \exp (j φ_{m} (t_{m n})) .

(13)

Each column of matrix

B

represents the unmodulated reference signal corresponding to the wth imaging grid, and its elements are determined by the intrinsic parameters of the MCI-based information metamaterial aperture system, such as the imaging distance of the system, the number of array elements of the informative metamaterial antenna, and the radar signal parameters. Therefore,

B

is referred to as the system matrix.

As shown in Figure 3a, the following section provides a detailed explanation of the four parts of an end-to-end network.

Pretreatment: As expressed in Equation (10), the reference matrix has been represented in the form of the product between the coding matrix D and the system matrix B. Now, the sensing matrix of the system is transformed into the following form:

S r = S β = D B β = D β' .

(14)

The dimensions of the matrices in Equation (14) are

β' \in C^{M \times 1}

. The transformation of the aforementioned sensing model is facilitated by the fact that matrix B is a constant matrix—once the parameters of the imaging system are fixed, B remains invariant. The transformation process of

β' = B β

is called preprocessing.

Compressed measurement subnetwork: to simulate the measurement process using a convolutional layer, based on the sensing model

S r = D β'

, the

N

N

rows of the coding pattern of the matrix D in Equation (14) are considered as

N

filters, each of size

1 \times 1 \times M

. The echo vector is the result of filtering vector

β'

N

times. Thus, the measurement process can be modeled as a convolutional layer with

N

filters, and since it is non-overlapping sampling, the step size of the convolutional layer is set to 1 × 1, and its input is a vector of length

M

, and its output is a vector of length

N

. The elements of the coding pattern matrix D correspond to each other and can be automatically updated for optimization in the network. Since the matrix D is a binary matrix consisting of 1 or −1, constraints need to be added so that the weights of the convolutional layer after training are {1, −1}. After training, the weights of this layer are the optimized coding pattern, which can be used to replace the random coding pattern in subsequent measurements, thus obtaining better reconstruction results.

Initial reconstruction subnetwork: For the initial reconstructed image, the common method is usually to calculate the pseudo inverse matrix of the reference matrix.

x = pinv (S) \cdot S r .

(15)

where pinv denotes the pseudo-inverse matrix operator and the dimension of the pseudo-inverse matrix is

pinv (S) \in C^{W \times N}

,

x

\in C^{W \times 1}

denotes the initial reconstructed image, and

S r \in C^{N \times 1}

is the echo vector, with each element being a complex number. In order to transform the above initial image reconstruction process into a convolutional layer form, similar to the measurement subnetwork in the previous section, we use a convolutional layer with

W

filters to express the initial image reconstruction process, where each filter is of size

1 \times 1 \times N

.

Since the output of the measurement subnetwork is an

N \times 1

complex vector, the two column vectors of the real and imaginary parts of Sr are taken as inputs to the initial reconstruction subnetwork, which outputs two vectors of length W. In order to transform the output vectors into an initial image, a shaping operation is added to the back-end of the convolutional layer. This operation can reconstruct the vector of

1 \times W

dimension into an initial image matrix of

\sqrt{W} \times \sqrt{W}

dimension, and then input it into the deep reconstruction subnetwork of the backend for further reconstruction.

The image generated by the initial reconstruction subnetwork is still defocused. Therefore, in order to obtain high-resolution imaging results, this paper applies U-Net to achieve an accurate reconstruction of the target, and the network structure is shown in Figure 3b [41]. The proposed reconstruction network is based on a U-Net-style encoder-decoder architecture. It mainly consists of an encoder, decoder, and skip connection. The encoder first performs convolution and pooling operations on the input image, gradually reducing the size of the image and increasing the number of channels, in order to extract the features of the image, and uses a linear rectification function

Relu (x) = \max \{0, x\}

as the activation function. The decoder fuses the extracted features through upsampling and convolutional layers and achieves the transformation from the feature domain to the image domain. The input is a complex-valued image with size

32 \times 32 \times 2

, representing the real and imaginary components. The encoder includes four convolutional layers (kernel size

3 \times 3

) with ReLU activation, interleaved with max-pooling layers (stride 2). The decoder uses transposed convolutions (

2 \times 2

) and skip connections to restore spatial resolution. The final output layer is a single-channel

32 \times 32 \times 1

image passed through a Sigmoid activation. Training is performed using the Adam optimizer with a learning rate of 1 × 10⁻⁴, batch size of 32, for 200 epochs. The U-Net network can effectively process microwave images containing noise or incomplete data. The skip connections in the network preserve high-resolution features and enhance imaging details.

The convolutional layer in the constructed compressive measurement subnetwork can automatically learn the optimal coding patterns, but the learned weights are floating point numbers without constraints, which is difficult to achieve in practical applications. Therefore, some constraints need to be imposed on the optimization process to ensure that the learned coding patterns are binary. To this end, we introduce a binary regularizer in the loss function to implement the constraints, as shown in the following equation:

R_{ρ} (ϕ) = \frac{1}{M} \sum_{m = 1}^{M} {(ϕ_{m} + 1)}^{2} {(ϕ_{m} - 1)}^{2} .

(16)

In Equation (16),

ϕ

denotes the coded aperture pattern and

M

denotes the number of coding units of the metamaterial antenna. It can be seen that when the

ϕ

term is taken as 1 or −1, the minimum value of

R_{ρ}

(

ϕ

) can be obtained. It is formulated as a continuous approximation of the sign function, commonly used in binary neural networks, and pushes the learned patterns toward the desired discrete values {−1, 1}. This helps match the physical constraint of 1-bit programmable metasurfaces, which support only two phase states (0 and π).

In the MCI system, the more uncorrelated the random radiation field generated by the metamaterial antenna in time, the better the imaging results will be. The non-coherence of the radiation field in the time dimension is achieved by changing the encoding mode. Therefore, a regularization function is designed as follows to reduce the correlation between each encoding mode:

R_{σ} (ϕ) = \frac{1}{M} \sum_{m = 1}^{M} (\prod_{n = 1}^{N} ϕ_{m}^{n}) .

(17)

In Equation (17),

N

represents the number of encoding measurements and

ϕ_{m}^{n}

represents the value of the m-th encoding unit during the nth measurement. The weaker the correlation of

N

encoding measurement modes, the smaller the

R_{ρ}

(

ϕ

).

R_{σ}

(

ϕ

) penalizes the mutual correlation among different encoding patterns. Specifically, it minimizes the off-diagonal Frobenius norm of the Gram matrix formed by the rows of the encoding matrix. This design is inspired by compressed sensing theory, in which lower mutual coherence leads to better signal recovery. Therefore, the regularization term is added to the loss function to reduce the correlation between each encoding mode.

The imaging process of MCI is expressed as an end-to-end network that directly learns an end-to-end mapping between the compression measurements and the target image. The input and labels during training of this network are the target images themselves. Similar to deep neural networks used for image restoration, we use the mean square error (MSE) as the loss function of the proposed end-to-end network, and its training process can be interpreted as the following loss function optimization problem:

\{ϕ^{*}, θ^{*}\} \in \underset{ϕ, θ}{a r g m i n} L (ϕ, θ),

(18)

L (ϕ, θ) = \frac{1}{K} \sum_{k = 1}^{K} {‖N_{θ} (M_{ϕ} (f_{k})) - f_{k}‖}^{2} + ρ R_{ρ} (ϕ) + σ R_{σ} (θ) .

(19)

where

ϕ

represents the weight of the compressed measurement subnetwork, i.e., the coded aperture pattern,

θ

represents the weight of the reconstruction network,

ϕ

^* and

θ^{*}

denote the optimal weights,

M_{ϕ}

represents the compressed sampling network,

N_{θ}

represents the reconstruction network, and

{f_{k}}_{k = 1}^{K}

represents the training data.

R_{ρ} (ϕ)

and

R_{σ} (θ)

denote two regularizers,

R_{σ} (θ)

serves to suppress the overfitting phenomenon during the training process,

R_{ρ} (ϕ)

is different from

R_{σ} (θ)

, which serves to impose constraints on the coding aperture patterns, such that the coding patterns must be binary or the correlation between different coding patterns is as small as possible, etc.

ρ

and

σ

are the regularization parameters of the two regularizers. Adaptive moment estimation is used as an optimizer of the network parameters during training.

In the training stage, each image is treated as an imaging scene, and the original target image is preprocessed to obtain the processed scattering coefficient matrix

β'

. The

N

line encoding mode of matrix D in the compressed measurement subnet is equivalent to

N

filters, and the scattering coefficient matrix

β'

is filtered

N

times by the compressed measurement subnet to obtain the target vector Sr. The binary constraint

R_{ρ} (ϕ)

and correlation constraint

R_{σ} (θ)

are added to the compressed measurement subnet.

R_{ρ} (ϕ)

can convert the learned encoding patterns into binary form, while

R_{σ} (θ)

reduces the correlation between encoding patterns. In the reconstruction subnetwork, the echo vector Sr obtained from the measurement subnetwork is convolved and shaped to obtain the initial image matrix of

\sqrt{N} \times \sqrt{N}

. Then, it is fed into the u-net network to obtain the final output image. The mean square error is set as the loss function, and the derivatives

\partial L / \partial N_{θ}

and

\partial L / \partial M_{ϕ}

of the backpropagation loss function are used to calculate the minimum loss function during the training process. After updating and optimizing, the weight of matrix D is obtained, which is the optimal encoding mode.

After the training is completed, we used the optimal encoding mode as the optimized aperture encoding mode. At this point, the signal before phase modulation of the input metasurface antenna was inputted, and after optimization, the aperture coding modulation and decoder demodulation were used to obtain the reconstructed image at the output end. The entire process is shown in Figure 4.

4. Experiments

The system parameters are shown in Table 1. In the simulation, the MNIST [42] and airplane datasets were selected as the target training datasets. The choice of MNIST in our work is motivated by its wide adoption as a benchmark in computational imaging and inverse problem research. It offers a controlled and well-characterized setting that enables the reproducible evaluation of algorithmic performance, especially in the early stages of model development. The aircraft dataset, on the other hand, introduces shape-level complexity and serves as a bridge between abstract digits and structured man-made targets. The dataset consists of 60,000 training images and 20,000 test images, with each dataset representing half of the total. These datasets represent the imaging targets in the microwave imaging system. The target dimensions are standardized at 32 × 32 pixels, with each image comprising an identical number of scattering points. Consistent with radar target characteristics, the scattering point values are normalized to random values within the range [0, 1], which subsequently serve as both the input and labels for the end-to-end network. We use the U-NET convolutional neural network to process the datasets. The initial learning rate is set to 10⁻⁴, and the code is implemented in TensorFlow. The neural network is trained on a desktop computer with an NVIDIA 2080 GPU and CUDA version 10.0.

In order to verify the performance of the proposed method, it was compared with several commonly used competitive algorithms. Specifically, commonly used computational imaging reconstruction algorithms include the OMP algorithm, sparse Bayesian learning (SBL) algorithm, and the alternative direction multiplier method (ADMM). The method of using random coded aperture and only using deep neural networks for recovery tasks is called Random-CA, and the method used in this article is called learning CA.

All other algorithms compared to end-to-end methods use random encoded aperture patterns. To quantitatively measure the performance of different algorithms, this paper evaluates the reconstruction quality by calculating three quantitative image quality indicators, including relative error (RE), peak signal-to-noise ratio (PSNR), and structural similarity index measure (SSIM). The smaller the value of RE, the higher the values of PSNR and SSIM, indicating a better reconstruction quality of the target image.

RE = \frac{{‖\hat{x} - x‖}_{2}^{2}}{{‖x‖}_{2}^{2}} .

(20)

In Equation (20),

\hat{x}

and

x

represent the original signal and the recovered signal, respectively.

PSNR = 10 \log_{10} (\frac{{Max}^{2}}{MSE}) .

(21)

In Equation (21),

Max

is the maximum pixel value of the original image, MSE is the mean square error.

SSIM (x, h) = l {(x, h)}^{α} c {(x, h)}^{χ} s {(x, h)}^{ε} .

(22)

In Equation (22),

x

and

h

are the pixel values of the original and reconstructed images, respectively,

l, c, s

are the brightness, contrast, and structure functions, respectively, and

α, χ, ε

are the weights.

4.1. Reconstruction Results of Targets with Different Sparsity

Consider imaging tasks on targets with different sparsity. Figure 5 shows the results of reconstructing different sparsity targets using multiple algorithms. The original target images are number 7, number 2, and aircraft, with sparsities of 0.7288, 0.6620, and 0.4320, respectively. We set the SNR to 20 dB and the compression ratio to 0.5. The quantitative analysis of imaging quality is shown in Table 2. On the one hand, it can be seen that the proposed method can successfully reconstruct the target and is superior to typical computational imaging reconstruction algorithms. On the other hand, compared with random CA, this method also achieves better reconstruction results, proving the effectiveness of the deep coding aperture pattern optimization proposed in this paper.

4.2. Noise Robustness

In the simulation experiment, we used the MNIST dataset and aircraft image dataset, with the original target compression ratio set to 0.5, and added Gaussian white noise with different SNR to the echo for imaging. We compared this method with four algorithms to visually observe and quantitatively analyze imaging quality. Figure 6 shows the imaging results under different SNR, and Figure 7 shows the quantitative analysis line chart of three measurements. From the comparison of imaging results and imaging performance, it can be seen that the method proposed in this paper has good noise robustness.

4.3. Reconstruction Results of Targets with Different Compression Ratio Measurements

In order to investigate the reconstruction performance of the proposed method under compression measurements, the echo signal was imaged with a SNR of 20 dB, and handwritten digital targets were imaged under different compression ratios. The imaging results under different compression ratio measurements are given in Figure 8, and Figure 9 shows the line graphs of quantitative imaging performance metrics under different compression ratios. It can be seen that the method in this paper significantly improves the recoverable compression ratio, i.e., the measurement pattern is limited, the target is sparsely sampled, the proposed method can still perform the recovery task when other algorithms are unable to reconstruct the target image under low compression ratios, and better performance can be obtained when the target is measured using the coded aperture learnt from the end-to-end network.

To assess the stability and reliability of our proposed Learned-CA method across various experimental conditions, we performed 50 independent Monte Carlo simulations for each predefined compression ratio. The 95% confidence intervals were computed using the statistical formula

C I_{95 %} = \bar{x} \pm s / \sqrt{n}

based on the standard deviation of experimental results from each group (where

\bar{x}

is the sample mean,

s

is the standard deviation, n = 50), thereby demonstrating the method’s statistical stability and confidence level.

An observation of Table 3, Table 4 and Table 5 reveals that the proposed algorithm consistently maintains tight confidence intervals across various experimental settings, which reflects its robustness and the statistical stability of its performance.

4.4. Computational Complexity and Inference Efficiency

To assess the practicality of the proposed method in real-world applications, especially under real-time or resource-constrained scenarios, we present a comprehensive analysis of the computational complexity, inference speed, and memory usage of our deep learning-based reconstruction framework. A comparative evaluation against conventional iterative methods, including ADMM, OMP, and SBL, is also provided.

The inference process of our model consists of a fixed number of convolutional and linear operations, resulting in a theoretical computational complexity of

O (n^{2})

, where n denotes the image dimension. In contrast, traditional iterative methods involve multiple matrix multiplications, sparse recovery, and inversion operations, leading to higher computational burdens—typically exceeding

O (n^{3})

over several iterations. We benchmarked all methods on an NVIDIA RTX 2080 GPU(NVIDIA, Santa Clara, CA, USA) using 32 × 32 input images at a compression ratio of

γ = 0.3

. As shown in Table 6, the proposed method achieved an inference time of 28 ms and GPU memory usage of approximately 480 MB, which is significantly lower than ADMM (470 ms, 1200 MB) and OMP (320 ms, 960 MB). In terms of FLOPs, the proposed network requires ~85 M floating-point operations, compared to 920 M for ADMM and 750 M for OMP, confirming its computational efficiency. Moreover, since the architecture remains fixed across varying SNR levels and compression ratios, the inference cost does not increase under more challenging conditions, unlike iterative methods, whose performance and convergence are highly sensitive to γ and noise levels.

5. Conclusions

This paper focuses on the optimization of encoding aperture in MCI systems. We have designed a microwave computational imaging method based on information metamaterials and end-to-end deep learning, for the joint optimization of coding aperture design and computational imaging methods in MCI systems. This method converts the encoding aperture of the end-to-end network into a weight layer for optimization. To make the optimized encoding aperture physically achievable, multiple regularization functions are used to implement constraints. Compared with existing computational imaging algorithms, this paper combines encoding aperture design and computational imaging into end-to-end networks and trains them to obtain the optimal encoding method. This network has certain advantages over classical imaging algorithms in reconstructing targets with different sparsity at low SNR. This system can reconstruct high-resolution sparse targets under compressed measurement, with good noise resistance and an improved recoverable compression ratio.

Future Work

To further enhance the applicability and practicality of the proposed method, several future research directions are worth pursuing.

First, although the current study validates the effectiveness of our approach through numerical simulations, real-world implementation remains a critical next step. We plan to construct a small-scale experimental prototype using 1-bit programmable metamaterials based on PIN diode or MEMS technology. This prototype will allow us to assess the feasibility of deploying the learned encoding patterns in physical systems and to examine the system’s robustness under hardware non-idealities, such as phase deviation, switching delay, and electromagnetic interference. Second, the scalability of the proposed framework is an important concern. While the model performs well on 32 × 32 resolution images, we encountered GPU memory overflow during training on higher-resolution data (e.g., 128 × 128). To overcome this limitation, we will explore modular decoupling strategies—such as stage-wise training—where the reconstruction network is pre-trained and the encoding pattern is subsequently optimized—as well as patch-wise processing and lightweight network design. These approaches are expected to improve the model’s efficiency and scalability. Third, the manufacturability of the optimized binary encoding patterns will be systematically studied. Although current programmable metamaterials support binary phase modulation, fabrication tolerances and imperfect switching may introduce deviations from the ideal {−1, 1} values. To mitigate this, we incorporated binary constraints and correlation-penalizing regularization during training to enhance pattern robustness. Future work will also consider quantization-aware training and hardware-in-the-loop simulations to better align the learned encoding patterns with real-world hardware characteristics.

Through these efforts, we aim to bridge the gap between algorithm design and physical implementation, advancing toward practical, real-time, high-resolution microwave computational imaging systems.

Author Contributions

Conceptualization, H.S., J.G.; methodology, J.S.; software, J.S.; validation, J.S.; formal analysis, H.S. and J.G.; investigation, J.S.; resources, H.S.; data curation, J.S.; writing—original draft preparation, J.S.; visualization, J.S.; supervision, H.S., and J.G.; project administration, H.S.; funding acquisition, H.S. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China under Grant 62071414, the R&D Program of Beijing Municipal Education Commission under Grant KZ202210016021, and the Cultivation project Funds for Beijing University of Civil Engineering and Architecture under Grant X24031.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Data are contained within the article.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Imani, M.F.; Gollub, J.N.; Yurduseven, O.; Diebold, A.V.; Boyarsky, M.; Fromenteze, T.; Pulido-Mancera, L.; Sleasman, T.; Smith, D.R. Review of Metasurface Antennas for Computational Microwave Imaging. IEEE Trans. Antennas Propag. 2020, 68, 1860–1875. [Google Scholar] [CrossRef]
Qiu, W.; Martorella, M.; Zhou, J.; Zhao, H.; Fu, Q. Three-dimensional inverse synthetic aperture radar imaging based on compressive sensing. IET Radar Sonar Navigat. 2015, 9, 411–420. [Google Scholar] [CrossRef]
Khalesi, B.; Sohani, B.; Ghavami, N.; Ghavami, M.; Dudley, S.; Tiberi, G. A Phantom Investigation to Quantify Huygens Principle Based Microwave Imaging for Bone Lesion Detection. Electronics 2019, 8, 1505. [Google Scholar] [CrossRef]
Cannatà, A.; Elahi, A.; O’halloran, M.; Pasian, M.; Di Meo, S.; Matrone, G.; Amin, B. Microwave Bone Imaging: Reconstruction of Anthropomorphic Numerical Calcaneus Phantoms for Bone Diseases Diagnosis. IEEE Access 2024, 12, 123447–123458. [Google Scholar] [CrossRef]
Yang, J.; Lee, H. Simultaneous Imaging of Static and Microwave Magnetic Field Distributions by Magneto Optical Indicator Microscopy. IEEE Access 2024, 12, 133045–133053. [Google Scholar] [CrossRef]
Sleasman, T.A.; Imani, M.F.; Diebold, A.V.; Boyarsky, M.; Trofatter, K.P.; Smith, D.R. Implementation and Characterization of a Two-Dimensional Printed Circuit Dynamic Metasurface Aperture for Computational Microwave Imaging. IEEE Trans. Antennas Propag. 2021, 69, 2151–2164. [Google Scholar] [CrossRef]
Zhou, Y.-S.; Wang, Q.; Ma, L.-L.; Li, C.-R.; Tang, L.-L.; Liu, Y.-K. Quality analysis for images acquired by a new microwave staring correlation imaging technique. In Proceedings of the 2012 IEEE International Geoscience and Remote Sensing Symposium, Munich, Germany, 22–27 July 2012; pp. 4602–4605. [Google Scholar]
Sreenivasulu, K.; Ray, K.P.; Rao, D.S.; Kumar, P.; Vengadarajan, A. X-Band 16-Channel Transmit-Receive Plank Unit for High-Resolution Imaging RADAR. IEEE Access 2024, 12, 139456–139468. [Google Scholar] [CrossRef]
He, J.; Chen, Z.; Zhao, C.; Chen, X.; Wei, Y.; Zhang, C. Wave Parameter Inversion with Coherent Microwave Radar Using Spectral Proper Orthogonal Decomposition. IEEE Trans. Geosci. Remote Sens. 2022, 60, 2006311. [Google Scholar] [CrossRef]
Yin, S.; Shi, X.; Huang, W.; Zhang, W.; Hu, F.; Qin, Z.; Xiong, X. Two-Bit Terahertz Encoder Realized by Graphene-Based Metamaterials. Electronics 2019, 8, 1528. [Google Scholar] [CrossRef]
Zhang, J.; Hu, T.; Shao, X.; Gu, H.; Xiao, Z. Wavenumber Spectrum Reconstruction Method for Microwave Computational Imaging with Re-Programmable Metasurface. IEEE Trans. Comput. Imaging 2023, 9, 383–395. [Google Scholar] [CrossRef]
Hunt, J.; Driscoll, T.; Mrozack, A.; Lipworth, G.; Reynolds, M.; Brady, D.; Smith, D.R. Metamaterial apertures for computational imaging. Science 2013, 339, 310–313. [Google Scholar] [CrossRef] [PubMed]
Xu, Y.; Zhu, D.; Hu, F.; Fang, B.; Fu, P. Target Imaging Using Compressed Sampling in Synthetic Aperture Interferometric Radiometer. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5301515. [Google Scholar] [CrossRef]
Molaei, A.M.; Hu, S.; Kumar, R.; Yurduseven, O. MIMO Coded Generalized Reduced Dimension Fourier Algorithm for 3-D Microwave Imaging. IEEE Trans. Geosci. Remote Sens. 2023, 61, 5205115. [Google Scholar] [CrossRef]
Al-Sadoon, M.A.G.; de Ree, M.; Abd-Alhameed, R.A.; Excell, P.S. Uniform Sampling Methodology to Construct Projection Matrices for Angle-of-Arrival Estimation Applications. Electronics 2019, 8, 1386. [Google Scholar] [CrossRef]
Sleasman, T.; Imani, M.F.; Xu, W.; Hunt, J.; Driscoll, T.; Reynolds, M.S.; Smith, D.R. Waveguide-Fed Tunable Metamaterial Element for Dynamic Apertures. IEEE Antennas Wirel. Propag. Lett. 2016, 15, 606–609. [Google Scholar]
Sleasman, T.; Boyarsky, M.; Pulido-Mancera, L.; Fromenteze, T.; Imani, M.F.; Reynolds, M.S.; Smith, D.R. Experimental Synthetic Aperture Radar with Dynamic Metasurfaces. IEEE Trans. Antennas Propag. 2017, 65, 6864–6877. [Google Scholar] [CrossRef]
Cui, T.J.; Qi, M.Q.; Wan, X.; Zhao, J.; Cheng, Q. Coding metamaterials, digital metamaterials and programmable metamaterials. Light Sci. Appl. 2014, 3, e218. [Google Scholar] [CrossRef]
Hunt, J.; Gollub, J.; Driscoll, T.; Lipworth, G.; Mrozack, A.; Reynolds, M.S.; Brady, D.J.; Smith, D.R. Metamaterial microwave holographic imaging system. J. Opt. Soc. Am. A 2014, 31, 2109–2119. [Google Scholar] [CrossRef]
He, L.; Chen, H.; Carin, L. Tree-structured compressive sensing with variational Bayesian analysis. IEEE Signal Process. Lett. 2009, 17, 233–236. [Google Scholar]
Mun, S.; Fowler, J.E. Block compressed sensing of images using directional transforms. In Proceedings of the IEEE International Conference on Image Processing (ICIP), Cairo, Egypt, 7–10 November 2009; pp. 3021–3024. [Google Scholar]
Bioucas-Dias, J.M.; Figueiredo, M.A. A new TwIST: Two-step iterative shrinkage/thresholding algorithms for image restoration. IEEE Trans. Image Process. 2007, 16, 2992–3004. [Google Scholar] [CrossRef]
Wang, Y.; Yang, J.; Yin, W.; Zhang, Y. A new alternating minimization algorithm for total variation image reconstruction. SIAM J. Imaging Sci. 2008, 1, 248–272. [Google Scholar] [CrossRef]
Zhang, J.; Zhao, D.; Gao, W. Group-based sparse representation for image restoration. IEEE Trans. Image Process. 2014, 23, 3336–3351. [Google Scholar] [CrossRef] [PubMed]
Hanumanth, P.; Bhavana, P.; Subbarayappa, S. Application of deep learning and compressed sensing for reconstruction of images. J. Phys. Conf. Ser. 2020, 1706, 012068–012084. [Google Scholar] [CrossRef]
Zhang, J.; Sharma, R.; García-Fernández, M.; Álvarez-Narciandi, G.; Abbasi, M.A.B.; Yurduseven, O. Deep learning for sensing matrix prediction in computational microwave imaging with coded-apertures. IEEE Access 2024, 12, 16844–16855. [Google Scholar] [CrossRef]
Benny, R.; Anjit, T.; Palayyan, M. Deep learning based non-iterative solution to the inverse problem in microwave imaging. Prog. Electromagn. Res. M 2022, 109, 231–240. [Google Scholar] [CrossRef]
Ghorbani, F.; Beyraghi, S.; Shabanpour, J.; Oraizi, H.; Soleimani, H.; Soleimani, M. Deep neural network-based automatic metasurface design with a wide frequency range. Sci. Rep. 2021, 11, 7102. [Google Scholar] [CrossRef]
Chiu, C.-C.; Li, C.-L.; Chen, P.-H.; Cheng, H.-M.; Jiang, H. Whale Optimization Algorithm with Machine Learning for Microwave Imaging. Electronics 2024, 13, 4342. [Google Scholar] [CrossRef]
Razzaque, A.; Badholia, A. PCA based feature extraction and MPSO based feature selection for gene expression microarray medical data classification. Meas. Sens. 2024, 31, 100945. [Google Scholar] [CrossRef]
Bao, J.H.; Li, W.H.; Huang, S.Q.; Yu, W.M.; Liu, C.; Cui, T.J. Physics-driven unsupervised deep learning network for programmable metasurface-based beamforming. iScience 2024, 27, 110595. [Google Scholar] [CrossRef]
Asok, A.O.; SJ, G.N.; Dey, S. Microwave Imaging with Novel Time-Domain Clutter Removal Algorithm Using High Gain Antennas for Concealed Object Detections. IEEE Trans. Comput. Imaging 2023, 9, 147–158. [Google Scholar] [CrossRef]
Dai, F.; Fu, H.; Hong, L.; Li, L.; Liu, H. Off-Grid Error and Amplitude–Phase Drift Calibration for Computational Microwave Imaging with Metasurface Aperture Based on Sparse Bayesian Learning. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5115514. [Google Scholar] [CrossRef]
Zhang, S.; Jin, S.; Dai, F. Enhancement of Metasurface Aperture Imaging via Information-Theoretic Waveform Optimization Algorithm. In Proceedings of the 2021 CIE International Conference on Radar (Radar), Haikou, China, 15–19 December 2021; pp. 2403–2406. [Google Scholar] [CrossRef]
Chen, Y.; Nasrabadi, N.M.; Tran, T.D. Hyperspectral image classification using dictionary-based sparse representation. IEEE Trans. Geosci. Remote Sens. 2011, 49, 3973–3985. [Google Scholar] [CrossRef]
Dai, F.; Zhang, S.; Li, L.; Liu, H. Enhancement of Metasurface Aperture Microwave Imaging via Information-Theoretic Waveform Optimization. IEEE Trans. Geosci. Remote Sens. 2022, 60, 5109512. [Google Scholar] [CrossRef]
Vakalis, S.; Chen, D.; Nanzer, J.A. Toward Space–Time Incoherent Transmitter Design for Millimeter-Wave Imaging. IEEE Antennas Wirel. Propag. Lett. 2020, 19, 1471–1475. [Google Scholar] [CrossRef]
Sun, Y.; Zhang, Y.; Wen, Y. Image Reconstruction Based on Fractional Tikhonov Framework for Planar Array Capacitance Sensor. IEEE Trans. Comput. Imaging. 2022, 8, 109–120. [Google Scholar] [CrossRef]
Schnass, K. Average Performance of Orthogonal Matching Pursuit (OMP) for Sparse Approximation. IEEE Signal Process. Lett. 2018, 25, 1865–1869. [Google Scholar] [CrossRef]
Güngör, A.; Çetin, M.; Güven, H.E. Compressive Synthetic Aperture Radar Imaging and Autofocusing by Augmented Lagrangian Methods. IEEE Trans. Comput. Imaging 2022, 8, 273–285. [Google Scholar] [CrossRef]
Han, Y.; Ye, J.C. Framing U-Net via Deep Convolutional Framelets: Application to Sparse-View CT. IEEE Trans. Med. Imaging. 2018, 37, 1418–1429. [Google Scholar] [CrossRef]
Lecun, Y.; Bottou, L.; Bengio, Y.; Haffner, P. Gradient-based learning applied to document recognition. Proc. IEEE 1998, 86, 2278–2324. [Google Scholar] [CrossRef]

Figure 1. Imaging model based on transmissive metamaterial antenna.

Figure 2. Joint optimization framework.

Figure 3. End-to-end network: (a) the composition of end-to-end networks; (b) deep reconstruction subnetwork.

Figure 4. Joint optimization process.

Figure 5. Imaging results of targets with different sparsity using different methods: (a) original signal; (b) ADMM; (c) OMP; (d) SBL; (e) Random-CA; (f) Learned-CA.

Figure 6. Imaging results with different signal-to-noise ratios; the bottom is the method described in this article.

Figure 7. Variation in imaging performance with different signal-to-noise ratios. (a) PSNR. (b) Relative Error. (c) SSIM. The red line represents the algorithm in this article.

Figure 8. Imaging results with different compression ratios; the bottom is the method described in this article.

Figure 9. Changes in imaging performance under different compression ratios. (a) PSNR. (b) RE. (c) SSIM. The red line represents the algorithm in this article.

Table 1. System parameters.

Parameters	Quantity
Centre frequency	15 GHz
Bandwidth	1 GHz
Pulse width	100 ns
Tuning frequency	10¹⁶ Hz/s
Imaging distance	6 m
Coding aperture size	0.5 m × 0.5 m
Number of metamaterial cells	41 × 41
Imaging grid size	0.02 m × 0.02 m
Number of imaging grids	32 × 32

Table 2. Imaging results for different methods. The best results are in bold.

Metrics	PSNR(dB)	RE	SSIM
OMP	10.79	0.0839	0.1807
ADMM	10.91	0.0811	0.0051
SBL	11.19	0.0759	0.1697
TVAL3	19.86	0.0211	0.814
Random-CA	20.16	0.0110	0.828
Learned-CA	25.86	0.0030	0.953

Table 3. The confidence intervals of PSNR for our proposed algorithm.

	Mean PSNR(dB)	95% CI Lower	95% CI Upper
0.01	16.33	16.27	16.39
0.1	23.51	23.47	23.55
0.2	24.28	24.24	24.32
0.3	24.67	24.63	24.71
0.4	25.03	24.97	25.09
0.5	25.94	24.89	24.99
0.6	26.05	26.01	26.09
0.7	26.32	26.49	26.35

Table 4. The confidence intervals of RE for our proposed algorithm.

	Mean RE	95% CI Lower	95% CI Upper
0.01	0.0359	0.0345	0.0373
0.1	0.0134	0.012	0.0148
0.2	0.0112	0.0099	0.0125
0.3	0.0098	0.0083	0.0113
0.4	0.0084	0.0072	0.0096
0.5	0.0075	0.0063	0.0087
0.6	0.0071	0.0058	0.0084
0.7	0.0058	0.0046	0.006

Table 5. The confidence intervals of SSIM for our proposed algorithm.

	Mean SSIM	95% CI Lower	95% CI Upper
0.01	0.6113	0.6032	0.6194
0.1	0.9211	0.9138	0.9284
0.2	0.9227	0.9167	0.9287
0.3	0.9246	0.9183	0.9309
0.4	0.9305	0.937	0.924
0.5	0.9342	0.9292	0.9392
0.6	0.9381	0.9333	0.9429
0.7	0.9404	0.9369	0.9439

Table 6. The complexity analysis of our proposed algorithm.

Method	Inference Time (ms)	FLOPs (×10⁶)	GPU Memory Usage (MB)
ADMM	470	920	1200
OMP	320	750	960
SBL	280	680	900
Randomed-CA	30	85	490
Learned-CA	28	85	480

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Shi, H.; Song, J.; Guo, J. Microwave Reconstruction Method Based on Information Metamaterials and End-to-End Deep Learning. Electronics 2025, 14, 1731. https://doi.org/10.3390/electronics14091731

AMA Style

Shi H, Song J, Guo J. Microwave Reconstruction Method Based on Information Metamaterials and End-to-End Deep Learning. Electronics. 2025; 14(9):1731. https://doi.org/10.3390/electronics14091731

Chicago/Turabian Style

Shi, Hongyin, Jiale Song, and Jianwen Guo. 2025. "Microwave Reconstruction Method Based on Information Metamaterials and End-to-End Deep Learning" Electronics 14, no. 9: 1731. https://doi.org/10.3390/electronics14091731

APA Style

Shi, H., Song, J., & Guo, J. (2025). Microwave Reconstruction Method Based on Information Metamaterials and End-to-End Deep Learning. Electronics, 14(9), 1731. https://doi.org/10.3390/electronics14091731

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Microwave Reconstruction Method Based on Information Metamaterials and End-to-End Deep Learning

Abstract

1. Introduction

2. Modeling of MCI Systems Based on Transmitting Information Metamaterials

3. Method

4. Experiments

4.1. Reconstruction Results of Targets with Different Sparsity

4.2. Noise Robustness

4.3. Reconstruction Results of Targets with Different Compression Ratio Measurements

4.4. Computational Complexity and Inference Efficiency

5. Conclusions

Future Work

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI