Detection of Parabolic Antennas in Satellite Inverse Synthetic Aperture Radar Images Using Component Prior and Improved-YOLOv8 Network in Terahertz Regime

Yang, Liuxiao; Wang, Hongqiang; Zeng, Yang; Liu, Wei; Wang, Ruijun; Deng, Bin

doi:10.3390/rs17040604

Open AccessArticle

Detection of Parabolic Antennas in Satellite Inverse Synthetic Aperture Radar Images Using Component Prior and Improved-YOLOv8 Network in Terahertz Regime

by

Liuxiao Yang

¹,

Hongqiang Wang

¹,

Yang Zeng

^1,*,

Wei Liu

¹

,

Ruijun Wang

² and

Bin Deng

¹

College of Electronic Science and Technology, National University of Defense Technology, Changsha 410073, China

²

Air Defense and Antimissile School, Air Force Engineering University, Xi’an 710043, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2025, 17(4), 604; https://doi.org/10.3390/rs17040604

Submission received: 17 December 2024 / Revised: 26 January 2025 / Accepted: 27 January 2025 / Published: 10 February 2025

(This article belongs to the Special Issue Advanced Spaceborne SAR Processing Techniques for Target Detection)

Download

Browse Figures

Versions Notes

Abstract

Inverse Synthetic Aperture Radar (ISAR) images of space targets and their key components are very important. However, this method suffers from numerous drawbacks, including a low Signal-to-Noise Ratio (SNR), blurred edges, significant variations in scattering intensity, and limited data availability, all of which constrain its recognition capabilities. The terahertz (THz) regime has reflected excellent capacity for space detection in terms of showing the details of target structures. However, in ISAR images, as the observation aperture moves, the imaging features of the extended structures (ESs) undergo significant changes, posing challenges to the subsequent recognition performance. In this paper, a parabolic antenna is taken as the research object. An innovative approach for identifying this component is proposed by using the advantages of the Component Prior and Imaging Characteristics (CPICs) effectively. In order to tackle the challenges associated with component identification in satellite ISAR imagery, this study employs the Improved-YOLOv8 model, which was developed by incorporating the YOLOv8 algorithm, an adaptive detection head known as the Dynamic head (Dyhead) that utilizes an attention mechanism, and a regression box loss function called Wise Intersection over Union (WIoU), which addresses the issue of varying sample difficulty. After being trained on the simulated dataset, the model demonstrated a considerable enhancement in detection accuracy over the five base models, reaching an mAP50 of 0.935 and an mAP50-95 of 0.520. Compared with YOLOv8n, it improved by 0.192 and 0.076 in mAP50 and mAP50-95, respectively. Ultimately, the effectiveness of the suggested method is confirmed through the execution of comprehensive simulations and anechoic chamber tests.

Keywords:

component recognition; Component Prior and Imaging Characteristics (CPICs); parabolic antennas; YOLOv8

1. Introduction

In response to the needs of scientific and economic advancement, an extensive array of space-based systems, including space stations and satellites, have been deployed into the vast expanse of space [1,2]. So far, optical telescopes [3,4] and ground-based radars [5] are the main detection sensors of satellites in orbit. High-resolution observations of celestial bodies are feasible with optical telescopes; however, the clarity of the resulting images is significantly influenced by the lighting circumstances. Ground-based radar systems are capable of conducting continuous, round-the-clock surveillance and ISAR imaging for space objects, and they have emerged as the predominant method for such oversight. However, they still have some drawbacks. Initially, the limitation of transmission power poses challenges in detecting High Earth Orbit (HEO) satellite targets and small satellite targets, as the extended transmission distance and their low Radar Cross Section (RCS) contribute to their difficulty in being observed. The second challenge is that current ground-based radar systems typically operate in frequency bands below the W-band. This choice is made to mitigate the effects of atmospheric attenuation, which can degrade radar signals as they pass through the atmosphere. However, operating at lower frequencies also comes with a trade-off: it limits the imaging resolution. As a result, it is not possible to achieve detailed imaging and recognition of space objects down to the component level, meaning that the fine details of the satellite structure or individual components may not be discernible with these radar systems. Thirdly, when observing the target in HEO, the change in the elevation angle of the Line of Sight (LOS) is not enough to match the change in the azimuth angle, resulting in an unbalanced distribution of attitude parameters. Furthermore, accurately confirming the status of the target requires a detailed understanding of the component information, which the current methods are unable to fully provide. Furthermore, certain key structural features of the object may be susceptible to fading and blurring during radar imaging. This can be problematic for intelligent image interpretation, as the loss of clarity in these features may hinder the ability to accurately identify and analyze the object’s components or characteristics. Such limitations can affect the overall effectiveness of radar surveillance and the subsequent analysis of space objects.

Terahertz (THz) waves are a form of electromagnetic radiation characterized by wavelengths that span from 3 mm (mm) to 30

μ

m (µm). This range corresponds to frequencies that fall between 0.1 terahertz (THz) and 10 THz. The use of THz waves in various applications is of interest due to their unique properties, such as their ability to penetrate certain materials without significant attenuation, making them useful for non-destructive testing and imaging as well as for communication and sensing [6]. Since terahertz waves have a shorter wavelength than low-frequency microwaves, this allows terahertz waves to achieve high-resolution imaging at shorter synthetic apertures. In addition, the terahertz region has many unique characteristics, including but not limited to a certain anti-interference [7] and penetration [8], and is widely used in astronomical exploration and wireless communication [9,10,11]. A space-based terahertz radar imaging system is proposed because of the above advantages. Therefore, in the application scenario of high-resolution radar, the performance of terahertz radar imaging is expected to reflect the detailed characteristics of the key component structure of the target.

The parabolic antenna load, as we all know, is a key component of satellite communication systems, and accurate identification of parabolic antennas is important to condition assessment and function maintenance, and it provides favorable support for subsequent three-dimensional reconstruction and pointing estimation tasks. The basic scattering characteristics and imaging methods for the terahertz frequency band are thoroughly discussed in reference [12]. This study demonstrates that the number of scattering centers in a scene increases significantly with the presence of rough surfaces, which can degrade image quality. To address the image quality degradation caused by dense scattering centers, researchers have investigated terahertz imaging enhancement techniques based on sparse regularization [13] and machine learning [14]. These methods have been shown to effectively surpass the Rayleigh limit, which is a fundamental limit of resolution in traditional imaging techniques. When these imaging enhancement methods were first proposed, scholars aimed to achieve a level of detail in the object of interest that was as clear as optical images. This goal was driven by the desire to obtain highly resolved and detailed images, even in the presence of complex scattering environments that traditionally limit the clarity of terahertz images. Through our research, due to the special azimuth dependence of extended structures (ESs), the key extended structures of complex targets such as parabolic antennas may disappear and be discretized into specular points, edge pair-points, or ellipse arcs, which are quite different from optical images. Therefore, we need to propose a new image interpretation and translation method for parabolic antennas.

The Constant False Alarm Rate (CFAR) detection technique, a well-established method, is widely utilized for its straightforwardness and efficiency [15]. A robust CFAR detector, grounded in truncated maximum likelihood principles, is adeptly implemented in environments prone to outlier interference, demonstrating adequate computational efficiency [16]. Despite this, contemporary object detection approaches leveraging deep neural networks have outperformed traditional methods in both accuracy and speed [17,18,19,20]. Rotter et al. [21] developed two alternative systems that utilize the Single-Shot multibox Detector (SSD) and the Tiny YOLOv2 network for the automated detection of sinking funnels generated by underground coal mining. A method integrating Interferometric Synthetic Aperture Radar (InSAR) and Convolutional Neural Networks (CNNs) is proposed for the automatic identification of subsidence funnels caused by coal mining activities [22]. Yu et al. [23] introduced a lean model named Light You Only Look Once (YOLO)-Basin, which is designed for subsidence basin detection using the YOLOv5 network. Under the premise of key frames, a technology to estimate the parameters of a parabolic antenna is presented explicitly [24]. This method significantly reduces the computational complexity of the model and improves the accuracy of the model. In the terahertz frequency ban, by analyzing the sliding scattering center and cross-polarization imaging characteristics of parabolic antennas, an improved Hough transform method has been proposed for identifying parabolic antennas [25]. Through the analysis of the prior information of the components and their imaging characteristics, the literature effectively identifies the components and is able to successfully reconstruct them in the terahertz regime [26]. These advancements offer valuable insights that inform the development of our proposed approach.

In this article, a novel method for detecting parabolic antennas is proposed. The main idea lies in the good use of CPICs as well as an improved-YOLOv8 object detection network. To make full use of the CPICs of the component, four objects are to be detected: the satellite body and the three statuses of the parabolic antenna, namely, specular point, edge-pair points, and ellipse arc. We not only give the characteristics of the parabolic antenna when imaging it alone but also realize the identification of parabolic antenna based on the detection of satellite. This article makes the following key contributions:

(1) Inserting the CPICs of a parabolic antenna into the object detection network, a new method for parabolic antenna detection is proposed, including two steps: determining the component prior and detecting the three statuses of the parabolic antenna.

(2) Using the scattering prior which is obtained by electromagnetic simulations, the imaging characteristics of the parabolic antennas of satellites are analytically determined.

(3) Compared with previous target detection networks, the proposed method achieves better detection effects for key components.

The proposed method’s overall structure is depicted in Figure 1. In Figure 1, firstly, we obtain the original echo data of the satellite model through electromagnetic simulation software, as mentioned in “Raw data of satellite echoes”. Then, using the classical RD imaging algorithm, we obtain the “Imaging result of satellite”. Additionally, by utilizing “Analyzing the ICPC” of the parabolic antenna, we extract the imaging characteristics of the parabolic antenna, providing an improvement approach for the next deep learning model. Finally, combining the ICPC of the parabolic antenna, we convert the recognition of the parabolic antenna into the detection of “Specular reflection point”, “Edge pair-point”, and “Ellipse arc”, and we execute “Training the Networks”. The organization of the remainder of this article proceeds as follows. In Section 2, the THz imaging characters of parabolic antennas are discussed. In Section 3, the Improved-YOLOv8 network is illustrated in detail. The electromagnetic simulation results and relevant analysis are provided, and the results are verified by the anechoic chamber experiment in Section 4. Finally, we give the conclusions of this paper in Section 5.

2. THz Imaging Character of Parabolic Antennas

The observation geometry of the spatial target for the space-based terahertz radar is depicted in Figure 2. The LOS vector links the

O_{T a r g e t}

and

O_{R a d a r}

points. In the case of a three-axis stabilized space target, its orientation remains constant with respect to the coordinate system of the target, and the projection onto the imaging plane is determined by the corresponding LOS angles of the radar.

The radar LOS can be represented as:

L O S = {[\begin{matrix} cos θ cos φ, & cos θ sin φ, & sin θ \end{matrix}]}^{T}

(1)

where the superscript T denotes transposition,

θ \in [- 90^{\circ}, 90^{\circ})

, and

φ \in [- 180^{\circ}, 180^{\circ})

, which represent the elevation angles and azimuth angles, respectively. The orientation of a parabolic antenna can be expressed as follows:

k = {[\begin{matrix} cos β cos α, & cos β sin α, & sin β \end{matrix}]}^{T}

(2)

where k is usually called the normal vector and

α \in [- 180^{\circ}, 180^{\circ})

and

β \in [- 90^{\circ}, 90^{\circ})

represent the yaw angle and the pitch angle, respectively. The R-axis and the D-axis of the ISAR plane are determined by the LOS vector and derivation LOS vector, respectively [27]. Since the projection relationship is not the key part of this article, we overlook its fussy details. It is important to recognize that all these angles can vary arbitrarily, leading to a complex issue. In reality, it is more common to focus on just a single rotational axis in order to achieve the highest possible imaging quality. After further analysis, we can see that the elevation and azimuth angles of the target relative to the radar’s LOS are, respectively, denoted as

ϑ = θ - β

(3)

ψ = φ - α

(4)

For the sake of analysis convenience, in the imaging setup of this paper, it is assumed that the elevation angle

ϑ

remains constant and only the rotation accumulation of the azimuth angle

ψ

is considered to change. Without loss of generality, let

α = 0

, namely, the relative azimuth angle

ψ

is equal to the

φ

angle.

We assume that the 2D ISAR imaging plane of the space target is composed of the R-axis and the D-axis, where the R-axis of the imaging plane is determined by the LOS vector, while the D-axis is defined based on the differentiation of the LOS vector. Given a point

M = {[u, v, w]}^{T}

, which represents a 3D vector, its corresponding projection onto the 2D R–D imaging plane is referred to as

N = {[r, d]}^{T}

. This projection relationship can be expressed as:

N = P \cdot M

(5)

where P is the projection matrix, represented as

P = [\begin{matrix} cos θ cos ϕ & cos θ sin φ & sin θ \\ P_{21} & P_{22} & cos θ θ^{'} \end{matrix}]

(6)

and

P_{21}

and

P_{22}

are denoted as

\{\begin{matrix} P_{21} = - sin θ cos φ θ^{'} - cos θ sin φ φ^{'} \\ P_{22} = - sin θ sin φ θ^{'} + cos θ cos φ φ^{'} \end{matrix}

(7)

where

θ^{'}

and

φ^{'}

represent the instantaneous velocities of the elevation and azimuth angles, respectively. Additionally, if the scatter point M is rotated along the

O_{T a r g e t} - U V W

coordinate with angle

{[θ_{μ} θ_{ν} θ_{ω}]}^{T}

, the rotated point

\hat{M}

is

\hat{M} = R_{W} \cdot R_{V} \cdot R_{U} \cdot M

(8)

where

R_{U}

,

R_{V}

, and

R_{W}

are the rotation matrices along the

U -

,

V -

, and

W -

axes, respectively. Specifically,

R_{U} = [\begin{matrix} 1 & 0 & 0 \\ 0 & cos θ_{μ} & - sin θ_{μ} \\ 0 & sin θ_{μ} & cos θ_{μ} \end{matrix}]

(9)

R_{V} = [\begin{matrix} cos θ_{ν} & 0 & - sin θ_{ν} \\ 0 & 1 & 0 \\ sin θ_{ν} & 0 & cos θ_{ν} \end{matrix}]

(10)

R_{W} = [\begin{matrix} cos θ_{ω} & - sin θ_{ω} & 0 \\ sin θ_{ω} & cos θ_{ω} & 0 \\ 0 & 0 & 1 \end{matrix}] .

(11)

In this scenario, the projection relationship is

N = P \cdot R_{W} \cdot R_{V} \cdot R_{U} \cdot M = P^{'} \cdot M

(12)

where

P^{'} = P \cdot R_{W} \cdot R_{V} \cdot R_{U}

.

Moreover, in the geometric model of THz-ISAR imaging, the motion of the target relative to the radar can be independently decomposed into translational and rotational components. Taking into account the initial slant range and the instantaneous change in slant range due to translational motion, the instantaneous slant range

R_{i} (t_{m})

of scatter point i can be expressed as:

R_{i} (t_{m}) = r_{0} + r (t_{m}) + x_{i} sin ω t_{m} + y_{i} cos ω t_{m}

(13)

where

t_{m}

denotes the slow time,

r_{0}

represents the initial slant range,

r_{m}

signifies the slant range of the target’s rotation center as it varies with the slow time,

x_{i}

and

y_{i}

are the coordinates of the i-th scatter point, and

ω

is the rotation speed of the target. It is assumed that the radar emits a linear frequency-modulated (LFM) signal:

s (\hat{t}; t_{m}) = rect [\frac{\hat{t}}{T_{p}}] \cdot exp (j 2 π f_{c} t + j π γ {\hat{t}}^{2}) .

(14)

Following the delay time

τ = \frac{2 R_{i} (t_{m})}{c},

(15)

the echo signal that has been acquired is

s_{i} (\hat{t}; t_{m}) = rect [\frac{\hat{t} - 2 R_{i} (t_{m}) / c}{T_{p}}] \cdot exp \{j 2 π f_{c} [\hat{t} - \frac{2 R_{i} (t_{m})}{c}] + j π γ_{0} {[\hat{t} - \frac{2 R_{i} (t_{m})}{c}]}^{2}\}

(16)

where

T_{p}

denotes the pulse duration,

f_{c}

stands for the carrier frequency, and

γ_{0}

indicates the frequency deviation of the linear frequency-modulated (LFM) signal. We proceed to perform dechirping and pulse compression on the received echo signal:

S (f_{r}; t_{m}) = \sum_{i = 0}^{Q - 1} \{σ_{i} \cdot sinc \{T_{p} [f_{r} + \frac{2 R_{i} (t_{m})}{c}]\} \cdot exp [- j \frac{4 π f_{c}}{c} R_{i} (t_{m})]\}

(17)

where Q denotes the number of scatter points and

σ_{i}

denotes the radar cross section of i-th scatter point.

Based on the imaging geometry above, the imaging characteristic of one ideal parabolic antenna is first analyzed.

The geometry of the paraboloid within the Cartesian coordinate system is depicted in Figure 3, and its equation could be presented in this manner:

x = \frac{y^{2} + z^{2}}{4 K},

(18)

where K represents the focal length. Given the rotational symmetry of the paraboloid, this study focuses on its projection onto the x–y plane, which forms a parabola. In Figure 3, the parabola is depicted as an orange curve. The diameter is represented by D, and

φ

denotes the angle between the electromagnetic wave transmitted by the radar and the y-axis. It is assumed that the scenario satisfies the far field condition, so the electromagnetic wave emitted by radar can be regarded as a plane wave. The focal point is denoted by G, and P is the specular reflected point on the parabola as it relates to the incident angle.

ρ

is the distance between G and P, and

β

is the angle between line segment GP and the y-axis. Since one of the parabola’s properties is that

β = 2 φ

, the equation of the parabola can be formulated in this polar coordinate manner:

ρ = \frac{2 K}{1 + cos (2 φ)} .

(19)

Figure 3 illustrates that the location of the point of specular reflection moves along the parabola as the incident wave continuously varies angle, and the range of angle

φ

can be computed as follows:

φ \in [- \frac{1}{2} arctan (\frac{8 D F}{16 K^{2} - D^{2}}), \frac{1}{2} arctan (\frac{8 D F}{16 K^{2} - D^{2}})] .

(20)

Owing to this distinctive feature, the paraboloid is classified as a sliding scattering center [28]. The point of specular reflection significantly impacts the Radar Cross Section (RCS). Additionally, the rim of the paraboloid contributes to the RCS [29]. Nonetheless, when the specular reflected point is present, this smaller part is often negligible. Therefore, the mirror-like scattering point is a characteristic of the paraboloidal imaging, in this case.

According to the analysis of the sliding scattering center of the parabolic antenna, the imaging characteristics of a parabolic antenna can be categorized into specular scattering points and non-specular scattering points. Furthermore, through observations from multiple simulation experiments, we have found that the non-specular scattering points can be divided into two main categories: edge pair-points and ellipse arc. To illustrate the imaging characteristic of parabolic antenna, we give an example of EM. The parameters of the parabolic antenna are as follows: radius = 25 cm, focal depth = 25 cm, edge width = 1 cm, and relative elevation angle

ψ = 10^{\circ}

. Three typical observation apertures, AB, CD, and EF, are selected, and their imaging results are shown in Figure 4c–e, respectively. The images are presented in dB magnitude scale. The imaging result under the aperture AB has the one specular reflection point, whereas those of the apertures EF and CD have the scattered points about the edge of a paraboloid, which is consistent with the above theoretical analysis. Furthermore, it is observed that in Figure 4a,b, the amplitude discrepancies at the edge points are subject to substantial changes when the observation aperture is no longer aligned with the specular direction. This implies that the amplitude of the less intense scattered point could potentially exceed the dynamic range, resulting in it being obscured.

Therefore, the imaging properties of the satellite are analyzed by forward modeling and electromagnetic computations. In the case of the forward modeling, the satellite’s Computer-Aided Design (CAD) model and its geometry and size are depicted in Figure 5. In particular, the satellite model adopted in this paper has a total length of

8.6

m, with each side of the sail panel measuring

3.3

m in length and

0.8

m in width and the diameter of the parabolic antenna being

1.1

m, as illustrated in Figure 5. Dependent on the corporeal composition and shell curvatures listed in Table 1, five primary scattering elements are identified. Among these five elements, the parabolic antenna, solar panels, main body, propeller, and lens are clearly discernible in the imaging outputs. In addition, through analysis, we believe that the parabolic antenna, solar panels, and main body are the primary components of the satellite and are mainly characterized by single reflections, with multiple scattering occurring in a few cases. Additionally, although the lens and propellers are cavity structures that lead to multiple scattering, their component sizes are relatively small, they occupy a minor proportion within the satellite, and they do not interfere with the targets we aim to detect. Considering the above analysis and the need to conserve computational resources, we have limited our simulation calculations to single scattering for this study.

For the CAD (in the second row) and corresponding imaging characteristics (in the first row), we give three typical different apertures to observe, as shown in Figure 6. All observation apertures are with the same breadth,

3^{\circ}

, and the observing center angles

φ

are

10^{\circ}

,

46^{\circ}

, and

70^{\circ}

. From left to right, they are the three typical imaging characteristics, namely, specular point, edge pair-points, and ellipse arc, respectively. It can be found that the main body and solar panels are clearly discernible in all three observation apertures. In the case of the specular reflection point, as shown in Figure 6a, the parabolic antenna is not visible in the image; the specular scattering point occupies the majority of the energy, which is located at left to the satellite, and aligns with the analysis of component prior. In the case of the non-specular direction, there are two statuses of the parabolic antenna, corresponding to Figure 6b,c. It is easy to see that the parabolic antenna can be distinctly found in the image when it appears in the shape of an ellipse, which is shown in Figure 6c; and the parabolic antenna will be discretized into two endpoints when there are other strong scattering points in the observation aperture, as shown in Figure 6b. The readers can verify the position of the paraboloid by comparing the optical image with the corresponding viewing angle.

3. Improved Network for Component Detection

This study proposes a research framework for the automatic detection of satellite parabolic antenna components based on Inverse Synthetic Aperture Radar (ISAR) and deep learning. This section mainly develops a parabolic antenna component detection model based on the modified YOLOv8 algorithm, according to the features of the parabolic antenna analyzed in the previous section under terahertz radar imaging. Therefore, this section mainly introduces the basic structural framework of the proposed network, the relevant evaluation indicators, and the other algorithms involved in the comparison.

A. Structure of Improved-YOLOv8 model

The most recent addition to the YOLO family of target detection algorithms, YOLOv8, boasts improvements across several key components over its earlier iterations. YOLOv8 incorporates an anchor mechanism inspired by YOLOX [30], which offers benefits in dealing with objects of elongated, unconventional shapes. Additionally, for the purpose of loss calculation in positive sample matching, YOLOv8 utilizes an adaptive multi-positive sample matching technique.

A variety of internal and external factors often result in size variations for satellites and their components, even in regions that appear identical. This diversity affects the size of the target detection area for the parabolic antenna. Yet, the target detection network generates detection boxes based on the fixed dimensions of a predefined box, which lacks flexibility. Further analysis of the samples in the dataset reveals that most samples have clear features and can be readily identified as simple samples. In contrast, a smaller number of samples have indistinct features, which raises the risk of misclassification, and are labeled as difficult samples. This creates a notable imbalance between simple and difficult samples. Nonetheless, it is essential for the model to be trained to recognize and differentiate these difficult samples so that the network’s detection capabilities can be improved.

Furthermore, the research introduces a refined algorithm derived from YOLOv8, designed for the automated detection of targets across multiple scales. Initially, a Dyhead [31] module, which employs an attention mechanism, is integrated into the algorithm’s head component to enable flexible detection of parabolic antenna targets that vary in size. Then, within the Intersection over Union (IoU) loss component, the standard Complete Intersection over Union (CIoU) from the YOLOv8 model is replaced with WIoU [32] to tackle the issue of sample imbalance between those that are easy to detect and those that are challenging. Figure 7 illustrates the primary architecture of the Enhanced-YOLOv8 algorithm.

(1) Dyhead: The attention mechanism is a sophisticated processing strategy that prioritizes relevant information while excluding irrelevant, extraneous data. It adeptly captures the relative importance of various semantic levels and selectively boosts features in alignment with the dimensions of an individual object [31]. The Dyhead module integrates the target detection head and the attention mechanism together. This methodology seamlessly converges several self-attention components across different feature layers, spatial positions, and output channels, facilitating the concurrent acquisition of scale, spatial, and task awareness within a cohesive target detection architecture. This integration remarkably promotes the representational power of the detection head without augmenting the computational complexity. As illustrated in Figure 8, the detection head receives a scaled feature pyramid F, which is represented as a tensor

F \in R^{L \times S \times C}

, where L represents the number of layers within the pyramid.

S = H \times W

; H and W represent the height and width. And C denotes channel number of the median horizontal feature. The general procedure for utilizing self-attention can be expressed as:

W (F) = π (F) \cdot F

(21)

where

π (\cdot)

denotes an attention mechanism. A straightforward approach to addressing this attention mechanism involves the use of fully connected neural networks. However, training the attention function across all dimensions is computationally intensive and, in practice, not feasible due to the tensor’s high dimensionality. Alternatively, we decompose the attention function into three consecutive attentions, each dedicated to concentrating on a single perspective:

W (F) = π_{C} (π_{S} (π_{L} (F) \cdot F) \cdot F) \cdot F

(22)

where

π_{L (\cdot)}

,

π_{S (\cdot)}

, and

π_{C (\cdot)}

represent three distinct attention functions, each applied to dimensions L, S, and C, respectively.

This module is equipped with scale-aware attention functions across the horizontal axis, and this module seamlessly integrates a range of different scales and thus effectively adjusts to the importance of various feature intensities. The scale-aware attention functions can be expressed as:

π_{L} (F) \cdot F = σ (f (\frac{1}{S C} \sum_{S, C} F)) \cdot F

(23)

where

f (\cdot)

denotes a linear function that is approximated through the use of a

1 \times 1

convolutional layer and

σ (x)

is defined as the hard-sigmoid function given by

σ (x) = m a x (0, m i n (1, (x + 1) / 2))

. The use of the hard-sigmoid operator is a key component of the attention mechanism, providing normalization, a probabilistic interpretation, and compatibility with the softmax function, all of which are essential for the effective operation of the attention mechanism in neural networks.

The spatially aware attention module operates within the spatial domain (specifically, along height and width), strategically pooling features from the same spatial locations across layers to yield more distinct feature representations, and can be expressed by

π_{S} (F) \cdot F = \frac{1}{L} \sum_{l = 1}^{L} \sum_{k = 1}^{K} w_{l, k} \cdot F (l; p_{k} + Δ p_{k}; c) \cdot Δ m_{k},

(24)

where K represents the count of sparsely sampled positions, with

p_{k} + Δ p_{k}

indicating a relocated point achieved through the application of a spatial shift

Δ p_{k}

that the model has learned, which is designed to concentrate on a region of discriminative interest. Additionally,

Δ m_{k}

denotes a significance factor that the model has learned at the position

p_{k}

. Both of these elements are derived from the input features extracted at the median level of the feature set

F

.

The task-specific attention module operates at the channel level, enabling the dynamic engagement or disengagement of operational channels to accommodate and support a variety of tasks, and can be expressed as

π_{C} (F) \cdot F = m a x (α^{1} (F) \cdot F_{c} + β^{1} (F), α^{2} (F) \cdot F_{c} + β^{2} (F))

(25)

where

F_{c}

denotes the feature tensor extracted from the c-th channel and the hyperparameter vector

{[α^{1}, α^{2}, β^{1}, β^{2}]}^{T} = θ (\cdot)

, which is determined by the function

θ (\cdot)

, is a specialized function that the model has been trained on to adjust activation thresholds. The function

θ (\cdot)

is implemented in a manner akin to that described in reference [33], involving the following steps: first, it performs a global average pooling operation across the

L \times S

dimensions to decrease the feature dimensionality, followed by two fully connected neural layers and a normalization layer. The output is then normalized using a shifted sigmoid activation function, constraining it to the range of

[- 1, 1]

.

(2) WIoU: In YOLOv8, the loss function is constructed from three distinct parts, which are divided into two unique streams, namely, classification and regression. For the former aspect, a Binary Cross-Entropy (BCE) loss is employed. For the latter aspect, the regression is handled using the CIoU bounding box loss and the Distribution Focus (DF) loss. A composite loss function is formulated by assigning appropriate weights to these three individual loss components, and it is represented as follows:

{Loss}^{total} = u {Loss}^{BCE} + v {Loss}^{DF} + w {Loss}^{CIoU}

(26)

where, u, v, and w serve as weighting parameters.

The YOLOv8 implementation of the CIoU takes into account the overlapping region, the separation between the central points, as well as the shape ratio calculation within the bounding box regression procedure. Moreover, the way it defines the aspect ratio as a relative measure is not clear, and it fails to consider the equilibrium between difficult and simple sample types. WIoU addresses this by implementing a dynamic, non-increasing focus mechanism, which uses “outlierness” as a criterion for evaluating anchor box quality rather than IoU, and it also offers a well-considered approach to gradient distribution. As a result, WIoU is able to prioritize anchor boxes of average quality, thereby tackling the imbalance between challenging and easy samples and enhancing the detector’s overall effectiveness [32].

The WIoU is segmented into three distinct iterations: v1, v2, and v3, each characterized by its unique computational equations detailed as follows:

L_{WIoUv 1} = R_{WIoU} \times L_{IoU}

(27)

R_{WIoU} = exp (\frac{{(x - x_{gt})}^{2} + {(y - y_{gt})}^{2}}{{(H_{g}^{2} + W_{g}^{2})}^{*}})

(28)

L_{WIoUv 2} = r \times L_{WIoUv 1}, r = {(\frac{L_{IoU}^{*}}{{\bar{L}}_{IoU}})}^{γ}, γ > 0

(29)

L_{WIoUv 3} = r \times L_{WIoUv 1}, r = \frac{β}{δ α^{β - δ}}

(30)

β = \frac{L_{IoU}^{*}}{{\bar{L}}_{IoU}} \in (0, + \infty)

(31)

where

(x, y)

and

(x_{g t}, y_{g t})

stand for the centroids of the anchor box and the ground-truth box, respectively.

W_{g}

and

H_{g}

refer to the measurements of the tightest surrounding rectangle. The presence of an asterisk (*) indicates that

W_{g}

and

H_{g}

are not part of the computational graph.

β

measures the level of being an outlier, a parameter that is adjusted by the hyperparameters

α

and

δ

. Moreover, r is used to denote the factor by which gradients are increased [32].

B. Baseline Methods

The cutting-edge approaches in deep learning for target detection could be sorted into two main kinds: those that rely on region proposals and those that use regression. Region proposal-based methods, also referred to as two-step detection techniques, segment the detection process into two separate stages: the initial extraction of potential regions through algorithmic means, followed by a further refinement of these proposed bounding boxes. As a result, these methods tend to offer superior detection precision. Notable examples of such methods are the region-based Convolutional Neural Network (R-CNN) [34], FastR-CNN [35], and Faster R-CNN [36]. In contrast, regression-based methods, also known as one-stage detection techniques, dispense with the need for pre-selected regions. They are capable of identifying the target’s category and pinpointing its location in a single step, leading to rapid inference times. The YOLO series [37] and RetinaNet [38] are among the most prominent examples. This research endeavors to facilitate the swift and automated detection of subsidence funnels on extensive interferogram datasets. While two-stage methods are accurate, their intricate training processes and substantial computational demands pose challenges to real-time detection. Consequently, four exemplar one-stage detection methods were chosen for this investigation, YOLOv3 [39], YOLOv5 [40], YOLOv6 [41], YOLOv8 [22], and YOLOv11 [42], with the goal of enhancing the experimental workflow’s efficiency.

The YOLO (You Only Look Once) groups have emerged as a prominent family of object detection algorithms, with each iteration bringing significant advancements. YOLOv3, released in 2016, marked a milestone with its end-to-end architecture and ability to detect multiple objects simultaneously. Subsequently, YOLOv5, introduced in 2019, offered improved speed and accuracy with its more efficient design and multi-scale training approach. Building upon the success of YOLOv5, YOLOv6 introduced a novel backbone structure and data-efficient training strategies in 2021, aiming to find a middle ground between speed and precision. Then, YOLOv8 continued to push the boundaries with further enhancements in detection performance and efficiency, solidifying YOLO’s position as a leading algorithm within the realm of computer vision. Finally, YOLO11 is a cutting-edge, state-of-the-art object detection model, built upon previous versions of YOLO and incorporating new features and improvements to further enhance its performance and flexibility. Additionally, this study has opted for the lighter versions of the YOLO series models. In particular, YOLOv8 and YOLOv11 were chosen with the “n” variant, YOLOv5 and YOLOv6 were selected with the “s” variant, and YOLOv3 was chosen with the “tiny” variant.

4. Numerical and Anechoic Chamber Experiments

This section begins with an overview of the datasets and the specifics of the training process. Subsequently, a comparative evaluation of the detection performance across various networks is presented. Following this, the results of ablation studies focused on quantitative assessments and the impact of network components are discussed. Furthermore, experiments conducted in anechoic chambers are performed to demonstrate the superiority of robustness when compared to other networks.

A. Dataset construction and experimental details

The method is verified by electromagnetic calculation data and an anechoic chamber. The simulation data are constructed and supported by the electromagnetic simulation software and satellite CAD model. The imaging parameters are configured to the following specifications: the center frequency is 220 GHz, the bandwidth is 20 GHz, the number of sampling points is 1200; the azimuth aperture is

3^{\circ}

with 500 points for sampling. The depression angle is set as

45^{\circ}

. Taking into account the imaging resemblance between neighboring observation apertures, a set of 617 original images is acquired by selecting them at a

0 . 58^{\circ}

interval from a total of

360^{\circ}

. Figure 9 demonstrates various representative imaging outcomes across a range of observation apertures.

Next, we need to label the target for detection based on CPICs. Provided that the primary components of the satellite and parabolic antenna are discernible in the image, they are marked or labeled. Furthermore, the difference in the bounding boxes of the satellite and the antenna is illustrated in Figure 10. We can see that the bounding boxes of the satellite are significantly larger than the three kinds of bounding boxes for the parabolic antenna, particularly the bounding box for the specular point, which tests the multi-scale detection capability of the object detection network. Additionally, the distribution of the three bounding boxes for the parabolic antenna is uneven. Among them, there are approximately 400 types of ellipse arc, 100 types of edge pair-points, and around 50 types of specular point. The total number of ellipse arc and edge pair-points accounts for nearly

90 %

, and together they form the main part of the antenna.

The mean Average Precision (mAP) serves as a crucial measurement for assessing the performance of deep learning models. It is of paramount importance due to its ability to provide a comprehensive evaluation of accuracy and recall capabilities across various trained instances. The choice of mAP as an evaluation criterion is justified because it is extensively utilized in numerous object detection research studies, offering a standardized and reliable way to compare the effectiveness of different models in identifying and recalling objects accurately. We choose mAP50 and mAP50-95 to evaluate the training results. The mAP50 metric measures the average precision of the model when the IoU (Intersection over Union) threshold is set to 0.5, while mAP50-95 measures the average precision of the model over the IoU threshold range from 0.5 to 0.95. Figure 11 shows the changes in mAP during the training process for different networks. The left side of Figure 11 shows that the training results of YOLOv3 are poor, and the training was terminated before reaching 0.7; YOLOv8 has an mAP50 around 0.8, which is a little better training effect than in YOLOv5 and YOLOv6. The proposed methods both have mAP50 values exceeding 0.9, maintaining a high level of training. The right side of Figure 11 gives the comparison result of mAP50-95. It shows that apart from the overall numbers having decreased by about 0.3, the overall trend remains unchanged. To further highlight the recognition superiority of the proposed method for parabolic antenna components, the next section will focus on analyzing the performance indicators of different algorithms for parabolic antenna detection. Moreover, the relevant parameters are

e p o c h s = 300

and

p a t i e n c e = 50

, which means the program will stop when the number of epochs reaches 300, or when there is no significant improvement over a continuous 50 iterations. So, we can see that different algorithms vary the epochs in Figure 11.

Taking into account the visual proximity among neighboring observation apertures, a selection of 620 pristine images is compiled by spacing them at intervals of 0.58 degrees within a full 360-degree range, as suggested in [26]. The dataset is segmented into training, validation, and testing subsets, with allocations of

70 %

,

10 %

, and

20 %

each for these respective phases. To enrich the variety of the training images, conventional data augmentation techniques such as horizontal mirroring, brightness adjustment, and the introduction of random noise are employed. To improve the model’s generalization ability, mosaic enhancement is applied to each image; to reduce computational load, the image size is reduced to 0.5 times the original size; the image brightness will be adjusted to about 0.4 of the original brightness; there is a 0.5 probability of horizontally flipping the input image during each random training sample generation. All the experiments were conducted on a system featuring an Intel(R) Core(TM) i9-13900HX processor running at 2.20 GHz, complemented by an NVIDIA GeForce RTX 4060 graphics card. The PyTorch deep learning library, specifically the GPU-enabled version, was utilized as the primary framework for these experiments. CUDA 11.8 was employed for the parallel processing capabilities of the GPU. The SGD optimizer was chosen for updating and optimizing the network parameters. The initial training learning rate was initialized to 1 × 10⁻³. The mini-batch size was specified as 32, and the total number of training epochs was set to 300. The input image size was 640, the number of worker threads for data loading was 8.

B. Accuracy Verification

To assess the effectiveness of deep learning models, the primary measure employed is the mean Average Precision (mAP). This measure is commonly applied in object detection research to depict the accuracy and recall patterns across different trained models [43]. The AP is computed through the difference-based AP evaluation technique, which represents the region below the precision–recall curve. The calculation formula is:

AP = (\frac{1}{n} \sum_{(r ϵ \frac{1}{n}, \frac{2}{n}, . . ., \frac{n - 1}{n}, 1)} P_{(r)})

(32)

mAP = \frac{1}{N} \sum_{k = 1}^{k = N} {AP}_{k}

(33)

where n means the entire number of detection instances.

P (r)

signifies the accuracy level at a specific recall rate of r. N refers to the complete count of categories, and

A P_{k}

denotes the AP for category k.

m A P_{50}

refers to IoU thresholds set at

50 %

. And mAP is short for

m A P_{50}

in this paper in the absence of ambiguity. Furthermore, auxiliary assessment metrics such as precision, recall, and the F1 measure are represented mathematically in the following manner:

Precision = \frac{TP}{TP + FP}

(34)

Recall = \frac{TP}{TP + FN}

(35)

F 1 = 2 \cdot \frac{Precision \cdot Recall}{Precision + Recall}

(36)

where TP stands for true positives, indicating the case where an object is accurately identified as a positive example. FP denotes false positives, involving the erroneous identification of an object that is not actually positive. FN refers to false negatives, where a legitimate object is mistakenly classified as negative.

C. Comparisons with other algorithms

To evaluate the detection performance fairly, the following representative algorithms are chosen for comparison: YOLOv3, YOLOv5, YOLOv6, YOLOv8 and YOLOv11.

Figure 12 presents the outcomes of detecting objects using seven various detection networks. The images across each row illustrate the detection results from these networks at a set focal width. The boxes marked in red, pink, orange, and yellow indicate the identified areas of the satellite, the ellipse arc, the edge pair-points, and specular point, respectively. Observations reveal that ours successfully detects the parabolic antenna and satellite for all five samples, while others have varying degrees of shortcomings. YOLOv3, YOLOv5, YOLOv6, YOLOv8, and YOLOv11 miss the detection of the component in row 5, which indicates our network’s advantage in recognizing small-scale targets. Meanwhile, YOLOv3 misses detection in row 3, and the identification outcomes of YOLOv5 yield numerous intersecting bounding boxes for the elliptical arc, suggesting that the network necessitates a more refined Non-Maximum Suppression (NMS) threshold.

Table 2, Table 3, Table 4 and Table 5 present performance metrics across various networks in the context of the satellite, the ellipse arc, the edge pair-points, and specular point quantitatively. From Table 2, we can tell that all networks detect the satellite accurately, which demonstrates stability in large target detection. Observing Table 3, it can be found that ours achieves the highest F1, R, mAP50, and mAP50-95, and the highest score of P is achieved by YOLOv5. In Table 4, we can see that the proposed algorithm achieves the top values across P, F1, mAP50, and mAP50-95 metrics, with YOLOv8 and YOLOv5 holding the top position for the R metric. In Table 5, the proposed algorithm also exhibits the highest mAP50 value, while YOLOv3 and YOLOv6, respectively, claim the top spots for the P and R metrics and YOLOv5 holds the top spots for the F1 and mAP50-95 metrics.

In order to focus attention on the detection of parabolic dish antennas, by integrating the mAP50 and mAP50-95 values from the aforementioned tables, namely, Table 3, Table 4 and Table 5, we obtain the following histogram in Figure 13. As a result, the efficacy of the suggested approach achieves peak levels in the recognition outcomes for parabolic antenna.

In order to gain a more detailed assessment of the detection capabilities, the precision–recall (PR) curves of the ellipse arc, the edge pair-points, and specular point are presented in Figure 14. For the ellipse arc, the proposed algorithm’s advantage is not significant. But for the specular point and edge pair-points, our algorithm is situated in the upper right-hand side of all the curves, thus verifying the stability of the detection.

To illustrate the improvement effects of the combined enhancement module, namely, Dyhead + WIoU (DW), introduced in this paper on the YOLOv8 model, and to further compare the impact of different versions of WIoU, we conducted ablation experiments on the Improved-YOLOv8 model. Considering the conservation of computational resources, we only chose to add one Dyhead module. Additionally, there exist three variations of WIoU: v1, v2, and v3, among which WIoUv3 requires the determination of hyperparameters

γ

and

δ

[32] with the classic values

γ = 1.9

and

δ = 3

. We show the results of mAP as an index to assess the performance of the proposed network quantitatively. The findings are presented in Table 6.

Table 6 visually illustrates the changes in model accuracy after adding Dyhead and different versions of WIoU. According to the aforementioned results, we could sum up that when the two modules are combined, i.e., Dyhead + WIoUv1, there is a significant improvement in model performance. As previously described, the Dyhead module helps the network to adaptively detect various scales of parabolic shapes from ISAR images, while WIoU to some degree mitigates the problem of sample unevenness between simple and complex examples.

D. Recognition Performance in Anechoic Chamber Measurement Data

Figure 15 illustrates the configuration of an anechoic chamber and presents a photograph of the satellite prototype. The radar transmits a linear frequency-modulated signal. Its start frequency and end frequency are 324 GHz and 344 GHz, respectively. This is a terahertz radar system with one transmitting antenna and four receiving antennas. For this experiment, only one of the receiving channels was used. In addition, a satellite model with a length of approximately 70 cm was mounted on an accurately calibrated turntable for capturing images through various observation apertures.

Figure 16 shows the recognition outcomes of different algorithms for the four anechoic chambers’ measurement data. From the figure, we can see that YOLOv3 has poor performance with misrecognition for both satellites and parabolic antenna; and YOLOv11 recognizes all the satellites but misses all parabolic antennas; YOLOv5 loses the recognition of two parabolic antennas and also duplicates the recognition of parabolic antennas; YOLOv6 and YOLOv8 lose the recognition of three parabolic antennas, and the latter has a false recognition, showing a weak generalizability. The proposed algorithm is able to recognize both satellites and parabolic antennas without any omissions or misjudgments, which fully demonstrates that the addition of modules can adaptively detect various scales and can solve the issue of sample imbalance between different types of samples.

E. The limitations of the Improved-YOLOv8

Although the proposed algorithm performs well on electromagnetic simulation datasets and can correctly identify some dark room data, exceeding most YOLO models and to some extent demonstrating the algorithm’s superiority, it has not been compared with other one-stage or two-stage models. In the design phase of the algorithm, we mainly focused on data processing under ideal conditions. While it can successfully identify some dark room measurement data, it has not yet conducted in-depth analysis and optimization for the noise interference that may be encountered in practical applications. Moreover, this paper has fully analyzed the imaging characteristics of smooth parabolic antennas under non-polarized conditions and achieved good recognition results. However, the study has not explored other types of parabolic antennas, such as those equipped with feed sources or with grid-shaped parabolic antennas; therefore, it cannot be guaranteed that the generalizability to other structures of parabolic antennas remains. In addition, the improved-YOLOv8 model proposed in this study mainly optimizes the final detection process of the model by modifying the head part, thereby improving the model’s performance. However, since the model has not been improved in the backbone and neck parts, it has not further optimized the model’s feature extraction and enhancement process. Therefore, in the future, we will focus on enhancing the model’s multi-scale feature extraction capabilities.

5. Conclusions

In summary, this article proposes an algorithm based on component prior knowledge and an improved version of YOLOv8, which achieves the identification of parabolic antennas on satellites. With reference to the specified imaging geometry and standard imaging techniques, the CPICs of the parabolic antenna in the THz band were analyzed, and the corresponding dataset was established. By effectively using the special CPICs, we transformed this component identification problem into three different types of object detection problems. Subsequently, by incorporating Dyhead and WIoU, the multi-scale target challenges and sample imbalance in the satellite ISAR images were addressed. Trained on simulated data, the Improved-YOLOv8 model achieved 0.935 and 0.52 in AP50 and AP50-95, respectively, surpassing the baseline YOLOv8 model. Compared with five other detection algorithms, YOLOv3, YOLOv5, YOLOv6, YOLOv8, and YOLOv11, our method notably improved about by 0.33, 0.10, 0.21, 0.19, and 0.20 in mAP50 and improved by about 0.16, 0.04, 0.02, 0.08, and 0.11 in mAP50-95. And for the anechoic chamber measurement data, the effectiveness and reliability of the proposed method in identifying parabolic antennas were also verified.

Recognizing the parabolic antenna, which is a major component of satellite communication systems on satellites in space, possesses immense application potential. It plays a significant role in ensuring the success of satellite missions and maintaining satellite’s safety and repair. In the future, we will consider this method to apply to other types of satellites as well as the detection of parabolic antennas and recognition under low signal-to-noise ratios. And the discussion about the improvement of the backbone of the network and different training strategies will be taken into account.

Author Contributions

Theoretical study, experiments, and writing L.Y.; experimental environment and software W.L. and R.W.; review and editing, H.W., Y.Z. and B.D. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported by the National Natural Science Foundation of China under Grant 62301573.

Data Availability Statement

Not applicable.

Acknowledgments

The authors are deeply appreciative of the meticulous attention and constructive feedback provided by the editors and reviewers.

Conflicts of Interest

The authors declare no conflict of interest.

References

Kou, P.; Qiu, X.; Liu, Y.; Zhao, D.; Li, W.; Zhang, S. ISAR Image Segmentation for Space Target Based on Contrastive Learning and NL-Unet. IEEE Geosci. Remote. Sens. Lett. 2023, 20, 3506105. [Google Scholar] [CrossRef]
Li, C.; Zhu, W.; Qu, W.; Ma, F.; Wang, R. Component recognition of ISAR targets via multimodal feature fusion. Chin. J. Aeronaut. 2025, 38, 103122. [Google Scholar] [CrossRef]
Sheng, W.; Long, Y.; Zhou, Y. Analysis of Target Location Accuracy in Space-Based Optical-Sensor Network. Acta Opt. Sin. 2011, 31, 228001. [Google Scholar] [CrossRef]
Bin, Z. Simulation study of space multi-target imaging for space-based opto-electronic telescope. Opt. Tech. 2007, 55, 368–375. [Google Scholar]
Avent, R.; Shelton, J.; Brown, P. The ALCOR C-band imaging radar. IEEE Antennas Propag. Mag. 1996, 38, 16–27. [Google Scholar] [CrossRef]
Zandonella, C. Terahertz imaging: T-ray specs. Nature 2003, 424, 721–722. [Google Scholar] [CrossRef]
Zhang, B.; Pi, Y.; Li, J. Terahertz Imaging Radar With Inverse Aperture Synthesis Techniques: System Structure, Signal Processing, and Experiment Results. Sensors J. IEEE 2015, 15, 290–299. [Google Scholar] [CrossRef]
Zhang, X.; Liang, J.; Wang, N.; Chang, T.; Guo, Q.; Cui, H.L. Broadband Millimeter-Wave Imaging Radar-Based 3-D Holographic Reconstruction for Nondestructive Testing. IEEE Trans. Microw. Theory Tech. 2020, 68, 1074–1085. [Google Scholar] [CrossRef]
Zhang, X.; Chang, T.; Wang, Z.; Cui, H.L. Three-Dimensional Terahertz Continuous Wave Imaging Radar for Nondestructive Testing. IEEE Access 2020, 8, 144259–144276. [Google Scholar] [CrossRef]
Song, H.J.; Nagatsuma, T. Present and Future of Terahertz Communications. IEEE Trans. Terahertz Sci. Technol. 2011, 1, 256–263. [Google Scholar] [CrossRef]
Cimmino, S.; Franceschetti, G.; Iodice, A.; Riccio, D.; Ruello, G. Efficient spotlight SAR raw signal simulation of extended scenes. IEEE Trans. Geosci. Remote Sens. 2003, 41, 2329–2337. [Google Scholar]
Gao, J.; Wang, R.J.; Deng, B.; Qin, Y.; Li, X. Electromagnetic Scattering Characteristics of Rough PEC Targets in the Terahertz Regime. IEEE Antennas Wirel. Propag. Lett. 2017, 16, 975–978. [Google Scholar]
Zhang, S.; Liu, Y.; Li, X.; Hu, D. Enhancing ISAR Image Efficiently via Convolutional Reweighted l1 Minimization. IEEE Trans. Image Process. 2021, 30, 4291–4304. [Google Scholar] [CrossRef]
Gao, J.; Deng, B.; Qin, Y.; Wang, H.; Li, X. Enhanced Radar Imaging Using a Complex-Valued Convolutional Neural Network. IEEE Geosci. Remote. Sens. Lett. 2019, 16, 35–39. [Google Scholar] [CrossRef]
Ai, J.; Mao, Y.; Luo, Q.; Xing, M.; Jiang, K.; Jia, L.; Yang, X. Robust CFAR Ship Detector Based on Bilateral-Trimmed-Statistics of Complex Ocean Scenes in SAR Imagery: A Closed-Form Solution. IEEE Trans. Aerosp. Electron. Syst. 2021, 57, 1872–1890. [Google Scholar]
Ai, J.; Luo, Q.; Yang, X.; Yin, Z.; Xu, H. Outliers-Robust CFAR Detector of Gaussian Clutter Based on the Truncated-Maximum-Likelihood- Estimator in SAR Imagery. IEEE Trans. Intell. Transp. Syst. 2020, 21, 2039–2049. [Google Scholar]
Amrani, M.; Bey, A.; Amamra, A. New SAR target recognition based on YOLO and very deep multi-canonical correlation analysis. Int. J. Remote. Sens. 2021, 43, 5800–5819. [Google Scholar] [CrossRef]
Amrani, M.; Jiang, F. Deep feature extraction and combination for synthetic aperture radar target classification. J. Appl. Remote. Sens. 2017, 11, 042616. [Google Scholar]
Hu, K.; Zhang, C.; He, L.; Zhu, Y. Prediction Model of Mechanical Properties of Elastic Composites Based on Machine Learning Algorithm. In Proceedings of the Artificial Intelligence for Future Society; Palade, V., Favorskaya, M., Patnaik, S., Simic, M., Belciug, S., Eds.; Springer Nature Switzerland: Cham, Switzerland, 2024; pp. 457–468. [Google Scholar]
Wang, Q.; Huang, Z.; Fan, H.; Fu, S.; Tang, Y. Unsupervised person re-identification based on adaptive information supplementation and foreground enhancement. IET Image Process. 2024, 18, 4680–4694. [Google Scholar] [CrossRef]
Rotter, P.; Muron, W. Automatic Detection of Subsidence Troughs in SAR Interferograms Based on Convolutional Neural Networks. IEEE Inst. Electr. Electron. Eng. 2021, 18, 82–86. [Google Scholar]
Guo, J.; Zhang, Z.; Wang, M.; Ma, P.; Gao, W.; Liu, X. Automatic Detection of Subsidence Funnels in Large-Scale SAR Interferograms Based on an Improved-YOLOv8 Model. IEEE Trans. Geosci. Remote. Sens. 2024, 62, 6200117. [Google Scholar] [CrossRef]
Yu, Y.; Wang, Z.; Li, Z.; Ye, K.; Li, H.; Wang, Z. A Lightweight Anchor-Free Subsidence Basin Detection Model With Adaptive Sample Assignment in Interferometric Synthetic Aperture Radar Interferogram. Front. Ecol. Evol. 2022, 10, 840464. [Google Scholar]
Cui, X.C.; Fu, Y.W.; Su, Y.; Chen, S.W. Physical Parameters Joint Estimation of Satellite Parabolic Antenna with Key Frame Pol-ISAR Images. IEEE Trans. Geosci. Remote. Sens. 2024, 62, 5100416. [Google Scholar]
Zhang, Y.; Yang, X.; Jiang, X.R.; Yang, Q.; Deng, B.; Wang, H.Q. Attitude direction estimation of space target parabolic antenna loads using sequential terahertz ISAR images. J. Infrared Millim. Waves 2021, 40, 496–507. [Google Scholar]
Fan, L.; Wang, H.; Yang, Q.; Chen, X.; Deng, B.; Zeng, Y. Fast Detection and Reconstruction of Tank Barrels Based on Component Prior and Deep Neural Network in the Terahertz Regime. IEEE Trans. Geosci. Remote. Sens. 2022, 60, 5230817. [Google Scholar]
Zhou, Y.; Zhang, L.; Cao, Y.; Wu, Z. Attitude Estimation and Geometry Reconstruction of Satellite Targets Based on ISAR Image Sequence Interpretation. IEEE Trans. Aerosp. Electron. Syst. 2019, 55, 1698–1711. [Google Scholar]
Liang, M.; Jin, L.; Tao, W.; Li, Y.; Wang, X. Micro-Doppler characteristics of sliding-type scattering center on rotationally symmetric target. Sci. China (Inf. Sci.) 2011, 54, 1957–1967. [Google Scholar]
Keller, J.B. Geometrical Theory of Diffraction. J. Opt. Soc. Am. 1962, 52, 116–130. [Google Scholar] [CrossRef]
Ge, Z.; Liu, S.; Wang, F.; Li, Z.; Sun, J. YOLOX: Exceeding YOLO Series in 2021. arXiv 2021, arXiv:2107.08430. [Google Scholar]
Dai, X.; Chen, Y.; Xiao, B.; Chen, D.; Liu, M.; Yuan, L.; Zhang, L. Dynamic Head: Unifying Object Detection Heads with Attentions. arXiv 2021, arXiv:2106.08322. [Google Scholar]
Tong, Z.; Chen, Y.; Xu, Z.; Yu, R. Wise-IoU: Bounding Box Regression Loss with Dynamic Focusing Mechanism. arXiv 2023, arXiv:2301.10051. [Google Scholar]
Chen, Y.; Dai, X.; Liu, M.; Chen, D.; Yuan, L.; Liu, Z. Dynamic ReLU. arXiv 2020, arXiv:2003.10027. [Google Scholar] [CrossRef]
Girshick, R.; Donahue, J.; Darrell, T.; Malik, J. Rich feature hierarchies for accurate object detection and semantic segmentation. arXiv 2014, arXiv:1311.2524. [Google Scholar]
Girshick, R. Fast R-CNN. In Proceedings of the 2015 IEEE International Conference on Computer Vision (ICCV), Santiago, Chile, 7–13 December 2015; pp. 1440–1448. [Google Scholar] [CrossRef]
Ren, S.; He, K.; Girshick, R.; Sun, J. Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 39, 1137–1149. [Google Scholar] [CrossRef] [PubMed]
Redmon, J.; Divvala, S.; Girshick, R.; Farhadi, A. You Only Look Once: Unified, Real-Time Object Detection. In Proceedings of the Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016. [Google Scholar]
Lin, T.Y.; Goyal, P.; Girshick, R.; He, K.; Dollár, P. Focal Loss for Dense Object Detection. IEEE Trans. Pattern Anal. Mach. Intell. 2017, 42, 2999–3007. [Google Scholar]
Redmon, J.; Farhadi, A. YOLOv3: An Incremental Improvement. arXiv 2018, arXiv:1804.02767. [Google Scholar]
Zhu, X.; Lyu, S.; Wang, X.; Zhao, Q. TPH-YOLOv5: Improved YOLOv5 Based on Transformer Prediction Head for Object Detection on Drone-captured Scenarios. In Proceedings of the 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), Montreal, BC, Canada, 11–17 October 2021. [Google Scholar]
Li, C.; Li, L.; Jiang, H.; Weng, K.; Geng, Y.; Li, L.; Ke, Z.; Li, Q.; Cheng, M.; Nie, W.; et al. YOLOv6: A Single-Stage Object Detection Framework for Industrial Applications. arXiv 2022, arXiv:2209.02976. [Google Scholar]
Khanam, R.; Hussain, M. YOLOv11: An Overview of the Key Architectural Enhancements. arXiv 2024, arXiv:2410.17725. [Google Scholar]
Lecun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436. [Google Scholar] [CrossRef]

Figure 1. The overall framework diagram of the proposed method.

Figure 2. The observational geometry for space-based terahertz radar in detecting space targets.

Figure 3. Geometry projection diagram of ISAR imaging.

Figure 4. Parabolic antenna imaging characteristics. (a) Three typical observation apertures. (b) Scattering intensity versus azimuth angle. (c) The specular point. (d) The edge pair-points. (e) The ellipse arc.

Figure 5. Satellite CAD model with 5 main scattering components (left) and its geometry and size (right).

Figure 6. Imaging results and corresponding CAD under three typical observation apertures.

Figure 7. Structure diagram of Improved-YOLOv8.

Figure 8. Structure diagram of Dyhead.

Figure 9. The training samples under different apertures.

Figure 10. The distribution of bounding boxes within the dataset.

Figure 11. The mAP50 (left) and mAP50-95 (right) of different networks in the training set.

Figure 12. A comparison of the detection performance of different algorithms on EM data.

Figure 13. mAP50 and mAP50-95 of different networks.

Figure 14. PR curves for three different objects.

Figure 15. Anechoic chamber experiment and satellite mock-up presentation. (a) Terahertz radar technology system. (b) Satellite model for anechoic chamber experiment.

Figure 16. Comparison of performance between different networks on anechoic chamber data.

Table 1. Scattering mechanism of each component of satellite.

Scattering Component	Description	Scattering Mechanism
Parabolic antenna	Dish-shaped structure that is designed to receive and transmit signals	Single/multiple reflection
Solar panel	Crucial component that provides electrical power to the satellite	Single/multiple reflection
Main body	Box-like structure that houses the majority of the satellite’s instruments, electronics, and systems	Single/multiple reflection
Lens	Cavity structure that is a key component in cameras or telescopes	Multiple reflection
Propeller	Cavity device that converts rotational motion into thrust	Multiple reflection

Table 2. The performance metrics across various networks in the context of satellite analysis.

Networks	P	R	F1	mAP50	mAP50-95
YOLOv3	0.968	1	0.984	0.994	0.679
YOLOv5	0.996	1	0.998	0.995	0.871
YOLOv6	1	0.98	0.99	0.995	0.766
YOLOv8	0.986	1	0.993	0.995	0.824
YOLOv11	0.996	0.989	0.981	0.994	0.794
Ours	0.996	1	0.998	0.995	0.873

Table 3. The performance metrics across various networks in the context of specular point analysis.

Networks	P	R	F1	mAP50	mAP50-95
YOLOv3	0.484	0.054	0.097	0.054	0.010
YOLOv5	0.991	0.333	0.498	0.628	0.217
YOLOv6	0.924	0.222	0.358	0.336	0.153
YOLOv8	0.756	0.278	0.407	0.352	0.123
YOLOv11	0.799	0.333	0.470	0.442	0.148
Ours	0.793	0.778	0.785	0.838	0.283

Table 4. The performance metrics across various networks in the context of edge pair-points analysis.

Networks	P	R	F1	mAP50	mAP50-95
YOLOv3	0.67	0.905	0.770	0.913	0.566
YOLOv5	0.736	0.952	0.830	0.913	0.627
YOLOv6	0.653	0.895	0.755	0.891	0.581
YOLOv8	0.694	0.952	0.803	0.933	0.62
YOLOv11	0.624	0.857	0.722	0.852	0.536
Ours	0.902	0.948	0.924	0.989	0.685

Table 5. The performance metrics across various networks in the context of ellipse arc analysis.

Networks	P	R	F1	mAP50	mAP50-95
YOLOv3	0.93	0.784	0.851	0.851	0.518
YOLOv5	0.902	0.94	0.921	0.96	0.593
YOLOv6	0.862	0.957	0.907	0.947	0.585
YOLOv8	0.856	0.897	0.876	0.943	0.59
YOLOv11	0.894	0.897	0.876	0.917	0.541
Ours	0.889	0.948	0.918	0.978	0.591

Table 6. Ablation experiment.

Method	MAP(%)
YOLOv8	0.743
YOLOv8 + WIoUv3	0.808
YOLOv8 + WIoUv2	0.737
YOLOv8 + WIoUv1	0.917
YOLOv8 + Dyhead	0.906
YOLOv8 + Dyhead + WIoUv3	0.859
YOLOv8 + Dyhead + WIoUv2	0.926
YOLOv8 + Dyhead + WIoUv1	0.935

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Yang, L.; Wang, H.; Zeng, Y.; Liu, W.; Wang, R.; Deng, B. Detection of Parabolic Antennas in Satellite Inverse Synthetic Aperture Radar Images Using Component Prior and Improved-YOLOv8 Network in Terahertz Regime. Remote Sens. 2025, 17, 604. https://doi.org/10.3390/rs17040604

AMA Style

Yang L, Wang H, Zeng Y, Liu W, Wang R, Deng B. Detection of Parabolic Antennas in Satellite Inverse Synthetic Aperture Radar Images Using Component Prior and Improved-YOLOv8 Network in Terahertz Regime. Remote Sensing. 2025; 17(4):604. https://doi.org/10.3390/rs17040604

Chicago/Turabian Style

Yang, Liuxiao, Hongqiang Wang, Yang Zeng, Wei Liu, Ruijun Wang, and Bin Deng. 2025. "Detection of Parabolic Antennas in Satellite Inverse Synthetic Aperture Radar Images Using Component Prior and Improved-YOLOv8 Network in Terahertz Regime" Remote Sensing 17, no. 4: 604. https://doi.org/10.3390/rs17040604

APA Style

Yang, L., Wang, H., Zeng, Y., Liu, W., Wang, R., & Deng, B. (2025). Detection of Parabolic Antennas in Satellite Inverse Synthetic Aperture Radar Images Using Component Prior and Improved-YOLOv8 Network in Terahertz Regime. Remote Sensing, 17(4), 604. https://doi.org/10.3390/rs17040604

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Detection of Parabolic Antennas in Satellite Inverse Synthetic Aperture Radar Images Using Component Prior and Improved-YOLOv8 Network in Terahertz Regime

Abstract

1. Introduction

2. THz Imaging Character of Parabolic Antennas

3. Improved Network for Component Detection

4. Numerical and Anechoic Chamber Experiments

5. Conclusions

Author Contributions

Funding

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI