Adversarial Attack for SAR Target Recognition Based on UNet-Generative Adversarial Network

Du, Chuan; Zhang, Lei

doi:10.3390/rs13214358

Open AccessArticle

Adversarial Attack for SAR Target Recognition Based on UNet-Generative Adversarial Network

by

Chuan Du

and

Lei Zhang

^*

School of Electronics and Communication Engineering, Sun Yat-sen University, Shenzhen 518107, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2021, 13(21), 4358; https://doi.org/10.3390/rs13214358

Submission received: 12 September 2021 / Revised: 12 October 2021 / Accepted: 22 October 2021 / Published: 29 October 2021

(This article belongs to the Special Issue Synthetic Aperture Radar (SAR)—New Techniques, Missions and Applications)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

Some recent articles have revealed that synthetic aperture radar automatic target recognition (SAR-ATR) models based on deep learning are vulnerable to the attacks of adversarial examples and cause security problems. The adversarial attack can make a deep convolutional neural network (CNN)-based SAR-ATR system output the intended wrong label predictions by adding small adversarial perturbations to the SAR images. The existing optimization-based adversarial attack methods generate adversarial examples by minimizing the mean-squared reconstruction error, causing smooth target edge and blurry weak scattering centers in SAR images. In this paper, we build a UNet-generative adversarial network (GAN) to refine the generation of the SAR-ATR models’ adversarial examples. The UNet learns the separable features of the targets and generates the adversarial examples of SAR images. The GAN makes the generated adversarial examples approximate to real SAR images (with sharp target edge and explicit weak scattering centers) and improves the generation efficiency. We carry out abundant experiments using the proposed adversarial attack algorithm to fool the SAR-ATR models based on several advanced CNNs, which are trained on the measured SAR images of the ground vehicle targets. The quantitative and qualitative results demonstrate the high-quality adversarial example generation and excellent attack effectiveness and efficiency improvement.

Keywords:

adversarial attack; adversarial example generation; UNet; generative adversarial network (GAN); synthetic aperture radar (SAR); automatic target recognition (ATR)

Graphical Abstract

1. Introduction

As an active imaging sensor, synthetic aperture radar (SAR) has the advantages of collecting all-time, all-weather, high-resolution images [1,2,3]. SAR-automatic target recognition (ATR) is a vital method to extract remote sensing information and plays an essential role in earth monitoring, military and homeland security [4,5,6,7]. In the field of SAR-ATR, deep convolutional neural networks (CNNs) have been proven powerful tools due to their hierarchical feature extraction ability [8,9,10,11,12]. However, several works have revealed that some security problems exist in these SAR-ATR models.

Szegedy et al. [13] first discover that by injecting well-designed tiny perturbations into image samples, adversarial examples can be intentionally produced to cause the recognition model to misclassify. This process of generating adversarial examples is named as “adversarial attack”, which has become a recent study trend [14,15,16,17,18,19] in the research field of remote sensing, radar, radio, etc. In radar signal processing, [14,15] verify that high-resolution range profile (HRRP) and SAR image target recognition models can be attacked successfully by well-designed adversarial examples. A faster C&W adversarial attack algorithm [16] is proposed to effectively fool deep CNN-based SAR target classifiers and meet real-time requirements. In the field of remote sensing, Li et al. [17] provide abundant experiments and insightful analysis on the adversarial attack of the deep CNNs-based remote sensing image scene classification. The work [18] systematically analyzes the influence of adversarial examples on classification results of remote sensing scene classifiers based on deep neural networks (DNNs), which also demonstrates that the defense capability of the classifiers to the adversarial examples can be significantly improved by adversarial training. In terms of radio propagation, white-box and black-box adversarial attack methods are explored in [20], showing the vulnerability of radio signals classification based on DNNs to adversarial examples. Due to the openness of wireless communication, the end-to-end learning communication system based on auto-encoders can be easily destroyed by the well-designed adversarial perturbations [21]. Although several adversarial attack algorithms have been proposed to generate adversarial examples, generating them with high efficiency requires more exploration.

Various adversarial attack algorithms have been proposed in recent years. For example, as a gradient-based method, the fast gradient sign method (FGSM) [22] produces adversarial examples by taking a one-step update of the original image along with the sign of the gradient of the cross-entropy classification loss function. The basic iterative method (BIM) [23] and projected gradient descent (PGD) [24] are the iterative versions of FGSM, which utilize the multiple steps gradient information to obtain better attack effectiveness. The DeepFool [25] finds the closest distance from the input image to the target classification boundary and performs an iterative attack to perturb the original image beyond the classification boundary. However, the defensive distillation algorithm [26] can defense against these existing adversarial attacks except the C&W attack [27]. As an optimization-based method, the C&W attack [27] models the adversarial examples generation as an optimization process that maximizing the confidence of the adversarial examples labeled as a wrong category while minimizing the power of the adversarial perturbations (mean-squared reconstruction error (MSE) loss). The C&W has acquired excellent adversarial attack performance. According to the attributed scattering center model, a SAR image of a target can be regard as the sum of the responses from various individual scattering centers in different range-Doppler cells [28]. Hence, the C&W’s MSE loss function is not suitable for SAR image adversarial example generation tasks, which will cause smooth target edge and blurry weak scattering centers in SAR image adversarial examples. Moreover, it is not appropriate for the adversarial attack task requiring an instant response, since its iterative optimization process costs a lot of time.

To efficiently generate adversarial examples of SAR images with sharp target edges and explicit weak scattering centers, in this paper, we propose to train a generator and discriminator in an adversarial way. We build a UNet [29] to realize the generator, which can extract the separable features of the targets from the whole SAR images to influence the recognition results. Moreover, it concatenates the low-resolution and high-resolution feature maps and learns the basic component scattering center information to generate a more refined SAR image adversarial examples. The discriminator aims to encourage that the generated adversarial examples are approximate to the real SAR images in sense of data distribution. In general, we apply the generative adversarial networks (GANs) [30] to efficiently produce high-quality adversarial examples for SAR images in white-box condition by adversarial training.

Our contributions are listed as the following.

(1): We leverage a generator to generate adversarial examples through fast network mapping rather than the iterative optimization in the previous optimization-based methods. Therefore, the proposed adversarial attack algorithm provides the SAR-ATR system with real-time attack capability.
(2): We utilize the UNet to learn the separable features of the targets to cause the misclassification of the recognition model. The UNet can also fuse the multi-resolution feature maps, benefiting the generation of SAR image adversarial examples.
(3): By introducing a discriminator, we can train the generator to produce higher-quality adversarial examples for SAR images by adversarial training, which can possess sharper target edges and more explicit weak scattering centers and achieve better attack performance.

The rest parts of this article are arranged as follows. Section 2 describes the problem definition of adversarial attack and our proposed algorithm in detail. In Section 3, we evaluate our proposed models and report experimental results. Conclusions and future works waiting to be explored are in Section 4.

2. Preliminaries

Adversarial Attack for SAR-ATR

Supposing

X

is the SAR image dataset.

x_{n} \in R^{W \times H}

is the n-th SAR image sample and

y_{n}

is the corresponding ground truth category label of

x_{n}

in the dataset

X

, where W and H denote the width and height of the SAR image, respectively.

F (\cdot)

is a target recognition model that provide a correct category prediction of a SAR image. For a commonly used deep CNN recognition model

F (\cdot)

with a softmax output layer, given an input SAR image sample x, the output of

F (x)

is

p \in R^{S}

denoting the probability distribution of the predicted categories, where

p_{s} \in [0, 1]

,

\sum_{s = 1}^{S} p_{s} = 1

and S denotes the number of the total target categories. The index of the predicted target category is an integer

C (x) = arg {max}_{s} (F {(x)}_{s}) \in [1, 2, . . ., S]

.

The aim of an adversarial attack for SAR-ATR is to generate the corresponding adversarial example

\tilde{x}

and make the SAR-ATR model misclassify. Meanwhile,

\tilde{x}

needs to be approximate to the original SAR image x under some distance metric so that their differences would not be perceived easily, where

\tilde{x} = x + δ

, and

δ

is the added tiny adversarial perturbation. The whole frameworks of SAR-ATR and adversarial attack for SAR-ATR are shown in Figure 1.

The commonly-used adversarial attack modes are introduced below.

Targeted attack: If there is a SAR image x and a designated category

t \neq y

, targeted attack aims to find an adversarial example

\tilde{x}

which is similar to x, subject to

C (\tilde{x}) = t

. Namely, the targeted attack can cause the SAR-ATR model to mislabel the adversarial example as the designated category.

Non-targeted attack: If there is no designated category for the adversarial example, the adversarial attack is reduced to a search for the adversarial example

\tilde{x}

which is similar to the original SAR image x, subject to

C (\tilde{x}) \neq y

, which is called a non-targeted attack.

3. Method

According to the attributed scattering center model, a SAR image of a target can be regard as the sum of the responses from various individual scattering centers in different range-Doppler cells [28]. The existing adversarial attack methods in SAR-ATR obtain the adversarial examples by minimizing the mean reconstruction square error, which will lead to the smooth target edge and the blurry weak scattering centers in the generated SAR image. These generated adversarial examples are obviously different from the real SAR images and possess poor deception. Therefore, to produce the adversarial examples with the characteristics of SAR images (with sharp target edges and explicit weak scattering centers), we propose an Attack-UNet-GAN algorithm to improve the quality of the generated adversarial example.

3.1. Attack-UNet-GAN

The overall architecture of our proposed Attack-UNet-GAN algorithm is shown in Figure 2b, which is consisted of three modules: a generator

G (\cdot)

, a discriminator

D (\cdot)

and a SAR-ATR model

F (\cdot)

. In the optimization-based adversarial attack algorithms, the adversarial examples of the test SAR images are generated by re-optimizing the loss function iteratively, which is of high time cost. Therefore, to construct the fast mapping from the original SAR images to the adversarial examples, we build the generator

G (\cdot)

. The input of

G (\cdot)

is the original SAR image x and its output is the adversarial example

\tilde{x} = G (x)

.

G (\cdot)

aims to learn the basic component scattering center information in the SAR images and encourage the generated adversarial examples to be indistinguishable from the original SAR images for the discriminator

D (\cdot)

. Furthermore, to generate more realistic adversarial examples with the characteristics of SAR image, the generated adversarial example

\tilde{x}

is then sent into the discriminator

D (\cdot)

, whose function is to distinguish the generated adversarial example

\tilde{x}

from the real SAR images x as possible. Thus,

G (\cdot)

and

D (\cdot)

are trained in an adversarial way and strengthen each other. To achieve the task of attacking the SAR-ART model, we should first have a high-accuracy SAR-ART model

F (\cdot)

, which takes the adversarial example

\tilde{x}

as input and outputs its predicted target label.

3.1.1. The Design of Loss Function

Targeted Attack: According to Section 2, we can describe the process of a targeted attack as the following constrained optimization problem:

\begin{matrix} \begin{matrix} min_{\tilde{x}} L_{dis} (\tilde{x}, x), s . t . C (\tilde{x}) = t, \tilde{x} \in {[0, 1]}^{W \times H} \end{matrix} \end{matrix}

(1)

where

L_{dis}

is the distance metric used for the similarity measurement between the original SAR image and the adversarial example. In this way, the differences between x and

\tilde{x}

are limited to be as small as possible. Due to the non-differentiable characteristic of the constraint

C (\tilde{x}) = t

, (1) is difficult to be solved, so some mathematical transformations are needed. Here, we construct a function

g_{1} (\cdot)

to make

C (\tilde{x}) = t

and

g_{1} (\tilde{x}) \leq 0

equivalent [27]. In this paper, the following expression of

g_{1} (\cdot)

is used:

\begin{matrix} g_{1} (\tilde{x}) = {(max_{s \neq t} (F {(\tilde{x})}_{s}) - F {(\tilde{x})}_{t})}^{+}, \end{matrix}

(2)

where s denotes the integer index of the target category.

F {(\tilde{x})}_{t}

denotes the predicted classification probability of the designated category t. The

{(z)}^{+}

is a shortened form of

\max (z, 0)

. Thus, we replace (1) by the following:

\begin{matrix} \begin{matrix} min_{\tilde{x}} L_{dis} (\tilde{x}, x), s . t . g_{1} (\tilde{x}) \leq 0, \tilde{x} \in {[0, 1]}^{W \times H} \end{matrix} . \end{matrix}

(3)

Then, the method of Lagrange multipliers is applied, and we can derive the loss function as follows:

\begin{matrix} \begin{matrix} L & = min_{\tilde{x}} (L_{dis} + L_{adv}) \\ = min_{\tilde{x}} (L_{dis} (\tilde{x}, x) + λ \cdot g_{1} (\tilde{x})) \end{matrix} . \end{matrix}

(4)

For the box constraint

\tilde{x} \in {[0, 1]}^{W \times H}

in (3), we use the sigmoid activation function to solve it, which is described in detail in Section 3.1.3. In this paper, we utilize the

l_{2}

-norm:

{∥v∥}_{2} = \sqrt{\sum_{i = 1}^{n} ({|v_{i}|}^{2})}

to realize

L_{dis}

in (4). Then, the optimization problem waited to be solved is

\begin{matrix} L = min_{\tilde{x}} ({∥\tilde{x} - x∥}_{2} + λ \cdot g_{1} (\tilde{x})), \end{matrix}

(5)

where

g_{1} (\tilde{x})

describes the distance between

max_{s \neq t} (F {(\tilde{x})}_{s})

and

F {(\tilde{x})}_{t}

. Minimizing the value of

g_{1} (\tilde{x})

will encourage the perturbed SAR image to be mislabeled as the designated category t. Minimizing the value of

{∥\tilde{x} - x∥}_{2}

aims to limit the power of the added adversarial perturbation. Note that the adversarial attack loss

L_{adv}

should describe the opposite of the difference between the predicted probability of the designated category t and the highest predicted probability among the other categories in the targeted attack, or the difference between the predicted probability of the ground truth category y and the highest predicted probability among the other categories in the non-targeted attack [27].

A smaller

g_{1} (\tilde{x})

means the probability of the input SAR image classified as the designated category t is higher. If

max_{s \neq t} (F {(\tilde{x})}_{s})

is optimized to be just smaller than

F {(\tilde{x})}_{t}

, the optimization of the loss function (5) will be finished. Hence, we introduce a threshold

η

to form a soft hinge loss on

L_{adv}

as

\begin{matrix} g_{1} (\tilde{x}) = max (- η, max_{s \neq t} (F {(\tilde{x})}_{s}) - F {(\tilde{x})}_{t}), \end{matrix}

(6)

which means that the optimization will not be stopped until

max_{s \neq t} (F {(\tilde{x})}_{s})

is

η

smaller than

F {(\tilde{x})}_{t}

. It can elevate the final probability of the input SAR image classified as the designated category t.

In this way, the loss function in targeted attack mode can be represented as:

\begin{matrix} L = min_{\tilde{x}} ({∥\tilde{x} - x∥}_{2} + λ \cdot max (- η, max_{s \neq t} (F {(\tilde{x})}_{s}) - F {(\tilde{x})}_{t})) . \end{matrix}

(7)

Non-targeted Attack: For a non-targeted attack, there is no designated category for the adversarial examples to be mislabeled as. In this way, the loss function in the non-targeted attack just aims to realize the misclassification of an adversarial example to a wrong category. Therefore, we should construct the

L_{adv}

in the loss function to describe the difference between the predicted probability of the ground truth category y and the highest predicted probability among the other categories [27]. Here, we replace (4) by the following loss function in the non-targeted attack:

\begin{matrix} \begin{matrix} L & = min_{\tilde{x}} (L_{d i s} + λ \cdot L_{a d v}) \\ = min_{\tilde{x}} ({∥\tilde{x} - x∥}_{2} + λ \cdot g_{2} (\tilde{x})) \end{matrix}, \end{matrix}

(8)

where

g_{2} (\tilde{x})

measures the difference between the predicted probability of the wrong category and the predicted probability of the ground truth category y. The selected expression of

g_{2} (\tilde{x})

is as the following:

\begin{matrix} g_{2} (\tilde{x}) = {(F {(\tilde{x})}_{y} - max_{s \neq y} (F {(\tilde{x})}_{s}))}^{+}, \end{matrix}

(9)

where

F {(\tilde{x})}_{y}

denotes the probability of the adversarial example

\tilde{x}

recognized as the ground truth category y. In order to elevate the robustness of the algorithm and the misclassification confidences of the adversarial examples, we also introduce a threshold

η

. Then

g_{2} (\tilde{x})

can be written as:

\begin{matrix} g_{2} (\tilde{x}) = max (- η, F {(\tilde{x})}_{y} - max_{s \neq y} (F {(\tilde{x})}_{s})), \end{matrix}

(10)

which means that the optimization of adversarial example

\tilde{x}

will not be stopped, until

F {(\tilde{x})}_{y}

is

η

smaller than

max_{s \neq y} (F {(\tilde{x})}_{s})

.

Hence, the loss function for optimizing the adversarial example

\tilde{x}

in the non-targeted attack can be written as:

\begin{matrix} L = min_{\tilde{x}} ({∥\tilde{x} - x∥}_{2} + λ \cdot max (- η, F {(\tilde{x})}_{y} - max_{s \neq y} (F {(\tilde{x})}_{s}))) . \end{matrix}

(11)

3.1.2. The Introduction of UNet and GAN

To realize the real-time generation of the adversarial example, a generator

G_{θ} (\cdot)

is built as shown in Figure 2, replacing the iterative searching process of the adversarial example in the C&W algorithm. The input of the generator

G_{θ} (\cdot)

is the original SAR image x, and its output is the adversarial example

\tilde{x} = G_{θ} (x)

. Therefore, the

\tilde{x}

in (7) and (11) can be replaced by

G_{θ} (\tilde{x})

. Moreover, to generate the adversarial examples with the characteristics of SAR images, we also introduce a discriminator

D_{ϕ} (\cdot)

to construct an adversarial training method, as shown in Figure 2. The input of the discriminator is the adversarial example

\tilde{x}

or original SAR image x, and its output is a scalar indicating whether the input is a real SAR image. The generator aims to generate the adversarial example

\tilde{x}

that is similar to the real SAR image x in the sense of data distribution, so as to deceive the discriminator

D_{ϕ} (\cdot)

. The goal of the discriminator

D_{ϕ} (\cdot)

is to distinguish the adversarial example

\tilde{x}

from the original SAR image x. So the adversarial training loss [30] can be expressed as:

\begin{matrix} \begin{matrix} L_{GAN} = E_{x} log D_{ϕ} (x) + E_{x} log (1 - D_{ϕ} (\tilde{x})) \\ = E_{x} log D_{ϕ} (x) + E_{x} log (1 - D_{ϕ} (G_{θ} (x))) \end{matrix}, \end{matrix}

(12)

where we denote the

E_{x} \equiv E_{x \sim p_{d a t a}}

,

θ

and

ϕ

are the parameters of the generative network and discriminative network, respectively. Note that the adversarial training loss aims to encourage the data distribution of the generated adversarial examples approximate to that of the real SAR images, leading to the generated adversarial examples with sharp target edges and explicit weak scattering centers. Thus, the generated adversarial examples possess the characteristics of SAR images and are highly deceptive.

Finally, the whole loss function of the algorithm Attack-UNet-GAN can be expressed as the following:

\begin{matrix} L = L_{dis} + λ \cdot L_{adv} + L_{GAN}, \end{matrix}

(13)

where

λ > 0

is a suitably chosen constant that controls the relative importance of misclassification loss

L_{adv}

.

For the optimization of the networks

G_{θ} (\cdot)

and

D_{ϕ} (\cdot)

, we solve the min-max game

arg {min}_{θ} {max}_{ϕ} L

by alternating the iterative optimizations of

G_{θ} (\cdot)

and

D_{ϕ} (\cdot)

. When we fix the parameters

ϕ

and optimize

{min}_{θ} L = {min}_{θ} (L_{dis} + λ \cdot L_{adv}) + {min}_{θ} (L_{GAN})

, the optimization of the first term

{min}_{θ} (L_{dis} + λ \cdot L_{adv})

can make the target recognition model misclassify and the added adversarial perturbations imperceptible. The optimization of the second term

{min}_{θ} (L_{G A N}) = {min}_{θ} (E_{x} log (1 - D_{ϕ} (G_{θ} (x))))

can make the output of

D_{ϕ} (G_{θ} (x))

approach 1, that is,

G_{θ} (\cdot)

can make the generated adversarial examples

G_{θ} (x)

similar to the real SAR images. When we fix the parameters

θ

and optimize

{max}_{ϕ} L = {max}_{ϕ} (L_{G A N}) = {max}_{ϕ} (E_{x} log D_{ϕ} (x) + E_{x} log (1 - D_{ϕ} (G_{θ} (x))))

, the optimization of it can make the output of

D_{ϕ} (x)

approach 1 and the output of

D_{ϕ} (G_{θ} (x))

approach 0, that is,

D_{ϕ} (\cdot)

can make the discriminator distinguish the real SAR images and the generated adversarial examples as possible. Thus, the generator and discriminator can be optimized in an adversarial way. The whole training process is outlined in Algorithm 1. Once

G_{θ} (\cdot)

is well-trained on the training SAR image data and SAR-ATR model, it can produce an adversarial example for each test SAR image through the one-step network mapping instead of the iterative optimization of the adversarial perturbations.

Algorithm 1: An example for the training process in Attack-UNet-GAN algorithm

1: Train a high-accuracy deep CNN-based SAR-ATR model

F (\cdot)

.

2: Build a generator

G_{θ} (\cdot)

and a discriminator

D_{ϕ} (\cdot)

.

3: Initialize generator parameters

θ

and discriminator parameters

ϕ

.

4: Set the appropriate mini-batch size, learning rate and network parameters and so on.

5: for number of training iterations do

6: Sample a mini-batch of m SAR images

\{x^{(1)}, . . ., x^{(m)}\}

from the training dataset, whose corresponding ground truth labels are

\{y^{(1)}, . . ., y^{(m)}\}

;

7: Feed the mini-batch of SAR images into the generator

G_{θ} (\cdot)

to generate the adversarial examples

\{{\tilde{x}}^{(1)}, . . ., {\tilde{x}}^{(m)}\}

and get the loss function

L_{dis}

;

8: Feed the generated adversarial examples into the SAR-ATR model

F (\cdot)

to output the prediction probability for each category and get the loss function

L_{adv}

;

9: Feed the generated adversarial examples and original SAR images into the discriminator

D_{ϕ} (\cdot)

alternately and get the loss function

L_{GAN}

;

10: Update the parameters

θ

and

ϕ

by SGD alternately on the whole loss function

arg {min}_{θ} {max}_{ϕ} L

in (13)

11: end for

3.1.3. The Detailed Network Architecture

The discriminator D: The detailed network architecture of the discriminator D is shown in Figure 3. Its input is the adversarial example

\tilde{x}

or original SAR image x with the size of

128 \times 128

. There are five blocks in the discriminator, each of which contains a

4 \times 4

convolutional layer with the stride size of 2 and padding size of 1 followed by a batch-normalization layer. The activation function between each block is a leaky ReLU function. The output layer is a sigmoid layer and outputs a scalar indicating whether the input sample is an original SAR image.

The generator G: In this paper, we introduce a UNet to realize the generator G and generate the adversarial examples. The UNet is initially proposed for the task of medical image segmentation [29]. It is a symmetric U-shaped deep CNN, which is consisted of an encoder and a decoder. The encoder is a stack of convolution, activation and pooling layers learning the separable features and basic component scattering center information. The sizes of the extracted feature maps are reduced by the encoder, which are then progressively expanded by the decoder. The decoder realizes the SAR image adversarial example generation with transposed convolutions. It combines the different resolution feature maps by a sequence of up-convolutions and the concatenation with corresponding feature maps of the same size from the encoder. This combination causes more sufficient feature information to be propagated to the higher resolution layers of the decoder, which can benefit the precise SAR image adversarial example generation.

The detailed UNet architecture is illustrated in Figure 4. The network architecture is symmetric for the encoder and decoder. The size of the input SAR image is

128 \times 128

. The number of layers, the resolution of each feature map and the number of feature map channels are also shown in Figure 4. The block of the encoder is consisted of two

3 \times 3

convolution layers followed by a

2 \times 2

max-pooling layer. The block of the decoder contains a

2 \times 2

up-convolution layer followed by two

3 \times 3

convolution layers. The output is the adversarial example, whose size is equal to that of the input SAR image. Note that we take the sigmoid layer as the output layer of the UNet, to restrict the values of the generated adversarial examples to the range of [0, 1].

4. Experiment

In this section, we use the well-trained SAR-ATR models on the public measured SAR image data to verify and test our proposed adversarial attack algorithm. We compare its attack performance with others competitive adversarial attack algorithms by attacking these deep CNN models. The experiments prove our algorithm’s competitive effectiveness, excellent efficiency and high-quality adversarial example generation.

4.1. Dataset and Experimental Setup

4.1.1. Dataset

The famous public measured SAR image data of the ground vehicle targets, the moving and stationary target acquisition and recognition (MSTAR) dataset [31,32], is utilized in our experiment. It is provided by the Air Force Research Laboratory and the Defence Advanced Research Projects Agency (AFRL/DARPA) [31]. This SAR image dataset is acquired leveraging the X-band HH polarization “STARLOS” spotlight SAR platform with the resolution of 0.3 m × 0.3 m. As the significant dataset for SAR-ATR performance evaluation, it contains abundant SAR images of vehicle targets and ground clutter. There are ten categories of vehicle targets in the dataset, such as BTR70, BTR60, BRDM2 and BMP2 (armored personnel carrier); 2S1 (rocket launcher); D7 (bulldozer); ZIL131 (truck); T62 and T72(tank); ZSU234 (air defense unit) [33], which are indexed by category labels 1, 2, …, 10, respectively. These SAR images in each category cover all target-aspect angles in the range of

[0^{\circ}, 360^{\circ}]

with a relative flat grass or exposed soil background. The adjacent target-aspect angle intervals are within

[1^{\circ}, 2^{\circ}]

. Notice that all targets are stationary targets. The optical images and corresponding SAR images of the targets are displayed in Figure 5.

We rescale the collected SAR images as

128 \times 128

pixels and obtain 5950 slice images. Each slice image is labeled as one of the ten kinds of targets. In addition, we carry out the amplitude normalization pre-processing to guarantee that the value of each SAR image pixel is limited within the range of

[0, 1]

. To validate the proposed algorithm’s generalization capability, the target-depression angles of the training and test SAR images are different. The target-depression angles and the numbers of the training and test images before the data augmentation are also listed in Table 1. In the training phase of the SAR-ATR models, the commonly used training data augmentation techniques [10], such as pose synthesis, translation and speckle noising, are also applied to alleviate the effects of overfitting and get the high-accuracy SAR-ATR models. Specifically, we first use one SAR image to produce 10 synthesized pose SAR images (rotating the SAR images). Then, they are translated by five times randomly. Finally, we perform the speckle noising augmentation operations on each translated SAR image with the parameter a (the maximum intensity of noise samples) set as 0.5, 1.0 and 1.5 [10].

4.1.2. Baselines and Experimental Setup

The following adversarial attack algorithms are the baselines compared with our algorithm:

Fast Gradient Sign Method (FGSM) [22]: Adversarial examples are generated by taking one-step update of the input along with the sign of the gradient of the cross-entropy loss function.
Basic Iterative Method (BIM) [23]: It is an extension of the FGSM by running a finer optimization for multiple iterations.
Project Gradient Descent (PGD) [24]: It is an iterative version of the FGSM, which takes multiple small steps iteratively while randomly adjusts the updating direction after each step.
DeepFool [25]: It finds the closest distance from the original image to the classification boundary and performs an iterative attack to perturb the original image beyond the classification boundary.
Carlini and Wagner’s Attack (C&W) [27]: The adversarial examples are generated by maximizing the probability of the adversarial example labeled as a wrong category while minimizing the power of the adversarial perturbations.

For the attacked SAR-ATR model, we use the standard deep learning classifiers, AlexNet [34], VGGNet16 [35] and ResNet32 [36], which are trained on the MSTAR dataset and have a classification accuracy of over

96 %

. The generator G is realized by a UNet [29] making the output and input SAR image size the same, whose detailed architecture is shown in Figure 4. For the discriminator D, the deep CNN [37] shown in Figure 3 is utilized to achieve it. For the distance metric function in this paper, we choose the

l_{2}

-norm. To optimize the generator and discriminator parameters, we adopt the Adam optimizer [38] with the learning rate

10^{- 4}

, the hyperparameter

β_{1} = 0.5

,

β_{2} = 0.999

, and the training batch size 64. We carry out all experiments in a Python program on a personal computer with a 3.7 GHz CPU, a 64 GB RAM, and a 24 GB NVIDIA Geforce RTX 3090 GPU.

4.2. Evaluation Measurements

Suppose that there are N test SAR images that can be classified correctly by the SAR-ATR model in total. The adversarial examples are generated from these N SAR images in the test dataset.

Targeted Attack: The attack success rate in the targeted attack mode is calculated by the following formula:

\begin{matrix} Ac c_{targeted} = \sum_{n = 1}^{N} I (C ({\tilde{x}}_{n}) = t^{(n)}) / N, \end{matrix}

(14)

where

I (\cdot)

denotes the indication function,

C ({\tilde{x}}_{n})

is the predicted category of the adversarial example

{\tilde{x}}_{n}

, and

t^{(n)}

is the designated category of the n-th adversarial example.

Non-targeted Attack: Without the designated category, the attack success rate in the non-targeted attack mode is calculated by the following formula:

\begin{matrix} Ac c_{non - targeted} = \sum_{n = 1}^{N} I (C ({\tilde{x}}_{n}) \neq y^{(n)}) / N, \end{matrix}

(15)

where

y^{(n)}

is the ground truth category of the n-th original SAR image.

4.3. Attack Performance Comparison

In this experiment, we attack different SAR-ATR models based on the deep CNNs (AlexNet, VGGNet16, ResNet32) under the condition of the white-box attack, which means that the network structures and parameters of the recognition models are known. The attack success rates of different adversarial attack algorithms for different recognition models in targeted and non-targeted attack modes are shown in Table 2 and Table 3, respectively. The attack success rates can reflect the effectiveness of the adversarial attack algorithms.

Among these adversarial attack algorithms, FGSM, BIM, PGD and DeepFool are gradient-based algorithms. C&W and Attack-GAN belong to optimization-based algorithms. BIM has higher attack success rates than FGSM, because it utilizes the multiple-step gradient information to acquire a more precise optimization result. PGD performs better than BIM, since it not only takes multiple small steps gradient update iteratively as BIM, but also randomly adjusts the direction after each step to search for a better adversarial example. The attack success rates of Attack-UNet-GAN are much higher than those of FGSM and competitive with those of the other four baseline algorithms. Attack-UNet-GAN can attack the SAR-ATR models more successfully than the Attack-UNet (without discriminator D). Due to the introduction of the discriminator D, the adversarial training loss can improve the data description ability of the generator G to generate better adversarial examples. In terms of attack success rate, Attack-UNet performs better than Attack-CNN, whose generator is realized by an 8-layer CNN with

1 \times 1

convolution kernels, since the UNet fuses the multiple resolution feature maps’ information and helps the more sufficient feature information be propagated to the higher resolution layers of the decoder to generate the better adversarial examples.

4.4. Comparation of the Generation Speed

To compare the calculation efficiency of each adversarial attack algorithm, we generate adversarial examples of the same test SAR image with

128 \times 128

pixels under the same calculation condition and record the running time of each algorithm’s program. The time cost of generating a

128 \times 128

pixels SAR image’s adversarial example for different adversarial attack algorithms is shown in Table 4. Among all these algorithms, the algorithms based on our proposed framework possess the fastest adversarial example generation speed, since they gain the adversarial example through the fast network mapping of the generator, rather than the iterative optimization in the C&W algorithm or the multiple calculations of the input test SAR images’ gradients in the BIM or PGD algorithm. Especially, compared with the C&W algorithm, the generation speed of the adversarial example for Attack-UNet-GAN is improved hundreds of times.

4.5. Influence of the Constant $λ$

To study the influence of the constant

λ

in (13) on the attack performance, we use the Attack-UNet-GAN algorithm to attack the SAR-ATR model based on ResNet32 for the values of

λ

located uniformly (on the log scale) from

λ = 0.001

to

λ = 100

. We plot the attack success rates and MSE distances for different values of

λ

in Figure 6. We can see that when

λ \leq 0.01

, the attack rarely succeeds. The attack success rate gradually increases to almost

100 %

, when the value of

λ

varies from 0.01 to 1. When

λ \geq 1

, the differences between the original SAR images and the generated adversarial examples become more apparent, but the attack always succeeds. Therefore, in our experiments, we set the value of

λ

as 1 to weigh the deception and attack performance of the generated adversarial examples.

4.6. Visualization of the Adversarial Examples

In this section, we carry out experiments to show the deception performace of the generated adversarial examples by different attack algorithms. The generated adversarial examples and the corresponding adversarial perturbations by different adversarial attack algorithms in targeted and non-targeted attacks are shown in Figure 7 and Figure 8. The attacked SAR-ATR model is based on the same ResNet32 for all adversarial attack algorithms. The predicted categories of the adversarial examples by the high-accuracy SAR-ATR model and the misclassified confidences to the wrong category for different adversarial attack algorithms are shown above the corresponding adversarial examples. We can see that the adversarial perturbations of FGSM, PGD, and BIM cover most parts of the SAR images. For C&W and DeepFool, the adversarial perturbations are mainly located on the SAR images’ shadow regions. Attack-UNet and Attack-UNet-GAN can mainly concentrate the adversarial perturbations on the target regions of the SAR images, because the target region of a SAR image possesses much more separable information benefiting the target recognition task than the background clutter and shadow regions. Thus, Attack-UNet and Attack-UNet-GAN can learn and utilize this separable information through the generator G to help produce the adversarial examples and fool the SAR-ATR model. In Figure 7g and Figure 8g, the target edges are sharper and the weak scattering centers of the target are more explicit than those in Figure 7h and Figure 8h, such as the regions surrounded by the red ellipses. Because the introduction of the discriminator D can help the generated adversarial examples approximate to real SAR images in the sense of data distribution and make them possess the characteristics of SAR images. For the DeepFool attack, there are some generated adversarial examples that can fool the SAR-ATR model successfully. However, the added adversarial perturbations are too strong, making the differences between the original SAR images and adversarial examples conspicuous.

4.7. Display of the Learned Features in UNet

To exhibit the excellent target feature extracting ability of the UNet for the SAR images, we visualize the hierarchical representations of the SAR image features extracted by different CNN layers of the UNet in Figure 9. In the first row of Figure 9, they are the features from the UNet’s encoder. It can be observed that the closer the layer is to the input original SAR image, the more specialized the leaned features are. On the contrary, the farther the layer is to the input SAR image, the more fundamental the leaned features are. The features of the fourth layer (Figure 9(c4)) in the encoder can be regarded as different basic strong scatter centers to construct all of the SAR target images. The features of the third layer (Figure 9(c3)) in the encoder are the component structures used to constitute the SAR images of the targets, such as spheres, dihedrals, trihedral, corner diffractions, etc. Further, the features learnt by the first layer (Figure 9(c1)) in the encoder possess more structure information, we can find different regions of the SAR image, such as the target, shadow and clutter regions. In the second row of Figure 9, they are the features from the UNet’s decoder. It can be seen that the closer the layer is to the output adversarial example, the more specialized the learned feature are, which is symmetric to that of the UNet’s encoder.

4.8. Separability of the Extracted Features

In this section, we represent the separability of the features extracted by the UNet. We visualize the original SAR images and high dimensional features extracted by the generator of Attack-UNet-GAN by utilizing T-SNE [39] to map them to the two-dimensional subspace in Figure 10a,b, respectively. The features are extracted by the last layer in the UNet’s encoder (the layer c5 in Figure 4). In Figure 10, each dot represents a SAR image or the feature of a SAR image, and each color denotes a category. It can be seen that the features learned by the generator are more separable and discriminative than the original SAR images of the targets. That is, the generator of our model extracts the features with prominent separability, which can help generate adversarial examples and cause the SAR-ATR model to misclassify.

4.9. Misclassified Category Distributions of the Adversarial Attack

To explore the misclassified category distribution of all adversarial examples, we calculate the misclassified categories for different adversarial attack algorithms. The misclassified category distributions show the percentages of the adversarial examples mislabelled as each of others target categories to all the adversarial examples of the ground truth label. We find that the misclassified categories are highly concentrated. As shown in Figure 11 and Figure 12, we use pie charts to visualize the distributions of the misclassified categories of the adversarial attack algorithms based on the MSTAR SAR image dataset.

The misclassified category distributions of adversarial attack algorithms are shown in Figure 11 and Figure 12. The ground truth category of all adversarial examples is D7 (bulldozer). Figure 11 shows the misclassified category distributions of the adversarial examples generated by six different attack algorithms for the same SAR-ATR model based on ReNet32. Figure 12 shows the misclassified category distributions of the adversarial examples generated by the same attack algorithm for three different deep CNN-based recognition models. In the pie charts, a color denotes a misclassified category. The percentage denotes the ratio of the number of the adversarial examples misclassified as the corresponding category to the total number of adversarial examples. In Figure 11 and Figure 12, it can be seen that although the adversarial examples are generated by different adversarial attack algorithms or for different attacked recognition models, their major misclassified categories are almost the same. For example, the BRDM2 (armored personnel carrier) is the major misclassified category of the adversarial examples for the original SAR images of the D7 (bulldozer). The reasons for this phenomenon may be the homogeneity and heterogeneity among categories. As it is found in the work [40] that the misclassified categories of the adversarial examples are more probably to be the categories that are closer to them in the sample’s feature space. Meanwhile, it can be observed that the similarity among the SAR images from different categories can be well reflected by the misclassified category distributions. For example, the armored personnel carrier is the major misclassified category of the adversarial examples from the bulldozer, representing that the armored personnel carrier and bulldozer may possess a strong similarity in the feature space or original SAR image space.

5. Discussion

The experiment results of Section 4.4 demonstrate the excellent adversarial example generation speed. Compared with the C&W algorithm, the generation speed is promoted even hundreds of times. The reason is the utilization of the generative network’s fast mapping. By utilizing a large number of training SAR images to train the generative network, it can well learn the basic features existing in SAR images to help build the mapping from the SAR image space to the adversarial example space. From the experiment results of Section 4.6, we observe that the introduction of GAN makes the generated SAR image adversarial examples possess sharp target edges and explicit weak scattering centers, because the adversarial training forces the generated adversarial examples to approximate the original SAR images in the sense of data distribution. Thus, the generated adversarial examples can possess the characteristics of real SAR images and stong deception. The experiment results of Section 4.7 and Section 4.8 illustrate the UNet’s powerful extraction capabilities of separable features and basic component scattering center information, which can benefit the generation of adversarial examples and cause the SAR-ATR model to misclassify.

From the experiment results of Section 4.3, we can conclude that the Attack-UNet-GAN algorithm has a competitive performance in terms of attack success rate with the baseline algorithms, since the baseline algorithms can update the adversarial examples iteratively leveraging the test data information. However, the Attack-UNet-GAN algorithm utilizes the well-trained generative network to yield the adversarial example in real-time, which is suitable for the adversarial attack of the SAR-ATR systems requiring instant responses. Therefore, we can study the improvement of the attack algorithm’s generalization capability to make the algorithm has a higher attack success rate on the different test SAR images in the future. Moreover, the proposed algorithm can be improved further to provide help for jamming remote sensing monitoring system and deflecting important information acquisition from remote sensing images.

We also evaluate the attack performance on the measured SAR images dataset, OpenSARShip. We build the dataset by the SAR images of the ship targets, such as Cargo, Fishing, Tanker, Tug and Other-type. The numbers of the SAR images of Cargo, Fishing, Tanker, Tug and Other-type targets are 8130, 126, 1618, 172, 942, respectively. We use the half of each target’s SAR images to construct the training dataset and the other half to construct the test dataset. The used SAR-ATR model is ResNet32. The average target classification accuracy is 78.48%. Then we use the baseline attack algorithms and Attack-UNet-GAN to attack the SAR-ATR model. We find that the generated adversarial examples are obviously different from the original SAR image of the target. Moreover, the attack success rates of these attack algorithm are very low. These attack algorithms do not perform well on the OpenSARShip dataset, probably because the resolutions of these SAR images are too low and the detailed information of the targets is not obvious. The adversarial attack algorithm can not make the SAR-ATR model misclassify by only modifying the original SAR image a little. That is, these attack algorithms are more suitable for attacking the SAR-ATR models of the high-resolution SAR images.

6. Conclusions

In this paper, an adversarial attack method based on UNet and GAN for deep learning SAR-ATR models is proposed. For our Attack-UNet-GAN algorithm, once well trained, the generator can produce adversarial examples efficiently through the network mapping for the test SAR images, replacing the time-consuming iterative re-optimization. By introducing the discriminator, the generated adversarial examples possess the characteristics of SAR images and are more deceptive, with sharper target edges and more explicit weak scattering centers. Utilizing the measured SAR image dataset, we demonstrate the strong attack performance of our algorithm in attack success rate, computation efficiency based on different deep learning recognition models. There are some potential future works to be explored. In practical applications, the relevant information of the SAR-ATR model is usually unknown, so it is more practical to propose a black-box adversarial attack algorithm. We consider using the learning ability of the distillation network to construct such a black-box adversarial attack model. Moreover, the transferability of the generated adversarial examples for SAR images needs to be deeply explored. It is expected to propose an attack algorithm to generate the adversarial examples with strong transferability to attack more types of SAR-ATR models successfully.

Author Contributions

Conceptualization, C.D. and L.Z.; methodology, software, validation, writing—original draft preparation, writing—review and editing, visualization, data curation, C.D.; supervision, L.Z.; funding acquisition, C.D. and L.Z. All authors have read and agreed to the published version of the manuscript.

Funding

This work was supported in part by the National Nature Sciences Foundation of China under Grant 61771372, in part by the Shenzhen Science and Technology Program under Grant KQTD20190929172704911, and in part by the Open Fund of Science and Technology on Electromagnetic Scattering Key Laboratory under Grant 61424090112.

Institutional Review Board Statement

The study does not involve humans or animals.

Informed Consent Statement

The study does not involve humans.

Data Availability Statement

The experiment in this paper uses a public data set, so no data is reported in this work.

Acknowledgments

In this section you can acknowledge any support given which is not covered by the author contribution or funding sections. This may include administrative and technical support, or donations in kind (e.g., materials used for experiments).

Conflicts of Interest

The authors declare that they have no conflict of interest to report regarding the present study.

References

Zhang, F.; Yao, X.; Tang, H.; Yin, Q.; Hu, Y.; Lei, B. Multiple mode SAR raw data simulation and parallel acceleration for Gaofen-3 mission. IEEE J. Sel. Topics Appl. Earth Observ. Remote Sens. 2008, 11, 2115–2126. [Google Scholar] [CrossRef]
Brown, W.M. Synthetic aperture radar. IEEE Trans. Aerosp. Electron. Syst. 1967, 3, 217–229. [Google Scholar] [CrossRef]
Moreira, A.; Prats-Iraola, P.; Younis, M.; Krieger, G.; Hajnsek, I.; Papathanassiou, K.P. A tutorial on synthetic aperture radar. IEEE Geosci. Remote Sens. Mag. 2013, 1, 6–43. [Google Scholar] [CrossRef] [Green Version]
Chiang, H.; Moses, R.L.; Potter, L.C. Model-based classification of radar images. IEEE Trans. Inf. Theory 2000, 46, 1842–1854. [Google Scholar] [CrossRef]
Sun, Y.; Liu, Z.; Todorovic, S.; Li, J. Adaptive boosting for SAR automatic target recognition. IEEE Trans. Aerosp. Electron. Syst. 2007, 43, 112–125. [Google Scholar] [CrossRef]
Srinivas, U.; Monga, V.; Raj, R.G. SAR automatic target recognition using discriminative graphical models. IEEE Trans. Aerosp. Electron. Syst. 2014, 50, 591–606. [Google Scholar] [CrossRef]
Rodger, M.; Guida, R. Classification-Aided SAR and AIS Data Fusion for Space-Based Maritime Surveillance. Remote Sens. 2021, 13, 104. [Google Scholar] [CrossRef]
Zhang, Z.; Wang, H.; Xu, F.; Jin, Y.Q. Complex-valued convolutional neural network and its application in polarimetric SAR image classification. IEEE Trans. Geosci. Remote Sens. 2017, 55, 7177–7188. [Google Scholar] [CrossRef]
Chen, S.; Wang, H.; Xu, F.; Jin, Y.Q. Target classification using the deep convolutional networks for SAR images. IEEE Trans. Geosci. Remote Sens. 2016, 54, 4806–4817. [Google Scholar] [CrossRef]
Ding, J.; Chen, B.; Liu, H.; Huang, M. Convolutional neural network with data augmentation for SAR target recognition. IEEE Geosci. Remote Sens. Lett. 2016, 13, 364–368. [Google Scholar] [CrossRef]
Du, C.; Chen, B.; Xu, B.; Guo, D.; Liu, H. Factorized discriminative conditional variational auto-encoder for radar HRRP target recognition. Signal Process. 2019, 158, 176–189. [Google Scholar] [CrossRef]
Vint, D.; Anderson, M.; Yang, Y.; Ilioudis, C.; Di Caterina, G.; Clemente, C. Automatic Target Recognition for Low Resolution Foliage Penetrating SAR Images Using CNNs and GANs. Remote Sens. 2021, 13, 596. [Google Scholar] [CrossRef]
Szegedy, C.; Zaremba, W.; Sutskever, I.; Bruna, J.; Erhan, D.; Goodfellow, I.; Fergus, R. Intriguing properties of neural networks. In Proceedings of the International Conference on Learning Representations, Banff, AB, Canada, 14–16 April 2014. [Google Scholar]
Huang, T.; Chen, Y.; Yao, B.; Yang, B.; Wang, X.; Li, Y. Adversarial attacks on deep-learning-based radar range profile target recognition. Inf. Sci. 2020, 531, 159–176. [Google Scholar] [CrossRef]
Huang, T.; Hang, Q.; Liu, Z.J.; Hou, R.; Wang, X.; Li, Y. Adversarial attacks on deep-learning-based SAR image target recognition. J. Netw. Comput. Appl. 2020, 162, 102632. [Google Scholar] [CrossRef]
Du, C.; Huo, C.; Zhang, L.; Chen, B.; Yuan, Y. Fast C&W: A Fast Adversarial Attack Algorithm to Fool SAR Target Recognition with Deep Convolutional Neural Networks. IEEE Geosci. Remote Sens. Lett. 2021. [Google Scholar] [CrossRef]
Li, C.; Xu, Z.; Li, Q.; Peng, J.; Wang, S.; Li, H. An Empirical Study of Adversarial Examples on Remote Sensing Image Scene Classification. IEEE Trans. Geosci. Remote Sens. 2021. [Google Scholar] [CrossRef]
Xu, Y.; Du, B.; Zhang, L. Assessing the Threat of Adversarial Examples on Deep Neural Networks for Remote Sensing Scene Classification: Attacks and Defenses. Trans. Geosci. Remote Sens. 2021, 59, 1604–1617. [Google Scholar] [CrossRef]
Li, H.; Huang, H.; Chen, L.; Peng, J.; Wu, G. Adversarial Examples for CNN-Based SAR Image Classification: An Experience Study. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 14, 1333–1347. [Google Scholar] [CrossRef]
Sadeghi, M.; Larsson, E.G. Adversarial attacks on deep-learning based radio signal classification. IEEE Wireless Commun. Lett. 2018, 8, 213–216. [Google Scholar] [CrossRef] [Green Version]
Sadeghi, M.; Larsson, E.G. Physical adversarial attacks against end-to-end autoencoder communication systems. IEEE Commun. Lett. 2019, 23, 847–850. [Google Scholar] [CrossRef] [Green Version]
Goodfellow, I.; Shlens, J.; Szegedy, C. Explaining and harnessing adversarial examples. In Proceedings of the International Conference on Learning Representations, San Diego, CA, USA, 7–9 May 2015. [Google Scholar]
Kurakin, A.; Goodfellow, I.; Bengio, S. Adversarial examples in the physical world. In Proceedings of the International Conference on Learning Representation, Toulon, France, 24–26 April 2017. [Google Scholar]
Madry, A.; Makelov, A.; Schmidt, L.; Tsipras, D.; Vladu, A. Towards deep learning models resistant to adversarial attacks. In Proceedings of the International Conference on Learning Representation, Vancouver, CB, Canada, 30 April–3 May 2018. [Google Scholar]
Moosavi-Dezfooli, S.M.; Fawzi, A.; Frossard, P. DeepFool: A simple and accurate method to fool deep neural networks. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 26 June–1 July 2016; pp. 2574–2582. [Google Scholar]
Papernot, N.; McDaniel, P.; Wu, X.; Jha, S.; Swami, A. Distillation as a defense to adversarial perturbations against deep neural networks. In Proceedings of the IEEE Symposium on Security and Privacy, San Jose, CA, USA, 22–26 May 2016; pp. 582–597. [Google Scholar]
Carlini, N.; Wagner, D. Towards evaluating the robustness of neural networks. In Proceedings of the IEEE Symposium on Security and Privacy, San Jose, CA, USA, 22–26 May 2017; pp. 39–57. [Google Scholar]
Gerry, M.J.; Potter, L.C.; Gupta, I.J.; Van Der Merwe, A. A parametric model for synthetic aperture radar measurements. IEEE Trans. Antennas Propag. 1999, 47, 1179–1188. [Google Scholar] [CrossRef]
Ronneberger, O.; Fischer, P.; Brox, T. U-net: Convolutional networks for biomedical image segmentation. In Proceedings of the 18th International Conference on Medical Image Computing and Computer-Assisted Intervention, Munich, Germany, 5–9 October 2015; pp. 234–241. [Google Scholar]
Goodfellow, I.J.; Pouget-Abadie, J.; Mirza, M.; Xu, B.; Warde-Farley, D.; Ozair, S.; Courville, A.; Bengio, Y. Generative adversarial nets. In Proceedings of the Neural Information Processing Systems Conference, Montréal, QC, Canada, 8–13 December 2014; pp. 2672–2680. [Google Scholar]
Keydel, E.R.; Lee, S.W.; Moore, J.T. MSTAR extended operating conditions: A tutorial. In Algorithms for Synthetic Aperture Radar Imagery III; International Society for Optics and Photonics: Bellingham, WA, USA, 1996; Volume 2757, pp. 228–242. [Google Scholar]
Ross, T.D.; Worrell, S.W.; Velten, V.J.; Mossing, J.C.; Bryant, M.L. Standard SAR ATR evaluation experiments using the MSTAR public release data set. In Algorithms for Synthetic Aperture Radar Imagery V; International Society for Optics and Photonics: Bellingham, WA, USA, 1998; Volume 3370, pp. 566–573. [Google Scholar]
Wang, C.; Pei, J.; Wang, Z.; Huang, Y.; Wu, J.; Yang, H.; Yang, J. When Deep Learning Meets Multi-Task Learning in SAR ATR: Simultaneous Target Recognition and Segmentation. Remote Sens. 2020, 12, 3863. [Google Scholar] [CrossRef]
Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet classification with deep convolutional neural networks. In Proceedings of the Neural Information Processing Systems Conference, Lake Tahoe, NV, USA, 3–8 December 2012; pp. 1097–1105. [Google Scholar]
Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. In Proceedings of the International Conference on Learning Representations, San Diego, CA, USA, 7–9 May 2015. [Google Scholar]
He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 26 June–1 July 2016; pp. 770–778. [Google Scholar]
Du, C.; Xie, P.; Zhang, L.; Ma, Y.; Tian, L. Conditional Prior Probabilistic Generative Model With Similarity Measurement for ISAR Imaging. IEEE Geosci. Remote Sens. Lett. 2021. [Google Scholar] [CrossRef]
Kingma, D.P.; Ba, J.L. Adam: A method for stochastic optimization. In Proceedings of the Second International Conference on Learning Representations, Banff, AB, Canada, 14–16 April 2014. [Google Scholar]
Van Der Maaten, L.; Hinton, G. Visualizing High-Dimensional Data Using t-SNE. J. Mach. Learn. Res. 2008, 9, 2579–2605. [Google Scholar]
Chen, L.; Li, H.; Zhu, G.; Li, Q.; Zhu, J.; Huang, H.; Peng, J.; Zhao, L. Attack Selectivity of Adversarial Examples in Remote Sensing Image Scene Classification. IEEE Access 2020, 8, 137477–137489. [Google Scholar] [CrossRef]

Figure 1. The whole framework of the SAR-ATR and the adversarial attack for SAR-ATR. (a) SAR-ATR, (b) adversarial attack for SAR-ATR.

Figure 2. The comparison of the algorithms’ frameworks: (a) C&W and (b) Attack-UNet-GAN.

Figure 3. The network architecture of the discriminator D in detail.

Figure 4. The network architecture of the generator G (UNet) in detail.

Figure 5. SAR images of all the ground targets in the MSTAR dataset and their corresponding optical images. (a) 2S1, (b) BMP2, (c) BRDM2, (d) BTR60, (e) BTR70, (f) D7, (g) T62, (h) T72, (i) ZIL131, (j) ZSU234.

Figure 6. The influence of the constant

λ

on the attack performance. We plot the attack success rate and MSE loss between the adversarial example and original SAR images as a function of

λ

.

Figure 6. The influence of the constant

λ

on the attack performance. We plot the attack success rate and MSE loss between the adversarial example and original SAR images as a function of

λ

.

Figure 7. (a) Original SAR images of the targets. The first row shows the adversarial examples produced by different attack algorithms in the targeted attack mode; the second row shows the corresponding adversarial perturbations. The predicted categories and confidences of the adversarial examples are listed above them. The corresponding attack algorithms: (b) FGSM, (c) BIM, (d) PGD, (e) DeepFool, (f) C&W, (g) Attack-UNet and (h) Attack-UNet-GAN.

Figure 8. (a) Original SAR images of the targets. The first row shows the adversarial examples produced by different attack algorithms in the non-targeted attack mode; the second row shows the corresponding adversarial perturbations. The predicted categories and confidences of the adversarial examples are listed above them. The corresponding attack algorithms: (b) FGSM, (c) BIM, (d) PGD, (e) DeepFool, (f) C&W, (g) Attack-UNet and (h) Attack-UNet-GAN.

Figure 9. The display of the feature maps learned by the generator (UNet) in different layers. The first row displays the feature maps learned by the encoder of UNet; the second row displays the feature maps learned by the decoder of UNet. The corresponding layer indexes of the feature maps in Figure 3 are displayed below each feature map.

Figure 10. The separability of the original SAR images and features extracted by the generator (UNet) of Attack-UNet-GAN. The SAR images and the features are mapped to the two-dimensional subspace by T-SNE shown as the dots on the plane. Each dot represents a SAR image or the feature of a SAR image, and each color denotes a category. (a) the original SAR images, (b) the features extracted by the UNet.

Figure 11. The misclassified category distributions of the adversarial examples (ground truth category: D7) generated by different attack algorithms for the same ResNet32-based SAR-ATR model. The misclassified category distributions show the percentages of the adversarial examples mislabelled as each of others target categories to all the adversarial examples of D7. (a) FGSM, (b) BIM, (c) PGD, (d) C&W, (e) DeepFool, (f) Attack-UNet-GAN.

Figure 12. The misclassified category distributions of the adversarial examples (ground truth category: D7) generated by the same attack algorithms for different SAR-ATR models. The misclassified category distributions show the percentages of the adversarial examples mislabelled as each of others target categories to all the adversarial examples of D7. The first row shows the distribution of FGSM; the second shows the distribution of Attack-UNet-GAN. (a) AlexNet, (b) VGGNet16, (c) ResNet32.

Table 1. The Numbers and target-depression angles of the training and test SAR images for SAR-ATR before the data augmentation.

	Training		Testing
Class	Depression Angles	Number	Depression Angles	Number
2S1	$15^{\circ}$	196	$17^{\circ}$	233
BRDM2	$15^{\circ}$	196	$17^{\circ}$	233
BTR60	$15^{\circ}$	196	$17^{\circ}$	232
D7	$15^{\circ}$	195	$17^{\circ}$	256
T72	$15^{\circ}$	274	$17^{\circ}$	299
BMP2	$15^{\circ}$	274	$17^{\circ}$	298
BTR70	$15^{\circ}$	274	$17^{\circ}$	299
T62	$15^{\circ}$	273	$17^{\circ}$	299
ZIL131	$15^{\circ}$	274	$17^{\circ}$	299
ZSU234	$15^{\circ}$	274	$17^{\circ}$	299

Table 2. The performances of different adversarial attack algorithms for different SAR-ATR models in terms of attack success rate under the condition of targeted attack.

Targeted Attack	AlexNet	VGGNet16	ResNet32
FGSM	95.34	87.19	75.55
BIM	98.08	97.41	97.84
PGD	98.73	98.01	98.57
DeepFool	97.91	97.02	98.26
C&W	98.59	97.84	98.62
Attack-CNN	94.79	93.86	94.38
Attack-UNet	97.85	97.26	98.11
Attack-Unet-GAN	98.47	97.63	98.39

Table 3. The performances of different adversarial attack algorithms for different SAR-ATR models in attack success rate under the condition of non-targeted attack.

Non-Targeted Attack	AlexNet	VGGNet16	ResNet32
FGSM	96.21	88.34	77.62
BIM	98.56	97.53	98.02
PGD	98.91	98.15	98.52
DeepFool	98.31	97.25	98.17
C&W	98.77	97.92	98.69
Attack-CNN	95.03	94.45	94.91
Attack-UNet	97.93	97.14	98.02
Attack-UNet-GAN	98.59	97.74	98.57

Table 4. Time complexity of different algorithms for generating an adversarial example for a

128 \times 128

pixels SAR image.

Table 4. Time complexity of different algorithms for generating an adversarial example for a

128 \times 128

pixels SAR image.

	FGSM	BIM	PGD	C&W	DeepFool	Attack-CNN	Attack-UNet	Attack-UNet-GAN
Time (s)	0.0091	0.6152	0.4985	0.8537	0.2863	0.0025	0.0039	0.0039

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Du, C.; Zhang, L. Adversarial Attack for SAR Target Recognition Based on UNet-Generative Adversarial Network. Remote Sens. 2021, 13, 4358. https://doi.org/10.3390/rs13214358

AMA Style

Du C, Zhang L. Adversarial Attack for SAR Target Recognition Based on UNet-Generative Adversarial Network. Remote Sensing. 2021; 13(21):4358. https://doi.org/10.3390/rs13214358

Chicago/Turabian Style

Du, Chuan, and Lei Zhang. 2021. "Adversarial Attack for SAR Target Recognition Based on UNet-Generative Adversarial Network" Remote Sensing 13, no. 21: 4358. https://doi.org/10.3390/rs13214358

APA Style

Du, C., & Zhang, L. (2021). Adversarial Attack for SAR Target Recognition Based on UNet-Generative Adversarial Network. Remote Sensing, 13(21), 4358. https://doi.org/10.3390/rs13214358

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Adversarial Attack for SAR Target Recognition Based on UNet-Generative Adversarial Network

Abstract

1. Introduction

2. Preliminaries

Adversarial Attack for SAR-ATR

3. Method

3.1. Attack-UNet-GAN

3.1.1. The Design of Loss Function

3.1.2. The Introduction of UNet and GAN

3.1.3. The Detailed Network Architecture

4. Experiment

4.1. Dataset and Experimental Setup

4.1.1. Dataset

4.1.2. Baselines and Experimental Setup

4.2. Evaluation Measurements

4.3. Attack Performance Comparison

4.4. Comparation of the Generation Speed

4.5. Influence of the Constant $λ$

4.6. Visualization of the Adversarial Examples

4.7. Display of the Learned Features in UNet

4.8. Separability of the Extracted Features

4.9. Misclassified Category Distributions of the Adversarial Attack

5. Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Adversarial Attack for SAR Target Recognition Based on UNet-Generative Adversarial Network

Abstract

1. Introduction

2. Preliminaries

Adversarial Attack for SAR-ATR

3. Method

3.1. Attack-UNet-GAN

3.1.1. The Design of Loss Function

3.1.2. The Introduction of UNet and GAN

3.1.3. The Detailed Network Architecture

4. Experiment

4.1. Dataset and Experimental Setup

4.1.1. Dataset

4.1.2. Baselines and Experimental Setup

4.2. Evaluation Measurements

4.3. Attack Performance Comparison

4.4. Comparation of the Generation Speed

4.5. Influence of the Constant λ

4.6. Visualization of the Adversarial Examples

4.7. Display of the Learned Features in UNet

4.8. Separability of the Extracted Features

4.9. Misclassified Category Distributions of the Adversarial Attack

5. Discussion

6. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4.5. Influence of the Constant $λ$