A Method of Infrared Small Target Detection in Strong Wind Wave Backlight Conditions

Ma, Dongdong; Dong, Lili; Xu, Wenhai

doi:10.3390/rs13204189

Open AccessArticle

A Method of Infrared Small Target Detection in Strong Wind Wave Backlight Conditions

by

Dongdong Ma

,

Lili Dong

^* and

Wenhai Xu

School of Information Science and Technology, Dalian Maritime University, Dalian 116026, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2021, 13(20), 4189; https://doi.org/10.3390/rs13204189

Submission received: 6 August 2021 / Revised: 10 October 2021 / Accepted: 11 October 2021 / Published: 19 October 2021

(This article belongs to the Topic High-Resolution Earth Observation Systems, Technologies, and Applications)

Download

Browse Figures

Review Reports Versions Notes

Abstract

:

How to accurately detect small targets from the complex maritime environment has been a bottleneck problem. The strong wind-wave backlight conditions (SWWBC) is the most common situation in the process of distress target detection. In order to solve this problem, the main contribution of this paper is to propose a small target detection method suitable for SWWBC. First of all, for the purpose of suppressing the gray value of the background, it is analyzed that some minimum points with the lowest gray value tend to gather in the interior of the small target. As the distance from the extreme point increases, the gray value of the pixel in all directions also increases by the same extent. Therefore, an inverse Gaussian difference (IGD) preprocessing method similar to the distribution of the target pixel value is proposed to suppress the uniform sea wave and intensity of the sky background. So as to achieve the purpose of background suppression. Secondly, according to the feature that the small target tends to “ellipse shape” in both horizontal and vertical directions, a multi-scale and multi-directional Gabor filter is applied to filter out interference without “ellipse shape”. Combined with the inter-scale difference (IsD) operation and iterative normalization operator to process the results of the same direction under different scales, it can further suppress the noise interference, highlight the significance of the target, and fuse the processing results to enrich the target information. Then, according to different texture feature distributions of the target and noise in the multi-scale feature fusion results, a cross-correlation (CC) algorithm is proposed to eliminate noise. Finally, according to the dispersion of the number of extreme points and the significance of the intensity of the small target compared with the sea wave and sky noise, a new peak significance remeasurement method is proposed to highlight the intensity of the target and combined with a binary method to achieve accurate target segmentation. In order to better evaluate the performance index of the proposed method, compared with current state-of-art maritime target detection technologies. The experimental results of multiple image sequence sets confirm that the proposed method has higher accuracy, lower false alarm rate, lower complexity, and higher stability.

Keywords:

maritime strong wind-wave backlight condition; infrared maritime target detection; multi-scale feature extraction; cross-correlation theory; peak significance remeasurement

Graphical Abstract

1. Introduction

Nowadays, the rapid development and high reliability of infrared search and tracking systems has become an increasingly urgent need in the field of maritime search and rescue [1,2,3]. How to accurately detect small targets in complex sea conditions has been the essential issue of maritime search and rescue. The main work of this paper is to solve the problem of small target detection under SWWBC. At present, detection methods for small targets in the complex maritime environment are emerging one after another.

In the research based on contrast and similarity characteristics, C. L. Philip [4] based on the robustness of HVS, proposed a multi-scale local contrast measurement method based on the derived kernel model (DK Model) to achieve infrared target detection. Li [5] proposed a local adaptive contrast detection method (LACM-LSK) based on local steering kernel reconstruction. Due to the improved local contrast method requirement that the target has a big difference in the local area compared with the clutter interference, the application scenario has greater limitations.

In the research based on texture directional features, aghaziyarati [6] proposed a method based on the cumulative directional derivative weighting coefficient to overcome the shortcomings of the average absolute gray difference (AAGD) algorithm. Moradi [7] constructed a new directional small target detection algorithm, called absolute directional mean difference (ADMD), using a concept similar to the average absolute gray difference. Wei [8] et al. decomposed the multi-scale image into horizontal direction by a wavelet transform method and define a “mutual wavelet energy composition” method (MWEC) to detect small infrared targets in the sea sky environment. Due to the weak ability of the above methods to suppress strong sea wave clutter, the false alarm rate will be higher when applied in SWWBC.

In the research based on statistical characteristics, Zhu [9] considered the inherent spatial correlation between image pixels to indicate that the background is continuous and highly correlated. On the contrary, the target is regarded as destroying the local correlation. Therefore, segmenting the target from the background can be regarded as the restoration of the low-rank matrix. Zhang [10] adopted a new non-convex low-rank constraint based on the infrared patch tensor (IPT) model, that is, the partial sum of tensor nuclear norm (PSTNN) joint weighted l1 norm to effectively suppress background. Due to the large amount of calculation in the above methods, real-time and practicality cannot be guaranteed.

In the research based on spatiotemporal characteristics, Zhao [11] proposed a small infrared moving target detection algorithm based on the spatiotemporal consistency of motion trajectory. Chiman [12] combined the optical flow method with contrast enhancement, connected component analysis, target association, and other methods to effectively perform target detection. The above method needs to calculate the corresponding relationship of feature points between different images, the detection platform must have the ability of antivibration, otherwise, the robustness to the complex noise environment is extremely poor.

In the research based on deep learning, Li [13] applied the deep learning maritime target detection model provided by Google, combined with the super-pixel segmentation algorithm to optimize the Grabcut algorithm, and discovered the deep learning marine target detection and segmentation, which can accurately extract the target contour and semantic information. Ryu [14] proposed a new far-infrared small target detection method based on deep learning and a heterogeneous data fusion method to solve the problem of the lack of semantic information due to the small target size. If the limited training samples contain not enough information, the target will not be well recognized in the deep learning method.

Recently, it has been noticed that the ability of the human visual attention system to detect objects from complex scenes of optical images is faster and more reliable [15,16]. Many excellent computational visual attention models have been proposed to simulate the structure of the human visual system.

Itti [17,18] proposed a visual attention system based on the behavior and neuron structure of the early primate visual system. First, Gaussian filter and Gabor filter and linear “center-surround” difference operation are applied to extract early visual features, then multi-scale image features under the same feature are linearly superimposed, and then the images under different features are linearly fused to form a single saliency map. Finally, Koch [19] proposed to filter the target location according to the dynamic neural network (WTA and IOR) in order from strong to weak. Dong [20] proposed a method based on the visual attention and pipeline-filtering model (VAPFM), the overall method adopts single-frame suspected target detection based on the improved visual attention model and multi-frame real target judgment based on anti-jitter (VAPFM). Wang [21] proposed a robust anti-jitter spatiotemporal saliency generation with parallel binarization (ASSGPB) method. Using the spatial saliency and time consistency of the target, the real target is separated from the cluttered area. The above-mentioned target detection method is effective in some simple scenes, if the task is to detect targets in complex scenes, the above method may lose its effect such as in SWWBC. In view of the above problems, the proposed method should have the following properties:

(1) Lower false alarm rate; (2) Lower time-consumption; (3) Higher detection rate; (4) Higher stability.

In order to realize the above four attributes, this paper proposes a small target detection method in SWWBC. The final experimental results show that the proposed method is superior to the traditional and the latest target detection methods in detection performance. The rest of this paper is organized as follows. In Section 2, a small target detection method in SWWBC is introduced. Section 3 introduces the details of the experiment and analyzes the results. Section 4 summarizes the conclusion.

2. Materials and Methods

Figure 1 is the flow chart of our proposed structure for target detection, which is mainly divided into five steps, namely background suppression, feature extraction, noise elimination, target enhancement, and target segmentation. In the first step, the acquired infrared image is processed by IDG filter to generate the preprocessed image, the purpose is to suppress background intensity. In the second step, the preprocessed image is divided into two signal streams, which are processed by horizontal and vertical Gabor filters [22,23] in multi-scale, respectively, to generate multi-scale feature images. The multi-scale feature map of each direction is generated after the IsD operation and iterative normalization operator. The multi-scale feature map is fused and iterative normalization operation to generate saliency map, so as to eliminate noise and extract texture features. The third step is to generate a result map by CC calculation to filter the noise. In the fourth step, the local maxima in eight directions are obtained from the CC result image, in order to find the subsequent target segmentation points, the peak significance is re-measurement to achieve the target enhancement. Finally, using the binary segmentation method to determine the true target.

2.1. Background Suppression

Reducing the background gray value as much as possible is a key purpose of background suppression methods. In order to achieve the best background suppression effect, the analysis of the characteristics of the target and background is an essential process. Therefore, our paper first analyzes the weak small target patch and background patch in the typical infrared image, and the analysis results are shown in Figure 2. The background patch of Figure 2a is selected at the sea wave with the typical gray distribution. In Figure 2b, it is selected at the thick clouds and sea waves. It is found from the gray value distribution of the weak small target patch in (a) T1 and (b) T1 T2, the interior of the small target tends to gather some gray minimum points which are quite different from the surrounding pixels. With the increase of the distance from the minimum point, the gray values of the pixels in all directions raise by the same extent. However, the gray value distribution of (a) B1 B2 B3 sea wave patch and (b) B1 clouds patch and B2 sea wave patch has no similarity with the target patch.

An IGD preprocessing algorithm similar to the target area gray value distribution is designed, the IGD preprocessing method is shown in Equation (1),

σ_{1}

and

σ_{2}

indicates the high-scale and low-scale filter parameters.

I G F (x, y) = \frac{1}{2 π σ_{1}^{2}} e^{- \frac{x^{2} + y^{2}}{2 σ_{1}^{2}}} - \frac{1}{2 π σ_{2}^{2}} e^{- \frac{x^{2} + y^{2}}{2 σ_{2}^{2}}}

(1)

The kernel function three-dimensional results are shown in Figure 3. From them, it can be found that the three-dimensional distribution of grayscale values in the target area is similar to the three-dimensional result of the kernel function, so the significance of the background can be better suppressed. According to the target space area in the statistical dataset is less than 80 pixels, IGD kernel size is selected as 9 × 9 in this paper. So that the spatial area of the kernel basically matches the spatial area of the target and the best background suppression effect can be obtained.

The final obtained background suppression results are shown in Figure 4. Firstly, it can be seen by comparing the global gray histogram of the original image of typical images A and B with the global gray histogram of the processing image, gray values of the background after processing are decreased and substantially smaller than those of the target.

Secondly, the local contrast is calculated by Equation (2),

g_{f}

represents the average gray level of the foreground, and

g_{b}

represents the average gray level of the background.

C o n t r a s t = \frac{m a x [g_{f}, g_{b}]}{m i n [g_{f}, g_{b}]}

(2)

For ease of observation, the competing areas with the same or higher local contrast as the target area are marked in green. The results show that the background interference area that has the same or higher local contrast as the target area after processing is significantly reduced, the reason for this result is also due to the reduction of the gray value of most backgrounds. Finally, it can also be found from the processed result image that the uniform sea waves and sky background in the original image are eliminated, which is also more beneficial to subsequent target detection tasks.

2.2. Feature Extraction

Accurate feature extraction [24,25] can better retain target information and eliminate interference information. By observing the resulting image after the background suppression, it can be found that the small target is closer to the “ellipse shape” and has strong texture characteristics in both the horizontal and vertical directions, and the Gabor filter just has the ability to extract the “ellipse shape” characteristics. In addition, the Gabor filter also has the function of multi-scale resolution [26,27]. By adjusting the size of the filter template to achieve “fine to coarse” feature extraction, we can ensure that the target region can be accurately extracted features. The mathematical expression of the Gabor filter is a deep representation in Equations (3) and (4).

θ

represent the filtering direction,

γ

represent the aspect ratio,

δ

represent the standard deviation,

λ

represent the wavelength, and

ψ

represent the phase. Figure 5 shows the result of feature extraction of background suppression images A and B.

G (θ, γ, δ, λ, ψ, x, y) = \exp (- \frac{x^{'}^{2} + γ y^{'}^{2}}{2 δ^{2}}) \cos (2 π \frac{x^{'}}{λ} + ψ)

(3)

{\begin{matrix} x^{'} = x \cos (θ) + y \sin (θ) \\ y^{'} = - x \sin (θ) + y \cos (θ) \end{matrix}

(4)

In the results of multi-scale horizontal and vertical texture feature extraction, it can be observed that the brightness of the target between adjacent scales in the same direction gradually decreases, and the brightness of the sea wave and clouds between adjacent scales in the same direction gradually increases or remains unchanged. According to the intensity variation characteristics of targets, waves and clouds, subtract the results of different scales in the same direction, which is called “IsD operation”. In the final “IsD operation” result, the position with a large difference is more likely to be the target, and the position with a small difference or negative difference is more likely to be the noise. Firstly, the negative difference is regarded as real noise and the value of the corresponding position is set to zero. Secondly, the iterative normalization operator is used to increase the intensity difference between the target and the noise and reduce the intensity of the noise to a negative number, so as to distinguish the target and the noise. The iterative normalization operator is shown in Equation (5),

c_{ex}

and

σ_{ex}

is stimulus factor and stimulus variance, respectively, which is used to further enhance the global highly significant region.

c_{inh}

and

σ_{inh}

is the suppression factor and the suppression variance respectively, which is used to further attenuate the global weak significance region. The results of multi-scale horizontal and vertical texture feature image processing are shown in Figure 6.

I N o r m (c_{ex}, σ_{ex}, c_{inh}, σ_{inh}, x, y) = \frac{c_{ex}^{2}}{2 π σ_{ex}^{2}} e^{- (x^{2} + y^{2}) / 2 σ_{ex}^{2}} - \frac{c_{inh}^{2}}{2 π σ_{inh}^{2}} e^{- (x^{2} + y^{2}) / 2 σ_{inh}^{2}}

(5)

In order to extract as much information of the target as possible, the IsD operation results with the same direction are fused, and the final fusion result is shown in Figure 7.

Algorithm 1: Get the result of multi-scale feature fusion.

Input:
Gaussian difference preprocessing results Gimg.
Output:
Multiscale fusion results MFR.

1 : G_{01}

= Gimg ⨂

Gabor(0, 0.5, 2.3333, 7, 0, 9) Equation (3)

2 : G_{02}

= G_{01}

⨂

Gabor(0, 0.5, 2.3333, 15, 0, 9) Equation (3)

3 : G_{03}

= G_{02}

⨂

Gabor(0, 0.5, 2.3333, 21, 0, 9) Equation (3)

4 : G_{901}

= Gimg ⨂

Gabor(90, 0.5, 2.3333, 7, 0, 9) Equation (3)

5 : G_{902}

= G_{901}

⨂

Gabor(90, 0.5, 2.3333, 15, 0, 9) Equation (3)

6 : G_{903}

= G_{902}

⨂

Gabor(90, 0.5, 2.3333, 21, 0, 9) Equation (3)

7 : C S_{01}

= (G_{01} - G_{02}

) ⨂