Feature Enhancement Using Multi-Baseline SAR Interferometry-Correlated Synthesis Images for Power Transmission Tower Detection in Mountain Layover Area

: The detection performance of power transmission towers in mountainous areas using SAR amplitude images is obviously influenced by the strong layover background (mainly including vegetation and soil) clutter interference around the towers. In this paper, power transmission tower detection in a mountainous layover area, using single-baseline SAR interferometry coherence images, which show better feature enhancement effectiveness compared to SAR amplitude images, is presented. Moreover, a novel feature enhancement method, that of generating multi-baseline SAR interferometry-correlated synthesis images for power transmission tower detection in a mountain layover area, is proposed. It demonstrates better feature enhancement (layover background cluster suppression) than that using single-baseline SAR interferometry coherence images. Theoretical analysis illustrates that the mountainous layover background clutter interference can be suppressed in the proposed single-baseline/multi-baseline SAR interferometry-correlated synthesis image. Experiments including over 12 repeat-pass TerraSAR-X staring spotlight mode acquisitions were conducted, and the results demonstrate that the detection performance with the use of multi-baseline SAR interferometry-correlated synthesis images showed an improvement of more than 43.6%, compared with the traditional method of using SAR amplitude images when benchmark deep learning-based detectors are used, i.e., Faster RCNN and YOLOv7.


Introduction
In China, many power transmission towers are located on mountainous soil or vegetated terrain, as shown in Figure 1. In summer, some natural disasters, such as landslides and debris flow, caused by heavy storms, occur often. For a long time, these disasters resulted in the inclination, or even collapse, of power transmission towers. Spaceborne synthetic aperture radar (SAR) is a widely used technique in remote sensing and nondestructive testing applications [1][2][3][4][5]. SAR is also a potential technique for monitoring the safety status of power transmission towers. For the safety status monitoring of power transmission towers, the inclination degree of a power transmission tower during one period could be calculated using improved differential interferometry SAR (D-InSAR) [6].
For the improved D-InSAR technique, the first step is to detect the locations of all power transmission towers in an entire SAR image. Then, the inclination degrees of every power transmission tower can be obtained using the improved D-InSAR technique. Tower detection is essentially a problem of geometric structure detection of the tower within the detection box. The amplitude of the SAR image was always selected as the feature for power transmission tower detection [7,8], which presented good detection performance in areas without much layover influence, such as relatively flat land areas. Nevertheless, Academic Editor: Dusan Gleich in steep mountain areas, there is significant layover, resulting in strong scattering values in the SAR amplitude image. Such layover, presented as strong background cluster interference around the power transmission towers, will influence the entire geometric structure detection of towers in the detection box, thus influencing the detection performance of the power transmission towers located in steep mountain areas. For the two repeat-pass spaceborne SAR acquisitions, the feature of SAR interferometry coherence has already shown better effectiveness for land cover classification than the feature of SAR amplitude [9]. For InSAR applications, the decorrelation process of the two repeat-pass spaceborne SAR acquisitions mainly includes spatial baseline decorrelation and temporal decorrelation. In general, the spatial baseline decorrelation is relatively slight if the spatial baseline of the two repeat-pass spaceborne SAR acquisitions is not very long. Most mountainous areas where power transmission towers are located include vegetation or soil. Consequently, the SAR interferometry coherence value around the tower is very low, due to the statistical temporal decorrelation from the vegetation and soil. In contrary, for a relatively short spatial baseline, the SAR interferometry coherence value of the power transmission tower itself is still high, because power transmission towers are made of metal, and thus can remain stable/persistent along the temporal dimension. Based on this, the feature enhancement method, using single-baseline SAR interferometry coherence images for power transmission tower detection in mountainous layover areas, is presented in this paper. Moreover, a novel feature enhancement method, using multibaseline SAR interferometry-correlated synthesis images for power transmission tower detection in mountainous layover areas, which demonstrates better feature enhancement than that using single-baseline SAR interferometry coherence images, is proposed.
Specifically, the geometric illustration of power transmission towers located on mountainous layover areas is analyzed, and theoretical analysis is further conducted in order to illustrate that the mountainous layover background (i.e., vegetation or soil) clutter interference can be effectively suppressed in the proposed single-baseline coherence image/multi-baseline SAR interferometry-correlated synthesis image. In recent years, deep learning methods have been widely used in object detection applications. For object detection applications, the current deep learning methods also include two-stage methods (such as Region-based CNN (R-CNN), Fast R-CNN, and Faster R-CNN) and one-stage methods (such as you only look once (YOLO) series and single-shot multibox detector (SSD)) [10][11][12][13][14][15][16][17][18][19][20]. Faster R-CNN and YOLOv7 are newer technologies among two-stage and one-stage object detection methods, and they show better comprehensive performance than previous versions. Consequently, in this paper, two benchmark object detectors, based on Faster R-CNN [12] and YOLOv7 [14], were used to demonstrate the detection performance of the proposed method.
The remainder of this paper is organized as follows. The principle of the proposed method will be elaborated upon in Section 2. The experimental results will be analyzed in Section 3 in detail. Section 4 presents the conclusion.

Power Transmission Tower Detection of Traditional Method Using SAR Amplitude Image
For one spaceborne SAR acquisition, let s denote the complex signal value of the spaceborne SAR acquisition. Figure 2a illustrates a geometric diagram of a power transmission tower located on a mountainous layover area. The layover background (LB) around the power transmission tower mainly includes vegetation and soil. Observed from Figure 2a, denotes the scattering points at different locations along the same equal distance to the radar (i.e., layover line) of the mountainous layover background around the power transmission tower. In traditional object detection applications based on SAR acquisitions, the SAR amplitude was always used as the feature to be detected.
Generally, LB s (where " " denotes the amplitude value), will be relatively large and will seriously influence the entire geometric structure detection of the tower in the detection box of the SAR amplitude image. We can define s C as where ( ) E denotes the expectation value that can be estimated using one small spatial region around the located pixels. s C can be quantitatively evaluated for the interference degree of the mountainous layover background in power transmission tower detection using a SAR amplitude image. The higher the value of s C , the stronger the interference degree of the mountainous layover background in power transmission tower detection.
The specific values of s C can be estimated statistically from the real SAR images. As analyzed statistically in Section 3.2, the typical value of s C is generally focused on the region of 0.5~2. Figure 2b is the workflow of the traditional power transmission tower detection method using a SAR amplitude image.   in this section), respectively. The noise variables are modeled as independent, and identically zero-mean complex Gaussian random distributed, with the following relationships: where ''*'' indicates complex conjugation. The signal-to-noise ratio (SNR) of the tower is defined as follows: While the scattering values of vegetation and much of the soil surface change-with different soil moisture concentrations in different weather conditions, or due to the other physical motions caused by a landslide or other event-in mountainous layover background areas around the power transmission tower, especially the phase, they will change dramatically along the temporal dimension. The layover background cluster variable can be modeled as follows: where the amplitude a is Rayleigh distributed, and  is uniformly distributed over (−π, π) [21].
_ LB x s is independently zero-mean complex Gaussian distributed, with the following relationship: For two repeat-pass spaceborne SAR acquisitions, the single-baseline interferometry coherence  is defined as follows [22,23]: where  represents the statistical stability degree along the spatial and time dimensions for the two SAR acquisitions.
Substituting (7) and (8) We can define C  as follows: From (2) and (13), we can theoretically conclude: In practical images, LB  is estimated using one small spatial region around the located pixels. Consequently, ( ) LB E  and C  are approximately, but not strictly, equal to 0.
C  can be quantitatively evaluated for the interference degree of the mountainous layover background in power transmission tower detection using single-baseline SAR interferometry coherence images. Similarly to s C , elaborated upon in Section 2.1, and as analyzed statistically in Section 3.2, from the real SAR images, the typical values of C  are generally focused in the region of 0.15~0.25.
From (14) and the above statistical analysis, compared with the use of SAR amplitude images, we can see the mean contrast between the mountainous layover background cluster around a power transmission tower and the power transmission tower itself would be obviously suppressed by using a single-baseline SAR interferometry coherence image for power transmission tower detection. Detecting the power transmission towers located at the steep mountain areas using single-baseline SAR interferometry coherence images demonstrates better detecting performance than that of detection using SAR amplitude images.

Feature Enhancement Using Multi-Baseline SAR Interferometry-Correlated Synthesis Image for Power Transmission Tower Detection in Mountainous Layover Area
For multiple repeat-pass spaceborne SAR acquisitions, the multi-baseline interferom- For the power transmission towers, (15) can be rewritten as follows: In order to obtain the highest multi-baseline interferometry-correlated function value of the power transmission tower, we need to optimize the function ( ) 13   . As previously mentioned,  and F  are estimated using a small spatial region around the located pixels, such as a 5 × 5-pixel window. Due to the phase stability of the power transmission tower along the temporal dimension, the SAR interferometry complex values (i.e., , can be estimated similarly). In practical SAR images, considering the noise influence in the k × k-pixel window, the pixel which has the highest single-baseline interferometry coherence value (i.e.,  ) is used to estimate As previously mentioned, tower SNR is very high, i.e., where " " denotes "much smaller than". Substituting (21) Different from power transmission towers, due to the phase randomness of the layover background, along the temporal dimension, the SAR interferometry complex values (i.e.,  (18) and (19) Substituting (24) and (25) The mean contrast of F  between the mountainous layover background cluster around a power transmission tower and the power transmission tower itself in a multi-baseline interferometry-correlated synthesis image is defined as follows: The specific values of ˆF C  can be estimated statistically from the real SAR images. As analyzed statistically in Section 3.2 and Figure 4, shown later, the typical values of ˆF C  are generally focused in the range of 0.05~0.08.
From (27) and the above statistical analysis, compared with single-baseline SAR interferometry coherence images, we can see that the mean contrast between mountainous layover background clustering around power transmission towers and the power transmission tower itself is further suppressed by using multi-baseline SAR interferometrycorrelated synthesis images for power transmission tower detection. Thus, multi-baseline SAR interferometry-correlated synthesis images can be used to better distinguish the power transmission tower from the layover background. Namely, detecting the power transmission towers located in the steep mountainous areas using multi-baseline SAR interferometry-correlated synthesis images demonstrates better detection performance than that from using single-baseline SAR interferometry coherence images. As a result, the workflow of the feature enhancement method and the corresponding qualitative effectiveness of using multi-baseline SAR interferometry-correlated synthesis images for power transmission tower detection in mountainous layover areas are shown in Figure 3b and Figure 3c, respectively.
The specific steps to this method are summarized as follows: Step 1: Compute the single-baseline SAR interferometry coherence value (i.e.,  ) using (9); Step 2: In the single-baseline SAR interferometry coherence image, for every k × k-pixel window, calculate the pixel line and column number which has the highest single-baseline interferometry coherence value using (18); Step 3: For every k × k-pixel window, estimate the corresponding ( Step 4: Substitute the estimated ( , i.e., multi-baseline SAR interferometry-correlated synthesis image generated using the proposed method; Step 5: Use the benchmark object detectors, e.g., Faster R-CNN and YOLO, in order to detect the power transmission tower in the generated image.
can name the generated image using this approach as "multi-baseline interferometry-correlated synthesis image based on master image approach". In order to investigate the influence of the baseline lengths of the SAR acquisitions further, we can redefine the multibaseline interferometry-correlated function F  by selecting different baseline interferometry pair combinations, as follows: The shorter the baseline of two repeat-pass spaceborne SAR acquisitions, the lower the decorrelation of SAR interferometry [3]. Consequently, in (28), we can define F  by selecting small baseline (SB) subsets as interferometry pair combinations. We name this approach "multi-baseline interferometry-correlated synthesis image based on SB approach". Similar to the generation method of multi-baseline interferometry-correlated synthesis images based on the master image approach, the multi-baseline interferometry-correlated synthesis image based on the SB approach can also be obtained.

Datasets
In order to verify the detection performance of the method proposed in this paper, 12 repeat-pass TerraSAR-X staring spotlight mode acquisitions, taken from 23 April 2016 to 2 September 2016 in Mao County, Sichuan province, China, were used in this experiment. The baselines of different repeat-pass SAR acquisitions relative to the master image acquired on 23 April 2016 are presented in Table 1. The symbol "− " on the baseline in Table 1 means that the SAR acquisition of the corresponding date is on the left side of the master image acquired on 23 April 2016. Otherwise, it is on the right side of the master image acquired on 23 April 2016. The SAR interferometry pair combinations of the SB case, and the corresponding baselines, are shown in Table 2. The resolution of these SAR data is 0.4547 m (Range direction) × 0.1685 m (Azimuth direction). The look angle of satellite is 40°. The pass/track direction is along the ascending path. In order to filter the noise effectively, multi-look preprocessing was used for the SAR data. The look was set to 5.    Considering that the inner geometric structure of power transmission towers could be clearly seen in the very high-resolution SAR image, such as in the TerraSAR-X staring spotlight mode data used in this paper, two benchmark object detectors, i.e., Faster R-CNN and YOLOv7, were adopted for learning the inner geometric structure of the tower. Particularly, the DL toolbox in Matlab version 2021a was used to implement these two algorithms using a PC with Intel single Core i7 CPU, NVIDIA RTX-3090 GPU (24 GB video memory), and 128 GB RAM. Specifically, the Faster R-CNN object detector was trained using the Matlab toolbox function, 'trainFasterRCNNObjectDetector', in which the input size, the number of anchor boxes, and the feature extraction network were 400 × 400, 3, and ResNet50 [24], respectively. For the YOLOv7 network architecture setting, the input size and the number of anchor boxes were 400 × 400 and 9, respectively. For both of these detectors, the initial learning rate of the network, the momentum, the minimum batch size, and the maximum epochs were set to 1 × 10 −3 , 0.9, 2, and 100, respectively.
The power transmission tower and background training samples were cropped manually from part of the SAR interferometric coherence image/multi-baseline interferometry-correlated synthesis image or amplitude image. Then, the cropped samples were synthesized together to generate patches with sizes of 400 × 400 pixels per patch. In patch generation, for the same tower target, the background samples around the tower could be changed and increased via geometric transformation, such as translation motion. Finally, after data augmentation based on the geometric transformation of the background samples, the training data, with 350 patches, were generated and fed into the training network. After obtaining the object detector, the entire large SAR single-baseline/multi-baseline interferometric coherence image or SAR amplitude image, with dimensions of 3674 × 7890 pixels (which could be approximately regarded as 9 × 20 = 180 patches, with a size of 400 × 400 pixels per patch), was used for testing.  Figure 5, in which the yellow boxes, red boxes, and green boxes denote the true detections, the false detections, and the missing detections, respectively. In order to quantitatively evaluate the detection performance of the obtained power transmission tower detector, the target detection probability, false detection probability, and F1 score [20] are adopted as follows: N denote the number of true detections, the total number of ground truths, the number of false detections, and the total number of detections, respectively. Considering the tradeoff between d P (target detection probability) and f P (false alarm probability), F1 can be used to evaluate the comprehensive performance of the object detector. The quantitative detection performance of Figure 5 is presented in Table 3. As can be observed from Figure 5 and Table 3, for power transmission tower detection, compared with the traditional method using the SAR amplitude image, the detection performance has been improved by using the single-baseline interferometric coherence image/multi-baseline interferometry-correlated synthesis image, no matter which detector is used. Also, multi-baseline interferometry-correlated synthesis images (including both those based on the master image-based approach and those based on the SB-based approach) demonstrate better detection performance than single-baseline SAR interferometry coherence images. The detection performance of multi-baseline interferometrycorrelated synthesis image based on the master image approach and the SB-based approach are basically the same. When Faster R-CNN is adopted as the object detector, compared with the traditional method using the SAR amplitude image, the target detection probability when using the multi-baseline interferometry-correlated synthesis image increases by more than 47.7%, while false detection probability decreases by more than 36.1%. In terms of the F1 score, the performance improvement is at least 44.8%. Similarly, when YOLOv7 is adopted, compared with the SAR amplitude image, the target detection probability when using the multi-baseline interferometry-correlated synthesis image increases by more than 52.4%, while false detection probability decreases by more than 17.5%. The final F1 score increases by at least 43.6%. The SAR amplitude images are contaminated by strong layover background cluster interference in mountainous areas, which significantly influences the detection performance of power transmission towers. However, by using the multi-baseline interferometry-correlated synthesis image, generated as proposed in this paper, such layover background cluster interference can be greatly suppressed; a more than 43.6% of improvement can be achieved, as shown in Table 3.

Conclusions
In the application of power transmission tower detection in mountainous areas, single-baseline SAR interferometry coherence images have the effectiveness of feature enhancement compared to SAR amplitude images. Moreover, the proposed multi-baseline SAR interferometry-correlated synthesis images (including both those based on the master image-based approach and those based on the SB-based approach) demonstrate better feature enhancement than that of using single-baseline SAR interferometry coherence images for power transmission tower detection in a mountainous layover area. Consequently, for power transmission tower detection in mountainous layover areas, the method of using multibaseline SAR interferometry-correlated synthesis images demonstrates better detection performance than the traditional method of using SAR amplitude images.