Abstract
The application of SAR tomography (TomoSAR) on the urban infrastructure and other man-made buildings has gained increasing popularity with the development of modern high-resolution spaceborne satellites. Urban tomography focuses on the separation of the overlaid targets within one azimuth-range resolution cell, and on the reconstruction of their reflectivity profiles. In this work, we build on the existing methods of compressive sensing (CS) and generalized likelihood ratio test (GLRT), and develop a multiple scatterers detection method named CS-GLRT to automatically recognize the number of scatterers superimposed within a single pixel as well as to reconstruct the backscattered reflectivity profiles of the detected scatterers. The proposed CS-GLRT adopts a two-step strategy. In the first step, an L1-norm minimization is carried out to give a robust estimation of the candidate positions pixel by pixel with super-resolution. In the second step, a multiple hypothesis test is implemented in the GLRT to achieve model order selection, where the mapping matrix is constrained within the afore-selected columns, namely, within the candidate positions, and the parameters are estimated by least square (LS) method. Numerical experiments on simulated data were carried out, and the presented results show its capability of separating the closely located scatterers with a quasi-constant false alarm rate (QCFAR), as well as of obtaining an estimation accuracy approaching the Cramer–Rao Low Bound (CRLB). Experiments on real data of Spotlight TerraSAR-X show that CS-GLRT allows detecting single scatterers with high density, distinguishing a considerable number of double scatterers, and even detecting triple scatterers. The estimated results agree well with the ground truth and help interpret the true structure of the complex or buildings studied in the SAR images. It should be noted that this method is especially suitable for urban areas with very dense infrastructure and man-made buildings, and for datasets with tightly-controlled baseline distribution.
1. Introduction
Synthetic aperture radar tomography (TomoSAR) [1] has successfully been used to reconstruct three-dimensional (3D) urban infrastructures [2,3], to estimate the forest biomass [4,5], and to image ice structures [6,7]. The launch of the new generation of spaceborne satellites (e.g., TerraSAR-X/Tan-DEM-X, COSMO-SkyMed, and Gaofen-3) provides SAR images with unprecedented high resolution up to meter regime and even to submeter regime, which dramatically facilitates the reconstruction and monitoring of urban infrastructures.
The TomoSAR reconstruction can be regarded as an inverse problem, which was firstly solved by the nonparametric methods such as beamforming (BF) [1,8], singular value decomposition (SVD) [9,10], and adaptive Capon [11,12,13]. BF and SVD get their popularity from their high efficiency and robustness. Both methods can preserve the full resolution of SAR images, but suffer from low resolution and high side-lobe level problems. Capon allows super-resolution imaging at the expense of a spatial resolution loss as multi-look processing is required. Parametric methods, such as multiple signal classification (MUSIC) [14], have also been introduced for TomoSAR. MUSIC can achieve super-resolution and side-lobe reduction but it requires a priori information and is sensitive to model errors. Based on the fact that target distribution along elevation is always sparse, especially in urban areas, compressive sensing (CS) [15,16,17] provides another solution for infrastructure tomographic reconstruction. It enhances the elevation resolution within the single-look archive and reduces the required number of measurements, while its main limitations stem from the off-grid effect so as to generate spurious outliers, the amplitude bias, and the inability to evaluate the probability of detection and the probability of false alarm.
While these methods provide the foundations for significant advances, technical issues of the exact number of targets or scatterers remain to be decided when applied in urban scene study. Automatic methods for detection and reconstruction of infrastructure and other man-made structures in urban area have been widely studied. According to the different methods of model order selection (MOS), existing methods can be classified into three groups: the generalized likelihood ratio test (GLRT) based [18,19,20,21,22,23,24]; information criterion based, such as Bayes information criterion (BIC) and Akaike information criterion (AIC) [25,26,27]; and no additional MOS based or iterative reweighted method [28,29,30].
An approach based on GLRT for the detection of targets in the tomographic framework was proposed in [18] (BF based GLRT) for the first time. As an extension of the work in [18], an approach focusing on the discrimination of single and double scatterers was introduced [19]. It could effectively estimate the positions and number of scatterers with constant false alarm rate (CFAR) efficiently. However, it suffered from the leakage effects related to side-lobe influence and from a low intrinsic elevation resolution, the so-called Rayleigh resolution. Rayleigh resolution is inversely proportional to the perpendicular baseline extension and is typically much worse than the resolution in azimuth and range, which makes it far from satisfactory when applied in the urban areas with high resolution and tightly-controlled TerraSAR/TanDEM-X. A support GLRT (sup-GLRT) method [20] was proposed to deal with the poor elevation resolution problem. It searches the best signal support to decide the number and positions of the significant scatterers based on nonlinear maximum for detecting at most scatterers. It is convenient when the scatterer distribution is very sparse (no more than two), while the computing complexity will increase dramatically with the increase of . To mitigate the computational burden, a fast sup-GLRT method [21], referred to as M-sup-GLRT, was proposed. It transfers the multidimensional optimal searching problem into a 1D optimal searching one, so as to enjoy the computational efficiency as well a super-resolution capability comparable to that of sup-GLRT. Most recently, M-sup-GLRT has been extended to 5D application [22] and showed the ability to not only monitor temporal deformation but also to detect the thermal dilation. Another application of sup-GLRT is the investigation of polarimetric TomoSAR (Pol-TomoSAR), which allows the reduction of the number of acquisitions required. Multilook GLRT, referred to as M-GLRT has been proposed to improve the detection capability over man-made targets as well in areas characterized by low SNRs [23]. As an extension of the method in [23], the Capon filter has been introduced in M-GLRT [24] to get a super-resolution ability.
SL1MMER [25,26] has been proposed to eliminate the outliers and get the accurate parameter estimation by introducing two steps of BIC MOS and parameter estimation. It can effectively drop the outliers by penalizing the higher orders and estimating the parameters by scale down L2 method. SL1MMER is super-resolved and has been proven perfect for the urban reconstruction [25,26]. Most recently, integrating with geographic information systems (GIS), its extended version, M-SL1MMER [31], has been proven to have an accurate tomographic reconstruction capability with even very small images. However, it greatly depends on the penalized parameters and there is no effective way to evaluate the false alarm rate, or it is not a CFAR estimation. A nonparametric iterative adaptive approach (IAA) [27] combined with an MOS of BIC, referred to as IAA-BIC, has been proposed to retrieve 3D structure infrastructures. It does not require any a priori knowledge or hyperparameter selection, and it performed well for both distributed and coherent scatterers with either a few multiple looks or single-look. However, when applied to the single-look urban study, the weighted matrix, corresponding to the inversion of data covariance matrix, is not always invertible.
An iterative reweighted CS (IR-CS) [28] method has been proposed to eliminate the spurious outliers by iterative reweighted L1 minimization. The outliers can be dropped effectively, especially at low SNR cases with doubling or tripling the calculation complexity. Another IR method called iterative reweighted alternating direction method of multipliers (IR-ADMM) [29] has been proposed to relieve the computational burden with almost the same elevation accuracy and with only a slightly worse detection rate than that of IR-CS. A two-step iterative shrinkage/thresholding (TWIST) [30] approach for TomoSAR was proposed to speed up the L1-Norm minimization procedure as well as to achieve unbiased estimation for ill-conditioned problem.
In this paper, we aim at proposing a multiple scatterers detection method in tomography with the following characteristics: (1) CFAR or quasi-CFAR; (2) super resolution capability; (3) high detection probability; (4) accurate elevation estimation; (5) full spatial resolution; and (6) acceptable computing complexity even with the increase of model order. To these ends, an approach named CS-GLRT is proposed by a two-step processing method. Firstly, a scale down support space is obtained by CS so as to super-resolve the problem as well as to provide a priori information for the next step about the candidate positions. Then, within the scaled down space, optimal research is implemented for each model order, followed by a -order hypothesis test, where each hypothesis test is conducted in a GLRT archive. The first step gives the potential positions, with which the computing complexity is greatly reduced for the second step’s optimal research. The second step separates the closely located scatterers without setting any hyperparameters with the false alarm rate (FAR) being controlled. The proposed approach is robustly applicable to triple or even higher order scatterers detection without much computational burden increase. The effectiveness was validated by simulated data as well by real data of TerraSAR-X spotlight over Shenzhen, China. Simulated results show that the proposed method can robustly super-resolve the overlaid targets with an accuracy approaching the Cramer–Rao Low Bound (). Real-data results show its capability of detecting single scatterers with high density, distinguishing a considerable number of double scatterers, and even detecting triple scatterers, whose results are consistent with the ground truth. In addition, comparisons of CS-GLRT and SL1MMER on false detection rate, and of CS-GLRT and sup-GLRT on computational burden were carried out. The results imply that CS-GLRT provides an alternative for SL1MMER and for sup-GLRT as it is a good tradeoff for robustness and efficiency. It should be noted that, to realize the sparse reconstruction, an open source software package named SparseLab [32] is used.
This paper is organized as follows. In Section 2, the signal model of TomoSAR and procedures of GLRT are reviewed. Section 3 explains the theoretical basis of CS and gives the proposed CS-GLRT methodology. Section 4 and Section 5 are devoted to the simulated and real data experiments, respectively. Section 6 addresses the additional discussions and comparisons. Finally, Section 7 gives the conclusion.
2. Signal Model and Problem Formulation TomoSAR
2.1. Acquisition Model
TomoSAR utilizes the multi-baseline sensors aligned along the elevation s over the same area, with slightly different view angles, allowing the capability to fully image the scene in the 3D space, whose imaging geometry is shown in Figure 1. Each azimuth-range pixel collects a stack of N focused images coregistered with respect to a given (master) one,
where is the reflectivity function along elevation, and denotes the spatial frequency with being the perpendicular baseline relative to the master sensor, representing the wavelength, and r being the slant range distance. Approximately discretizing the continuous reflectivity function along s and considering the noise in the real scenario, Equation (1) can be rewritten as
where is the steering matrix mapping the model space into the data space, with the generic element expressed as,
and is the N-dimensional noise vector, circular Gaussian distributed with zero mean and a covariance matrix of , where is the noise power and denotes the identity matrix. The straightforward solution of Equation (2) is the so-called BF or periodogram,
where denotes the Hermitian operation.
Figure 1.
SAR tomography geometry of the problem under study. An array with N sensors receives signal from two or more scatterers (overlaid targets are denoted by red and blue dots). Note that distances are not in scale.
For urban study, one important application is separating multiple scatters interfering within the same range-azimuth resolution cell, or layover mechanism separation, as shown in Figure 1 by red and blue dots. BF as well as other nonparametric imaging methods is limited by the grating lobes caused by the nonuniform distribution of the baselines, and also by the low Rayleigh resolution inversely proportional to the perpendicular baseline extension,
where is the perpendicular baseline extension. For the modern TerraSAR-X/TanDEM-X spaceborne mission, whose baseline is strictly controlled within 400–500 m, leading to a common of 20–30 m, which is typically only – of the resolution in range and azimuth. When dealing with the superposition of scatterers from building facade/roof and the ground, the 20–30 m resolution is far from satisfactory, leading to the desperate demand for imaging methods with super-resolution capability. Actually, for single scatterer detection, there is a for elevation (see the detailed deduction in Appendix A.1), expressed as
where is the signal-to-noise ratio and is the standard deviation of the spatial baseline. Using the parameters of TerraSAR-X (see Table 1 for systematic parameters and Figure 2 for baseline distribution, respectively), if dB (a common SNR for permanent scatterers (PS)), m can be obtained. Usually, for an unbiased estimation, whose error can be controlled within , it can be considered as an excellent estimator [33]. It should be noted that, if there are no targets located less than , BF or unmodified periodogram turns out to be the excellent estimator which outperforms any of the super-resolution methods [33].
Table 1.
Parameters of TerraSAR-X.
Figure 2.
Spatial/temporal baseline of SAR image acquisition. The red star represents the master image, while blue diamonds show the slave images.
2.2. Problem Formulation
GLRT’s first introduction to TomoSAR was devoted to deciding the presence or absence of a single target by a binary hypothesis test [18], with denoting the different hypotheses
and representing the decision, whose decision rule is
where and are the likelihood functions of under and , respectively, and T is the decision threshold, which is set according to the desired value of false alarm probability (which is defined in Section 3).
Hi: presence of i target with i = 0, 1
In the problem under study, we are interested in estimating, pixel by pixel, the number of scatterers k superimposed in a single pixel and the corresponding parameter vectors, i.e., magnitude, elevation and phase. Moreover, a maximum sparseness of K is assumed in the elevation direction, and it is convenient to cast the problem in terms of multiple statistical hypothesis testing with the signal model written as
which can be solved in the framework of the detection theory. It consists of steps where binary tests are sequentially applied in each step, as shown in Figure 3, with and being the general likelihood ratio and the decision threshold for the ith step, respectively ( and are both given in the next section).
Figure 3.
Diagram of the GLRT.
3. Problem Solution
3.1. CS
Based on the assumption that only a low number of scatterers with different elevations are present in the same range-azimuth resolution cell, CS, a popular approach for sparse signal reconstruction method, provides a new sampling theory for data acquisition and allows super-resolution using only few signal samples. It has been proven perfectly suitable for urban tomographic reconstruction [15,16]. The sparse theory says that, when the restricted isometry property (RIP) (see Equation (15)) and the incoherence property are met, K-sparse signal can be exactly recovered by L1 minimization as
where denotes the Lagrange multiplier depending on the number of samples N and the noise level [34]. is the normalized version of , i.e., . The -term forces the residual to be small, whereas the -term enforces the sparsity of the representation. controls the tradeoff between the sparsity of the spectrum and the residual norm. According to [34], if the dictionary is an orthogonal basis, the mean-squared error properties are near optimal if we set as . The readers can refer to Appendix A.2 for more details about the deviation of . Here, is the noise level, which can be estimated using several methods, e.g., SVD-Wiener [35]. According to Equation (8), in both simulations and real-data experiments, can be simply estimated as , where means the expectation operation. With CS applied in SAR tomography, super resolution capability as well as robustness to phase noise can be obtained. Nevertheless, as the RIP and incoherence property are often violated because of the predefined , the over-sampling rate along elevation and so on, spurious artifacts often exist.
3.2. CS-GLRT
Before describing the proposed method, here, some shortcomings of CS and the BF based GLRT are listed, respectively, as follows:
- CS
- Spurious artifacts
- Underestimation of magnitude
- BF based GLRT
- Low resolution
- Side-lobe effect
X.X. Zhu [25] and A. Budillon [20] have already dedicated to circumventing the mentioned limitations referred to as SL1MMER and sup-GLRT, respectively. As we know, the limitations of CS stem from the magnitude being underestimated and there being outliers around the true position of targets. However, it can be believed that the correct positions can always be detected despite the outliers and underestimated magnitude. Assume that there are up to K scatterers superimposed within a single pixel; it is reasonable to set potential positions, considering two outliers around each true position. With these candidate positions, we can greatly scale down the steering matrix to a -column mapping matrix . Then, following the guidelines of [20], we can apply GLRT with -column rather than with M-column . In our multiple scatterers case, the decision rule in Equation (7) can then be expressed as
where is the index set composed by the j arbitrary column indexes of the mapping matrix with , and is the corresponding reflectivity, with the ML estimation of the parameters listed as follows [20]
where is the corresponding sub-mapping matrix (or support space) composed by these j mapping vectors. Substituting Equation (11) into Equation (10), each testing step can be obtained as
where is the space orthogonal to the support space , denoted as
with . Then, the problem remains to decide the thresholds dependent on the desired false alarm rate.
Based on the descriptions above, here, we propose a new method named CS-GLRT, whose procedures are summarized as follows and as the flow diagram depicted in Figure 4:
Figure 4.
Diagram of the proposed CS-GLRT. Lines in red squares are true reflectivity profiles, and lines in blue dots are the estimated reflectivity profile in each step. There are artifacts for the CS imaging, and magnitudes are underestimated. The artifacts are suppressed and magnitudes are corrected by the GLRT model order selection with negligible elevation error.
- Potential positions detection by CS imaging. In this step, the nonsignificant spurious scatterers are cleaned to offer a priori information for the possible scatterers’ locations with super-resolution so as to separate the closely located targets. Often, the number of potential positions can be set aswhere K represents the predefined maximum number of scatterers, is the norm, i.e., the number of nonzero values, and means the normalized elevation profile reconstructed by CS. Here, threshold of zero and nonzero can be set as (corresponds to a noise capacity of 20 dB) and considers the other two possible outliers surrounding each scatterer. Exploiting the CS imaging, the problem to be solved is scaled down from M-dimension to -dimension. Empirically, K can be set as 3 and is around 10.
- Model order selection and parameter estimation. For each model order, , we search for the optimal from possible combinations so as to minimize the numerator in Equation (10) . After obtaining the testing value of in each step, we can do hypothesis test sequentially, as shown in the dotted box in Figure 4. Once the model order i is selected, the elevations are the ones corresponding to the minimum numerator under the decided hypothesis and the backscattered reflectivity profile can be obtained by LS means.
Some comments on the proposed CS-GLRT method are now in order. Firstly, the computational complexity is mainly caused by the iterative steps in CS imaging, thus its computational burden is comparable to that of SL1MMER. Secondly, not all the possible combinations in Step 2 are valid, i.e., the combinations of scatterers located very close to each other (e.g., ) should be dropped so as to avoid the wrong detection. Thirdly, this method suppresses the overestimation to the biggest extent as it adopts the low-to-high order detection strategy, i.e, only when the case is decided as an order higher than is there the possibility for the case being decided as . Actually, there is the other strategy adopting the high-to-low order detection [19] that false alarm rate of the high order is independent of the of the lower order scatterers. As a consequence, it is perfectly suitable for the small areas processing in the complex urban environment.
4. Simulated Results and Performance Assessment
We evaluated the performances of the proposed CS-GLRT based schemes by the properties of detection and accuracy, which were carried out resorting to Monte Carlo (MC) simulations. The simulated tomographic data were generated by exploiting the system parameters of a TerraSAR-X stack (see Table 1) of images, whose baseline distribution in the temporal/spatial domain is reported in Figure 2.
4.1. Feasibility Check
Before the implementation of the proposed algorithm, we would like to check its applicability under our systematic parameters. Firstly, the RIP is defined and tested to validate the CS imaging used here.
where v is any vector having K nonzero coefficients at the same positions as s, and is a small number. evaluates the reconstruction sparsity of mapping matrix (steering matrix here). The smaller is, the better the sparse signal can be reconstructed in the presence of noise [15]. RIP property of two scatterers versus scatterer distance is shown in Figure 5. Figure 5a,b shows that increases when the scatterers come closer than , that is to say, the closer the two scatterers are, the more sensitive the reconstruction becomes to noise. We start with two scatterers with different magnitude ratio (Figure 5a) and with constant SNR of the first scatterer, dB. It says that, in high SNR case, the system becomes less sensitive to noise with the increase of magnitude ratio. Then, RIP’s change with the alternation of phase difference is shown in Figure 5b. It says that the in-phase case () is the most disadvantaged one, while the quadrature phase case () is the most advantageous one. For these reasons, the simulations in this study adopted the most disadvantaged case, i.e., in-phase and with equal magnitude, if not specifically clarified.
Figure 5.
RIP property of two scatterers versus scatterer distance (normalized to ) under different: (a) magnitude ratio with red solid, blue dashed and black dotted-dashed denoting , respectively; and (b) phase difference with red solid, blue dashed and black dotted-dashed denoting , respectively. It should be noted that and are kept constant in (a), and in (b). refers to Equation (15).
Based on the fact that the number of superimposed targets within a single pixel is no more than 3 (also pointed out in [2]), K can be reasonably set as 3. Without loss of generality, we simulated up to three scatterers. To test whether there are any possibilities for the proposed CS-GLRT method of separating the overlaid scatterers within a single range-azimuth resolution cell, firstly the probability density function (PDF) of each test value under different hypotheses , , are reported in Figure 6. Figure 6a–c shows the PDFs of , and , respectively. It should be noted that all the experiments were conducted by Monte Carlo simulations with scatterers separation of in the hypothesis of , and of and in , if not specifically clarified. SNR was set to 10 dB here, which is the case for persistent scatterers. The red, green, blue and black lines represent the different hypotheses of –, respectively. Obviously, we can see in Figure 6a that can be distinguished from by with no overlaps. Similarly, can be recognized from by with negligible overlaps. In addition, and are well separated in archive. It indicates that there is a great possibility that the different statistical hypotheses can be distinguished by the corresponding test values. Actually, in the experiments on simulated and real data presented in the following, setting thresholds is a good choice.
Figure 6.
PDFs of under different hypotheses of (red), (green), (blue) and (black) with SNR=10dB and scatterers separation of for , and of and for : (a–c) the PDFs of , and , respectively.
4.2. Parameter Definition
To assess the performance of the proposed CS-GLRT method, some parameters are firstly defined following the descriptions in [19]. The first one is false alarm rate denoted as
Then, the (correct) detection probability and false detection rate are defined as
The last one is related to the estimation accuracy. Actually, there are three parameters, phase , magnitude a and elevation s. What we are really concerned about here is the accuracy of the elevation, which can be evaluated by the root mean square error (RMSE). We only consider the cases that the scatterer order is correctly decided, i.e., , (k = 1,2,3), and the elevation RMSE is denoted as
where and spi are the estimated and true elevation positions of the ith order scatterer of the pth simulation that fulfills (Dk/Hk).
Some remarks of the performances are now in order. Often, the detection property is evaluated by the (correct) detection probability under a certain false alarm rate , which are given in Equations (16) and (17), respectively. reflects the overestimation characteristic, and here we consider over-detection rate of each order of equal importance. Fixing to get the thresholds of each step, in turn, can be obtained. As already explained in [20], the thresholds are insensitive to different SNRs provided that is set.
4.3. Performance Assessment
The proposed CS-GLRT method is with CFAR, or with quasi-CFAR, as the false alarm rate can be controlled by setting different thresholds, and we set in the following experiments. The detection rate and RMSE of elevation versus are demonstrated in Figure 7a,b, respectively. In Figure 7a, (red, square), (blue, triangle) and (green, circle) are plotted versus from to 15 dB. We can see that the single, double and triple scatterers cases can be perfectly decided when SNR is big enough, e.g. 1.5, 3 and 5 dB, respectively. Then, the accuracy is demonstrated in Figure 7b and (red, square), (blue, triangle) and (green, circle) are plotted versus from to 15 dB as the detection probability is too small to show RMSE for dB. It can be observed that, with the increase of SNR, RMSE approaches to 0, although not definitely 0 because of some estimation bias. It should be noted that oscillates when the SNR is very small, e.g. SNR < 0 dB, because of the small hardly reflecting the RMSE.
Figure 7.
Performance of the proposed CS-GLRT for single (red, square), double (blue triangle) and triple (green, circle) scatterers, respectively: (a) detection probability; and (b) RMSE of the elevation.
In addition, the super-resolution capability was checked by varying the distance between two scatterers from to . The detection probability plotted versus is shown in Figure 8a. It shows that, when SNR is big enough (≥8 dB), the closely located targets (up to 0.6) can be well separated with nearly . The corresponding RMSE of elevations is depicted in Figure 8b, which shows that the RMSE can be limited to 3 m () when dB for closely located scatterers (up to 0.6). Figure 8a,b jointly illustrates the super-resolution capability of CS-GLRT.
Figure 8.
Super-resolution performance of the proposed CS-GLRT for the separation of (red, square), (blue triangle), (green circle), and (black, diamond), respectively: (a) detection probability; and (b) RMSE of the elevations.
To clearly see the estimation accuracy, we compared RMSE with the accuracy obtained by CRLB. Simulations of two scatterers with different separations under dB and dB were conducted (3 dB and 10 dB are the lower and upper bound of persistent scatterers, respectively [36]). The results are shown in Figure 9a,b, respectively. It should be noted that the CRLBs shown in Figure 9 are the numerical results of Equations (A8)–(A10) and the detailed deduction of CRLBs can be referred in Appendix A.1. The solid lines show the true positions with one scatterer fixed while moving the other one. As only super-resolution cases are interested, we only show here. Then, the dashed lines mark the true position ± and the dots denote the mean value of the estimated elevation with the error bar being the standard deviation. Missing points indicate the detection probability below . It should be noted that the x-axis and the y-axis are normalized to the Rayleigh resolution . The dots approaching solid lines and error bar being nearly enveloped in the corresponding dashed lines indicate that elevation can be well estimated by the proposed CS-GLRT method.
Figure 9.
Elevation accuracy of the proposed CS-GLRT compared to CRLB: (a) SNR = 3 dB; and (b) SNR = 10 dB. The y value of each dot reflects the sample mean of the estimated elevations and the corresponding error bar shows the standard deviation.
5. Results on Real Data
5.1. Test Site and Data Stack
Located in the south of China, Shenzhen is now one of the most developed cities in China, with a population of more than 10 million. It is highly urbanized with many high-rise buildings and dense infrastructures. A dataset of 26 scenes at 0.25 m-resolution staring spotlight TerraSAR-X images (see the system parameters in Table 1) were acquired on descending orbits during the period of the interval span from January to December 2016. To collaboratively process all the images, a referenced image acquired on 18 July 2017 was selected while the others were all coregistered to it. The spatial and temporal baseline distribution is shown in Figure 2, where the referenced one is labeled as a five-point star and the others are denoted as diamonds. As addressed above, because of the relative high burden of the CS imaging, we only tested the applicability of the proposed method on small areas, i.e., a single building.
5.2. Scatterers Detection
The test area covers a high-rise building in Bao’an district (Transport Bureau Building), as shown in the optical image from Google Earth in Figure 10a (the building in the red box) and the amplitude image in Figure 10b. As shown in Figure 10a, it is a circular building with a total height of about 120 m, with a low-rise podium and some lower buildings in the neighborhood. The SAR amplitude in Figure 10b shows that the roof of the main building is probably overlaid with its own building facade and the near-range vegetation, while the roof of the podium may be superimposed with its own facade, the facade of the main building and the rough ground. Schematic drawing of possible layover cases are presented in Figure 10c, where the red squares lead to two overlaid scatterers, and the blue squares result in two, three and even four overlaid scatterers.
Figure 10.
Images of the study area: (a) optical image from Google Earth under study, with the building in the red box; (b) mean intensity map of the spotlight TerraSAR-X, with the red square and blue diamond the positions of the detected triple scatterers in SAR image, whose elevations are depicted in Figure 14; and (c) schematic drawing of possible signal contributions in a single azimuth-range pixel (size of resolution cells not to scale).
Before applying the proposed CS-GLRT method, the atmospheric errors should be calibrated. For the small area under study here, atmospheric error can simply be calibrated by referencing to a ground point. For atmospheric error calibration for large areas, the methods introduced in [37,38] have proven useful. The results of detected single and double scatterers are shown in Figure 11 and Figure 12, respectively, with the color bar from hot red to pink denoting heights from 0 m to 140 m. Note that the results are relative heights with respect to a near-zero point (ground point). Obviously, in Figure 11, the profile of the building is correctly detected and the estimated height range matches the true height quite well, although a few red points are found in between the building body (green), and some outliers are found along the azimuth direction. The former phenomenon was probably caused by the interference of the ground and the reflection from the building facade happens to be weak. The latter outliers were probably affected by the side lobes in azimuth, which can also be observed in the SAR intensity image (Figure 10b). To the left of the left building facade, there should only be ground targets or no targets. However, “line” targets were detected and with the same height of the pixel (same range of the line targets) on the building facade. To illustrate it, the profile of a range bin (green line in Figure 10b) is plotted, as shown in Figure 13. We adopted an up-sampling factor of 8 and only show part of the azimuth profile for a clear illustration. Under the influence of the side lobes of the strong scatterers (with the main lobe energy as large as 74 dB), many local peaks occur and are detected as targets. The appropriate restriction of side lobes is necessary and challenging in the process of SAR imaging (from L0 to L1 datatype), which is beyond the scope of this paper and will be left for the future research of SLC data preprocessing.
Figure 11.
Study result: Height of single scatterers.
Figure 12.
Study results for double scatterers: (a) height of the higher double scatterers; and (b) height of the lower double scatterers. Color bar is the same as that in Figure 11.
Figure 13.
Normalized azimuth profile corresponds to the green line in Figure 10b. Note that only part of the profile is shown for a clear illustration.
Figure 12 shows the height of the detected double scatterers with Figure 12a the top layer and Figure 12b the lower layer. The results agree well with the guess above that the roof of the main building is overlaid with its own facade, the main building facade is overlaid with the ground and the podium roof, respectively, and the podium roof is overlaid with its own facade and the ground, respectively. Actually, it is also possible that the triple-scatterer case happens, although with low probability (see the red square and blue diamond for their positions in SAR image in Figure 10b). The corresponding estimated heights are shown in Figure 14. We can infer that the red squares and blue diamonds are probably from the interference of the facade and roof of the lower buildings, and the tall building facade. Specifically, the number of detected single, double and triple scatterers in this experiment is 59,056, 3813 and 2, respectively.
Figure 14.
Study results: Height of triple scatterers, corresponding the red square and blue diamond in Figure 10b.
6. Discussion
In this section, we discuss some possible comparisons of the proposed CS-GLRT with other super-resolved methods, namely SL1MMER and sup-GLRT. In addition, how to deal with ghost scatterers, which would appear to be at negative heights, is discussed.
6.1. CS-GLRT vs. SL1MMER
Theoretically, CS-GLRT has a similar computational burden, estimation accuracy and super-resolution capability as those of SL1MMER. Experimentally, using a desktop with 8 cores at 3.6 GHz and 32 GB RAM, the time consumed of MC simulations by SL1MMER and CS-GLRT is and s, respectively. Please note that parallel computing is not adopted here. The latter two performances are almost the same and are not shown here due to limited space. Here, we focus on the detection characteristics as shown in Figure 15. Scatterers can be correctly detected by SL1MMER in the and cases, yet the detection characteristic is not satisfactory in the case. In Figure 15b, we can see that the false detection rate of SL1MMER vibrates with , while that of CS-GLRT remains almost stable at a very low level. Thus, SL1MMER detection is satisfactory in the high order circumstances, yet, in the lower order case, it tends to be over-decided. Usually, to alleviate computational complexity, conventional GLRT method can be carried out at first so as to distinguish the presence or absence of scatterers, thus guaranteeing a low . Nevertheless, the other false detection rate cannot be controlled, thus still leading to an unsatisfactory FAR. Therefore, CS-GLRT is a more robust method with FAR controlled.
Figure 15.
Detection characteristics of SL1MMER (solid) and CS-GLRT (dashed): (a) correct detection probability of single scatterer (, red, square), double scatterers (, blue, triangle), and triple scatterers (, green, circle); and (b) false detection probability with (red, square), (blue, triangle), and ( green, circle).
6.2. CS-GLRT vs. Sup-GLRT
The explanation about the calculation burden is stated as follows. In [20], the maximization of the likelihood function means possible searches, and possible searches in total, while, in our case, it scales down to and , respectively. M is the sampling rate along elevation with a typical value of hundreds, while is around 10 in our experiment. If we increase to , the computational burden increases by approximately times for sup-GLRT, while almost remains stable for CS-GLRT. Firstly, we inspect the time consumed by sup-GLRT. Using the same desktop mentioned above, it costs 593.1 s and 112,393.6 s (31 h 13 m 13.6 s) for and , respectively, if we do MC simulations and set . As for CS-GLRT, the time consumed is 300.6 s and 306.1 s, respectively, indicating that the computational complexity remains quite stable for CS-GLRT with the change of the number of the pre-set scatterers.
The efficient version of sup-GLRT, M-sup-GLRT, can alleviate the computational burden of sup-GLRT dramatically, but its accuracy greatly depends on the correct location of the first scatterer for super-resolving. Nevertheless, the scatterers’ locations always interfere with each other and can hardly be correctly located. Thus, M-sup-GLRT is beyond the scope of this paper.
According to the analysis and experiments above, Table 2 compares the performances of different automatic tomographic imaging and detecting methods. We can conclude that CS-GLRT is robust and computationally acceptable for the different preset K.
Table 2.
Performance comparison.
6.3. Ghost Scatterers
When dealing with real data, ghost scatterers are often present, especially for large areas. There are two main sources for such ghost scatterers. The first is the limitation of estimation accuracy (CRLB), which is related to , satellite parameters, and baseline distribution (see the details in Appendix A.1). When decreases, deteriorates and it is possible for these points estimated with low height to be such ghost scatterers. Secondly, the miscalibattion of phase error can also contribute to such ghost scatterers. Often, isolated points with large negative heights are not likely to be genuine and can be masked in the final results. Then, point groups with moderate negative heights are likely to be genuine and can be validated further if the true structure is available. Actually, to avoid such ghost scatterers, a masking procedure is necessary prior to tomographic processing to cancel out the points with low and/or low correlation. Sometimes, filtering steps are useful for ghost scatterer removal. It is complicated and experience dependent, and, to our knowledge, there is no literature illustrating such a problem. Dealing with small areas, such a phenomenon is rare and easy to handle.
7. Conclusions
A multiple scatterers detection method named CS-GLRT is presented in this paper, which can super-resolve the multiple targets lying in the same azimuth-range pixel with CFAR. A two-step strategy is adopted by CS-GLRT, where the first step gets the possible candidates by super-resolution, thus scaling down the searching space in Step 2 greatly. In turn, Step 2 drops out the outliers and misdetections resorting to minimum residual of each order. Its performances can be evaluated in terms of probability of correct detection and probability of false detection. Simulations by Monte Carlo show that the proposed method can achieve a super-resolution up to with an accuracy approaching the . The merits of the CS-GLRT approach are summarized as follows:
- characteristic of CFAR, controlled by the adopted thresholds;
- accurate scatterer number detection as hypothesis test adopted;
- robustness to the nonuniform baseline distribution and super-resolution capability as CS imaging adopted; and
- small calculation increase with the increase of K as a priori information of provided by CS imaging.
The practical application of the method was carried out on spotlight TerraSAR-X data over an area of Shenzhen (China), where layover effect is common under the very high resolution data. Presented results show the effectiveness of the proposed approach in detecting reliable single, double and even triple scatterers in urban areas. The very dense detection of single scatterers guarantee the effective reconstruction of manmade structures. It should be noted that only 3D reconstruction is shown here, while the application can be extended to 4D (adding velocity) and even 5D (adding velocity and thermal) with appropriate data stack.
Author Contributions
H.L. conceived the idea, performed the experiments, produced the result, and drafted the manuscript. Z.D. and Y.Z. prepared and provided the stack of SAR images, on which the experiments were performed. Z.L. contributed to discussions of the idea and results. A.Y. and X.Z. contributed to the discussion of the results and the revision of the article. The article was improved by the contributions of all co-authors and all authors accepted the content.
Funding
This work was supported by National Natural Science Foundation of China (61771478).
Acknowledgments
The authors would like to thank the German Aerospace Center (DLR) for providing the TerraSAR-X products.
Conflicts of Interest
The authors declare no conflict of interest.
Abbreviations
The following abbreviations are used in this manuscript:
| 3D | three-dimensional |
| AIC | Akaike information criterion |
| BF | beamforming |
| BIC | Bayes information criterion |
| CFAR | constant false alarm rate |
| CRLB | Cramer–Rao Low Bound |
| CS | compressive sensing |
| FAR | false alarm rate |
| GIS | geographic information systems |
| GLRT | generalized likelihood ratio test |
| IAA | iterative adaptive approach |
| IR-CS | iterative reweighted CS |
| IR-ADMM | iterative reweighted alternating direction method of multipliers |
| LS | least square |
| MC | Monte Carlo |
| MOS | model order selection |
| MUSIC | multiple signal classification |
| probability density function | |
| Pol-TomoSAR | polarimetric TomoSAR |
| PS | permanent scatterers |
| QCFAR | quasi-constant false alarm rate |
| RIP | restricted isometry property |
| RMSE | root mean square error |
| SAR | synthetic aperture radar |
| SLC | single look complex |
| SNR | signal-to-noise Ratio |
| sup-GLRT | support GLRT |
| SVD | singular value decomposition |
| TomoSAR | synthetic aperture radar tomography |
| TWIST | two-step iterative shrinkage/thresholding |
Appendix A
Appendix A.1.
Following the CRLB of two scatterers in [26], here we take a look at the general case of K scatterers, whose parameters are magnitudes , elevation positions and phases , , with the parameter vector denoted as
According to the system model of TomoSAR, the N-pass measurement
with the mean of each pass
and covariance matrix
where is the noise power. It is easy to obtain the likelihood function as
From the Fisher information matrix,
Then, the derived . It should be noted that the derivative of single parameter is
The generic element of the Fisher information matrix J, with describing the impact of the individual isolated scatterers (i.e., in the absence of the others) and contributing to the interference between the pth and qth scatterers, are given by
in Equation (A10).
From Equation (A9), the of the single scatterer case is easily shown as
Thus, the CRLB of elevation, ,
The elevation accuracy is
It is possible to figure out the analytical expression of in multiple scatterers cases, but too lengthy to show here with so many parameters. Actually, what really interests us is the accuracy of elevation, i.e., . Therefore, in the main body, the s are all obtained by numerical results by substituting the systematic parameters into Equations (A9) and (A10) and then by matrix inversion. We should also bear in mind that in the multiple scatterers case, , is dependent on all the parameters, thus (double scatterers) is different from (triple scatterers case).
Appendix A.2.
According to [34], can be motivated as follows. A number of papers [39,40] have carefully studied an approach to denoising by so-called soft thresholding in an orthonormal basis. In detail, suppose that is an orthogonal matrix, and define empirical -coefficients by
Define the soft threshold nonlinearity by
and the thresholded empirical coefficients by
This is soft thresholding of empirical orthogonal coefficients. The papers just cited show that thresholding at has a number of optimal and near-optimal properties regarding mean-squared error. We claim that (again in the case of an ortho basis) the thresholding estimate is also the solution of Equation (9). Observe that the soft-thresholding nonlinearity solves the scalar minimum problem
Note that, because of the orthogonality of , , and so we can rewrite Equation (9) in this case as
Now, applying Equation (A17) coordinatewise establishes the claim.
Figure A1.
Profile reconstruction by CS under different . From left to right, , respectively. All figures are with two scatterers with the same magnitude but with different separations ( for the top figures and for the bottom figures). The red squares are the true locations, while the blue dots are the estimated ones. Please note that the x-axis and the y-axis are normalized to and , respectively.
In addition, the effect of can be simply demonstrated by simulated results. We set under an of 10 dB, and two scatterers located with a separation of and , respectively. Three different s are adopted with , and . The results are shown in Figure A1 with the top figures being the cases of and the bottom figures being the cases of . Please note that the blue dots are estimated results, while the red squares are the true values. From left to right, the are , and , respectively. In the figures, it is obvious that the larger results in a sparser estimation, but, if the choice of is not too far from the optimal one, i.e., , the estimation can be acceptable. Thus, the optimal choice of is quite empirical but will locate near such a .
References
- Reigber, A.; Moreira, A. First Demonstration of Airborne SAR Tomography Using Multibaseline L-Band Data. IEEE Trans. Geosci. Remote Sens. 2000, 38, 2142–2152. [Google Scholar]
- Budillon, A.; Johnsy, A.C.; Schirinzi, G. Urban Tomographic Imaging Using Polarimetric SAR Data. Remote Sens. 2019, 11, 132. [Google Scholar]
- Zhu, X.X.; Yu, A.X.; Dong, Z.; Wu, M.Q.; Li, D.X.; Zhang, Y.S. New approach for robust and efficient detection of persistent in SAR tomography. Remote Sens. 2019, 11, 356. [Google Scholar]
- Martin del Campo, G.D.; Shkvarko, Y.V.; Reigber, A.; Nannini, M. TomoSAR Imaging for the Study of Forested Areas: A Virtual Adaptive Beamforming Approach. Remote Sens. 2018, 10, 1822. [Google Scholar]
- Li, X.W.; Liang, L.; Guo, H.D.; Huang, Y. Compressive Sensing for Multibaseline Polarimetric SAR Tomography of Forested Areas. IEEE Trans. Geosci. Remote Sens. 2016, 54, 153–166. [Google Scholar]
- Banda, F.; Dall, J.; Tebaldini, S. Single and Multipolarimetric P-Band SAR Tomography of Subsurface Ice Structure. IEEE Trans. Geosci. Remote Sens. 2016, 54, 2832–2845. [Google Scholar]
- Yitayew, T.G.; Ferro-Famil, L.; Eltoft, T.; Tebaldini, S. Tomographic Imaging of Fjord Ice Using a Very High Resolution Ground-Based SAR System. IEEE Trans. Geosci. Remote Sens. 2017, 55, 698–714. [Google Scholar]
- She, Z.; Gray, D.; Bogner, R.; Homer, J. Three-dimensional SAR imaging via multiple pass processing. In Proceedings of the 1995 IEEE International Geoscience and Remote Sensing Symposium (IGARSS-95), Florance, Italy, 28 June–2 July 1995; Volume 5, pp. 2389–2391. [Google Scholar]
- Fornaro, G.; Serafino, F.; Soldovieri, F. Three-Dimensional Focusing With Multipass SAR Data. IEEE Trans. Geosci. Remote Sens. 2003, 41, 507–517. [Google Scholar]
- Fornaro, G.; Lombardini, F.; Serafino, F. Three-dimensional multipass SAR focusing: Experiments with long-term spaceborne data. IEEE Trans. Geosci. Remote Sens. 2005, 43, 702–714. [Google Scholar]
- Lombardini, F.; Pardini, M.; Fornaro, G.; Serafino, F.; Verrazzani, L.; Costantini, M. Linear and adaptive spaceborne three-dimensional SAR tomography: A comparison on real data. IET Radar Sonar Navig. 2009, 3, 424–436. [Google Scholar]
- Lombardini, F.; Pardini, M. Superresolution differential tomography: Experiments on identification of multiple scatterers in spaceborne SAR data. IEEE Trans. Geosci. Remote Sens. 2012, 50, 1117–1129. [Google Scholar]
- Kumar, S.; Joshi, S.K.; Govil, H. Spaceborne PolSAR Tomography for Forest Height Retrieval. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2017, 10, 5175–5185. [Google Scholar]
- Sauer, S.; Ferro-Famil, L.; Reigber, A.; Pottier, E. Three-Dimensional Imaging and Scattering Mechanism Estimation Over Urban Scenes Using Dual-Baseline Polarimetric InSAR Observations at L-Band. IEEE Trans. Geosci. Remote Sens. 2011, 49, 4616–4629. [Google Scholar]
- Zhu, X.X.; Bamler, R. Tomographic SAR Inversion by L1 Norm Regularization - The Compressive Sensing Approach. IEEE Trans. Geosci. Remote Sens. 2010, 48, 3839–3846. [Google Scholar]
- Budillon, A.; Evangelista, A.; Schirinzi, G. Three-Dimensional SAR Focusing From Multipass Signals Using Compressive Sampling. IEEE Trans. Geosci. Remote Sens. 2011, 49, 488–499. [Google Scholar]
- Budillon, A.; Ferraioli, G.; Schirinzi, G. Localization Performance of Multiple Scatterers in Compressive Sampling SAR Tomography: Results on COSMO-SkyMed Data. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2014, 7, 2902–2910. [Google Scholar]
- Maio, A.; Fornaro, G.; Pauciullo, A. Detection of Single Scatterers in Multidimensional SAR Imaging. IEEE Trans. Geosci. Remote Sens. 2009, 47, 2284–2297. [Google Scholar]
- Pauciullo, A.; Reale, D.; Maio, A.; Fornaro, G. Detection of Double Scatterers in SAR Tomography. IEEE Trans. Geosci. Remote Sens. 2012, 50, 3567–3586. [Google Scholar]
- Budillon, A.; Schirinzi, G. GLRT Based on Support Estimation for Multiple Scatterers Detection in SAR Tomography. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2016, 9, 1086–1094. [Google Scholar]
- Budillon, A.; Johnsy, A.C.; Schirinzi, G. A Fast Support Detector for Superresolution Localization of Multiple Scatterers in SAR Tomography. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2017, 10, 2768–2779. [Google Scholar]
- Budillon, A.; Johnsy, A.; Schirinzi, G. Extension of a Fast GLRT Algorithm to 5D SAR Tomography of Urban Areas. Remote Sens. 2017, 9, 844. [Google Scholar]
- Pauciullo, A.; Reale, D.; Franze, W.; Fornaro, G. Multi-Look in GLRT-Based Detection of Single and Double Persistent Scatterers. IEEE Trans. Geosci. Remote Sens. 2018, 56, 5125–5137. [Google Scholar]
- Danis, C.; Fornaro, G.; Pauciullo, A.; Reale, D.; Datcu, M. Super-Resolution Multi-Look Detection in SAR Tomography. Remote Sens. 2018, 10, 1894. [Google Scholar]
- Zhu, X.X.; Bamler, R. Demonstration of Super-Resolution for Tomographic SAR Imaging in Urban Environment. IEEE Trans. Geosci. Remote Sens. 2012, 50, 3150–3157. [Google Scholar]
- Zhu, X.X.; Bamler, R. Super-Resolution Power and Robustness of Compressive Sensing for Spectral Estimation With Application to Spaceborne Tomographic SAR. IEEE Trans. Geosci. Remote Sens. 2012, 50, 247–258. [Google Scholar]
- Peng, X.; Wang, C.C.; Li, X.W.; Du, Y.N.; Fu, H.Q.; Yang, Z.F.; Xie, Q.H. Three-Dimensional Structure Inversion of Buildings with Nonparametric Iterative Adaptive Approach Using SAR Tomography. Remote Sens. 2018, 10, 1004. [Google Scholar]
- Ma, P.; Lin, H.; Lan, H.; Chen, F. On the Performance of Reweighted L1 Minimization for Tomographic SAR Imaging. IEEE Geosci. Remote Sens. Lett. 2015, 12, 895–899. [Google Scholar]
- Wang, X.; Xu, F.; Jin, Y.Q. The Iterative Reweighted Alternating Direction Method of Multipliers for Separating Structural Layovers in SAR Tomography. IEEE Geosci. Remote Sens. Lett. 2017, 14, 1883–1887. [Google Scholar]
- Lianhuan, W.; Balz, T.; Zhang, L.; Liao, M. A Novel Fast Approach for SAR Tomography: Two-Step Iterative Shrinkage/Thresholding. IEEE Geosci. Remote Sens. Lett. 2015, 12, 1377–1381. [Google Scholar]
- Zhu, X.X.; Ge, N.; Shahzad, M. Joint Sparsity in SAR Tomography for Urban Mapping. IEEE J. Sel. Top. Signal Proces. 2015, 9, 1498–1509. [Google Scholar]
- Donoho, D.; Stodden, V.; Tsaig, Y. SparseLab. Software, Version 2.1. 2007. Available online: http://sparselab.stanford.edu (accessed on 26 May 2007).
- Stoica, P.; Moses, R. Spectral Analysis of Signals; Prentice-Hall: Englewood Cliffs, NJ, USA, 2005. [Google Scholar]
- Chen, S.S.; Donoho, D.L.; Saunders, M.A. Atomic decomposition by basis pursuit. SIAM Rev. 2001, 43, 129–159. [Google Scholar]
- Zhu, X.X.; Bamler, R. Very High Resolution Spaceborne SAR Tomography in Urban Environment. IEEE Trans. Geosci. Remote Sens. 2010, 48, 4296–4308. [Google Scholar]
- Adam, N.; Bamler, R.; Eineder, M.; Kampes, B. Parametric estimation and model selection based on amplitude-only data in ps-interferometry. In Proceedings of the ESA FRINGE Workshop, Frascati, Italy, 28 November–2 December 2005. [Google Scholar]
- Ferretti, A.; Prati, C.; Rocca, F. Permanent scatterers in SAR interferometry. IEEE Trans. Geosci. Remote Sens. 2001, 39, 8–20. [Google Scholar]
- Fornaro, G.; Pauciullo, A.; Serafino, F. Deformation monitoring over large areas with multipass differential SAR interferometry: A new approach based on the use of spatial differences. Int. J. Remote Sens. 2009, 30, 1455–1478. [Google Scholar]
- Donoho, D.L. De-Noising by soft thresholding. IEEE Trans. Inf. Theory 1995, 41, 613–627. [Google Scholar]
- Donoho, D.L.; Johnstone, I.M.; Kerkyacharian, G.; Picard, D. Wavelet shrinkage: Asymptopia? J. R. Stat. Soc. Ser. B 1995, 57, 301–369. [Google Scholar]
© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).