Single-Look SAR Tomography of Urban Areas

: Synthetic aperture radar (SAR) tomography (TomoSAR) is a multibaseline interferometric technique that estimates the power spectrum pattern (PSP) along the perpendicular to the line-of-sight (PLOS) direction. TomoSAR achieves the separation of individual scatterers in layover areas, allowing for the 3D representation of urban zones. These scenes are typically characterized by buildings of di ﬀ erent heights, with layover between the facades of the higher structures, the rooftop of the smaller ediﬁces and the ground surface. Multilooking, as required by most spectral estimation techniques, reduces the azimuth-range spatial resolution, since it is accomplished through the averaging of adjacent values, e.g., via Boxcar ﬁltering. Consequently, with the aim of avoiding the spatial mixture of sources due to multilooking, this article proposes a novel methodology to perform single-look TomoSAR over urban areas. First, a robust version of Capon is applied to focus the TomoSAR data, being robust against the rank-deﬁciencies of the data covariance matrices. Afterward, the recovered PSP is reﬁned using statistical regularization, attaining resolution enhancement, suppression of artifacts and reduction of the ambiguity levels. The capabilities of the proposed methodology are demonstrated by means of strip-map airborne data of the Jet Propulsion Laboratory (JPL) and the National Aeronautics and Space Administration (NASA), acquired by the uninhabited aerial vehicle SAR (UAVSAR) system over the urban area of Munich, Germany in 2015. Making use of multipolarization data [horizontal / horizontal (HH), horizontal / vertical (HV) and vertical / vertical (VV)], a comparative analysis against popular focusing techniques for urban monitoring (i.e., matched ﬁltering, Capon and compressive sensing (CS)) is addressed.


Introduction
Synthetic aperture radar (SAR) tomography (TomoSAR) is a multibaseline interferometric technique, whose main goal is the estimation of the power spectrum pattern (PSP) along the perpendicular to the line-of-sight (PLOS) direction [1,2]. The attainable PLOS resolution, after matched filtering, is inversely proportional to the synthesized tomographic aperture, limited at best to the Rayleigh resolution [1,3] when the spatial baselines are evenly distributed. However, the acquisition constellation must gather a sufficiently large amount of passes to prevent aliasing within the height range of interest.
In practical scenarios, the number of passes is constrained to the revisit time that leads to temporal decorrelation issues and to the viability of the individual missions to perform the numerous tracks, which translates into a limited (reduced) resolution. Furthermore, nonuniformly distributed baselines entail high sidelobes.
Taking all these matters into consideration, this article proposes a novel methodology to decrease considerably the number of processed looks necessary to perform focusing of the TomoSAR data, preserving all the advantages that the usage of statistical regularization offers.
First, we suggest utilizing a robust version of Capon beamforming, being robust against the rank deficiencies of the data covariance matrices. In this way, we achieve enhanced resolution with a reduced number of looks or even without multilooking. In this scenario, multilooking is required only to handle the multiple nondeterministic sources and in order to increase accuracy in presence of signal-dependent (multiplicative) noise [2], but not to guarantee that the data covariance matrices are invertible.
After applying one of these techniques, the recovered PSP is to be refined using statistical regularization. We make use of the abovementioned MARIA and WISE approaches, which perform regularization in the context of ML and WCF, respectively, but using now a robust version of Capon beamforming to provide them with a first estimation (zero-step iteration) of the PSP.
Both MARIA and WISE present similar advantages. The main difference is that WISE is more flexible to the assumed probability density function (pdf) of the observed data [12]; in contrast, MARIA assumes a Gaussian pdf [12,14]. Consequently, taking into account the a priori knowledge about the pdf of the observed data, the most suitable technique is to be chosen.
The proposed methodology is summarized in Figure 1. For demonstration purposes, in this article, we select DCRCB, which retrieves accurate signal estimates with reduced array output interference [16], and WISE [12], since we assume no a priori information about the pdf of the observed data. The reader, however, can easily adapt the proposed methodology to its convenience.
Remote Sensing 2020, 20, x FOR PEER REVIEW 3 of 28 First, we suggest utilizing a robust version of Capon beamforming, being robust against the rank deficiencies of the data covariance matrices. In this way, we achieve enhanced resolution with a reduced number of looks or even without multilooking. In this scenario, multilooking is required only to handle the multiple nondeterministic sources and in order to increase accuracy in presence of signal-dependent (multiplicative) noise [2], but not to guarantee that the data covariance matrices are invertible.
After applying one of these techniques, the recovered PSP is to be refined using statistical regularization. We make use of the abovementioned MARIA and WISE approaches, which perform regularization in the context of ML and WCF, respectively, but using now a robust version of Capon beamforming to provide them with a first estimation (zero-step iteration) of the PSP.
Both MARIA and WISE present similar advantages. The main difference is that WISE is more flexible to the assumed probability density function (pdf) of the observed data [12]; in contrast, MARIA assumes a Gaussian pdf [12,14]. Consequently, taking into account the a priori knowledge about the pdf of the observed data, the most suitable technique is to be chosen.
The proposed methodology is summarized in Figure 1. For demonstration purposes, in this article, we select DCRCB, which retrieves accurate signal estimates with reduced array output interference [16], and WISE [12], since we assume no a priori information about the pdf of the observed data. The reader, however, can easily adapt the proposed methodology to its convenience. The suggested strategy is intended to deal with only one of the stated downsides of Capon beamforming: the need of multilooking to become applicable. Keep in mind that radiometric accuracy is not preserved, since the related statistical regularization techniques are iterative and involve numerous matrix inversions, along with other matrix/vector operations, to accomplish superresolution.
Also, in this paper, the proposed methodology is pushed to its limits by taking into account the single-look case to perform TomoSAR focusing. Apart of preserving the range/azimuth spatial resolution, this set-up is useful for the study of the several point-type targets, situated in urban scenes, The suggested strategy is intended to deal with only one of the stated downsides of Capon beamforming: the need of multilooking to become applicable. Keep in mind that radiometric accuracy is not preserved, since the related statistical regularization techniques are iterative and involve numerous matrix inversions, along with other matrix/vector operations, to accomplish super-resolution. Also, in this paper, the proposed methodology is pushed to its limits by taking into account the single-look case to perform TomoSAR focusing. Apart of preserving the range/azimuth spatial resolution, this set-up is useful for the study of the several point-type targets, situated in urban scenes, whose behavior tends to be deterministic. This eases the search for stable scatterers, necessary to perform persistent scatterer interferometry (PSI) [6].
Finally, the capabilities of the proposed methodology are validated by means of strip-map airborne data acquired by the uninhabited aerial vehicle SAR (UAVSAR) system of the Jet Propulsion Laboratory (JPL) and the National Aeronautics and Space Administration (NASA), over the urban area of Munich, Germany in 2015 [18]. Making use of multipolarization data [horizontal/horizontal (HH), horizontal/vertical (HV) and vertical/vertical (VV)], a comparative analysis against the aforementioned focusing techniques for urban monitoring (i.e., matched filtering, Capon and CS) is addressed. We refer first to the multilooking case in order to observe the spatial mixture of sources; later on, the single-look case is treated.
The rest of the paper is organized in the following manner: the TomoSAR signal model is reviewed in Section 2; Section 3 studies a modified version of the WISE iterative statistical regularization approach, using DCRCB to retrieve its first input (zero-step iteration); Section 4 is dedicated to present the experimental results, performed using strip-map airborne UAVSAR data; lastly, Section 5 presents a discussion on the previously given experimental results, providing also the concluding remarks.

TomoSAR Signal Model
Within the framework of DOA estimation, as in the previous related studies [12][13][14], the considered TomoSAR signal model is defined by the linear equation of observation (EO) (1) For a certain acquisition constellation, composed of L passes, the data signal vector y denotes the set of focused signals for a given azimuth-range location. We assume co-registration independent on height. Vector s is composed of M samples, taken at the elevation positions {z m } M m=1 displaced along the PLOS direction. These samples describe the continuous complex random reflectivity for a specified azimuth-range position within the illuminated scene. The L × M steering matrix A is the signal formation operator (SFO) that maps the source Hilbert signal space S onto the observation Hilbert signal space Y. Vector n accounts for the additive noise.
Matrix A gathers the steering vectors {a m } M m=1 , each one of dimension L, which contain the interferometric phase information associated to a source located at the PLOS elevation position {z m } M m=1 , above the reference focusing plane. For a given elevation position z m , the corresponding steering vector is given by [4], in which is the two-way vertical wavenumber between the master track and the l-th slave track. Here, {d l } L l=2 is the PLOS-oriented baseline distance between the master track and the l-th slave track, r 1 is the slant-range distance to the target of interest, θ stands for the incidence angle and λ is the carrier wavelength.

of 29
A problem is said to be well-posed in the Hadamard sense if three conditions are accomplished [19]: (i) a solution exists, (ii) the solution is unique and (iii) the solution depends continuously on the data. The TomoSAR problem is considered ill-conditioned in the Hadamard sense, since, generally, the linear EO in (1) has infinite solutions. The number of passes L is normally (much) lower than the number of samples M, meaning that there are more unknowns M than equations L in (1). Moreover, the presence of additive and multiplicative noise adds statistical uncertainty to the problem. Thus, in order to guarantee well-conditioned solutions in the Hadamard sense, some constraints must be imposed to the solver to the EO in (1).
The complex random Gaussian zero-mean vectors s, n and y are characterized by their corresponding correlation matrices where + stands for the Hermitian conjugate, E{·} is the expectation operator and factor N 0 is the power spectral density of the white noise power [2]. The entries of vector s are assumed to be uncorrelated. This simplifies the mathematical developments that factor into the chosen (WISE) statistical regularization approach [12], presented afterward. Therefore, we refer to the same model, with the correlation matrix R s as a diagonal matrix. Vector b, at the principal diagonal of the diagonal matrix D(b), defines the second-order statistics of the complex reflectivity vector s, the backscattering power, which we refer to as the PSP.
The TomoSAR problem consists then of estimating the PSP vector b. For such a purpose, the different techniques within the DOA framework make use of the SFO (steering matrix) A and some a priori information about the statistics of the signal and noise.

First Step: DCRCB
We start by briefly reviewing the Capon beamforming technique, which refers to the solution to the spatial filter design problem [5] given by Let us consider that R y equals the sample covariance matrix There is an implicit assumption that R −1 y and Y −1 exists in both (8) and (10), respectively. This assumption sets a limit (J ≥ L) on the minimum number of independent realizations (looks) for Y in (9), as rank(Y) ≤ L. Note that we presume no multicollinearity when J = L, this is not always the case.
Each matrix Y ( j) J j = 1 in (9), having rank one, contributes through its summation to the assembly of a full-rank matrix Y, constructed from L linearly independent observations. Diagonal loading [17] is a popular approach to improve the robustness of Capon against the rank-deficiencies of the data covariance matrix Y. However, the exact amount of diagonal loading is difficult to determine. The robust versions of Capon introduced in [15][16][17] (e.g., DCRCB), accurately calculate the amount of diagonal loading based on the uncertainty set of the steering vectors.
Generally, the modelled data correlation matrix R y in (6) does not equal the sample data covariance matrix Y, as assumed by Capon. The difference between Y and the diagonal-loaded (thus invertible) matrix R y can be interpreted as due to steering vector errors. These steering vector errors occur normally because of the discrepancies between the assumed signal's DOA and the true one, or due to calibration errors, when there are discrepancies between the expected array response and the actual array response.
The DCRCB technique reduces such steering vector errors by providing an approximate solution to the following minimization problem , where · accounts for the Euclidean norm and with ε as a user specified parameter (degree-of-freedom). After solving (11), the DCRCB approach is inferred from the Capon's expression in (10), but using instead the previously computed steering vectors {â m } M m=1 and the modelled data correlation matrix R y in (6). The DCRCB solver is explicitly defined through the eigen-decomposition Y = UΓU + via (the derivation of (12) is detailed in [16]) where the columns of U contain the eigenvectors of Y, and the main diagonal of the diagonal matrix Γ gathers the respective eigenvalues µ l L l=1 . Factors {ϑ m } M m=1 correspond to the set of Lagrange multipliers obtained from with g m = U + a m M m=1 and ρ = L/(L − ε/2) 2 .

Second Step: WISE
The WCF minimization problem is defined by [20,21] with R y = R y (b) = AD(b)A + + N 0 I. Here, operator tr{·} retrieves the trace of the embraced matrix. WISE provides an approximate solution to (14), given by (the derivation of (15) is detailed in [12]) Notice that WISE is solution-dependent, since it requires of a first estimate of the PSPb [0] as input and in order to construct the correlation matrix R y in (6).

Different initial estimatesb
[0] in (15) may produce different WCF solutions; therefore, seeking convergence, WISE is implemented in an iterative fashion [12]. Nevertheless, this only alleviates the issue to a certain extent, when the discrepancies between the different estimatesb [0] are not highly significant. Moreover, the degradation or improvement of the retrieved solutionb WISE still depends on the accuracy of the first estimateb [0] .

As depicted in Algorithm 1, WISE refines a first estimate of the PSP after each iteration {b
until achieving convergenceb [i=I] . In practice, the iterative process is finished either by reaching a user tolerance control level ε TL or a maximum number of iterations i MAX . This article suggests retrieving the first estimateb [0] in (15) using DCRCB [15,16]. Being a refined robust version of Capon beamforming, DCRCB permits reducing the number of looks significantly, being applicable even for the single-look case. The algorithm in Algorithm 1 has the same structure as in [12], the main change is the replacement of Capon by DCRCB, employed to recover the first inputb [0] =b DCRCB .
The projector onto convex solution sets is integrated into the iterative solver in order to guarantee convergence [22,23]. The minimization problem to be solved is convex, thus, it converges under the conditions that {b Consistent with the methodology presented in Figure 1, the algorithm in Algorithm 1 allows for the usage of different techniques to obtain the first input, e.g., RCB, NCCB and DCRCB [15][16][17]. On the other hand, MARIA has the exact same structure as in Algorithm 1, but using a different solver [12] b rather thanb . Note that the previous warnings about the TomoSAR problem being ill-conditioned do not apply, since the whole purpose of the addressed regularization techniques is to cure that same ill-conditioning.

Experimental Results
Due to strictly regulated air traffic, the experiments conducted in urban areas primarily use space-borne data sets rather than air-borne data acquisitions. However, the high flight altitude of the UAVSAR system of the JPL/NASA eases this constraint. In 2015 UAVSAR was flown in order to acquire L-band fully polarimetric TomoSAR data collections from Munich [18], the third largest city in Germany. Figure 2 shows one single-look-complex (SLC) image out of the stack; the presence of radio frequency interference is due to the several external sources, by instance, those coming from the Munich's airport.

=
The projector onto convex solution sets is integrated into the iterative solver in order to guarantee convergence [22,23]. The minimization problem to be solved is convex, thus, it converges under the conditions that > 0 and that remains positive definite, as the number of iterations increases.
Consistent with the methodology presented in Figure 1, the algorithm in Algorithm 1 allows for the usage of different techniques to obtain the first input, e.g., RCB, NCCB and DCRCB [15][16][17]. On the other hand, MARIA has the exact same structure as in Algorithm 1, but using a different solver [12] = , rather than . Note that the previous warnings about the TomoSAR problem being ill-conditioned do not apply, since the whole purpose of the addressed regularization techniques is to cure that same illconditioning.

Experimental Results
Due to strictly regulated air traffic, the experiments conducted in urban areas primarily use space-borne data sets rather than air-borne data acquisitions. However, the high flight altitude of the UAVSAR system of the JPL/NASA eases this constraint. In 2015 UAVSAR was flown in order to acquire L-band fully polarimetric TomoSAR data collections from Munich [18], the third largest city in Germany. Figure 2 shows one single-look-complex (SLC) image out of the stack; the presence of radio frequency interference is due to the several external sources, by instance, those coming from the Munich's airport. The utilized gulfstream G-III aircraft flew at a nominal altitude of 12.5 km with a swath of 22 km and length of 60 km. The incidence angles range from 25° to 65°. For the specified microwave  Table 1. These were completed on a heading of 193 • . The employed k z values were chosen with the aim of attaining finer vertical resolution in such an urban environment; until now, the largest flown by the UAVSAR system for TomoSAR data acquisitions. Figure 3 shows the vertical wavenumber k z against incidence angle for the different baselines within the previously specified acquisition geometry. The employed values were chosen with the aim of attaining finer vertical resolution in such an urban environment; until now, the largest flown by the UAVSAR system for TomoSAR data acquisitions. Figure 3 shows the vertical wavenumber against incidence angle for the different baselines within the previously specified acquisition geometry.      The employed values were chosen with the aim of attaining finer vertical resolution in such an urban environment; until now, the largest flown by the UAVSAR system for TomoSAR data acquisitions. Figure 3 shows the vertical wavenumber against incidence angle for the different baselines within the previously specified acquisition geometry.     Table 1).
Before using the suggested methodology (as in Figure 1) to perform TomoSAR, we first define the region-of-interest (ROI), where the edifice of the Bavarian state chancellery is located. Figure 5 shows the respective intensity images, with respect to the master track, for all polarizations (HH, HV and VV). The azimuth and range indices are displayed, acting as a guide to identify the azimuth and range bounds of the ROI, which is specified through a red rectangle. The tomograms presented afterward correspond to the lines crossing the ROI, spanning about 194 m along azimuth with a 1.2 • slope.
the region-of-interest (ROI), where the edifice of the Bavarian state chancellery is located. Figure 5 shows the respective intensity images, with respect to the master track, for all polarizations (HH, HV and VV). The azimuth and range indices are displayed, acting as a guide to identify the azimuth and range bounds of the ROI, which is specified through a red rectangle. The tomograms presented afterward correspond to the lines crossing the ROI, spanning about 194 m along azimuth with a 1.2° slope. We start by considering the multilooking case, performing Boxcar filtering on the set of data covariance matrices using a 15 × 3 (range/azimuth) pixel window. More pixels in range are employed for multilooking, in order to make more evident the spatial mixture of sources when comparing to the single-look case, presented later on. One of the main challenges in SAR remote sensing for urban areas is related to the presence of several scattering mechanisms concurring in the backscattered signal. In this context, a study carried in [24], exploiting coherent scatterers, has shown that a datadriven approach can partially help in reconstructing the orientation/radiation pattern of scatterers. Generally, complex scatterers like buildings are randomly oriented w.r.t. the SAR coordinates and one cannot assume their orientation being favorable to the flight direction. Therefore, due to the required multilooking, scattering contribution (source) mixtures will be recorded in the covariance matrix impacting the final reconstruction. For this reason, although the considered building is by chance parallel to the azimuth/flight direction, range multilooking has also been included. This is also The tomographic slices correspond to the edifice of the Bavarian state chancellery, denoted in Figure 6a-c through a blue rectangle and a yellow dashed (half) circle. Figure 6a shows the Google Earth© image of the test region, whereas Figure 5b shows the respective polarimetric SLC SAR image [the colors correspond to the channels HH (red), VV (blue) and HV (green)]. As it can be observed in Figure 6c, the edifice of the Bavarian state chancellery comprises two wings (22 m high) and a central building (28 m high), the latter with two upper floors and a dome on top (52 m high at its peak). Both wings have a ceiling made of glass. The area depicted via the blue rectangle in Figure 6a-c includes the two wings and the central building of the edifice aforementioned, whereas the yellow dashed (half) circle in Figure 6a-c encloses the dome.
We start by considering the multilooking case, performing Boxcar filtering on the set of data covariance matrices using a 15 × 3 (range/azimuth) pixel window. More pixels in range are employed for multilooking, in order to make more evident the spatial mixture of sources when comparing to the single-look case, presented later on. One of the main challenges in SAR remote sensing for urban areas is related to the presence of several scattering mechanisms concurring in the backscattered signal. In this context, a study carried in [24], exploiting coherent scatterers, has shown that a data-driven approach can partially help in reconstructing the orientation/radiation pattern of scatterers. Generally, complex scatterers like buildings are randomly oriented w.r.t. the SAR coordinates and one cannot assume their orientation being favorable to the flight direction. Therefore, due to the required multilooking, scattering contribution (source) mixtures will be recorded in the covariance matrix impacting the final reconstruction. For this reason, although the considered building is by chance parallel to the azimuth/flight direction, range multilooking has also been included. This is also to show that, the methodology presented in this article, which drastically reduces the need for looks, has also the advantage to present a lower sensitivity to the source mixtures/building orientation.
Remote Sensing 2020, 20, x FOR PEER REVIEW 10 of 28 to show that, the methodology presented in this article, which drastically reduces the need for looks, has also the advantage to present a lower sensitivity to the source mixtures/building orientation. As a reference, we first apply matched spatial filtering (MSF) [13], on the sample covariance matrices, to focus the multilooked TomoSAR data. Subsequently, we refer to the area depicted via the red line in Figure 5, crossing the ROI. It can be seen in Figure 7a, that the two wings, constituting the considered edifice (enclosed by the blue rectangle in Figure 6), are more noticeable for the HH and VV polarizations. The central building and the dome (encircled by the yellow dashed line in Figure 6) can be distinguished in all polarizations, with HV offering additional information. Nonetheless, the presence of artifacts and high ambiguity levels hamper the interpretation of the results obtained by MSF. In order to avoid these issues, more sophisticated approaches are normally utilized, which attain enhanced resolution too.
Next, for comparison purposes, we refer also to some of the most popular competing state-ofthe-art nonparametric TomoSAR focusing techniques, namely, Capon [5], DCRCB [15,16] and CS [10,11]. Next, we apply WISE as in [12], using Capon beamforming to provide it with its first input.
The implemented CS technique, introduced in [10,11], raises from the solution to the covariancematching convex optimization problem As a reference, we first apply matched spatial filtering (MSF) [13], on the sample covariance matrices, to focus the multilooked TomoSAR data. Subsequently, we refer to the area depicted via the red line in Figure 5, crossing the ROI. It can be seen in Figure 7a, that the two wings, constituting the considered edifice (enclosed by the blue rectangle in Figure 6), are more noticeable for the HH and VV polarizations. The central building and the dome (encircled by the yellow dashed line in Figure 6) can be distinguished in all polarizations, with HV offering additional information. Nonetheless, the presence of artifacts and high ambiguity levels hamper the interpretation of the results obtained by MSF. In order to avoid these issues, more sophisticated approaches are normally utilized, which attain enhanced resolution too.
Next, for comparison purposes, we refer also to some of the most popular competing state-of-the-art nonparametric TomoSAR focusing techniques, namely, Capon [5], DCRCB [15,16] and CS [10,11]. Next, we apply WISE as in [12], using Capon beamforming to provide it with its first input.
The implemented CS technique, introduced in [10,11], raises from the solution to the covariance-matching convex optimization problem  Here, matrix Ω is a wavelet transform sparsifying matrix, whereas the parameters τ 1 , τ 2 ≥ 0 control the trade-off between the sparsity in Ω, model mismatch and the total variation (TV) regularization term. For the presented experiments we consider the equi-balanced adjustment τ 1 = τ 2 = 0.5 and a sparsifying basis based on the Symlets 4 wavelet family with three levels of decomposition.
Conversely, factor N 0 in R y in (6) acts as a user-specified regularization parameter for WISE [12]. In the given analyses, we set N 0 as suggested in [25]. The maximum number of iterations i MAX in Algorithm 1 is set to 10. Also, we refer to the simplest assignment γ = 0 in (16), which defines the constraining projector onto the nonnegative convex cone solution set.
In order to better appreciate the ambiguity levels reduction and suppression of artifacts achieved by the addressed focusing methods, and since most super-resolution techniques (e.g., CS and WISE) do not preserve radiometric accuracy, all tomograms shown hereafter are normalized with respect to the (pseudo) power recovered by MSF, which is known to be more accurate in this aspect. Moreover, all tomographic slices are presented in a dB scale. Figures 7-9 show then, the tomographic slices with respect to the abovementioned TomoSAR focusing techniques for three polarizations HH, HV and VV, correspondingly. Furthermore, for an easy assessment, Figure 10 presents the superimposed vertical profiles for each position composing the displayed tomograms; the (pseudo) power is presented in a linear scale. We can observe that, in comparison to MSF, the other treated focusing techniques attain higher resolution with better ambiguity levels reduction and suppression of artifacts. CS and WISE are the techniques with best response, WISE being the one achieving finer resolution. As suggested in [12], the PSP recovered using Capon beamforming serves as first input to the WISE iterative procedure. The solution given by Capon is refined via statistical regularization, by means of several iterations, attaining enhanced reconstructions. Also, we can clearly appreciate the resemblance between the Capon and DCRCB responses; as explained previously, DCRCB is design as a robust version of Capon beamforming.
Notice that, in the case of the VV polarization, CS and WISE have more difficulties to get rid of the ambiguities placed above the building, with a pseudo-power up to -6 dB and -10 dB, respectively. Consequently, the analysis of all polarizations, along with the comparison of subsequent tomograms, aids discerning the presence of ambiguities. Nonetheless, due to the spatial mixture of sources, and as consequence of multilooking, the succeeding tomographic slices may combine the same structures, providing fewer information.
For example, because of the chosen multilooking window, when performing Boxcar filtering, a single tomographic slice comprises the backscattering coming from all different parts composing the edifice of the Bavarian state chancellery (as illustrated in Figure 6), namely, both wings, at left and right hand, correspondingly, the central building and the dome. This exemplifies the spatial mixture of sources due to multilooking.
For the single-look case, it is expected the need of more tomograms to describe the complete structure of the edifice of the Bavarian state chancellery, thereby each contributing tomographic slice has significantly new meaningful information. Subsequently, we performed single-look TomoSAR for all polarizations (HH, HV and VV) on the three areas depicted by the green, red and blue lines crossing the ROI in Figure 5, but using the methodology suggested in Figure 1, i.e., DCRCB + WISE. For comparison purposes, we refer to the same TomoSAR focusing techniques as before, except for Capon, since it is no longer applicable.
This time we make use of three tomographic slices to describe the edifice of the Bavarian state chancellery. Figures 11-13 show the single-look tomographic slices corresponding to the area depicted via the green line in Figure 5. Whereas Figure 14 presents the respective superimposed vertical profiles for each position within the displayed tomograms; the (pseudo) power is presented in a linear scale. Conversely, Figures 15-17 depicts the single-look tomograms corresponding to the area depicted via the red line in Figure 5, crossing the ROI. As before, Figure 18 shows the respective superimposed vertical profiles for each position composing the displayed tomograms.      Figure 11. HH tomograms retrieved from the area depicted by the green line crossing the ROI specified in Figure 5. We perform focusing using (a) MSF, (b) DCRCB, (c) CS and (d) DCRCB + WISE. We refer to the single-look case. All tomograms are normalized with respect to the (pseudo) power recovered by MSF. Figure 11. HH tomograms retrieved from the area depicted by the green line crossing the ROI specified in Figure 5. We perform focusing using (a) MSF, (b) DCRCB, (c) CS and (d) DCRCB + WISE. We refer to the single-look case. All tomograms are normalized with respect to the (pseudo) power recovered by MSF.          The results given by Figures 11-13 depict the two wings composing the abovementioned edifice (enclosed by the blue rectangle in Figure 6), whereas the succeeding tomographic slices, given by Figures 15-17, show part of the central building, as seen in Figure 6. As earlier, MSF presents worse resolution, higher ambiguity levels and numerous artifacts. On the other hand, the other treated more sophisticated approaches improve such aspects, being CS and WISE the techniques with best The results given by Figures 11-13 depict the two wings composing the abovementioned edifice (enclosed by the blue rectangle in Figure 6), whereas the succeeding tomographic slices, given by Figures 15-17, show part of the central building, as seen in Figure 6. As earlier, MSF presents worse resolution, higher ambiguity levels and numerous artifacts. On the other hand, the other treated more sophisticated approaches improve such aspects, being CS and WISE the techniques with best performance. Note that WISE remains as the method achieving finer resolution. This time, the solutions retrieved by DCRCB are the ones refined via statistical regularization, by means of the iterative process described in Algorithm 1. After convergence, enhanced reconstructions are attained.
Note that, for MSF and DCRCB, the set of tomograms depicting the wings of the selected edifice (Figures 11-13) present high ambiguity levels among the PLOS height range from −35 m to 60 m and from −10 m to −20 m, with a pseudo-power up to −3 dB and −5 dB, respectively, in the VV polarization. CS and WISE perform reduction of the ambiguity levels; however, the presence of ambiguities is still significant in some positions, especially for VV, which may lead to false detections. The analysis of all polarizations and the succeeding tomograms helps discriminating these ambiguities. The latter makes more sense for the single-look case, since the spatial mixture of sources is avoided. Moreover, we can also compare the single-look response against the multilook response. By instance, the ambiguity levels for CS and WISE in Figure 11 (HH polarization), among the aforesaid PLOS height ranges, are weaker. Furthermore, these same ambiguities do not appear in the succeeding tomograms in Figures 15-17. Without having a priori information on the ROI, we could infer that the sources within such ranges, for the VV polarization in Figure 13, are indeed ambiguities. Thus, the PLOS height range can be set accordingly, e.g., from −10 m to 30 m.
For completeness, Figure 19 shows the single-look tomographic slices corresponding to the area depicted via the blue line in Figure 5. These tomograms represent the dome encircled by the yellow dashed line in Figure 6. We refer only to the methodology suggested in Figure 1 (i.e., DCRCB + WISE) since its capabilities have been already verified. Consistently, Figure 20 presents the corresponding superimposed vertical profiles for each position composing the displayed tomograms; the (pseudo) power is presented in a linear scale.
An interesting aspect about the results given in Figures 19 and 20 is the presence of a single dominant scatterer, which could be utilized to perform PSI. The study carried in [6] suggests combining single-look TomoSAR with a PSI approach toward the objective of improving deformation sampling in layover-affected urban areas. For such a purpose, the use of CS, as a super-resolution technique, is restricted to those pixels that contain closely spaced scatterers (classified a priori), due to the computational burden. The methodology introduced in this work is intended to replace CS; being much faster, it could be applied for the aims of [6] in a general basis, without the need of pre-classifying the pixels composing the illuminated scene.
Finally, Table 2 presents the average processing time to perform focusing of a single pixel, for all addressed TomoSAR focusing techniques. All methods are implemented in Python, using the CVXPY [26] and the CVXOPT [27] software libraries to solve the CS convex problem in (18). In order to take full advantage of the super-resolution capabilities of CS and WISE, we make use of M = 100 samples to describe the estimated PSPb (see Section 2). We consider the single-look case for all methods, excluding Capon.

Discussion
The goal of the introduced methodology is to achieve enhanced TomoSAR imaging of urban areas without the necessity of performing multilooking. In this scenario, multilooking is optionally used to deal with the random scattering responses, but not to guarantee well-conditioned (invertible) data covariance matrices.
One of the motivations of this work is to present an alternative to Capon beamforming, which results inapplicable when the data covariance matrices are rank deficient, and to CS, up to now one of the most used and reliable TomoSAR focusing techniques for urban scenarios, yet, computationally expensive.
Consequently, the introduced methodology suggests solving the nonlinear ill-conditioned TomoSAR inverse problem using statistical regularization approaches, in the context of the decision making theory. Particularly, the MARIA and WISE methods are considered; the first one inspired on ML and the second one based on the WCF criterion. These techniques offer several advantages, which include: (i) resolution enhancement; (ii) ambiguity levels reduction; (iii) suppression of artifacts; (iv) it is not necessary to know beforehand the number of sources sharing the same resolution cell.
The related ML and WCF optimization problems are solved analytically, leading to the possibility to adapt the respective solvers to specific circumstances. By instance, with the aim of preventing the spatial mixture of sources due to multilooking, this paper presents a variation of MARIA and WISE, which utilizes a robust version of Capon (i.e., DCRCB, NCCB and RCB) to provide them with its first input (zero-step iteration). In this way, we make use of the robust properties of DCRCB, NCCB and RCB, against the rank deficiencies of the involved data covariance matrices.
In order to demonstrate the capabilities of the given methodology, we refer to the worst case scenario: the single-look case. First, we refer to the area depicted via the red line in Figure 5. As a reference, we present the corresponding tomograms for the different addressed TomoSAR focusing techniques, after performing multilooking. Subsequently, we refer to the three areas depicted by the green, red and blue lines crossing the ROI in Figure 5. The corresponding tomographic slices for the single-look case are provided.
The first retrieved solutions, performing multilooking, already show the feature-enhancing properties of the selected (WISE) statistical regularization approach, in comparison to the other competing techniques: MSF, Capon, DCRCB and CS. We conclude that, when sufficient multilooking is provided, the use of Capon beamforming is recommended to provide WISE (or MARIA) with the first-input (zero-step iteration). Being the mathematical expression of Capon easier than its robust versions, it is easier to implement, having less computational burden.
On the other hand, when the data covariance matrices are not invertible, as for the single-look case, the use of a robust version of Capon is required to compute a first estimate of the PSP, which, later on, is to be refined using statistical regularization. For demonstration purposes, we make use of DCRCB as input to WISE. The respective tomograms show how WISE still performs feature-enhancement to a first PSP estimate, gotten using a different technique than Capon beamforming.
In general, the algorithm of WISE (and MARIA) allows using different techniques as an input, giving to the user the flexibility to adapt them for the different scenarios. It is recommended, however, the use of Capon beamforming and its robust versions, rather than any other methods, as explained in the following. MARIA and WISE retrieve approximate solutions to their respective minimization problems, being the "ideal" version of Capon the asymptotic. By ideal, we refer to Capon beamforming but using the true (modelled) data covariance matrix, instead of the sample covariance matrix. Therefore, for consistency reasons, it is advisable to use Capon as input, or in this case, one of its robust versions.
CS is a special case, which merits some discussion. The approach considered here is related to the covariance-matching minimization problem described in Equation (18), which is normally tackled using nonspecifically adapted convex optimization algorithms. By instance, for the presented work, the CVXPY [26] and CVXOPT [27] software libraries have been employed. Yet, not having an analytical solution causes some downsides, mentioned next: the tomograms retrieved via CS may present black fringes, since sometimes a solution to the TomoSAR problem cannot be found; the correct selection of τ 1 and τ 2 in Equation (18) produces a better response, however, the setting up of these parameters is not always straightforward.
Regularization parameter selection is customarily done using the L-curve method [28], the Morozov's discrepancy principle [28] or the Stein's unbiased risk estimate (SURE) [29]. These criteria are a posteriori, meaning that a comparison of different retrieved solutions with different regularization parameters is performed. The use of criteria like the aforementioned for tuning up the CS parameters results implausible. On one hand, CS incurs much more computation time in contrast to the other competing methods, due to the nonavailability of adapted efficient convex optimization algorithms and due to the iterative nature of such algorithms. On the other hand, modifying the different parameter selection tools to deal with the CS case, would involve having the analytical expression of the solver.
For these reasons, we consider CS impractical and we rather suggest using Capon beamforming or one of its robust versions (depending on the circumstances), along with MARIA or WISE to perform refinement of the previously retrieved solutions via statistical regularization.
Finally, future lines of research will focus on using better methods to perform multilooking rather than the standard Boxcar filtering. The use of nonlocal spatially adaptive filtering methods is a good option, which enhances the estimation of the covariance matrices, improving the scatterer separation in layover areas thanks to their smoothing and edge-preserving properties [30,31]. Another line of research is the use of ML to estimate the particular structures of the data covariance matrices, making them closer, in a statistical sense, to the true covariance matrices. Recall that most of the TomoSAR focusing algorithms are applied on an estimate of the actual data covariance matrices, obtained from a single independent realization.