Automatic Detection of Low-Rise Gable-Roof Building from Single Submeter SAR Images Based on Local Multilevel Segmentation

Chen, Jinxing; Wang, Chao; Zhang, Hong; Wu, Fan; Zhang, Bo; Lei, Wanming

doi:10.3390/rs9030263

Open AccessLetter

Automatic Detection of Low-Rise Gable-Roof Building from Single Submeter SAR Images Based on Local Multilevel Segmentation

¹

Key Laboratory of Digital Earth Science, Institute of Remote Sensing and Digital Earth, Chinese Academy of Sciences, Beijing 100094, China

²

University of Chinese Academy of Sciences, Beijing 100049, China

³

China Nanjing Research Institute of Electronics Technology, Nanjing 210039, China

^*

Authors to whom correspondence should be addressed.

Remote Sens. 2017, 9(3), 263; https://doi.org/10.3390/rs9030263

Submission received: 4 November 2016 / Revised: 18 February 2017 / Accepted: 9 March 2017 / Published: 13 March 2017

Download

Browse Figures

Versions Notes

Abstract

:

Low-rise gable-roof buildings are a typical building type in shantytowns and rural areas of China. They exhibit fractured and complex features in synthetic aperture radar (SAR) images with submeter resolution. To automatically detect these buildings with their whole and accurate outlines in a single very high resolution (VHR) SAR image for mapping and monitoring with high accuracy, their dominant features, i.e., two adjacent parallelogram-like roof patches, are radiometrically and geometrically analyzed. Then, a method based on multilevel segmentation and multi-feature fusion is proposed. As the parallelogram-like patches usually exhibit long strip patterns, the building candidates are first located using long edge extraction. Then, a transition region (TR)-based multilevel segmentation with geometric and radiometric constraints is used to extract more accurate edge and roof patch features. Finally, individual buildings are identified based on the primitive combination and the local contrast. The effectiveness of the proposed approach is demonstrated by processing a complex 0.1 m resolution Chinese airborne SAR scene and a TerraSAR-X staring spotlight SAR scene with 0.23 m resolution in azimuth and 1.02 m resolution in range. Building roofs are extracted accurately and a detection rate of ~86% is achieved on a complex SAR scene.

Keywords:

very high resolution (VHR); roof patch; roof ridge; parallelogram-like; transition region

Graphical Abstract

1. Introduction

Synthetic aperture radar (SAR) sensors with high resolution, either airborne or spaceborne, show high potential in applications of urban remote sensing such as damage assessment [1,2,3], urban environment monitoring [4], and three-dimensional (3-D) reconstruction [5]. Compared with low- or medium-resolution SAR images that are usually utilized to monitor large-scale objects, very high resolution (VHR) SAR images at meter-level and submeter-level provide more details of the targets of interest in terms of both geometric structures and radiometric features. To ensure a reliable, consistent, and fast extraction of the information from complex SAR urban scenes, robust automatic information extraction methods are essential.

As one of the most important components in urban areas, buildings attract more attention for urban studies. Many techniques for building detection from VHR SAR images have been developed. These methods can be divided into two categories according to the types of experimental data. The first category relies on the availability of ancillary or multiple image data, e.g., multidimensional data [5], multitemporal data [6], InSAR [7], and PolSAR [8], which implies that the area under investigation is imaged more than once with different viewing configurations (changed incidence and/or aspect angle). Because of the complexity of data acquisition and usage, these methods obviously have limitations in some application scenarios such as emergency response or locations with restricted data [9]. The second category consists of methods based on a single SAR image. The related methods can be grouped into model-based methods [10,11], feature-based methods [12,13], and ontological semantic-based methods [9] with the common foundation of building signature extraction [14].

In meter-level SAR images, buildings are mainly represented by strip and block signatures, which are generally composed of layovers, double bounce lines and shadows. The related signature extraction methods include five types. (1) Region-based approaches, which first generate regions that are homogeneous with respect to the extracted features, such as gray statistics and texture features, and then produce individual primitive according to the radiometric, geometric, and topologic characteristics [14]. Related approaches involve Markov random fields [15], mean-shift [16]. (2) Edge-based approaches, such as the POL_CFAR edge detector [8], ROEWA detector [17]. (3) Bright line-based approaches, which consider layovers as line features with a certain width and further extract them via line detectors under multiscales [5,12]. (4) Range profile-based methods. Chen et al. [18] proposed a 1-D detector, referred to as the “range detector,” to detect the footprints of the illuminated wall of cuboid buildings. (5) Combined methods, which merge two or more kinds of the aforementioned methods [19].

Nowadays, spaceborne SAR images with submeter-level resolution can be obtained. In 2013, a new TerraSAR-X mode named staring spotlight (ST) [20], whose azimuth resolution improved to 0.24 m, was introduced for various applications such as earthquake damage mapping [1] and archaeological remote sensing [21]. This kind of submeter-level SAR data source provides new opportunities for building detection and extraction as more details can be observed in such images in terms of both geometric structures and radiometric features, making it possible to focus an analysis on individual buildings with small sizes.

As finer scattering mechanisms become visible in submeter data, the dominant building signatures, which can be used as cues for building extraction, have changed with the improvement of SAR image resolution. Bright spots, strips, and blocks can all be dominant signatures in submeter-level SAR images [8]. The methods suited for meter-level SAR images may be extended and tuned for submeter-level data by introducing new types of primitives, composed objects, and rules [12]. Soergel et al. [22] used principles from perceptual grouping to detect building features such as long, thin roof edge lines, groups of salient point scatters, and symmetric configurations from SAR images with decimeter resolution. Lu et al. [8] proposed a new building signature extraction method based on symmetric analysis. Symmetry axes of elongated building layover and shadow are first delineated, and local radiometric features are then extracted from both the range direction and the direction normal to the symmetry axes.

The availability of richer information from submeter-level SAR image has a positive effect on building detection and extraction, but gives rise to an increase in complexity and the heterogeneity of building signatures. The appearances of buildings strongly vary due to differences in structural elements (such as windows, balconies, or roof details), the size and material of the façade, and the building height. The dominant signatures are quite different for diverse building types (e.g., isolated small buildings, large buildings, and residential buildings), so it is necessary to propose more specific and targeted building detection methods.

In this paper, based on the new characteristics of submeter-level SAR images, the authors propose a framework for the more accurate detection of low-rise, gable-roof buildings from a single submeter-level resolution SAR image. The low-rise gable-roof building is one of the most typical building types in rural areas, shantytowns, and industrial parks of China. The detection and extraction of this type of building can be used for urban reconstruction and damage assessment. Previous studies related to gable-roof building detection can be found in [9,12,23]. Gui et al [9] proposed an ontological semantic-based method to extract buildings with different orientations and different structures. The results show a high detection rate, but the building outlines are inaccurate. Methods in [12,23] are oriented to meter-level SAR images, where both the flat- and gable-roof buildings are imaged as narrow strips. These two methods are not robust for gable-roof building detection in submeter-level SAR images due to the difference in building appearances. Therefore, it is essential to improve the traditional methods or propose a new method.

The advantages of our framework are as follows. (1) New types of feature primitives (e.g., building roof ridge and two adjacent parallelogram-like roof patches) on submeter-level SAR images are utilized to identify individual buildings. (2) Since building signature features are closely related, the authors adopt a transition region (TR)-based segmentation to simultaneously extract edge and regional features, which can unify the primitive extraction process. (3) In view of the fractured nature of the linear and regional features and the instability of feature extraction in VHR images, a multilevel threshold defining strategy is used in the TR-based segmentation to analyze the image with multiple levels to obtain more informative and robust features. Linear and regional features are fused to optimize final results.

The remainder of this paper is organized as follows. The backscattering model analyses of gable-roof buildings are described in Section 2. Section 3 introduces the processing chains of the proposed framework. The experimental results are provided in Section 4. Finally, the discussion and conclusions are presented in Section 5 and Section 6, respectively.

2. Gable-Roof Building Signature Analysis

Figure 1 shows a schematic view of the backscattering model of a typical low-rise gable-roof building. The typical layover and double-bounce, which are the dominant features in meter-level resolution SAR images, are not discernable and merged into the echoes of building roof in submeter resolution SAR images. The key features of this kind of building turn into parallelogram-like roof patches, roof edges, and shadowing effects. These features are interpreted in detail following radiometric and geometric analyses.

Radiometric analysis: The building signatures are divided into three parts (A1, A2, and A3 in Figure 1c), with the mean intensity of A1 > A2 > A3. Part A1 indicates the near-sensor roof patch, including scattering areas a, b, and c in Figure 1d. Roof scattering is affected by wall scattering and ground scattering, and those contributions are mixed (i.e., layover, a + b). A2 indicates the far-sensor roof patch, which is only characterized by scattering from the roof (denoted by d), and A3 is the shadow area. The intensity and spatial relationships of A1, A2, and A3 are used to identify building objects and exclude false alarms in subsequent building detection.

Geometric analysis: As A1 and A2 usually exhibit parallelogram-like long strip patterns, the long edge lines of the strips (illustrated with green, red and yellow lines in Figure 1c) can indicate the building orientation. Long edge lines are more likely to be obtained than the short edges because edges are likely to split into several fractions due to the effects of noise and complex surface scenes. Therefore, in this paper, the long edge lines of the two parallelogram-like patches are used as the cues for building detection.

According to the sensor viewing direction and the distance between roof edges and the sensor, the long edge lines are defined into three types: near-sensor edge, far-sensor edge, and roof ridge. As shown in Figure 1c, the green, red, and yellow lines indicate the position of near-sensor edge, roof ridge, and far-sensor edge, respectively. Figure 2 shows the spatial composing relationship of edge lines and roof patches. Adjacent feature primitives are merged to compose more structured features. The intensity and spatial relationships of all these features are used to identify building objects in subsequent building detection.

3. Method

Figure 3 shows the flow chart of our approach. The processing chain consists of four main steps: edge extraction, building candidate area extraction, parallelogram-like roof patch extraction using TR-based multilevel segmentation, and selection of roof candidates and individual building identification. To mitigate speckle noise in the SAR image, the original image is filtered with a non-local algorithm proposed by Deledalle et al. [25], which has a strong denoising effect and simultaneously preserving fine structures.

3.1. Edge Extraction and Classification

The edge detection is performed on the filtered image using a gradient computation method, gradient by ratio (GR), proposed by Dellinger et al. [26]. A ratio map is obtained from the GR method, which is then thresholded and binary linear strips are obtained (see Figure 4c). A local adaptive threshold, rather than a global threshold, is used to preserve more detailed edge information and to improve the detection rate. The threshold is derived from the gradient histogram of a local sub-window (30 × 30) with the highest 12% of the response pixels remaining. Several morphological processing steps, e.g., dilation, erosion, skeleton extraction, and removal of branch points, are used to optimize the binary image. Afterward, the Douglas–Peucker algorithm [27], which reduces the number of points in a curve that is approximated by a series of binary pixels, is applied to generate segment primitives. Segment primitives meeting a certain proximity and collinearity are reshaped by least-square-fit processing, which produces a new vector representation of the segments. Finally, edge lines are obtained through an iterative calculation of the global connection of segment primitives. Edge lines beyond a certain length are reserved as roof edge candidates and short edges are eliminated. In this study, a length threshold equal to 20 is used. The corresponding result is shown in Figure 4d.

The types of the roof edge candidates are determined by a two-rectangle classifier shown in Figure 5. The two rectangular windows on both sides of an edge line are constructed for intensity computation. The window length is the same as the length of the edge line, and the width r is set to five times the resolution of the image in this paper. The classification rules composed by the mathematical relationship are shown in Figure 5b. Three parameters,

M_{r e c 1}

,

M_{r e c 2}

, and

T_{s h a d o w}

, are introduced to indicate the mean intensity of pixels in rec1 and rec2 and the shadow area of SAR image, respectively.

M_{r e c 1}

and

M_{r e c 2}

are obtained by the classifier, and

T_{s h a d o w}

is obtained by analyzing the intensity of sample pixels belonging to the shadow regions in the SAR image [16].

3.2. Building Candidate Area Extraction

Given an edge candidate

L_{i}

with two endpoints

P_{1} (x_{1}, y_{1})

and

P_{2} (x_{2}, y_{2})

, a rectangular region of interest labeled as

C_{i}

is obtained as indicated in Figure 6. The two endpoints are extended by a distance d, which is selected by analyzing the size of an individual gable-roof building in the SAR image. The region of interest should be large enough to contain a whole building, but too large of a region increases the computational cost. The post processing is based on

C_{i}

.

3.3. Parallelogram-Like Roof Patch Extraction

3.3.1. Feature Primitive Extraction and Optimizing

A TR-based image segmentation method is used to extract the two parallelogram-like roof patches. TR-based image segmentation is a recently developed single-object segmentation technique [28,29] that first extracts the image transition region between the object and the background, and then takes the mean of the pixels in the transition region as the segmentation threshold for image thresholding. In this paper, instead of using local complexity and local variance as employed by Li et al. [28], the local gradient, which was used for edge extraction in Section 3.1, is also used as a descriptor to extract the transition region. For each edge candidate

L_{i}

, the corresponding binary linear strip from the GR is extracted as transition region

R_{i}

. Example images are shown along each step for better illustration purposes. Selecting an edge candidate from Figure 4d, the extracted area of a building candidate is shown in Figure 7a. Figure 7b shows the corresponding binary linear strip, which is used to generate the edge line. The binary linear strip is extracted from Figure 4c, and in this section, it is used as a transition region for image segmentation.

Given a suitable threshold

t

computed from the transition region, such as the mean intensity of the pixels in transition region, a binary image is obtained after the image thresholding. The binary image is optimized by some morphological processing such as dilation and erosion. Then, the regions that overlap with the transition region are reserved as optimized building roof candidates

C a n d i_{i}

. The example image is shown in Figure 7c. The boundary pixels of

C a n d i_{i}

are obtained and used to generate line segments using the Douglas–Peucker algorithm. The line segments parallel to

L_{i}

are extracted as new roof edge candidates

L_{i j}, j = 1, ... N

, and

N

is the total number of parallel lines. The type of

L_{i j}

is determined using the two-rectangle classifier. Afterwards, the spatial composing relationship shown in Figure 2 is used to eliminate the false candidates. For example, if

L_{i}

is a near-sensor edge, then

L_{i j}

must be a roof ridge, or vice versa. The example image is shown in Figure 7d. Finally, the linear feature primitives (

L_{i}

,

L_{i j}

) and regional feature primitive (

C a n d i_{i}

) are fused and optimized using the following steps to obtain a parallelogram-like roof patch.

Shape-constrained feature optimizing. For each pair of parallel edges, $C a n d i_{i}$ is reduced to the region between the two parallel lines, and a parallelogram P is matched by extending the edges. The edge length is set to the midline length of $C a n d i_{i}$ . The cost function $F (P)$ can be written as

$F (P) = 1 - \frac{f (C a n d i_{i} \cap P)}{f (P)}$

(1)

where function $f$ is an area computing function. The parallelogram P with the minimum $F (P)$ is reserved. The corresponding example image is shown in Figure 7e.
Homogeneity-constrained feature optimizing. The fitted parallelogram P is fused with the mean shift segmentation in this step. First, the segmentation patches that overlap with P are extracted as $C a n d i_{i}$ , and the parallelogram matching function $F (P)$ is then computed again to obtain the final P. This processing step is not strictly necessary for the correct operation of the proposed technique but increases the accuracy of roof patch extraction. The corresponding example image is shown in Figure 7f.

3.3.2. Multilevel Thresholds

Gable-roof buildings are exhibited as two adjacent parallelogram-like roof patches with different intensity. The near-sensor roof patch, which has a higher intensity than the far-sensor one, is extracted first. Then, a multilevel quantization technique is used in the TR-based segmentation to extract the far-sensor roof patch. The multilevel quantization technique is formed by a multilevel threshold defining strategy.

In this study, the multilevel threshold is defined as

t_{m} = T_{\max} - \frac{T_{\max} - T_{s h a d o w}}{M} \times (m - 1), m \in [1, ... M]

(2)

where

T_{\max}

is the upper limit of

t_{m}

, M is the total number of levels，and

T_{s h a d o w}

is the mean intensity of shadow areas. These three parameters are set in the following way: (1)

T_{\max}

is selected according to the mean intensity of the transition region, which is a typical threshold in the TR-based segmentation. We set

T_{\max} = k t_{0}

, where k is a scaling factor and

t_{0}

is the mean intensity. k is used to enhance the ability to separate the near-sensor roof patch from the far-sensor roof patch. If there is a distinct difference in intensity between the two kinds of roof patches, a small k (e.g., k = 1) can be effective. Otherwise, a larger value is needed. In our experiments, k = 1.2 is used. (2) The multilevel quantization technique is superior to the single-level technique because it can capture the detailed variation of intensities around edges and is resistant to speckle noise. A large M can capture more robust linear and regional features but increases the computational cost. In our experiments, a similar performance has been obtained using M = 6, 8, 10, 12. A middle value M = 8 is suggested. (3)

T_{s h a d o w}

is used to separate the object from the shadow, it determines the lower bound of the threshold. In this study, shadow areas are extracted according to the approach proposed in [16]; then,

T_{s h a d o w}

can be obtained.

Based on the multilevel image segmentation, the two roof patches are concatenated at multiple levels. The structure and intensity information of the near-sensor roof patch can be used to restrain the extraction of the far-sensor one. The example image is shown in Figure 7g. A building roof candidate containing both the near-sensor roof patch and the far-sensor roof patch is extracted under a level-5 threshold. The near-sensor roof patch is removed and the rest region (Figure 7h) is used to construct the far-sensor roof patch in a way similar to that of the near-sensor one. The final roof edges are shown in Figure 7i.

3.4. Individual Building Identification

First, the spatial relationship between the shadow and the roof patches is used to eliminate false alarms. The roof patches are reserved on the condition that there is an adjacent shadow area on the far-sensor side. Then, a contrast restriction is used to identify individual building. For each quadrangle constructed by the two extracted roof patches, the local contrast value

C_{r}

is calculated from the filtered image by the following equation:

C_{r} = [\frac{1}{M_{i n}} \sum_{i \in A_{i n}} x_{i}] . [\frac{1}{M_{o u t}} \sum_{i \in A_{o u t}} (1 - x_{i})]

(3)

where

A_{i n}

and

A_{o u t}

are the inner regions of the quadrangle and the surrounding outer thick border, respectively; the thickness of

A_{o u t}

is set to half of the roof width;

M_{i n}

and

M_{o u t}

are the number of pixels contained in the regions

A_{i n}

and

A_{o u t}

, respectively; and

x_{i}

represents the pixel intensity value normalized between 0 and 1.

C_{r}

is a measure of the contrast between the pixels contained in the quadrangle and their surroundings. Higher differences between the mean intensities of the two regions produce higher

C_{r}

values. This measure was proposed by Simonetto et al. [30] and used by Ferro et al. [12] for local contrast computation.

4. Materials and Experimental Results

The proposed method was verified on a TerraSAR-X staring spotlight SAR scene and a Chinese VHR airborne SAR scene. The imaging parameters are listed in Table 1. The TerraSAR-X staring spotlight image used in this paper was taken over the southern outskirt of Suzhou, Jiangsu province in China, in August 2014. The Chinese X-band airborne VHR SAR image was acquired over a rural area near Xi’an City in the Shanxi province of China in 2013. The building orientations are different in the two scenes.

4.1. Test on TerraSAR-X Staring Spotlight SAR Scene

As shown in Figure 8a, the TerraSAR-X staring spotlight SAR scene (555 × 410) contains 23 low-rise gable-roof buildings. The roof signatures are prominent, whereas the layover and double-bounce phenomena are not discernable, making the traditional building detection methods based on layover and double-bounce not applicable in this scenario. The contrast of intensity between the near-sensor roof patch and the far-sensor roof patch is more clearly presented in a mean shift segmentation result (Figure 8b). Near-sensor roof patches correspond to bright segments, whereas far-sensor roof patches correspond to gray segments.

To locate the building candidate area, edges are extracted as shown in Figure 8c. The results are fractured while the long edges are mostly near the long strip structures of the roofs. Then, TR-based multilevel segmentation is implemented to extract parallelogram-like roof patches. After feature optimization and false alarm elimination, individual buildings are identified in Figure 8d. All 23 gable-roof buildings are detected with accurate roof patch extraction.

To validate the performance of our algorithm, the marker-controlled watershed segmentation-based method [17] is also performed on the TerraSAR-X staring spotlight SAR scene. The OS-CFAR detector is used to extract bright pixels of buildings. Since the intensity of the pixels in far-sensor roof patches is relatively low, these pixels are lost. The result is shown in Figure 8e. All 23 gable-roof buildings are partly detected, with far-sensor roof patches missing.

4.2. Test on Chinese Airborne SAR Scene

A more complex scene imaged by a Chinese airborne SAR is shown in Figure 9. Thirty-six gable-roof buildings are included in this scene. Building signatures are severely affected by the surrounding environment, such as adjacent buildings, trees, grass, and other human-made objects. The processing procedure is the same as what was used for the TerraSAR-X image. The final experimental result is shown in Figure 10. Some statistics for quantitative evaluation were manually calculated and are summarized in Table 2.

The detection rate is defined as the ratio of correctly identified buildings to the total number of gable-roof buildings in the SAR image. A building is defined as partially detected if it is detected for less than 90% of its pixels. The merged buildings indicate that two adjacent buildings are identified as one building. It can be seen that our method performs well, with 86% of buildings detected, including some buildings with fractured roof features, indicating that our method is robust and resistant to the effect of the surrounding environment. Let us consider, for example, Building B23 in Figure 11a, which has a heterogeneous roof patch. The mean shift segmentation result is shown in Figure 11b, and it is fractured. The transition region used for the extraction of roof edges and roof patches is shown in Figure 11c, and the extracted building roof candidates and parallel lines are shown in Figure 11d,e. The building roof candidates are malformed and the edge lines are partial extracted. Figure 11f is the final result. The whole roof edges are accurately obtained, which indicates that the proposed method has the ability to optimize fractured roof features.

However, there are some merged, partially extracted or undetected buildings. The merged buildings are caused by the overlap between adjacent buildings with the same orientation (Buildings B12 and B13, B24 and B25, B26 and B27, B31 and B32). Buildings B3 and B4 are partially detected as their signatures are disturbed on the left side. In the process of roof patch extraction, the near-sensor roof patch is extracted first, and its length is used as a prior knowledge to extract the far-sensor roof patch. Therefore, if the near-sensor roof patch is partially extracted, some pixels in the far-sensor one will be undetected too, thus leading to an incomplete and inaccuracy extraction of roof edges in the process of parallelogram construction. Meanwhile, the direction error of the parallel lines used to generate parallelogram affects the final result too (Building B5). Moreover, if the buildings are shadowed or their edges are too blurry to be extracted, they will be undetected (e.g., Buildings B14, B15, and B16).

5. Discussion

As shown by our results, the proposed method obtains accurate roof extraction from a TerraSAR-X staring spotlight mode SAR image and shows an 86% detection rate on a complex airborne SAR scene. The number of false alarms is limited due to the strict constraints of the radiometric and geometric characteristics of the roof signatures. Other objects such as trees, roads, and other human-made structures do not show similar radar signatures to those of gable-roof buildings. Figure 8d shows that, compared with the marker-controlled watershed segmentation-based method that oriented to meter-level SAR images, the proposed method has a distinct advantage on the extraction of building outlines.

The proposed method is feature-based, and the basis is the exploitation of the very high resolution. Meter-level SAR images from other beam modes with less spatial resolution, such as TerraSAR-X spotlight images, are not the application scenarios of our method as a low number of features are visible in such images. Except for the resolution limitation, the proposed method is sensitive to viewing configurations of the SAR sensor, which may lead to a missing of gable-roof building features. As shown in Figure 1, our method is based on the angle relationship of the incidence angle

θ

smaller than

90^{°} - α

(

α

is the roof inclination). If

θ > 90^{°} - α

, only the near-sensor roof patch can be imaged. With a decreasing number of features, the gable-roof and flat-roof buildings can be detected in a unified way, such as in [12].

In the process of parallelogram-like roof patch extraction, each parallelogram is constructed based on a pair of parallel edges. As a future refinement, the other two edge lines (corresponding to the two short edges of a parallelogram) will be located and used for parallelogram optimizing based on perceptual grouping methods. The goal of this step is to provide a more accurate result of building outlines, which can be used as a starting point for further calculations, e.g., the estimation of building widths and lengths. Moreover, by exploiting the reconstruction of the shadow areas, height retrieval techniques can be also applied to estimate building heights.

6. Conclusion

This paper addresses the problem of automatic detection of low-rise gable-roof buildings in a single submeter-level SAR image. Unlike most related methods, which focus on the usage of building layover, double-bounce, and shadow signatures, the new dominant features, i.e., two adjacent parallelogram-like roof patches and roof ridge are utilized for building detection in a single submeter-level SAR image. Features are extracted in a coarse to fine manner, using a TR-based multilevel segmentation technique. The proposed method obtains accurate roof extraction from a TerraSAR-X staring spotlight SAR scene and shows an 86% detection rate on a complex airborne SAR scene. The false alarms are limited due to the strict constraints of the radiometric and geometric characteristics of roof signatures. As future developments, the building roof will be further optimized and used for the retrieval of building widths, lengths, and heights.

Acknowledgments

This work was supported by the National Natural Science Foundation of China (Nos. 41331176, 41371413,41401514) and the National Key Research and Development Program of China (2016YFB0501501). The authors would like to thank the German Aerospace Center for providing the TerraSAR-X ST data from the TerraSAR-X AO project (LAN2456).

Author Contributions

Jinxing Chen conceived and performed the experiments; Chao Wang and Hong Zhang supervised and designed the research and contributed to the article’s organization; Fan Wu and Bo Zhang carried on the result analysis, Wanming Lei optimized some algorithms. Jinxing Chen, Chao Wang, and Hong Zhang drafted the manuscript, which was revised by all authors. All authors read and approved the final manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

Gong, L.; Wang, C.; Wu, F.; Zhang, J.; Zhang, H.; Li, Q. Earthquake-induced building damage detection with post-event sub-meter VHR TerraSAR-X staring spotlight imagery. Remote Sens. 2016, 8, 887. [Google Scholar] [CrossRef]
Brett, P.T.B.; Guida, R. Earthquake damage detection in urban areas using curvilinear features. IEEE Trans. Geosci. Remote Sens. 2013, 51, 4877–4884. [Google Scholar] [CrossRef] [Green Version]
Brunner, D.; Lemoine, G.; Bruzzone, L. Earthquake damage assessment of buildings using VHR optical and SAR imagery. IEEE Trans. Geosci. Remote Sens. 2010, 48, 2403–2420. [Google Scholar] [CrossRef]
Henderson, F.M.; Xia, Z.G. SAR applications in human settlement detection, population estimation and urban land use pattern analysis: A status report. IEEE Trans. Geosci. Remote Sens. 2002, 35, 79–85. [Google Scholar] [CrossRef]
Simonetto, E.; Oriot, H.; Garello, R. Rectangular building extraction from stereoscopic airborne radar images. IEEE Trans. Geosci. Remote Sens. 2005, 43, 2386–2395. [Google Scholar] [CrossRef]
Amitrano, D.; Belfiore, V.; Cecinati, F.; Di Martino, G.; Iodice, A.; Mathieu, P.P.; Medagli, S.; Poreh, D.; Riccio, D.; Ruello, G. Urban areas enhancement in multitemporal SAR RGB images using adaptive coherence window and texture information. IEEE J. Sel. Topics Appl. Earth Obs. Remote Sens. 2016, 9, 3740–3752. [Google Scholar] [CrossRef]
Dubois, C.; Thiele, A.; Hinz, S. Building detection and building parameter retrieval in InSAR phase images. ISPRS J. Photogramm. Rem. Sens. 2016, 114, 228–241. [Google Scholar] [CrossRef]
Xu, F.; Jin, Y.Q. Automatic reconstruction of building objects from multiaspect meter-resolution SAR images. IEEE Trans. Geosci. Remote Sens. 2007, 45, 2336–2353. [Google Scholar] [CrossRef]
Gui, R.; Xu, X.; Dong, H.; Song, C.; Pu, F. Individual building extraction from TerraSAR-X images based on ontological semantic analysis. Remote Sens. 2016, 8, 708. [Google Scholar] [CrossRef]
Franceschetti, G.; Iodice, A.; Riccio, D. A canonical problem in electromagnetic backscattering from buildings. IEEE Trans. Geosci. Remote Sens. 2002, 40, 1787–1801. [Google Scholar] [CrossRef]
Quartulli, M.; Datcu, M. Stochastic geometrical modeling for built-up area understanding from a single SAR intensity image with meter resolution. IEEE Trans. Geosci. Remote Sens. 2004, 42, 1996–2003. [Google Scholar] [CrossRef]
Ferro, A.; Brunner, D.; Bruzzone, L. Automatic detection and reconstruction of building radar footprints from single VHR SAR images. IEEE Trans. Geosci. Remote Sens. 2013, 51, 935–952. [Google Scholar] [CrossRef]
Zhang, F.L.; Shao, Y.; Zhang, X.; Balz, T. Building L-shape footprint extraction from high resolution SAR image. In Processings of the IEEE Joint Urban Remote Sensing Event, Munich, Germany, 11–13 April 2011.
Lu, P.P.; Du, K.N.; Yu, W.D.; Feng, H. New building signature extraction method from single very high-resolution synthetic aperture radar images based on symmetric analysis. J. Appl. Remote Sens. 2015, 9, 095072. [Google Scholar] [CrossRef]
Fu, X.Y.; You, H.J.; Fu, K. Building segmentation from high-resolution SAR images based on improved Markov random field. Acta Electron. Sin. 2012, 40, 1141–1147. [Google Scholar]
Cellier, F.; Oriot, H.; Nicolas, J.M. Introduction of the mean shift algorithm in SAR imagery: Application to shadow extraction for building reconstruction. In Proceedings of the Earsel 3D Remote Sensing Workshop, Porto, Portugal, 6–11 June 2005.
Zhao, L.J.; zhou, X.G.; Kuang, G.Y. Building detection from urban SAR image using building characteristics and contextual information. EURASIP J. Adv. Signal Proc. 2013, 1, 1–16. [Google Scholar] [CrossRef]
Chen, S.S.; Wang, H.P.; Xu, F.; Jin, Y.Q. Automatic recognition of isolated buildings on single-aspect SAR image using range detector. IEEE Geosci. Remote Sens. Lett. 2015, 12, 219–223. [Google Scholar] [CrossRef]
Chaabouni-Chouayakh, H.; Datcu, M. Coarse-to-fine approach for urban area interpretation using Terrasar-X data. IEEE Geosci. Remote Sens. Lett. 2010, 7, 78–82. [Google Scholar] [CrossRef]
Mittermayer, J.; Wollstadt, S.; Prats-Iraola, P.; Scheiber, R. The TerraSAR-X staring spotlight mode concept. IEEE Trans. Geosci. Remote Sens. 2014, 52, 3695–3706. [Google Scholar] [CrossRef]
Tapete, D.; Cigna, F.; Donoghue, D.N. ‘Looting marks’ in space-borne SAR imagery: Measuring rates of archaeological looting in Apamea (Syria) with TerraSAR-X Staring Spotlight. Remote Sens. Environ. 2016, 178, 42–58. [Google Scholar] [CrossRef]
Soergel, U.; Thoennessen, U.; Brenner, A.; Stilla, U. High-resolution SAR data: New opportunities and challenges for analysis of urban areas. IEEE Proc. Radar Sonar Naving. 2006, 153, 294–300. [Google Scholar] [CrossRef]
Brunner, D.; Lemoine, G.; Bruzzone, L.; Greidanus, H. Building height retrieval from VHR SAR imagery based on an iterative simulation and matching technique. IEEE Trans. Geosci. Remote Sens. 2010, 48, 1487–1504. [Google Scholar] [CrossRef]
Tang, K.; Fu, K.; Sun, X.; Sun, H.; Wang, H.Q. Signature analysis and 3-D reconstruction of rectangular building in very high resolution SAR images. J. Infrared Millim. Waves. 2013, 32, 198–204. [Google Scholar] [CrossRef]
Deledalle, C.A.; Denis, L.; Tupin, F.; Reigber, A.; Jäger, M. NL-SAR: A unified non-local framework for resolution-preserving (Pol)(In) SAR denoising. IEEE Trans. Geosci. Remote Sens. 2015, 53, 2021–2038. [Google Scholar] [CrossRef]
Dellinger, F.; Delon, J.; Gousseau, Y.; Michel, J.; Tupin, F. SAR-sift: A sift-like algorithm for SAR images. IEEE Trans. Geosci. Remote Sens. 2015, 53, 453–466. [Google Scholar] [CrossRef] [Green Version]
Douglas, D.; Peucker, T. Algorithms for the Reduction of the Number of Points Required to Represent a Digitized Line or its Caricature; John Wiley and Sons. Ltd: New York, NY, USA, 2011; pp. 15–18. [Google Scholar]
Li, Z.; Liu, G.; Zhang, D.; Xu, Y. Robust single-object image segmentation based on salient transition region. Pattern Recogn. 2016, 52, 317–331. [Google Scholar] [CrossRef]
Li, Z.; Zhang, D.; Xu, Y.; Liu, C. Modified local entropy-based transition region extraction and thresholding. Appl. Soft Comput. 2011, 11, 5630–5638. [Google Scholar] [CrossRef]
Simonetto, E.; Oriot, H.; Garello, R. Extraction of industrial structures and DEM from airborne SAR images. In Proceedings of Physics in Signal and Image Processing, Marseille, France, 23–24 January 2001.

Figure 1. (a) A typical low-rise gable-roof building example [24]. (b) Top view of the roof with a certain incidence angle and aspect angle. (c) Structures of the gable-roof building signature. The green, red, and yellow lines indicate the position of near-sensor edge, roof ridge, and far-sensor edge, respectively. (d) Scattering model in the range profile viewed from the left. Here, the incidence angle

θ

is smaller than

90^{°} - α

. (

α

is the roof inclination). The gray areas at the bottom of the figure symbolize intensity. (e) The appearance of a gable-roof building in a real Chinese airborne synthetic aperture radar (SAR) image with 0.1 m resolution.

Figure 1. (a) A typical low-rise gable-roof building example [24]. (b) Top view of the roof with a certain incidence angle and aspect angle. (c) Structures of the gable-roof building signature. The green, red, and yellow lines indicate the position of near-sensor edge, roof ridge, and far-sensor edge, respectively. (d) Scattering model in the range profile viewed from the left. Here, the incidence angle

θ

is smaller than

90^{°} - α

. (

α

is the roof inclination). The gray areas at the bottom of the figure symbolize intensity. (e) The appearance of a gable-roof building in a real Chinese airborne synthetic aperture radar (SAR) image with 0.1 m resolution.

Figure 2. The spatial composing relationship of edge lines and roof patches. Adjacent feature primitives are merged to compose more structured features.

Figure 3. Flowchart of the proposed method.

Figure 4. An example of edge extraction and classification on a submeter-level airborne SAR image containing two gable-roof buildings. (a) Airborne SAR image with resolution 0.1 m × 0.1 m. (b) Filtered image. (c) Binary image based on gradient by ratio (GR). (d) Edge lines with length more than 20.

Figure 5. Definition of the classifier. (a) Definition of the two-rectangle window. (b) Classification rules.

Figure 6. Definition of the building candidate area.

Figure 7. An example of parallelogram-like roof patch extraction. (a) The selected edge candidate (red line) and the corresponding building candidate area. (b) Transition region. (c) Building roof candidate with threshold t. (d) The extracted parallel edge lines. (e) The parallelogram P after the shape-constrained feature optimizing. (f) The final P after the homogeneity-constrained feature optimizing. (g) Building roof candidate with a level-5 threshold. (h) The near-sensor roof edges (red parallelogram) and the far-sensor roof patch candidate (binary strip). (i) The final roof edges denoted by two red parallelograms.

Figure 8. (a) TerraSAR-X staring spotlight image with viewing direction from the left. (b) The result of mean shift segmentation. (c) Results of edge extraction. (d) Final results of gable-roof building detection. (e) Results of the marker-controlled watershed segmentation method.

Figure 9. Chinese airborne SAR 0.1 m resolution image of a village with viewing direction from the left.

Figure 10. Final results of gable-roof building detection. The extracted gable roofs are overlain on the original SAR image. Buildings B2, B14, B15, B16 and B18 are undetected.

Figure 11. An example of roof extraction with fractured regional features. (a) Building B23 image. (b) The result of mean shift segmentation. (c) Transition region. (d,e) The extracted building roof candidates and parallel lines in different level. (f) Final result of building roof.

Table 1. Imaging parameters of experimental data.

**Table 1.** Imaging parameters of experimental data.
Parameter	TerraSAR-X Staring Spotlight SAR	Chinese Airborne SAR
range resolution(m)	1.02	0.1
azimuth resolution(m)	0.24	0.1
center incidence angle(deg)	33	58
polarization type	HH	HH

Table 2. Building detection results of Figure 10.

**Table 2.** Building detection results of Figure 10.
Total Buildings	Detected Buildings	Partially Detected Buildings	Merged Buildings	False Alarms	Detection Rate
36	31	2	4	0	86%

* Note: the number of detected buildings includes “partially detected buildings” and “merged buildings.” A building is defined as partially detected if it is detected for less than 90% of its pixels.

© 2017 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license ( http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Chen, J.; Wang, C.; Zhang, H.; Wu, F.; Zhang, B.; Lei, W. Automatic Detection of Low-Rise Gable-Roof Building from Single Submeter SAR Images Based on Local Multilevel Segmentation. Remote Sens. 2017, 9, 263. https://doi.org/10.3390/rs9030263

AMA Style

Chen J, Wang C, Zhang H, Wu F, Zhang B, Lei W. Automatic Detection of Low-Rise Gable-Roof Building from Single Submeter SAR Images Based on Local Multilevel Segmentation. Remote Sensing. 2017; 9(3):263. https://doi.org/10.3390/rs9030263

Chicago/Turabian Style

Chen, Jinxing, Chao Wang, Hong Zhang, Fan Wu, Bo Zhang, and Wanming Lei. 2017. "Automatic Detection of Low-Rise Gable-Roof Building from Single Submeter SAR Images Based on Local Multilevel Segmentation" Remote Sensing 9, no. 3: 263. https://doi.org/10.3390/rs9030263

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Automatic Detection of Low-Rise Gable-Roof Building from Single Submeter SAR Images Based on Local Multilevel Segmentation

Abstract

1. Introduction

2. Gable-Roof Building Signature Analysis

3. Method

3.1. Edge Extraction and Classification

3.2. Building Candidate Area Extraction

3.3. Parallelogram-Like Roof Patch Extraction

3.3.1. Feature Primitive Extraction and Optimizing

3.3.2. Multilevel Thresholds

3.4. Individual Building Identification

4. Materials and Experimental Results

4.1. Test on TerraSAR-X Staring Spotlight SAR Scene

4.2. Test on Chinese Airborne SAR Scene

5. Discussion

6. Conclusion

Acknowledgments

Author Contributions

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI