Next Article in Journal
Moist Convection in the Giant Planet Atmospheres
Previous Article in Journal
A Comparative Study of a Typical Glacial Lake in the Himalayas before and after Engineering Management
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Research on Optimization of Processing Parcels of New Bare Land Based on Remote Sensing Image Change Detection

Land Satellite Remote Sensing Application Center, Ministry of Natural Resources, Beijing100048, China
*
Author to whom correspondence should be addressed.
Remote Sens. 2023, 15(1), 217; https://doi.org/10.3390/rs15010217
Submission received: 28 October 2022 / Revised: 23 November 2022 / Accepted: 24 November 2022 / Published: 30 December 2022

Abstract

:
To meet the demands of natural resource monitoring, land development supervision, and other applications for high-precision and high-frequency information extraction from constructed land change, this paper focused on automatic feature extraction and data processing optimization methods for newly constructed bare land based on remote sensing images. A generalized deep convolutional neural network change detection model framework integrating multi-scale information was developed for the automatic extraction of change information. To resolve the problems in the automatic extraction of new bare land parcels, such as mis-extractions and parcel fragmentation, a proximity evaluation model that integrates the confidence-based semantic distance and spatial distance between parcels and their overlapping area is proposed to perform parcel aggregation. Additionally, we propose a complete set of optimized processing techniques from pixel pre-processing to vector post-processing. The results demonstrated that the aggregation method developed in this study is more targeted and effective than ArcGIS for the automatically extracted land change parcels. Additionally, compared with the initial parcels, the total number of optimized parcels decreased by more than 50% and the false detection rate decreased by approximately 30%. These results indicate that this method can markedly reduce the overall data volume and false detection rate of automatically extracted parcels through post-processing under certain conditions of the model and samples and provide technical support for applying the results of automatic feature extraction in engineering practices.

Graphical Abstract

1. Introduction

As urbanization gains traction, illegal land occupation is becoming increasingly prominent. Consequently, newly constructed land has become a key subject of supervision and regulation in land development. To effectively improve the monitoring capacity of natural resources and achieve the “early detection and early prevention” goal of natural resource management against illegal land occupation, remote sensing image change detection technology is urgently needed to extract the features pertaining to changes in the development of different types of constructed land. According to the classification system in the land use change surveys, new bare land is among the most significant forms of change in constructed land, where a certain land type is converted from its vegetated or natural state into cleared land for construction. Therefore, obtaining automatic and accurate regional and even national data on land development changes is crucial for natural resource management.
Remote sensing change detection is the process of determining changes in land coverage based on multiple satellite images at different points in time to obtain accurate real-time information. Change detection methods for remote sensing images are mainly divided into two major categories: traditional and deep learning methods. Traditional methods can be divided into image differencing and feature-based and target-driven change detection. These traditional change detection methods have limitations such as the inability to eliminate manual intervention and low levels of automation, and they are easily affected by changes in imaging conditions, image acquisition time, image matching quality, and noise, which render the change detection results unsatisfactory. With the gradual expansion of deep learning in the field of remote sensing, the power of neural networks in feature selection and fitting has generated new ideas for image change detection tasks. Since the introduction of AlexNet [1] in 2012, convolutional neural networks (CNN) with enhanced structure and performance have been launched every year, such as Visual Geometry Group (VGG) [2], GoogLeNet [3], fully convolutional networks (FCNs) [4], U-Net [5], Residual Network (ResNet) [6], and Efficientnet [7]. Many deep learning methods for change detection have been derived based on these networks. As deep learning-based image change detection can autonomously identify high-dimensional features reflected in the changed regions of the image without human intervention, it is the mainstream method today.
Currently, most deep learning methods treat remote sensing image change detection as a semantic segmentation task by evaluating the changes in each pixel in the input image pair. Most models are based on the classic U-Net’s encoder–decoder network architecture design and have built on it, including DeepLab V3+ [8] and Semantic FPN [9]. Change detection models, such as SNUNet-CD [10] and STANet [11], have been used to study remote sensing change detection in land use based on deep learning networks, with a significant focus on changes in buildings [12,13]. Presently, relatively few relevant studies have used deep learning algorithms for high-resolution remote sensing change detection on land use in large areas (such as city- and national-level) with practical business applications. Furthermore, several studies are limited to a few test images or small areas. In this study, we address the business demand for the automatic feature extraction of new bare land at a national scale. This fully integrates the Siamese neural networks, atrous convolution, encoder–decoder, and other advanced algorithms and network structures to build a generalized deep CNN change detection model framework for automatic extraction on large-scale newly constructed bare land with accuracy and efficiency.
The accuracy of deep learning in the automatic feature extraction of land cover change has considerably improved compared with that of traditional algorithms. However, owing to the complexity of geospatial landscapes and non-uniform image quality, the automatic extraction of change parcels for large-scale, practical business applications is hindered by high false and missed detection rates and irregular parcel morphology. This results in a significant workload for subsequent manual verification, which makes it difficult to meet the need for high-precision and high-frequency monitoring.
Current research on feature extraction for the remote sensing of newly constructed land mainly focuses on the design and improvement of change detection models, rather than the post-processing of parcels that have been extracted [14]. Taking new bare land as a case in point, the original parcels extracted using deep learning models are often segmented, small, or have jagged edges or holes within them, which directly affect their practical application.
Post-processing is an operation performed after the change parcels of remote sensing images have been automatically extracted, which can further reduce the errors in the parcels. Commonly used post-processing operations include parcel aggregation, morphological processing, filtering, and region growing algorithms. Of these, parcel aggregation is an important aspect of post-processing. Most current studies focus on modeling the aggregation of multi-category land use parcels [15,16], which takes into account the spatial topology and semantic information of the land type. However, the calculation of the semantic proximity is generally closely related to the multi-level category attributes of the land type and its tenure information. In terms of single-category parcel aggregation, there are relatively more studies on the aggregation of building outlines [17], with no relevant post-processing studies on the parcels of newly constructed bare land.
Therefore, building on the foundations of a deep CNN change detection model framework, this study focuses on adaptive post-processing techniques on the change detection results—in particular, the automatic extraction of new bare land parcels, which can further improve the performance and accuracy of change detection from the initial prediction. The major contributions of this study can be summarized as follows:
  • A generalized deep CNN change detection model framework is constructed by integrating Siamese neural networks, atrous convolution, encoder–decoder, and other advanced algorithms and network structures to perform the large-scale automatic extraction of new bare land parcels with accuracy and efficiency.
  • To tackle the issue of large data volume, false extraction, and the difficulty of practical application in the automatic feature extraction of new bare land parcels, a complete parcel optimization process from the “pixel-object” pre-processing to comprehensively post-process the vector parcels is proposed by utilizing the probability distribution of pixel-level change.
  • A multi-criteria proximity evaluation model is proposed by integrating the spatial distance between parcels, their overlapping area, and confidence difference to aggregate adjacent parcels.
The rest of this paper is organized as follows. Section 2 introduces the methodology of this paper. Section 3 presents the study areas and setup of the experiments. The experimental results are shown in Section 4 and the performance of the proposed method is discussed in Section 5. Finally, the conclusions are given in Section 6.

2. Materials and Methods

2.1. Overall Workflow

This study builds a generalized deep Siamese CNN change detection model, and, based on bi-temporal optical remote sensing images, performs automatic detection to obtain the initial change results of bare land. Subsequently, an automatic parcel optimization method is introduced to address the problems of large data volume and false extractions. As shown in Figure 1, the method first performs pre-processing on the initial change detection results, such as pixel normalization and mean confidence score calculation, to derive the confidence score between parcels. Thereafter, comprehensive post-processing, including edge simplification, aggregation, hole-filling, and the area and confidence score filtering of the vector parcels, is performed to obtain the final extraction results. For the method of parcel aggregation, this study develops a comprehensive parcel proximity evaluation model by considering semantic similarity, the spatial distance between parcels, their overlapping area, and other related factors as the criteria for the adaptive aggregation of parcels of new bare land.

2.2. Deep Siamese CNN Change Detection Framework

To integrate multi-scale feature information, this study combines Siamese neural networks and encoder–decoder structures commonly used in semantic segmentation to build a deep Siamese CNN change detection framework (Figure 2).
The encoder comprises two Siamese neural network layers with shared weights for feature extraction, which support ResNet, Xception, Efficientnet, and other typical backbone networks. The input of the encoder is images of different temporalities, which are processed by the Siamese neural network and atrous spatial pyramid pooling (ASPP), to obtain multi-level difference feature maps for integration. Through deconvolution by the decoder, concatenation, upsampling, and other calculations, detailed information on the feature maps and its spatial dimension is gradually recovered. The probability distribution of change is calculated, as well as the cross-entropy loss between the change prediction map and reference value to continuously update the network parameters. Finally, a pixel-level change prediction map with the same dimensions as the input is obtained.

2.3. Pre-Processing of Change Detection Results

The bi-temporal remote sensing images are processed by the deep Siamese CNN change detection model to generate pixel-level change probability distribution maps. Based on the change probability value of each pixel, the proposed method first performs pre-processing on the results generated by the CNN model, such as probability normalization and mean confidence score calculation, to enable subsequent parcel aggregation and confidence-based post-processing.

2.3.1. Pixel-Level Probability Normalization

The final layer of the change detection neural network performs a Softmax classification of each pixel of the feature map and generates a probability distribution map. A cross-entropy loss function is then used to iterate over the model parameters. The Softmax function converts the output values of multiple classifications into a probability distribution in the range of [0, 1]. The classification probability of each pixel is calculated by the following equation:
P = Softmax ( z i ) = e z i j = 1 C e z j
where C is the total number of categories (or nodes) for classification, i denotes the change category, z i   is the output value of the ith node, and P represents the predicted change probability of the pixel. The information to be extracted in single-element change detection tasks is usually divided into two main categories: change and background.

2.3.2. Object-Level Mean Confidence Score Calculation

The bi-temporal images are firstly processed by the deep CNN change detection model. Figure 3a shows the pixel-level probability distribution map output by the Softmax classifier, with different grayscale values representing different change probabilities. The greater the brightness, the greater the probability of change in that pixel. Thereafter, a certain probability threshold is determined to obtain the initial result of the automatically detected change parcel in Figure 3b.
To facilitate the subsequent grading and filtering of the automatically extracted parcels, a confidence score is calculated for each closed parcel as a unit. A region-growing algorithm uses pixels with non-zero values as seed points to perform neighborhood expansion across the whole image. The average probabilistic value of pixels in each parcel is calculated as the confidence score of the parcel (Figure 3c).

2.4. Post-Processing of Change Parcels

After pre-processing, the detected map is processed from raster to vector format. At present, the bare land change parcels automatically extracted by deep learning algorithms often face issues, such as the segmentation of adjacent patches, fragmented distribution of small parcels, and holes within the parcels (Figure 4). These require the post-processing of the parcels, such as edge simplification, aggregation, and hole-filling, to further refine the automatic extraction results. Therefore, this study proposes a parcel proximity evaluation model that integrates spatial distance, overlapping area, and confidence scores for parcel aggregation processing.

2.4.1. Parcel Proximity Evaluation Model

According to the characteristics of automatically extracted new bare land parcels, a buffer radius of 4–8 pixels is set for each parcel. The buffer intersection region is evaluated; for each parcel within the intersection region, its proximity is calculated in pairs, and the proximity threshold is set for parcel aggregation according to the specific requirement of practical applications.
(1) Confidence-based semantic similarity
The closer that the change probability values of two parcels are, the higher the semantic similarity of their categories is. After calculating the buffer, the semantic similarity of any two intersecting parcels M, N is calculated with the following equation.
P s e m _ p r e = 1 | p M p N | / 255
where p M ,     p N is the average confidence score of the parcels with the confidence range set to [0, 255].
The semantic distance between two parcels is calculated as d s e m _ p r e = | p M p N | / 255 . The larger the semantic distance, the smaller the semantic similarity value and the smaller the semantic relationship.
(2) Spatial proximity
To calculate the shortest distance between any two parcels in a buffer region d s p a = D i s ( F M , F N ) , that is, the closest spatial distance between two polygons.The distance threshold d t h r e s h is set as 2–4-pixel width in the processing of new bare land parcels and the spatial proximity is calculated with the following formula   P s p a _ d i s :
{ 0.5 d t h e s h d s p a 0.5 d t h r e s h   , P s p a _ d i s = 1 < d s p a d t h r e s h   , P s p a d i s = 0.5 d s p a > d t h r e s h   ,   P s p a _ d i s = 0
A shorter spatial distance between two parcels indicates a higher spatial proximity.
(3) Overlap ratio of buffer areas
As shown in Figure 5, buffer areas are made for parcels M and N, and the areas where the buffer intersects with the parcel are set as   s 1 and   s 2 , respectively. The two parcels form a polygon     s 0 at the intersection, and the overlap ratio of buffer areas is calculated using the following equation:
P a r e _ l a p = s 1 + s 2 s 0 + s 1 + s 2
Integrating the confidence-based semantic similarity, spatial proximity, and buffer area overlap ratio, the following model is used to calculate the combined proximity P c o m of any two parcels with overlapping buffers:
P c o m [ M ,   N ] = δ 1 P s e m _ p r e [ M ,   N ] + δ 2 P s p a _ d i s [ M , N ] + δ 3 P a r e _ l a p [ M , N ]
where δ 1 , δ 2 , δ 3 are the weights of the variables, δ 1 + δ 2 + δ 3 = 1 , and the range of P c o m values is [0, 1]. When equal weights are applied to the three variables, δ 1 = δ 2 = δ 3 = ⅓.
In actual aggregation processing, according to the distribution characteristics of automatically extracted bare land change parcels, the priority factor to be considered is whether the two parcels belong to the change category semantically, which plays a key role in aggregation. At the same time, the parcels should be close enough in space, and the overlapping area ratio is an additional condition for spatial proximity. Therefore, the proximity model adopts weights as such: δ 1 > δ 2 > δ 3 .

2.4.2. Post-Processing

The post-processing of the bare land change parcels mainly includes edge simplification, parcel aggregation, hole-filling, and area/confidence filtering, and the methods and processing guidelines are as follows:
(1) Edge simplification: The bare land change parcels automatically extracted by the deep CNN have irregular shapes and jagged edges. To reduce the redundant nodes, the Douglas–Peucker algorithm is used for the edge simplification of the parcels, which simplifies the data by identifying and removing smaller nodes in the polygon that affect the overall shape of the line elements. The tolerance parameter is used to determine the degree of simplification of the edges, which is the maximum allowable vertical distance between each fold and the newly created line. The greater the tolerance, the greater the degree of simplification. This method uses a tolerance of a 1–2-pixel width for the simplification of the bare land change parcels.
(2) Parcel aggregation: Using the simplified parcels, the P c o m value is calculated based on the proximity evaluation model proposed in this study. The threshold P t h r e s h   is set to aggregate any two parcels larger than it. For instance, M, N, and s0 in Figure 5 are aggregated into one parcel to form a closed continuous region. The confidence score p F of the new aggregated parcel F is obtained from the confidence of the original parcels p M , p N , calculated based on area weighting.
(3) Filtering and filling: According to the requirements for practical applications of change detection, smaller change parcels are removed based on a certain area threshold, A t h r e s h . Another area threshold for holes in parcels, H t h r e s h is set, with holes smaller than the threshold being filled. Finally, a confidence range p can be specified based on business needs to output different parcels with different confidence levels.

3. Experiments

3.1. Study Areas and Data Sources

Two regions in the Shanxi Province, China, were selected as the experimental areas: Yulin and Taiyuan (Figure 6). Based on pre-temporal and post-temporal remote sensing images from the fourth quarter of 2019 and 2020 in the experimental areas, a deep Siamese CNN change detection model was used to execute the automatic feature extraction of change information of the new bare land. The image data used in the experimental area are a mosaicking dataset of Chinese multi-sensor satellite remote sensing images (Figure 7), including ZY3-02/03 optical stereo mapping satellites and the 2 m/8 m optical satellite constellation (GF-1 B, C, and D). The spatial resolution of the images is approximately 2 m.
As a large dataset is required to train the deep CNN change detection model, based on domestic multi-sensor satellite data with 2 m resolution, the training samples were collected and semi-automatically labeled based on the historical archives of land-use change parcels. The sample comprises a pre-temporal image, a post-temporal image, and a label.
As this study focuses on the automatic extraction of newly constructed bare land, the pre-temporal sample can be agricultural land, garden, forest, grass, buildings, or any other land type, whereas the post-temporal is fixed as constructed bare land. To maintain domain adaptation between the sample and test data, this experiment mainly selected sample data from the fourth quarter of recent years. In terms of sample diversity and balance, factors such as the target scale, image radiation, and distribution area were accounted for. The samples were sliced into 512 × 512 pixels, with the total number being 20,052, as shown in Figure 8.

3.2. Experimental Setup

Based on the new collected bare land samples and image data of the experimental areas, the model training and change detection experiments were carried out firstly. In order to obtain relatively ideal initial change parcels, the efficiency and accuracy of the proposed deep Siamese CNN change detection model was evaluated from the aspects of backbone network selection, hyper-parameter adjustment, etc.
The change detection results of the CNN model are evaluated in the standard metric mean intersection over union (MIoU). As shown in (6), the MIoU is the ratio between the intersection and union of the two datasets, which represent the ground truth and predicted change area, respectively:
MIoU = 1 k + 1 i = 0 k P i i j = 0 k P i j + j = 0 k P j i P i i
where k denotes the number of classes, i , j denotes a specific class, P i i represents the pixel number of TPs, and P i j and P j i denote FPs and FNs, respectively. Moreover, TP, FP, and FN are the number of true positives, false positives, and false negatives, respectively. The MIoU is calculated on a per-class basis and then averaged. In this paper, the change detection task is mainly divided into two classes, i.e., change and unchanged.
To verify the effects of the pre-processing and post-processing of new detected bare land parcels proposed in this study, experiments such as confidence calculation, proximity calculation and discrimination, and comprehensive post-processing were carried out, and then the results were analyzed. Furthermore, a comparison experiment between the proposed aggregation method and that of the ArcGIS aggregation tool was designed to analyze the effect of parcel aggregation and ascertain the advantages of the proposed proximity evaluation model and aggregation criteria.
To evaluate the improvement between the parcels processed by the proposed optimizing method and the original automatic extraction results in terms of accuracy and data volume, a comparison test between the experimental results and the true value from manually labeled was conducted. The number of parcels, false and missed detection rates before and after processing, were calculated to determine the final accuracy.
As shown in (7), the false detection rate is calculated as follows:
FDR = ( A C ) / A 100 %
Additionally, the missed detection rate is calculated as follows:
MDR = ( B C ) / A 100 %
where   A denotes the number of change parcels in different processes output by the experiments, B denotes the number of manually labeled parcels, and C denotes the number of intersecting parcels between A and B.

4. Results

4.1. Automatic Extraction Results of New Bare Land

Based on the Siamese CNN change detection framework proposed in this paper, 20,052 samples were divided into training and validation sets in the ratio of 8:2 for model training. The samples were only enhanced with rotation, mirroring, and dithering. ResNet (Residual Network) was used as the model backbone, and the multi-layer feature merging strategy in the encoding and decoding process is consistent with that in deeplabv3+, which uses the first and last layers for feature merging.
The accuracy and speed of two different model versions of ResNet, ResNet101, and ResNet152 were comparatively analyzed. After 100 iterations of training, the overall training MIoU was approximately 78% using 6 V100 GPUs (32G memory). The training time of resnet101 and resnet152 was 41 and 55 h, respectively. As their accuracy levels are similar, this study selected resnet101, which has better training and prediction efficiency, as the backbone. Additionally, the hyper-parameters were adjusted to obtain the optimal training model for the prediction of new bare land parcels in the experimental area.
Based on the trained model, the automatic feature extraction of new bare land was performed in two experimental areas in Yulin and Taiyuan (with an area of approximately 50,000 km2). The initial numbers of automatically extracted parcels were 17,733 and 2521 in Yulin and Taiyuan, respectively (Figure 9). Part of the extraction results after the vectorization of parcels are shown in Figure 10. Overall, the model can predict most new bare land types. The objects on pre-temporal images cover a variety of types, such as cultivated land, forest land, grassland, and buildings, it can be seen that the model is relatively robust. However, a large number of missed extractions, false extractions, and inaccurate target morphology have been revealed by manual visual interpretation. The number of automatically extracted parcels and manually identified parcels were counted, and the false detection rate (FDR) and missed detection rate (MDR) of automatically extracted parcels were calculated (Equations (7) and (8)) with a 50% overlap rate criterion. The results are shown in Table 1. The initial FDR of the two experimental areas was relatively high, both above 70%, and the MDR did not exceed 30%.
The initial results of automatic extraction based on CNN are less than satisfactory for several reasons.
First, the number of samples was limited, and these samples cannot ensure the complete coverage of the texture, color, shape, and other features of bare land change parcels in the experimental area, which is the main reason for the missed extraction. Second, as shown in Figure 7, for the multi-sensor mosaic images, local areas with cloud cover or large differences in radiation at the borders of the mosaic images may lead to false extraction. Third, the current change detection model based on deep learning mainly adopts supervised learning, which has several limitations. The model is usually strongly dependent on samples. Additionally, for the inconsistency of color, radiation and definition of multi-sensor remote sensing images in practical applications, the robustness of the model needs to be enhanced.
Therefore, in this paper, based on the automatically extracted parcels, we performed the optimization processing of the parcel morphology and topology to improve the initial extracted results.

4.2. Results of the Parcel Optimization Process

Based on the automatic extraction results of the two experimental areas, the confidence scores of all parcels were calculated according to the pre-processing methodology in Section 2.2. Next, the post-processing of the change parcels was performed according to the steps in Section 2.3. In the experiments, the edge simplification tolerance parameter was set to one pixel. When calculating the spatial proximity, d t h r e s h was set to a width of two pixels. When calculating the buffer overlap area ratio, the buffer radius was set to five pixels in width. In the parcel aggregation proximity model,   δ 1 = 0.5 ,     δ 2 = 0.3, and δ 3 = 0.2, were used to calculate the P c o m value for the proximity of two neighboring parcels.
The results of each process of the parcel optimization are shown in Table 2, namely pre-processed parcels, simplified parcels, aggregated results, and final results after hole-filling, removal of small parcels, and confidence filtering.
In the table above, the (a) column shows examples of the pre-processed parcel with three to four adjacent parcels in each example, and each of their confidence values (P1, P2, P3, and P4) are calculated. As shown in Table 3 below, the proximity ( P c o m ) was calculated only for pairs of adjacent parcels with intersecting buffers, and the aggregation threshold ( P t h r e s h ) was set to 0.65 in this experiment. The aggregated results are shown in column (c) of Table 2. The final results of the optimization process are shown in Table 2d after area filtering, hole-filling, and confidence filtering, where the area filter threshold ( A t h r e s h ) was set to 1200   m 2 , and the confidence score filtering range p   of the aggregated parcels was 165–255. Based on the processing results, the overall post-processing process proposed in this paper is reasonable and can effectively solve the problems of automatically extracted parcels being segmented, having holes or redundancy.
To verify the effectiveness of the aggregation method proposed in this paper, the last column in Table 2 shows the results processed using the cartographic generalization tool in ArcGIS software. The AggregatePolygons function in ArcGIS was used for aggregation. In the experiment, the aggregation distance parameter aggr_dis was set to a five-pixel width, and the parcel filtering area and size of the holes that were retained were consistent with the method in this paper. According to the post-processing examples in Table 2 and several processed results in Figure 11, the comparative analysis is as follows:
The ArcGIS processing results of Examples 1, 2, and 4 in Table 2 are slightly different from the final results obtained using the method proposed in this paper in column (d). For the case of the uneven distance between two parcels, the aggregation probability of the method proposed in this study is high and the parcels can be easily aggregated. This is due to the proposed aggregation method incorporating semantic confidence within a certain buffer range (five pixel width), and the spatial proximity only considers the nearest distance between two parcels, which is relatively loose. The ArcGIS aggregation method does not consider the semantic information of the parcels and is restrictive for the distance evaluation, resulting in some fragmentation of the final new bare land parcels. As the connectivity of bare land types tends to be wide rather than narrow, the overall aggregation method proposed in this paper is targeted and effective.
Additionally, in Example 2, the confidence score of the top-right parcel is low and does not satisfy the aggregation condition. The method proposed in this paper can effectively eliminate such discrete distribution parcels with high error rates through confidence-based filtering.

5. Discussion

The number of automatically extracted parcels of the two test areas after aggregation, area filtering, confidence filtering, and other post-processing experiments is shown in Figure 12. Aggregation processing mainly serves to reduce the degree of fragmentation of parcels, and the number is reduced by about 10%. Area filtering only removed less than 5% of the parcels, mainly due to the small number of bare land parcels with an area less than 1200   m 2 in the experimental area. In addition, the proportion of small area parcels in the training samples is relatively low.
The final confidence filtering step can significantly reduce the redundant parcels, and the final numbers were 7216 and 1109 for Yulin and Taiyuan, respectively. Compared with the initial data volume, the total number of parcels decreased by more than 50%.
Compared with the manually labeled parcels, the false and missed detection rates of the two areas were calculated. As shown in Table 4, compared with the initial extraction results in Table 1, the FDR decreased by approximately 30%, and the MDR correspondingly increased by approximately 4–6% with a significant reduction in the total numbers of extracted parcels.
Statistical analysis results suggested that the optimization processing method proposed in this paper markedly reduces the false detection of automatically extracted parcels, which can solve the current problems of large data volume and high-FDR associated with automatically extracted change parcels using deep learning and other methods. This study improves the feasibility of using automatic change detection parcels in actual monitoring applications. Furthermore, the process of confidence filtering will inevitably delete some correct parcels, and it is necessary to select the optimal confidence p value through a large number of experiments to balance the FDR and MDR of the final extraction results.

6. Conclusions

In this paper, a complete framework for the automatic feature extraction, parcel pre-processing, and comprehensive post-processing of new bare land from remote sensing images has been introduced. A general deep Siamese CNN change detection model was designed, which can perform the change detection of various land types. To address the problems of false extraction and segmented and large redundancy of automatically extracted parcels via deep learning, this paper focuses on the subsequent optimization of the processing of these parcels.
Combining the characteristics of the area distribution and shape connectivity of the bare land change parcels, a targeted aggregation model is proposed. The experimental results using the proposed method are better than those obtained by ArcGIS. A technical methodology for parcel optimization from “pixel-object” pre-processing to the comprehensive post-processing of vector parcels is proposed. The parcel statistical results revealed that the proposed method has significant effects on parcel aggregation, reducing parcel redundancy and false detection.
Although the method proposed in this paper can reduce the false detection, it erroneously removes a few correct parcels. The missed detection rates in the experiment are approximately 20–30%; thus, there is still a large room for further improvement. Further experiments will be performed to obtain the optimal parameter to balance the false and missed detection rates. Additionally, the domain adaptation study of the samples and images to be detected will be carried out to further improve the overall accuracy.

Author Contributions

L.L. provided the core idea and designed the framework, analyzed the experiments, and wrote the manuscript; Y.G. gave guidance on the extraction method and experimental setup; X.T. and S.Y. gave some useful guidance on the model framework and processing workflow; Z.L. collected some experimental data and implemented part of the experiments; L.D. and Y.H. collected some experimental data and polished the language of the paper. All authors have read and agreed to the published version of the manuscript.

Funding

This study was supported by the National Natural Science Foundation of China (42101410). Funded by Key Laboratory of Land Satellite Remote Sensing Application, Ministry of Natural Resources of the People’s Republic of China.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Krizhevsky, A.; Sutskever, I.; Hinton, G.E. ImageNet classification with deep convolutional neural networks. Int. Conf. Neural Inf. Process. Syst. 2012, 2012, 1097–1105. [Google Scholar] [CrossRef] [Green Version]
  2. Simonyan, K.; Zisserman, A. Very deep convolutional networks for large-scale image recognition. arXiv 2014, arXiv:1409.1556. [Google Scholar]
  3. Szegedy, C.; Liu, W.; Jia, Y.; Sermanet, P.; Reed, S.; Anguelov, D.; Erhan, D.; Vanhoucke, V.; Rabinovich, A. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015. [Google Scholar]
  4. Long, J.; Shelhamer, E.; Darrell, T. Fully convolutional networks for semantic segmentation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 3431–3440. [Google Scholar]
  5. Ronneberger, O.; Fischer, P.; Brox, T. U-net: Convolutional networks for biomedical image segmentation. In International Conference on Medical Image Computing and Computer-Assisted Intervention; Springer: Cham, Switzerland, 2015; pp. 234–241. [Google Scholar]
  6. He, K.; Zhang, X.; Ren, S.; Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, Las Vegas, NV, USA, 27–30 June 2016; pp. 770–778. [Google Scholar]
  7. Tan, M.; Le, Q.V. EfficientNet: Rethinking Model Scaling for Convolutional Neural Networks. In Proceedings of the 36th International Conference on Machine Learning, Long Beach, CA, USA, 10–15 June 2019. [Google Scholar]
  8. Chen, L.C.; Zhu, Y.; Papandreou, G.; Schroff, F.; Adam, H. Encoder-Decoder with Atrous Separable Convolution for Semantic Image Segmentation; Springer: Cham, Switzerland, 2018. [Google Scholar]
  9. Kirillov, A.; Girshick, R.; He, K.; Dollár, P. Panoptic feature pyramid networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, Long Beach, CA, USA, 10–15 June 2019; pp. 6399–6408. [Google Scholar]
  10. Fang, S.; Li, K.; Shao, J.; Li, Z. SNUNet-CD: A densely connected Siamese network for change detection of VHR images. IEEE Geosci. Remote Sens. Lett. 2021, 19, 1–5. [Google Scholar] [CrossRef]
  11. Chen, H.; Shi, Z. A spatial-temporal attention-based method and a new dataset for remote sensing image change detection. Remote Sens. 2020, 12, 1662. [Google Scholar] [CrossRef]
  12. Seydi, S.T.; Hasanlou, M.; Amani, M. A new end-to-end multi-dimensional CNN framework for land cover/land use change detection in multi-source remote sensing datasets. Remote Sens. 2020, 12, 2010. [Google Scholar] [CrossRef]
  13. Ji, S.; Shen, Y.; Lu, M.; Zhang, Y. Building instance change detection from large-scale aerial images using convolutional neural networks and simulated samples. Remote Sens. 2019, 11, 1343. [Google Scholar] [CrossRef] [Green Version]
  14. Lv, Z.; Liu, T.; Wan, Y.; Benediktsson, J.A.; Zhang, X. Post-processing approach for refining raw land cover change detection of very high-resolution remote sensing images. Remote Sens. 2018, 10, 472. [Google Scholar] [CrossRef] [Green Version]
  15. Li, C.; Yin, Y.; Liu, X.; Wu, P. An automated processing method for agglomeration areas. ISPRS Int. J. Geo-Inf. 2018, 7, 204. [Google Scholar] [CrossRef] [Green Version]
  16. Peng, D.; Wolff, A.; Haunert, J.-H. Using the A⋆ Algorithm to Find Optimal Sequences for Area Aggregation. In International Cartographic Conference; Springer: Cham, Switzerland, 2017; pp. 389–404. [Google Scholar]
  17. Ai, T.; Yin, H.; Shen, Y.; Yang, M.; Wang, L. A formal model of neighborhood representation and applications in urban building aggregation supported by Delaunay triangulation. PLoS ONE 2019, 14, e0218877. [Google Scholar] [CrossRef] [PubMed]
Figure 1. The overall workflow of the proposed methodology.
Figure 1. The overall workflow of the proposed methodology.
Remotesensing 15 00217 g001
Figure 2. Change detection framework based on deep Siamese CNN.
Figure 2. Change detection framework based on deep Siamese CNN.
Remotesensing 15 00217 g002
Figure 3. The process of object-level confidence calculation: (a) Probability distribution map output from the Softmax classifier. (b) Initial change detection result. (c) Object-level mean confidence.
Figure 3. The process of object-level confidence calculation: (a) Probability distribution map output from the Softmax classifier. (b) Initial change detection result. (c) Object-level mean confidence.
Remotesensing 15 00217 g003
Figure 4. Automatic extraction parcels.
Figure 4. Automatic extraction parcels.
Remotesensing 15 00217 g004
Figure 5. Schematic diagram of overlapping buffer areas between parcels.
Figure 5. Schematic diagram of overlapping buffer areas between parcels.
Remotesensing 15 00217 g005
Figure 6. Location map of experimental area.
Figure 6. Location map of experimental area.
Remotesensing 15 00217 g006
Figure 7. Bi-temporal multi-sensor mosaic images of the experimental area: (a) Images from the fourth quarter of 2019; (b) Images from the fourth quarter of 2020.
Figure 7. Bi-temporal multi-sensor mosaic images of the experimental area: (a) Images from the fourth quarter of 2019; (b) Images from the fourth quarter of 2020.
Remotesensing 15 00217 g007aRemotesensing 15 00217 g007b
Figure 8. Example of bare land change detection samples. (a,d) Pre-temporal images; (b,e) Post- temporal image; (c,f) labels.
Figure 8. Example of bare land change detection samples. (a,d) Pre-temporal images; (b,e) Post- temporal image; (c,f) labels.
Remotesensing 15 00217 g008
Figure 9. Automatic extraction results of new bare land in two experimental areas. The numbers of automatically extracted parcels in green were 17,733 and 2521 in Yulin and Taiyuan, respectively.
Figure 9. Automatic extraction results of new bare land in two experimental areas. The numbers of automatically extracted parcels in green were 17,733 and 2521 in Yulin and Taiyuan, respectively.
Remotesensing 15 00217 g009
Figure 10. Example of automatic extracted new bare land parcels.
Figure 10. Example of automatic extracted new bare land parcels.
Remotesensing 15 00217 g010
Figure 11. Post-processing results: (a) Original parcels on pre-temporal images. (b) Final results on post-temporal images. (c) ArcGIS results on post-temporal images.
Figure 11. Post-processing results: (a) Original parcels on pre-temporal images. (b) Final results on post-temporal images. (c) ArcGIS results on post-temporal images.
Remotesensing 15 00217 g011
Figure 12. The number of parcels in each process.
Figure 12. The number of parcels in each process.
Remotesensing 15 00217 g012
Table 1. Automatic feature extraction accuracy.
Table 1. Automatic feature extraction accuracy.
Experimental AreaAutomatic Extracted ParcelsManually Labeled ParcelsIntersecting ParcelsFDRMDR
Yulin17,7335748406377.1%29.3%
Taiyuan252187967173.4%23.6%
Table 2. Results of the parcel optimization process.
Table 2. Results of the parcel optimization process.
Post-Processing
Examples
Proposed MethodArcGIS Results
(a) Pre-Processed Parcels(b) Simplified Parcels(c) Aggregated Results(d) Final Results
1Remotesensing 15 00217 i001Remotesensing 15 00217 i002Remotesensing 15 00217 i003Remotesensing 15 00217 i004Remotesensing 15 00217 i005
2Remotesensing 15 00217 i006Remotesensing 15 00217 i007Remotesensing 15 00217 i008Remotesensing 15 00217 i009Remotesensing 15 00217 i010
3Remotesensing 15 00217 i011Remotesensing 15 00217 i012Remotesensing 15 00217 i013Remotesensing 15 00217 i014Remotesensing 15 00217 i015
4Remotesensing 15 00217 i016Remotesensing 15 00217 i017Remotesensing 15 00217 i018Remotesensing 15 00217 i019Remotesensing 15 00217 i020
Table 3. Confidence and proximity scores of parcels.
Table 3. Confidence and proximity scores of parcels.
ExamplesP1P2P3P4 P c o m
1158189148- P c o m 12 = 0.88 P c o m 13 = 0.77
2155176189147 P c o m 12 = 0.95 P c o m 23 = 0.90
3166163184- P c o m 12 = 0.5 0 P c o m 13 = 0.95
4194176214- P c o m 12 = 0.95 P c o m 23 = 0.75
Table 4. Parcel accuracy after processing.
Table 4. Parcel accuracy after processing.
Experimental AreaParcels after Post-ProcessingManually Labeled ParcelsIntersecting ParcelsFDRMDR
Yulin72165748382047.1%33.5%
Taiyuan110987961644.4%29.9%
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Liu, L.; Tang, X.; Gan, Y.; You, S.; Luo, Z.; Du, L.; He, Y. Research on Optimization of Processing Parcels of New Bare Land Based on Remote Sensing Image Change Detection. Remote Sens. 2023, 15, 217. https://doi.org/10.3390/rs15010217

AMA Style

Liu L, Tang X, Gan Y, You S, Luo Z, Du L, He Y. Research on Optimization of Processing Parcels of New Bare Land Based on Remote Sensing Image Change Detection. Remote Sensing. 2023; 15(1):217. https://doi.org/10.3390/rs15010217

Chicago/Turabian Style

Liu, Lirong, Xinming Tang, Yuhang Gan, Shucheng You, Zhengyu Luo, Lei Du, and Yun He. 2023. "Research on Optimization of Processing Parcels of New Bare Land Based on Remote Sensing Image Change Detection" Remote Sensing 15, no. 1: 217. https://doi.org/10.3390/rs15010217

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop