A Rapid and Automated Urban Boundary Extraction Method Based on Nighttime Light Data in China

Liu, Xiaojiang; Ning, Xiaogang; Wang, Hao; Wang, Chenggang; Zhang, Hanchao; Meng, Jing

doi:10.3390/rs11091126

Open AccessArticle

A Rapid and Automated Urban Boundary Extraction Method Based on Nighttime Light Data in China

by

Xiaojiang Liu

^1,2,

Xiaogang Ning

^1,*,

Hao Wang

¹,

Chenggang Wang

^1,3,

Hanchao Zhang

¹ and

Jing Meng

^1,2

¹

Chinese Academy of Surveying and Mapping (CASM), No. 28, Lianhuachi West Road, Beijing 100830, China

²

School of Geometrics, Liaoning Technical University, 88 Yulong Road, Fuxin 123000, China

³

College of Geomatics, Shandong University of Science, 579 Qianwangang Road, Qingdao 266590, China

^*

Author to whom correspondence should be addressed.

Remote Sens. 2019, 11(9), 1126; https://doi.org/10.3390/rs11091126

Submission received: 16 April 2019 / Revised: 26 April 2019 / Accepted: 26 April 2019 / Published: 10 May 2019

(This article belongs to the Special Issue Advances in Remote Sensing with Nighttime Lights)

Download

Browse Figures

Versions Notes

Abstract

As urbanization has progressed over the past 40 years, continuous population growth and the rapid expansion of urban land use have caused some regions to experience various problems, such as insufficient resources and issues related to the environmental carrying capacity. The urbanization process can be understood using nighttime light data to quickly and accurately extract urban boundaries at large scales. A new method is proposed here to quickly and accurately extract urban boundaries using nighttime light imagery. Three types of nighttime light data from the DMSP/OLS (US military’s defense meteorological satellite), NPP-VIIRS (National Polar-orbiting Partnership-Visible Infrared Imaging Radiometer Suite), and Luojia1-01 data sets are selected, and the high-precision urban boundaries obtained from a high-resolution image are selected as the true value. Next, 15 cities are selected as the training samples, and the Jaccard coefficient is introduced. The spatial data comparison method is then used to determine the optimal threshold function for the urban boundary extraction. Alternative high-precision urban boundary truth-values for the 13 cities are then selected, and the accuracy of the urban boundary extraction results obtained using the optimal threshold function and the mutation detection method are evaluated. The following observations are made from the results: (i) The average relative errors for the urban boundary extraction results based on the three nighttime light data sources (DMSP/OLS, NPP-VIIRS, and Luojia1-01) using the optimal threshold functions are 29%, 20%, and 39%, respectively. Compared with the mutation detection method, these relative errors are reduced by 83%, 18%, and 77%, respectively; (ii) The average overall classification accuracies of the extracted urban boundaries are 95%, 96%, and 93%, respectively, which are 5%, 1%, and 7% higher than those for the mutation detection method; (iii) The average Kappa coefficients of the extracted urban boundaries are 61%, 71%, and 61%, respectively, which are 5%, 4%, and 12% higher than for the mutation detection method.

Keywords:

urban boundary; DMSP/OLS; NPP-VIIRS; Luojia1-01; Jaccard coefficient; histogram feature

Graphical Abstract

1. Introduction

Over the past 40 years, China has experienced rapid urbanization due to its policies of reform and opening up [1]. The continuous population growth and rapid expansion of urban land use have had a significant impact on the ecological and resource environments. There have been several problems in some areas, such as the urbanization of land faster than the population growth, insufficient resources and environmental carrying capacity, and extraordinary ecological and environmental issues [2,3]. Therefore, it is of great importance to scientifically implement urban planning and its associated adjustments to rapidly and accurately access changes in urban boundaries, which helps provide a better understanding of the urbanization process.

Remote sensing imagery has become an important data source for urban boundary extraction and urban expansion monitoring due to its timeliness and economy, as well as a wide range of monitoring capabilities. Higher resolution images (such as from QuickBird in the United States, GF satellite imagery in China, etc.) have the ability to better distinguish features and can be used for urban boundary extraction. However, these data come with a high acquisition cost, a high data processing workload, a weak acquisition ability, a long renewal period, large spectral differences in ground objects, and a strong regional heterogeneity, which hinder the automation of urban boundary extraction at national or large regional scales [3,4,5,6]. The above problems have been addressed as a result of the development of nighttime light imagery. First, the global nighttime light data obtained from an operations linescan system (OLS) sensor carried by the US military’s defense meteorological satellite program (DMSP) has been proven by many scholars to be related to human activities, which can be used to study the urbanization process at regional or larger scales [7,8,9,10,11,12]. The Suomi National Polar-orbiting Partnership-Visible Infrared Imaging Radiometer Suite (NPP-VIIRS) has a higher spatial resolution than the DMSP/OLS data and can also be used to monitor human activities [13,14,15]. The world’s first professional nighttime remote sensing satellite “Luojia1-01” imagery was jointly developed by a team at Wuhan University and related institutions and can monitor the macroeconomic operations of China and the world for the National Development and Reform Commission and other departments to provide an objective basis for governmental decision-making [16]. Nighttime light data has been used as early as 1970 for urban extraction [17].

The nighttime light data can provide full coverage at the national or global scale with only a few images, while ordinary optical images require multiple frames for even a large city. In addition, nighttime light data can be automated and processed more quickly, giving it unique advantages for large-scale research. The nighttime light data can detect faint nighttime lights, which reveal only the intensity of the light without showing spectral differences [18]. While ordinary optical images can only reflect the spatial information of objects, the nighttime light data has both spatial attributes and socio-economic attributes [8,9], making it more meaningful for urbanization research. At the same time, there are some regions marked as urban areas from optical imagery, but these can be recently built without significant human activity, suggesting they should not be defined as urban areas. The use of nighttime light data can avoid this problem.

There have been many scholars that have used nighttime light data to extract urban boundaries. The methods to accomplish this are roughly divided into three categories: Threshold methods [19,20,21,22,23,24,25,26,27,28], traditional image classification [29,30,31,32], and local trimming methods combined with other data [27,33]. Of these, the threshold method is the most widely used because of its relatively simple approach. Firstly, empirical threshold methods, statistical data comparison methods, mutation detection methods and spatial data comparison methods are all included as subsets of the threshold method where urban boundaries are extracted by setting a threshold based on past experience [19]. While these methods are simple and easy to implement, their applicability is difficult to guarantee due to their strong subjectivity. Secondly, the method using statistical data comparisons takes statistics for urban land areas that are released by states and local regions as the true values to extract the urban boundaries [20,21]. In this method, the threshold is the optimal value from the urban boundary extraction that minimizes the differences between the area of the urban area extracted from the nighttime light data and the area from the statistics. However, this approach has no uniform definition for different regions where it is applied, making it difficult to compare horizontally between different cities. In addition, these statistics involve local interests, giving them a questionable reliability. Additionally, in the mutation detection method, the segmentation threshold for the urban boundary is determined when changing the threshold for the nighttime light data [21,22]. The threshold corresponding to a sudden increase in the perimeter of the urban boundary is taken as the segmentation threshold. Cities in single-center urban areas are more suitable for this method as the applicability for cities with multi-central urban areas is questionable. The extracted results are much larger than the actual urban area using this method, and each time the threshold is updated, a raster-to-vector conversion has to be performed, making this approach less efficient [23]. Finally, the final urban boundary is obtained from changing the threshold of the nighttime light data by using spatial comparison method [2,4,28]. Then, urban boundaries extracted using high-resolution optical remote sensing imagery are compared with the selected thresholds. The optimal threshold is determined when the areas of the two urban boundaries differs the least. The results obtained using this method provide some spatial characteristics.

Urban boundaries extracted using Landsat imagery are often used as true values in spatial data comparison methods. However, only the impervious surfaces or construction land can be better extracted from Landsat imagery as opposed to urban areas that reflect the range of human activity due to its coarser resolution. The accuracy of the optimal threshold will be affected if this data is selected as the true value. The accuracy of the urban boundary obtained using the spatial data comparison method still needs to be further explored through the generation of higher resolution remote sensing imagery [3,4,5,6]. In the current research, only one or a few cities can be extracted due to the difficulty in obtaining their true values. Moreover, the relationship between the urban boundaries extracted from an optical image and the thresholds for the nighttime light image has not been discovered. In addition, DMSP/OLS data are widely used in current urban extraction, while the NPP-VIIRS data have been researched and applied relatively less often, and Luojia1-01 data are used even less frequently. In this paper, three kinds of nighttime light data from the DMSP/OLS, NPP-VIIRS, and Luojia1-01 are selected, and the high-precision urban boundaries obtained from the high-resolution imagery are used as the true values. The optimal threshold function to extract urban boundaries is determined using the Jaccard coefficient and three different nighttime illumination datasets. Fifteen cities were selected as the training samples during this process. Finally, the accuracy of the urban extraction results obtained using the optimal threshold function is evaluated against other methods.

Selecting the statistical comparison method as a comparative experiment to the proposed approach requires comparing the area in the statistical data with the true urban area of the text. However, this would be unscientific because the concepts of “urban” in these two data sets are different. It is unreasonable to compare the empirical threshold method with the proposed method because it is too subjective and its accuracy is difficult to guarantee. However, the mutation detection method can be compared fairly with the proposed method as it does not introduce other information and only uses the image itself to extract the urban area. Therefore, the mutation detection method is selected for accuracy verification together with the proposed method.

2. Study Area and Data

In this paper, 15 cities (Tianjin, Chongqing, Hohhot, Jining, Luoyang, Nanjing, Qingdao, Shenzhen, Xuzhou, Wuhan, Shijiazhuang, Taiyuan, Changchun, Zhengzhou, and Yangquan) located in different regions and of various sizes were selected as the training samples for the proposed method. Then, 13 cities (Baotou, Beijing, Dalian, Fuzhou, Jinan, Kunming, Lanzhou, Nanchang, Nanning, Shenyang, Urumqi, Xi’an, and Yinchuan) were selected for accuracy verification through a comparison of the urban boundaries extracted using the mutation detection method and the proposed method.

The true values of the urban boundaries selected in this paper are extracted from high-resolution images using consistent extraction standards with a strong reliability. First, the aerial films GF1 and GF2, which generally have resolutions better than 1 m, are obtained from the Surveying and Mapping Department. Higher resolution imagery provides more obvious urban boundary characteristics, which gives a more detailed and accurate boundary decision. Moreover, manual extraction is used to further ensure the accuracy of the boundary extraction. A series of standard rules and processes is proposed to guarantee the consistent and comparative results of the urban boundaries. Finally, the accuracy of this set of boundaries has been verified, and the results are superior to the boundaries extracted when using other methods [5].

Administrative boundaries from the results of basic geography monitoring are used in this paper.

The nighttime light data comes from three sources, including the 2012 global nighttime average light data (DMSP/OLS) as obtained from sensors onboard a US military meteorological satellite, the August average image data from the 2012 monthly synthesized nighttime remote sensing imagery (NPP-VIIRS), and the 2018 Luojia1-01 satellite. The satellite imagery data are from around the month of August, and all the image data are projected using the Albers double standard latitude line. The acquisition time of the Luojia1-01 images is different for each of the considered cities. As a result, the original light intensity values have certain differences and cannot be directly compared or analyzed. Therefore, the Luojia1-01 imagery is radiometrically scaled using Equation (1), and the administrative area demarcation line is used to select the image data for each city area. Radiation corrections can limit the differences caused by images taken in different periods to make them more comparable.

L = D N^{\frac{2}{3}} * 10^{- 10}

(1)

where L is the radiance value after the absolute radiation correction and DN is the image gradation value.

3. Method

The research methodology used in this paper is illustrated in Figure 1. First, 15 cities are selected as training samples, and a set of high-precision urban boundaries based on the higher resolution images is selected as the true values. The Jaccard coefficients are introduced and used to compare the similarities and differences between the sample sets and the nighttime light data. Larger coefficient values indicate a higher sample similarity. The true values and the extracted urban results are compared using a pixel-by-pixel comparison as the nighttime light image threshold is constantly changed. The optimal threshold for the training sample is determined when the Jaccard coefficient is the largest. As a result, a set of training thresholds is obtained. A functional relationship between the histogram information for the nighttime light imagery and the optimal thresholds in the training sample set is found by extracting the histogram information from the nighttime light images. The final urban boundary threshold is calculated when the histogram feature information is placed in this functional relationship. Finally, the mutation detection method is selected as a comparative experiment, which does not introduce any additional information and only uses the image itself to extract the urban area. The urban boundaries for thirteen cities not included in the training samples were extracted and compared using the mutation detection method and the proposed method.

3.1. Sample Training

The accuracy and spatial correlation between the training results and the true values can be guaranteed when the Jaccard coefficients are introduced during the sample training. The Jaccard coefficients compare the similarities and differences between two samples. The larger the coefficient, the higher the similarity of the samples [34], which is defined as:

J c = \frac{\sum_{i = 1}^{n u m} N T L_{i} \cap T R U E_{i}}{\sum_{i = 1}^{n u m} N T L_{i} \cup T R U E_{i}}

(2)

In Equation (2),

J c

represents the degree of coincidence with the true value, where a higher

J c

suggests a better fit. The

\cap

operator represents the spatial intersection, which is recorded as 1 when both the true value pixel and the nighttime light image both indicate an urban area; otherwise, it is recorded as a 0. The

\cup

operator indicates a space-sum. That is, the operation gives a value of 1 when either a nighttime light image pixel or the truth-value cell are in the urban area. The

N T L_{i}

and

T R U E_{i}

are the i-th pixel values in the nighttime light image (0 means non-urban and 1 represents an urban area). Thus, the numerator in the formula represents the number of pixels that are determined to be an urban area in both images, and the denominator is the total number of pixels that are marked as urban.

The urban boundary of the training sample city is rasterized into a binary image. At the same time the image is then resampled to the same resolution as the nighttime light data. The urban area extracted according to the initial threshold and the true values are operated on pixel-wise according to Equation (2). In the process of changing the thresholds, the extracted urban area gradually approaches the true value. As a result, the Jaccard coefficient tends to increase first and then decrease. When the Jaccard coefficient is maximized, the corresponding value is taken to be the optimal threshold. As shown in Figure 2, the denominator in Equation (2) continues to decrease as the threshold increases, and the

J c

will continue to increase. As the threshold increases, the urban pixel values marked by the nighttime light image fall completely within the urban area marked by the true value. In this case, the denominator remains unchanged and the numerator begins to decrease, causing the JC to begin to decrease. Therefore, the maximum

J c

value can be obtained in this process, and the urban area determined from the nighttime light image corresponding to the optimal threshold is the most accurate in regards to the total number of correct pixels and the spatial position.

3.2. Method to Determine the Optimal Threshold Function

3.2.1. Urban Boundary Extraction Based on the DMSP/OLS Data

Due to the problems of light overflow and oversaturation in the DMSP/OLS data, there is more light intensity when closer to a city center. Here, it is found that there is always a sudden increase in the pixel frequency for each city (from 30 to 63) by analyzing the DMSP/OLS image histograms within a city’s jurisdiction (see Figure 3). Combined with the frequency distribution law of the DMSP/OLS data, the histogram frequency of the pixel values in the range 30–63 is calculated and subsequently subtracted from the front and back to find the group with the largest frequency difference. Then, the former is taken as the image frequency burst point, and the associated pixel values and the optimal threshold of the sample area are analyzed. As a result, there is a strong linear correlation (R² = 0.9532) between the nighttime data histograms and the thresholds, as shown in Figure 4. Therefore, the functional relationship between the optimal threshold to extract urban boundaries and the histogram features of the nighttime light data is as shown in Equation (3).

F S 1 = 1.0944 * B 1 + 5.3461

(3)

where

F S 1

represents the optimal threshold for the DMSP/OLS data extraction in the urban areas, and

B 1

represents the corresponding pixel value for the DMSP/OLS data frequency burst points.

3.2.2. Urban Boundary Extraction Based on NPP-VIIRS Data

The NPP-VIIRS data is generally superior to the DMSP/OLS data as it has a higher resolution and uses a floating-point 32-bit form to store the pixel values. This avoids data saturation problems and gives is a clear distinction between luminance pixels. In this case, there is no rule to follow when directly counting the frequency histogram information. Therefore, the maximum value of the nighttime light data in the city area of the training sample is counted first, and the ratio of the optimal threshold to the maximum is subsequently calculated. It is found through experiments that there is a strong correlation between the proportional value and the maximum value of the data. A regression analysis indicates that the two have a strong power correlation, as shown in Figure 4) (R² = 0.9379). The functional relationship between the optimal threshold for the urban boundary extraction and the histogram features of the nighttime light data is given in Equation (4).

F S 2 = 4.5441 * M 1^{- 0.807} * M 1

(4)

which simplifies to

F S 2 = 4.5441 * M 1^{0.193}

(5)

where

F S 2

represents the optimal threshold for the NPP-VIIRS data extraction in urban areas and

M 1

represents the maximum value of the NPP-VIIRS data.

3.2.3. Urban Boundary Extraction Based on the Luojia1-01 Data

The Luojia1-01 is the first professional night-vision remote sensing satellite from China. It has an image resolution up to 130 m, and the original pixel values can span ten orders of magnitude, which is more accurate for light detection. After absolute radiation correction, the brightness values of the light become floating point data, and the histogram information distribution is not balanced with the NPP-VIIRS data. The statistical image histograms (divided into five groups) are shown in Figure 5. It is found that less than 1% of the pixels are distributed in the last four groups, suggesting that the maximum may be an outlier due to excessive data dispersion. Therefore, the maximum value before the first statistical cell is taken as the true maximum, and the optimal threshold is found by extracting the urban area according to the NPP-VIIRS data. First, the ratio of the threshold from the training set to the maximum value is obtained. Then, the relationship between this ratio and the maximum value is analyzed to have a strong power correlation, as shown in Figure 4 (R² = 0.9384). Therefore, the functional relationship between the optimal threshold for urban boundary extraction and the histogram characteristics of nighttime light data is as shown in Equation (6).

F S 3 = 77.749 * M 2^{- 0.917} * M 2

(6)

which simplifies to

F S 3 = 77.749 * M 2^{0.083}

(7)

In Equation (7),

F S 3

represents the optimal threshold for extracting urban areas from the Luojia1-01 image data and

M 2

represents the maximum of the first 20% of the pixel values for the Luojia1-01 image data.

3.3. Accuracy Evaluation

The relative error, commission error, omission error, overall classification accuracy, and the Kappa coefficient based on the urban pixel classification results are all used to validate the model. The accuracy stabilities for all the cities in the experiment are tested based on their standard deviations. When the Kappa coefficient falls between 0.0 and 0.20, it is considered to have an extremely low consistency, while 0.21–0.40 indicates a general consistency, 0.41–0.60 is a medium consistency, 0.61–0.80 is a high consistency and 0.81–1 indicates they are nearly identical [35]. The calculation formulae and definitions for each evaluation indicator are shown in Table 1.

In Table 1,

x

represents the number of urban pixels in the experiment,

μ

is the number of urban pixels in the true value,

λ

is the number of cells that are simultaneously marked as urban in the experimental results and true values,

r

is the number of cells correctly classified in the experiment (urban and non-urban),

T

is the number of test samples,

X_{i}

is the sample value,

δ

is the average of the sample

X

,

N

is the total number of pixels,

x_{i i}

is the total number of diagonals of the matrix,

x_{i +}

is the total number of i rows, and

x_{+ i}

is the total number of i columns.

4. Results

4.1. Urban Boundary Extraction Results

The urban extraction from Equations (3), (5), and (7) are integrated based on the DMSP/OLS, NPP-VIIRS, and Luojia1-01 datasets to obtain Equation (8).

FS = α ∗ (HF)^β + δ

(8)

where,

F S

is the extracted optimal threshold of the urban area,

H F

is the characteristic information of the nighttime light image histogram,

α

is the highest power coefficient,

β

is the power number, and

δ

is a constant term. Based on the DMSP/OLS, NPP-VIIRS, and Luojia1-01 data extractions, the values for

α

are 1.0944, 4.5441, and 77.749;

β

are 1, 0.193, and 0.083; and

δ

are 5.3461, 0, and 0, respectively. The histogram feature information for the nighttime light imagery is directly substituted into Equation (8) to extract the urban areas for the 13 cities independently, as shown in Figure 6. At the same time, the urban area extraction for the 13 cities is performed using the mutation detection method as a control experiment.

4.2. Results of the Accuracy Evaluation

Based on the formulae for the accuracy evaluation indicators in Table 1, the accuracy of the urban extraction results based on the three nighttime light datasets are verified using the proposed method and the mutation detection method for the 13 cities that did not participate in the training. The average (AVE) and standard deviation (SD) of the overall accuracy evaluation are shown in Table 2. The accuracy verification results for each city are shown in Table 3.

From Table 2 and Table 3, when using the proposed extraction method with the DMSP/OLS data, the average relative error is 29%, the average commission error is 25%, the average omission error is 42%, the average overall classification accuracy is 95%, and the average Kappa coefficient is 61%. When using the mutation detection method, the relative changes in these indicators are 83%, 29%, −35%, −5%, and −5%, respectively. There are major advantages in all the evaluation indicators with the exception of the omission error. The low omission error for the mutation detection method is due to the fact that there are more misdivided pixels, which gives a larger extracted urban area than found in the true value. Therefore, the overall classification accuracy is lower. On the other hand, the extraction results for the proposed method have the phenomenon of “enclave” omission. However, the commission and omission errors for the results obtained using the proposed method are similar and more stable. In the accuracy evaluation of each city, the average relative error and the commission error for the results obtained using the proposed method are lower than from the mutation detection method, and the overall classification accuracy is higher. The Kappa coefficient is only slightly lower in the cities of Kunming and Lanzhou when using the proposed method and is higher for the other cities. The average Kappa coefficient is 61%, which is consistent with the true value.

Using the proposed extraction method with the NPP-VIIRS data, the average relative error is 20%, the average commission error is 27%, the average omission error is 23%, the average overall classification accuracy is 96%, and the average Kappa coefficient is 71%. In comparison, the mutation detection method shows that these indicators increase by 18%, 3%, 2%, −1%, and −4%, respectively. Compared with the mutation detection method, the proposed extraction method is advantageous for all indicators and has an improved accuracy. In the accuracy evaluation of each city, with the exception of Fuzhou, the extraction results using the proposed method are more effective over the various accuracy evaluation indicators. The overall standard deviations of each evaluation indicator are also smaller than those of the mutation detection method. It can be seen from Table 3 that the misclassification error and relative error for Fuzhou City are relatively large. Combined with the analysis of the true value for the urban boundary, it can be seen that the urban expansion of Fuzhou City is larger between 2010 and 2015. Thus, using the true value of the urban boundary from 2010 will result in certain errors for the evaluation of the nighttime light city image from 2012. Some areas in the true value have not been attributed as urban areas due to other factors, such as being far from the main city. However, the other cities are less affected by these phenomena.

Based on the Luojia1-01 data, the proposed method has an average relative error of 39%, average commission error of 44%, and average omission error of 23%, average overall classification accuracy of 93%, and average Kappa coefficient of 61%. In comparison, the mutation detection method has indicators that are increased by 77%, 9%, −2%, −7%, and −12%, respectively. Except for a slight drop in the omission error, the other indicators have higher precisions. The overall standard deviations for each evaluation indicator are smaller than those for the mutation detection method, indicating that the extraction results are more stable. In the accuracy evaluation of each city, the classification accuracy and Kappa coefficient using the proposed extraction method are higher than the corresponding results for the mutation detection method. In terms of the error evaluation, except for Shenyang and Nanjing, the omission error is slightly larger. The analysis shows that the mutation detection method has a large commission error in these two cities. The difference between these two errors is large, and the overall classification accuracy is low.

5. Discussion

There are some shortcomings regarding the proposed method, such as incomplete extraction of the urban “enclave” when using the DMSP/OLS data. The mutation detection method has a larger commission error, which makes the omission error of the proposed extraction method greater than that for the mutation detection method. The urban boundary extraction is based on the NPP-VIIRS data that was first released in 2012, while the true values used in this paper are from 2010, and some areas are impacted by the urban growth over this two-year discrepancy.

There are still additional problems that need to be solved in subsequent studies. First, there is some data that are difficult to collect, and it is difficult to implement comparative experiments and precision verifications in the strict sense. For example, the global images from the Luojia1-01 are difficult to obtain because the system was recently released, and there are differences in the image acquisition times for different regions. In addition, the true values of urban boundaries outside of China for verification experiments are difficult to obtain. Thus, obtaining these data and performing the urban boundary extraction at a global scale is the focus of future studies.

In addition, only the mutation detection method was selected in this work for the comparison experiment. This decision was made after making sufficient and scientific considerations. First, it is unfair to compare the traditional method of processing ordinary optical images with a method specifically designed for nighttime light imagery as there are completely different data features in both images. If the statistical comparison method is selected as a comparative experiment, the area in the statistical data will be compared with the true urban area of the text, which is unscientific since the concept of “urban” in these two data sets is different. In addition, it is unreasonable to compare the empirical threshold method with the proposed method because it is too subjective and its accuracy is difficult to guarantee. Therefore, we believe that only the mutation detection method is suitable as a comparative experiment.

Finally, there is no reference to the modifiable areal unit problem (MAUP). This is because the three datasets were not captured over the same period and have different resolutions. It is unscientific to discuss the scale effects of the MAUP caused by the size of the pixel. Thus, we only give a short qualitative evaluation instead of a more specific approach. Therefore, increasing the accuracy of the data to better compare the urban area extraction, setting multiple thresholds to solve the problem of loss based on the “enclave” in the DMSP/OLS data, including more cities in the training, and analysis of urban expansion changes combined with MAUP are areas of future work.

6. Conclusions

The nighttime light data for the DMSP/OLS, NPP-VIIRS and Luojia1-01 datasets are used to test the proposed urban extraction method. The high-precision urban boundary obtained from high-resolution imagery is used as the true value, and 15 cities are selected as the training samples to introduce and determine the Jaccard coefficients. The optimal threshold function for the urban boundary extraction using the three nighttime light datasets is determined through comparisons with the spatial data. Then, the high-precision urban boundary truth-values for the 13 cities are selected for testing, and the accuracy of the urban boundary extraction results obtained using the optimal threshold function and mutation detection method are evaluated and compared. The evaluation results show that the accuracy of the proposed method is better at extracting urban areas than the more conventional mutation detection method since it has good generalization capabilities.

The average relative errors for the extraction results based on the three nighttime light datasets (DMSP/OLS, NPP-VIIRS, and Luojia1-01) using the optimal threshold function are 29%, 20%, and 39%, respectively. Compared with mutation detection method, these represent reductions of 83%, 18%, and 77%, respectively. The average overall classification accuracies are 95%, 96%, and 93%, respectively, which are 5%, 1%, and 7% higher than the mutation detection method. The average Kappa coefficients are 61%, 71%, and 61%, respectively, which are 5%, 4%, and 12% higher than the mutation detection method.

Author Contributions

X.L. collected and processed the data, performed the analysis, and wrote the paper. X.N. and H.W. conceived and designed the study. H.W. and H.Z. contributed to revisions of the manuscript. C.W. and J.M. contributed to the analysis of the data. All authors reviewed and edited the draft, approved the submitted manuscript, and agreed to be listed and accepted the version for publication.

Funding

The authors thank the national key research and development plan (2016YFE0205300); the National Natural Science Foundation of China (41401513); and the central level of public welfare research institutes basic research business fee project (7771803) for financial support.

Acknowledgments

We would like to express our respect and gratitude to the anonymous reviewers and editors for their professional comments and suggestions. The authors would like to thank the “Monitoring of spatial pattern changes in cities and typical urban agglomerations above the prefecture level” for providing a nationwide high-precision urban boundary. At the same time, the authors thank the Luojia1-01 satellite nighttime light image dataset from Wuhan University and the DMSP/OLS and NPP-VIIRS nighttime light datasets from the US National Defense Meteorological Satellite. Finally, thanks for academic editor and reviewers’ comments concerning our manuscript. Those comments are all valuable and very helpful for revising and improving our paper, as well as the important guiding significance to our researches.

Conflicts of Interest

The authors declare no conflict of interest.

References

Gu, C.L.; Xu, H.X. Development of urban geography in China since 1978. Sci. Geogr. Sin. 1999, 19, 320–331. [Google Scholar]
Fang, C.L. The urbanization and urban development in China after the reform and opening-up. Econ. Geogr. 2009, 1, 19–25. [Google Scholar]
Ning, X.G.; Wang, H.; Zhang, H.C.; Liu, Y.F.; Pang, B.; Hao, M.H. High-Precision Urban Boundary Extraction and Urban Sprawl Spatial-Temporal Analysis in China’s Prefectural Cities from 2000 to 2016. Geom. Inform. Sci. Wuhan Univ. 2018, 43, 1916–1926. [Google Scholar]
Ning, X.G.; Wang, H.; Lin, X.G.; Cao, Y.X.; Du, J. Spatio-temporal Urban Sprawl Monitoring and Analysis over Beijing-Tianjin-Hebei Urban Agglomeration during 1990-2015. Acta Geod. Cartogr. Sin. 2018, 47, 1207–1215. [Google Scholar]
Wang, H.; Ning, X.G.; Zhang, H.C.; Liu, Y.F.; Yu, F. Urban Boundary Extraction and Urban Sprawl Measurement Using High-resolution Remote Sensing Images: A Case Study of China’s Provincial Capital, ISPRS—International Archives of the Photogrammetry. Remote Sens. Spat. Inform. Sci. 2018, XLII-3, 1713–1719. [Google Scholar]
Zhang, H.C.; Ning, X.G.; Wang, H.; Shao, Z.F. High accuracy urban expansion monitoring and analysis of China’s provincial capitals from 2000 to 2015 based on high-resolution remote sensing imagery. Acta Geogr. Sin. 2018, 73, 2345–2363. [Google Scholar]
Elvidge, C.D.; Baugh, K.E.; Kihn, E.A.; Kroehl, H.W.; Davis, E.R.; Davis, C.W. Relation between satellite observed visible-near infrared emissions, population, economic activity and electric power consumption. Int. J. Remote Sens. 1997, 18, 1373–1379. [Google Scholar] [CrossRef]
Forbes, D.J. Multi-scale analysis of the relationship between economic statistics and DMSP-OLS night light images. GISci. Remote Sens. 2013, 50, 483–499. [Google Scholar] [CrossRef]
Sutton, P.; Roberts, D.; Elvidge, C. A Comparison of Nighttime Satellite Imagery and Population Density for the Continental United States. Photogramm. Eng. Remote Sens. 1997, 63, 1303–1313. [Google Scholar]
Letu, H.; Hara, M.; Yagi, H.; Naoki, K.; Tana, G.; Nishio, F.; Shuhei, O. Estimating energy consumption from night-time DMPS/OLS imagery after correcting for saturation effects. Int. J. Remote Sens. 2010, 31, 4443–4458. [Google Scholar] [CrossRef]
Ma, T.; Zhou, C.H.; Pei, T.; Haynie, S.; Fan, J.F. Quantitative estimation of urbanization dynamics using time series of DMSP/OLS nighttime light data: A comparative case study from China’s cities. Remote Sens. Environ. 2012, 124, 99–107. [Google Scholar] [CrossRef]
Liu, Z.F.; He, C.Y.; Zhang, Q.F.; Huang, Q.X.; Yang, Y. Extracting the dynamics of urban expansion in China using DMSP-OLS nighttime light data from 1992 to 2008. Landsc. Urban Plan 2012, 106, 62–72. [Google Scholar] [CrossRef]
Shi, K.F.; Yu, B.L.; Hu, Y.J.; Huang, C.; Chen, Y.; Huang, Y.X.; Chen, Z.Q.; Wu, J.P. Modeling and mapping total freight traffic in China using NPP-VIIRS nighttime light composite data. Map Sci. Remote Sens. 2015, 52, 274–289. [Google Scholar] [CrossRef]
Shi, K.F.; Yu, B.L.; Huang, Y.X.; Hu, Y.J.; Yin, B.; Chen, Z.Q.; Chen, L.J.; Wu, J.P. Evaluating the Ability of NPP-VIIRS Nighttime Light Data to Estimate the Gross Domestic Product and the Electric Power Consumption of China at Multiple Scales: A Comparison with DMSP-OLS Data. Remote Sens. 2014, 6, 1705–1724. [Google Scholar] [CrossRef]
Li, X.; Xu, H.M.; Chen, X.L.; Li, C. Potential of NPP-VIIRS Nighttime Light Imagery for Modeling the Regional Economy of China. Remote Sens. 2013, 5, 3057–3081. [Google Scholar] [CrossRef]
Anonymous. “LJ01-1” nighttime light remote sensing satellite passed the demonstration. Hydrogr. Surv. Chart. 2017, 37, 82. [Google Scholar]
Imhoff, M.; Lawrence, W.T.; Stutzer, D.S.; Elvidge, C.D. A technique for using composite DMSP/OLS “City Lights” satellite data to map urban area. Remote Sens. Environ. 1997, 61, 361–370. [Google Scholar] [CrossRef]
Gallo, K.P.; Elvidge, C.D.; Yang, L.; Reed, B.C. Trends in night-time city lights and vegetation indices associated with urbanization within the conterminous USA. Int. J. Remote Sens. 2004, 25, 2003–2007. [Google Scholar] [CrossRef]
He, C.Y.; Shi, P.J.; Li, J.G.; Chen, J.; Pan, Y.Z.; Li, J.; Zhuo, L.; Ichinose, T. Restoring urbanization process in China in the 1990s by using non-radiance-calibrated DMSP/OLS nighttime light imagery and statistical data. Chin. Sci. Bull. 2006, 51, 1614–1620. [Google Scholar] [CrossRef]
Wang, Y.Y.; Xu, D.; Zhu, X.G. The Spatio-temporal Characteristics of the Urban and Town Construction Land Expansion in Jiangsu Province from DMSP-OLS Nighttime Images. Mod. Urban Res. 2010, 2, 67–73. [Google Scholar]
Shi, K.F.; Huang, C.; Yu, B.L.; Yin, B.; Huang, Y.X.; Wu, J.P. Evaluation of NPP-VIIRS night-time light composite data for extracting built-up urban areas. Remote Sens. Lett. 2014, 5, 358–366. [Google Scholar] [CrossRef]
Huang, L.; Yang, Y.B.; Zhu, Q. Expansion Research on the Build Up Area in Nanjing City Based on DMSP/OLS Data. Geosp. Inf. 2018, 16, 94–97. [Google Scholar]
Yang, Y.; He, C.Y.; Zhao, Y.Y.; Li, T.; Qiao, Y.W. Research on the layered threshold method for extracting urban land using the DMSP/OLS stable nighttime light data. J. Image Graph. 2011, 16, 666–673. [Google Scholar]
Mi, X.N.; Bai, L.Y.; Tan, X.H.; Lu, N.; Peng, G.P.; Shao, Q. A New Method of Extracing Areas of Center City Regions Based on DMSP/OLS Data. J. Geo. Inf. Sci. 2013, 25, 159–164. [Google Scholar] [CrossRef]
Wang, X.H.; Xiao, P.F.; Feng, X.Z.; Li, H.X. Extraction of large-scale urban area information in China using DMSP/OLS nighttime light data. Remote Sens. Land. Resour. 2013, 25, 159–164. [Google Scholar]
Chen, Z.; Hu, D.Y.; Zeng, W.H.; Deng, L. TM image and nighttime light data to monitoring regional urban expansion: A case study of Zhejiang Province. Remote Sens. Land. Resour. 2014, 26, 83–89. [Google Scholar]
Song, J.C.; Li, X.H.; Lin, T.; Zhang, G.Q.; Ye, H.; He, X.Y.; Ge, R.B. A Method of Extracting Urban Built-up Area Based on DMSP/OLS Nighttime Data and Google Earth. J. Geo. Inf. Sci. 2013, 25, 159–164. [Google Scholar]
Liu, X.P.; Yang, Y.C.; Fu, D.X.; Li, H.Y.; Tian, H.Z. Urban Spatial Expansion Based on DMSP_OLS Nighttime Light Data in China in 1992–2010. Sci. Geogr. Sin. 2014, 34, 129–136. [Google Scholar]
Cao, X.; Chen, J.; Imura, H.; Higashi, O. A SVM-based method to extract urban areas from DMSP-OLS and SPOT VGT data. Remote Sens. Environ. 2009, 113, 2205–2209. [Google Scholar] [CrossRef]
Zhou, Y.Y.; Smith, S.J.; Zhao, K.G.; Imhoff, M.; Thomason, A.; Ben, B.L.; Asrar, G.R.; Zhang, X.S.; He, C.Y.; Elvidge, C.D. A global map of urban extent from nightlights. Environ. Res. 2015, 10, 1748–9326. [Google Scholar] [CrossRef]
Zou, J.G.; Chen, Y.H.; Ding, G.; Xuan, W. A Clustered Threshold Method for Extracting Urban Built-up Area Using the DMSP/OLS Nighttime Light Images. Geom. Inform. Sci. Wuhan Univ. 2016, 41, 196–201. [Google Scholar]
Cao, X.; Chen, J.; Imura, H.; Higashi, O.; Elvidge, C.D.; Zhao, K.G.; Thomason, A.; Imhoff, M. A cluster-based method to map urban area from DMSP/OLS nightlights. Remote Sens. Environ. 2014, 147, 173–185. [Google Scholar]
Tang, L.B.; Cui, H.S. Improvement of Urban Construction Land Extraction Method Based on NPP-VIIRS Nighttime Light Data and Landsat-8 Data: A Case Study of Guangzhou City. Geom. Spat. Inf. Technol. 2017, 40, 69–73. [Google Scholar]
Hamers, L.; Hemeryck, Y.; Heweyers, G.; Janssen, M.; Keters, H.; Rousseau, R.; Vanhoutte, A. Similarity measures in scientometric research: The Jaccard index versus Salton’s cosine formula. Inf. Process. Manag. 1989, 25, 315–318. [Google Scholar] [CrossRef]
Yue, R.H. Research of Land Cover Classification in Mongolia Plateau Based on MODIS; Inner Mongolia Normal University: Hohhot, Inner Mongolia, China, 2010. [Google Scholar]

Figure 1. Urban boundary extraction technology roadmaps using nighttime light data.

Figure 2. Schematic diagram showing the threshold changing process (NTL is the values in the nighttime light image).

Figure 3. DMSP/OLS data frequency distribution: The first is distribution of the DMSP/OL pixel value frequency in Tianjin and the second is the distribution of the DMSP/OL pixel value frequency in Nanjing.

Figure 4. Correlation analysis of the nighttime light data histogram features and sample area optimal thresholds (The first is DMSP/OLS threshold analysis. The second is NPP-VIIRS threshold analysis. The last is Luojia1-01 threshold analysis).

Figure 5. Luojia1-01 image frequency histogram.

Figure 6. Urban boundary extraction results based on the nighttime light images for DMSP/OLS, NPP-VIIRS, and Luojia1-01 (The first line is the result of the mutation detection method, and the second line is the result of proposed method in the results of each data source.).

Table 1. Accuracy of the evaluation indicators.

Evaluation Indicators	Calculation Formula	Evaluation Indicators	Calculation Formula
Relative Error	$R E = \frac{x - μ}{μ}$	Commission Error	$C E = \frac{x - λ}{x}$
Omission Error	$O E = \frac{μ - λ}{μ}$	Standard Deviation	$S D = \sqrt{\frac{\sum_{i = 1}^{T} {(X_{i} - δ)}^{2}}{T}}$
Overall Accuracy	$O A = \frac{r - N}{N}$	Kappa Coefficient	$K C = \frac{N \sum_{i = 1}^{n} x_{i i} - \sum_{i = 1}^{n} (x_{i +} \times x_{+ i})}{N^{2} - \sum_{i = 1}^{n} (x_{i +} \times x_{+ i})}$

Table 2. Comparison of urban extraction accuracy indicators.

	Data	DMSP/OLS		NPP-VIIRS		Luojia1-01
Indicators		AVE	SD	AVE	SD	AVE	SD
Relative Error	proposed method	29%	0.21	20%	0.22	39%	0.30
Relative Error	mutation detection	112%	0.54	38%	0.27	116%	1.20
Commission Error	proposed method	25%	0.12	27%	0.08	44%	0.08
Commission Error	mutation detection	54%	0.09	30%	0.12	53%	0.17
Omission Error	proposed method	42%	0.15	23%	0.11	23%	0.12
Omission Error	mutation detection	7%	0.06	25%	0.18	21%	0.16
Overall Accuracy	proposed method	95%	0.02	96%	0.02	93%	0.03
Overall Accuracy	mutation detection	90%	0.04	95%	0.02	86%	0.15
Kappa Coefficient	proposed method	61%	0.09	71%	0.05	61%	0.07
Kappa Coefficient	mutation detection	56%	0.09	67%	0.06	49%	0.15

Table 3. Comparison of the accuracy of urban boundaries extraction (P represents the proposed method and M represents the mutation detection method).

Accuracy Based on DMSP/OLS
Cities	Relative Error		Commission Error		Omission Error		Overall Accuracy		Kappa Coefficient
Cities	P	M	P	M	P	M	P	M	P	M
Baotou	15.1%	171.7%	21.7%	65.1%	33.5%	5.2%	96.3%	87.0%	70.0%	45.3%
Beijing	63.4%	184.4%	47.9%	65.2%	15.0%	1.1%	93.9%	87.9%	61.5%	46.3%
Dalian	13.9%	233.0%	35.3%	70.9%	44.3%	3.1%	95.1%	84.4%	57.3%	38.6%
Fuzhou	43.1%	90.5%	14.1%	47.9%	51.1%	0.7%	91.9%	87.4%	58.2%	61.4%
Jinan	24.3%	89.8%	24.9%	53.3%	43.1%	11.3%	94.6%	90.2%	61.9%	56.2%
Kunming	40.0%	42.9%	11.7%	38.1%	47.0%	11.6%	95.9%	94.9%	64.2%	70.1%
Lanzhou	10.9%	62.6%	48.9%	51.5%	54.4%	21.1%	90.9%	90.3%	43.2%	54.9%
Nanchang	10.2%	73.0%	29.5%	45.2%	22.3%	5.1%	95.8%	93.6%	71.6%	66.2%
Nanning	61.8%	171.7%	13.7%	60.7%	67.1%	17.7%	98.2%	96.4%	46.9%	51.6%
Shenyang	22.8%	184.4%	18.5%	54.5%	37.1%	3.1%	94.5%	87.2%	68.0%	55.5%
Urumqi	15.6%	233.0%	23.7%	49.0%	35.6%	6.4%	97.7%	96.1%	68.7%	64.2%
Xi’an	55.2%	171.7%	12.1%	50.7%	60.6%	6.4%	92.5%	88.3%	51.0%	58.3%
Yinchuan	3.4%	184.4%	26.3%	53.9%	28.8%	0.6%	94.7%	88.6%	69.5%	57.3%
Accuracy Based on NPP-VIIRS
Baotou	17.6%	18.0%	28.5%	36.7%	41.0%	25.2%	95.4%	95.1%	62.2%	65.9%
Beijing	37.5%	86.6%	37.1%	48.9%	13.5%	4.7%	95.8%	93.8%	70.7%	63.4%
Dalian	7.6%	23.1%	31.1%	24.6%	25.8%	42.0%	96.2%	96.1%	69.4%	63.5%
Fuzhou	83.6%	18.1%	46.0%	33.3%	0.9%	21.2%	88.4%	91.8%	63.5%	67.5%
Jinan	19.6%	42.7%	15.5%	37.3%	32.1%	10.6%	96.1%	94.4%	73.2%	70.6%
Kunming	21.8%	50.5%	23.5%	19.0%	40.2%	59.9%	95.5%	94.7%	64.8%	51.2%
Lanzhou	32.7%	74.5%	31.2%	44.0%	8.7%	2.3%	95.5%	92.9%	76.0%	67.4%
Nanchang	5.7%	6.3%	30.5%	28.1%	26.5%	32.6%	95.6%	95.6%	69.0%	67.2%
Nanning	6.5%	75.7%	27.3%	46.8%	22.6%	6.5%	98.7%	97.7%	74.3%	66.7%
Shenyang	2.5%	24.9%	25.4%	32.2%	23.5%	15.3%	94.6%	94.0%	72.5%	71.9%
Urumqi	11.0%	37.1%	16.5%	10.9%	25.7%	43.9%	98.4%	97.9%	77.8%	67.9%
Xi’an	9.5%	1.9%	23.1%	20.2%	15.8%	18.7%	95.3%	95.5%	77.8%	78.1%
Yinchuan	5.9%	34.0%	18.1%	12.5%	22.9%	42.2%	96.0%	95.0%	77.2%	67.0%
Accuracy Based on Luojia1-01
Baotou	19.9%	51.5%	50.8%	54.8%	41.0%	31.6%	93.4%	92.6%	50.1%	50.6%
Beijing	77.8%	93.2%	54.5%	56.7%	19.1%	16.4%	92.2%	91.5%	54.3%	52.9%
Dalian	9.1%	25.2%	42.4%	45.9%	37.2%	32.2%	94.7%	94.3%	57.2%	57.2%
Fuzhou	91.6%	390.8%	48.8%	79.6%	1.9%	0.0%	87.0%	46.7%	60.1%	14.5%
Jinan	18.5%	36.0%	46.1%	48.8%	36.1%	30.4%	92.8%	92.3%	54.6%	54.8%
Kunming	10.5%	27.7%	31.9%	24.3%	24.7%	45.3%	95.4%	95.2%	69.0%	61.0%
Lanzhou	19.4%	30.2%	29.9%	23.9%	16.3%	46.9%	95.3%	94.3%	73.7%	59.6%
Nanchang	78.3%	56.8%	49.3%	46.3%	9.7%	15.8%	92.7%	93.4%	61.2%	62.1%
Nanning	62.9%	44.1%	51.9%	50.0%	21.6%	27.9%	97.4%	97.6%	58.4%	57.9%
Shenyang	12.5%	66.2%	42.1%	50.8%	34.8%	18.3%	92.0%	90.0%	56.9%	56.1%
Urumqi	11.4%	132.8%	34.9%	59.9%	27.4%	6.6%	97.5%	94.4%	67.3%	53.7%
Xi’an	52.4%	258.3%	42.3%	72.4%	12.1%	1.2%	91.4%	70.7%	64.9%	31.0%
Yinchuan	45.0%	296.3%	41.4%	75.2%	15.0%	1.7%	92.6%	70.5%	65.4%	28.3%

© 2019 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Liu, X.; Ning, X.; Wang, H.; Wang, C.; Zhang, H.; Meng, J. A Rapid and Automated Urban Boundary Extraction Method Based on Nighttime Light Data in China. Remote Sens. 2019, 11, 1126. https://doi.org/10.3390/rs11091126

AMA Style

Liu X, Ning X, Wang H, Wang C, Zhang H, Meng J. A Rapid and Automated Urban Boundary Extraction Method Based on Nighttime Light Data in China. Remote Sensing. 2019; 11(9):1126. https://doi.org/10.3390/rs11091126

Chicago/Turabian Style

Liu, Xiaojiang, Xiaogang Ning, Hao Wang, Chenggang Wang, Hanchao Zhang, and Jing Meng. 2019. "A Rapid and Automated Urban Boundary Extraction Method Based on Nighttime Light Data in China" Remote Sensing 11, no. 9: 1126. https://doi.org/10.3390/rs11091126

APA Style

Liu, X., Ning, X., Wang, H., Wang, C., Zhang, H., & Meng, J. (2019). A Rapid and Automated Urban Boundary Extraction Method Based on Nighttime Light Data in China. Remote Sensing, 11(9), 1126. https://doi.org/10.3390/rs11091126

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

A Rapid and Automated Urban Boundary Extraction Method Based on Nighttime Light Data in China

Abstract

1. Introduction

2. Study Area and Data

3. Method

3.1. Sample Training

3.2. Method to Determine the Optimal Threshold Function

3.2.1. Urban Boundary Extraction Based on the DMSP/OLS Data

3.2.2. Urban Boundary Extraction Based on NPP-VIIRS Data

3.2.3. Urban Boundary Extraction Based on the Luojia1-01 Data

3.3. Accuracy Evaluation

4. Results

4.1. Urban Boundary Extraction Results

4.2. Results of the Accuracy Evaluation

5. Discussion

6. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI