An Optimized Approach for Extracting Urban Land Based on Log-Transformed DMSP-OLS Nighttime Light, NDVI, and NDWI

: Quantitative and accurate urban land information on regional and global scales is urgently required for studying socioeconomic and eco-environmental problems. The spatial distribution of urban land is a signiﬁcant part of urban development planning, which is vital for optimizing land use patterns and promoting sustainable urban development. Composite nighttime light (NTL) data from the Defense Meteorological Program Operational Line-Scan System (DMSP-OLS) have been proven to be effective for extracting urban land. However, the saturation and blooming within the DMSP-OLS NTL hinder its capacity to provide accurate urban information. This paper proposes an optimized approach that combines NTL with multiple index data to overcome the limitations of extracting urban land based only on NTL data. We combined three sources of data, the DMSP-OLS, the normalized difference vegetation index (NDVI), and the normalized difference water index (NDWI), to establish a novel approach called the vegetation–water-adjusted NTL urban index (VWANUI), which is used to rapidly extract urban land areas on regional and global scales. The results show that the proposed approach reduces the saturation of DMSP-OLS and essentially eliminates blooming effects. Next, we developed regression models based on the normalized DMSP-OLS, the human settlement index (HSI), the vegetation-adjusted NTL urban index (VANUI), and the VWANUI to analyze and estimate urban land areas. The results show that the VWANUI regression model provides the highest performance of all the models tested. To summarize, the VWANUI reduces saturation and blooming, and improves the accuracy with which urban areas are extracted, thereby providing valuable support and decision-making references for designing sustainable urban development. 7.35%, and 6.20% higher. These results indicate that the VWANUI provides a higher extraction accuracy than the other approaches.


Introduction
Urbanization is an important force that is shaping current development patterns [1]. Between 1950 and 2014, the proportion of the world's population living in cities rose from 30% to 54%, and the global urban population is expected to rise by an additional 2.5 billion in the next few decades, with most of this increase occurring in Asia and Africa [2]. This massive immigration into cities has resulted in rapid urban expansion [3]. Although urban land accounts for only a small part of the global surface, it has a significant impact on climate, biogeochemical cycles, biodiversity, and hydrology on local, regional, and even global scales. Rapid urban expansion has led to resource depletion, water shortages, air pollution, urban heat islands, and other environmental problems, which constitute serious challenges to global and regional sustainable development [4,5]. Urban land is increasingly recognized as an important but difficult-to-quantify component of models for global change [6]. Therefore, the timely and accurate quantification of urban information is vital for assessing the relationship between urbanization and the environmental consequences. In addition, it forms the basis for optimizing land use patterns and promoting sustainable urban development [7][8][9]. To avoid confusion, we use the term "urban land" in this work to refer to impervious land surfaces (i.e., manmade coverings and constructions) [10,11].
Traditionally, obtaining urban land information has been based on the socioeconomic statistics of administrative units [12], but this approach lacks sufficient spatial information. Nowadays, remote sensing technology can be used to observe and acquire regular, large-scale data on urban land, and satellite images are now widely used to monitor urban areas [13]. Previous studies mainly used high-or medium-spatial-resolution images to extract information on individual urban areas [14][15][16]. However, large-scale urban agglomerations require a large number of costly high-or medium-spatial-resolution images, as well as a large amount of time, labor, and computing resources for the processing and interpretation of these images, which means that such data are not suitable for extracting information on large-scale urban land expansion [17]. Thus, it remains difficult to rapidly obtain information on urban land expansion and evolution on a large scale and in a short time, especially in areas undergoing rapid urbanization, such as the Guangdong-Hong Kong-Macao Greater Bay Area of China. Therefore, improved approaches are urgently needed for extracting and analyzing the spatial distribution and evolution of urban land.
Nighttime light (NTL) data from the Defense Meteorological Satellite Program's Operational Line-Scan System (DMSP-OLS) have proven to be valuable and reliable for mapping urban areas on regional and global scales [18][19][20]. The DMSP-OLS has the requisite low-light sensitivity for detecting artificial lighting from the Earth's surface at night, and NTL has been widely used in research on urban areas, e.g., mapping [21,22], population estimation [23,24], carbon emission inventory [25,26], and monitoring urbanization [27,28]. Therefore, NTL data provide an alternative or complementary approach for drawing urban maps on regional and global scales [13,29,30]. Researchers have already used DMSP-OLS NTL images to infer the extent of urban areas on regional, national, and global scales [31][32][33][34][35]. For instance, Shi et al. [31] used the threshold method to quantify the urban land areas in 12 major Chinese cities in 2012, and Lu et al. [32] combined DMSP-OLS and terra moderate-resolution imaging spectroradiometry (MODIS) normalized difference vegetation index (NDVI) data to map urban areas in southeastern China, which improved estimates of urban areas. Bagan and Yamagata [33] investigated the spatiotemporal dynamics of urban expansion in Japan. Goldblatt et al. [21] combined NTL with Landsat-8 data to extract the urban land cover of India, Mexico, and the United States and emphasized the benefits of data fusion technology for urban land extraction. Zhang and Seto [34] mapped urbanization dynamics on regional and global scales by applying an iterative unsupervised classification approach, which showed that NTL provides an avenue for regional and, potentially, global land measurements, and Zhou et al. [35] developed consistent spatiotemporal urban maps on a global scale. Although the above research has made contributions, challenges remain for extracting urban land area using DMSP-OLS NTL data because these data are subject to blooming and saturation [36]. Blooming effects arise from incoherent light radiating in all directions from light sources on the Earth's surface. For instance, the dispersion of light brightness to surrounding areas [37,38] leads to blooming effects from the light brightness. By contrast, the saturation is due to the limited radiometric range of DMSP-OLS [6]. These two deficiencies combine to degrade the precision of urban land information extracted from DMSP-OLS NTL [39].
Blooming effects tend to lead to overestimates of larger urban areas based solely on DMSP-OLS NTL data [40,41]. Simultaneously, saturation in bright urban core areas hinders the use of NTL to accurately estimate urban land areas or to separate different types of land cover [32]. However, the combination of remote sensing data from multiple sources may provide more information than any single source because each source has different characteristics. Previous studies revealed a strong negative correlation between the vegetation index and urban impermeable surface [42], and recent approaches have attempted to combine the vegetation index and NTL images to more accurately extract urban areas [32,41,43]. For instance, Lu et al. [32] combined DMSP-OLS NTL with NDVI data to build a human settlement index (HSI) for mapping urban settlements. However, the HSI overcorrects for saturation from suburban areas, resulting in a loss of available data around urban areas [43]. The vegetation-adjusted NTL urban index (VANUI) was also developed to correct NTL saturation [43]. The NDVI is limited in its ability to distinguish urban land from nonurban land (such as water and wasteland) and cannot identify water bodies around urban land, such as coastal areas. The normalized difference water index (NDWI), however, effectively differentiates between water and nonwater features [44] and can thereby compensate for the limitations of the HSI and VANUI, which consider only NDVI contributions.
Another major obstacle to using DMSP-OLS NTL for urban studies is data saturation; that is, the NTL data for urban core areas tend to be truncated because of the limited radiometric range of DMSP-OLS. Some techniques have been developed to correct NTL saturation; for instance, Letu et al. [45] proposed a saturated nighttime light correction approach to correcting the DMSP-OLS stable nighttime light based on a cubic regression equation (CRE). Then, Letu et al. [46] again put forward a saturated DMSP-OLS nighttime light correction method based on a DMSP-OLS nighttime radiance calibration image. Ziskin et al. [47] attempted to calibrate the radiation brightness by changing the gain on the sensor. In addition, Liu et al. [17] developed a method for systematically correcting multiyear and multisatellite stable NTL data, which were used to correct the on-board calibration and influence of the differences between sensors, differences in crossing time between satellites, and degradation of the sensors during the long time series of the DMSP-OLS data [17,48]. However, the radiation calibration of NTL data is difficult and time-consuming. To improve the accuracy of urban mapping on regional and global scales, simple methods based on combining MODIS NDVI and NTL data, such as the HSI and VANUI, have been proposed for alleviating saturation [32,43]. However, the HSI and VANUI themselves have some limitations.
Thus, given the inherent problems caused by blooming and saturation in DMSP-OLS NTL data, which hinder the wide application of these data for the accurate extraction of urban land information, we propose an optimized approach, namely, the vegetationwater-adjusted NTL urban index (VWANUI), for extracting accurate and timely urban information. The innovative aspect of the VWANUI is that it significantly reduces the blooming and saturation inherent in DMSP-OLS NTL data from urban areas by combining a logarithmic transformation with the NDVI and NDWI to improve the differentiation between urban and nonurban land while better retaining surface details. In short, the aim of this study was to improve the precision of urban land extraction, and our specific objective was to reduce the blooming and saturation inherent in DMSP-OLS NTL data by the proposed approach.

Research Area
The research area was the Guangdong-Hong Kong-Macao Greater Bay Area of southern China, which has experienced unprecedented urbanization and rapid economic growth over the past 40 years. This urban agglomeration consists of 11 cities with a total area of about 56,000 km 2 ( Figure 1). In 2018, the population was 71.16 million. This area has long been the "vanguard" of China's urban development. Although the area takes up less than 1% of China's land area, it accounted for 12.17% of the national GDP in 2017.

Data Sources
This study used the DMSP-OLS NTL remote sensing data for 2012 from the National Oceanic and Atmospheric Administration National Geophysical Data Center (NOAA NGDC; https://ngdc.noaa.gov/eog/dmsp/downloadV4composites.html). The dataset is composed of the average digital numbers (DNs) of stable light images for an entire year. The DNs for light in the NTL images range from 1 to 63, whereas the background and noise are recorded as zeros.
The NDVI and NDWI were calculated from Landsat-8 OLI data with a 30 m resolution, and Landsat series satellite data were acquired from the International Scientific Data Service Platform, Computer Network Information Center, Chinese Academy of Sciences (CAS). The DMSP-OLS, NDVI, and NDWI were re-registered and projected onto the Albers Conical Equal Area Projection. The DMSP-OLS datasets were resampled by bilinear fitting to the same spatial resolution of 30 m.
The land use/cover (LUC) dataset was high-resolution data at 30 m that were derived from the Resource and Environment Data Cloud Platform of the CAS (http://www.resdc. cn/). Then, the "urban land" derived from the LUC dataset was used as reference data to verify the extraction accuracy, establish regression models, and, finally, validate the performance of the VWANUI.

Vegetation-Water-Adjusted Nighttime Light Urban Index (VWANUI)
Following the research of Zhang et al. [43] and Li et al. [49], we propose, herein, the VWANUI to improve the extraction of urban land based on DMSP-OLS NTL, the NDVI, and the NDWI. The VWANUI is calculated as follows: where NIR refers to the near-infrared band; RED, to the red band; G, to the green band; MIR, to the mid-infrared band; and NTL, to the DMSP-OLS NTL for the study area. First, the NTL data were processed by applying a logarithmic transformation of the radiation intensity. The sharp jump in radiation intensity around the urban core areas was restrained, and the difference in radiation intensity between the nonurban core areas and the suburban areas was enhanced, which contributed to amplifying the NTL difference between urban and nonurban areas [50]. Therefore, applying the function log 10 (NTL) (i.e., the logarithmic transformation of the DMSP-OLS NTL composite data) improved the extraction of urban land from NTL composite data.
Second, the NDVI was calculated from Landsat-8 OLI data. The NDVI ranges from −1 to 1, where greater positive values indicate denser vegetation-covered areas and more negative values indicate larger nonvegetated areas [51,52]. In other words, the greater the NDVI, the higher the density of green vegetation coverage. Previous studies also showed that vegetation abundance correlates negatively with urban impervious surface area [53]. Although the NDVI is affected by atmospheric scattering, advanced satellite systems have been thoroughly calibrated to obtain accurate values for the NDVI [54]. The expression 1 − NDVI represents a greater weight of nonvegetated areas in the urban core areas rather than the surrounding areas, which also leads to an increase in data variability in the urban core areas [43]. Therefore, the value of 1 − NDVI for the urban core areas approaches 1, while the value for vegetation-rich nonurban areas is close to zero. The combination of 1 − NDVI and NTL can reduce the saturation of NTL in urban core areas and facilitate the rapid identification of changes in these areas.
The NDWI also ranges from −1 to 1. Theoretically, a positive NDWI indicates water bodies, whereas a negative NDWI indicates nonwater areas [55]. However, a negative NDWI may actually represent a water body-for example, due to bare sediment in rivers, lakes, and seas. The present study shows that the threshold value of the NDWI for water bodies is −0.1. Thus, we used −0.1 as the NDWI threshold value for water bodies. Thus, in this study, NDWI < −0.1 was assigned a value of 1, and NDWI > −0.1 was assigned a value of zero. This is expressed by Equation (3), where the binary index NDWI B can remove water bodies within urban areas. The combination of NDWI B and NTL eliminates blooming from the NTL data for rivers and other water bodies.
To summarize, we combined a logarithmic transformation with the NDVI and NDWI to reduce the saturation of NTL data from urban core areas and eliminate the blooming of NTL data from water bodies and vegetation areas of urban core areas, both of which help to improve the accuracy with which urban land is extracted.
Following the method of Li et al. [49], we used the VWANUI to extract urban land areas. Specifically, the urban land derived from the LUC dataset was used as reference data to determine the optimal threshold of the VWANUI. Because the spatial resolution of the LUC dataset (30 m) is much finer than that of DMSP-OLS data (1000 m), it is feasible to use LUC to produce urban land as reference data for evaluation [41,56,57]. The optimal threshold for the VWANUI was determined as follows: where T j is the threshold of the VWANUI in city j and Kappa j is the Kappa coefficient calculated according to the urban land extracted by the VWANUI and the reference data. Specifically, the urban land was extracted by the VWANUI based on the threshold T j , and the extracted urban land and the reference data were used to calculate the Kappa coefficient. VWANU I min j and VWANU I max j are the minimum and maximum values of the VWANUI in city j, respectively. Finally, areas with a VWANUI that exceeded the optimal threshold were extracted as the urban land of a given city.
Then, according to the confusion matrix [58], the Kappa coefficient [58] was calculated as follows: where r is the number of rows in the matrix; x ij is the value at row i and column j; x i+ and x +j are the marginal totals of row i and column j, respectively; and N is the total number of observations.
In addition, the LUC dataset published by CAS has been proven to represent the actual land cover for the corresponding year [17]. According to the research of Li et al. and He et al. [17,39], the "urban land" of the LUC dataset published by CAS was used as reference data to verify the extraction results. This supports the credibility of the verification results obtained in the present study.

Accuracy of Urban Land Extraction
Shao et al. [59] and Liu et al. [60] demonstrated that if only a single index is used to verify the accuracy of map classification and extraction, the accuracy evaluation will contain errors and will be unreliable. Therefore, in this study, the accuracy and reliability were evaluated by using the overall accuracy (OA), the producer's accuracy (PA), the user's accuracy (UA), and the F-score. The OA, PA, and UA were all calculated by using the confusion matrix table [59]. The error matrix shown in Table 1 was based on the work of Shao et al. [59] and Olofsson et al. [61]. The table also contains the formulas for calculating the PA and UA.

Map Data
Reference Data Note: x jj is the fraction of correctly classified samples of class j in the diagonal of the error matrix, x ij is the fraction of samples in row i and column j, J is the total number of land-cover types, x i+ is the total fraction of class i extracted from map data, and x +j is the total fraction of class j obtained from the reference data.
According to the error matrix in Table 1, the OA was calculated as follows: where x jj represents the proportion of correctly classified samples in class j in the diagonal of the error matrix (Table 1), and J is the total number of land-cover types. Finally, the F-score evaluates the accuracy of a class using a weighted average of the producer's and user's accuracy [62,63]. The F-score was calculated as follows:

Verifying the Optimality
To thoroughly verify the optimality of the proposed approach based on logarithmic transformation and vegetation and water indexes, we also considered five NTL composite indicators: the NTL vegetation index (NVI), NTL water index (NWI), logarithmic NTL vegetation index (LNVI), logarithmic NTL water index (LNWI), and NTL vegetation-water index (NVWI). These indicators were calculated as follows: NW I = NTL × NDW I B (10) LNW I = lg(NTL) × NDW I B In this study, the overall accuracy, PA, UA, and F-score of urban land extracted based on different indicators were used to fully confirm the optimality of the approach with these indicators (Figures 2 and 3). Specifically, the mean OA obtained with the VWANUI was 4.29%, 2.96%, 3.74%, 4.77%, 3.84%, and 1.89% higher than that obtained by using the NTL, NVI, NWI, LNVI, LNWI, and NVWI, respectively. Likewise, the corresponding mean urban PA was 13.74%, 6.42%, 17.05%, 10.85%, 34.59%, and 4.10% higher; the corresponding mean urban UA was 7.02%, 5.98%, 16.72%, 10.70%, 34.19%, and 4.11% greater; and the corresponding urban F-score was 10.61%, 6.20%, 16.88%, 10.77%, 34.39%, and 4.10% greater. The results show that, for urban areas extracted using the VWANUI, the overall accuracy, PA, UA, and F-score are all larger than for urban areas extracted using the other indexes. In brief, compared with these different indicators, the VWANUI provides the highest accuracy for the OA, urban PA, urban UA, and F-score for all the cities in the study area. Therefore, the proposed approach based on the log-transformed NTL and vegetation and water indexes is the optimal approach.

Advantage of Using the VWANUI
To verify the advantages of the proposed approach, VWANUI images ( Figure 4c) and DMSP-OLS NTL images (Figure 4b) were compared with higher-resolution Landsat-8 OLI images (Figure 4a) for five rapidly urbanizing cities: Guangzhou, Foshan, Jiangmen, Huizhou, and Zhaoqing. As shown in Figure 4, the NTL of the urban core areas is constant because of saturation. In addition, the blooming effects in the NTL data give a higher DN to the dispersed images around the urban areas. However, the VWANUI images succeed in separating nonurban land cover from urban land, even in the surrounding urban areas (Figure 4c).
To further evaluate the capability of the VWANUI to reduce saturation and eliminate blooming, the saturated DMSP-OLS NTL, HSI, VANUI, and VWANUI were visually compared along a latitudinal transect through the research areas ( Figure 5). The VWANUI image changes significantly in the urban core areas, and its ground surface texture is abundant. Although the HSI, VANUI, and VWANUI can reduce the saturation of the NTL data and increase the variability in urban areas, the VWANUI image has greater variability in the core urban areas and so more clearly benefits from the reduced saturation of the NTL data.  In addition, the VWANUI can eliminate blooming in the DMSP-OLS NTL. However, neither the VANUI nor the HSI reduces blooming because they remain at high values for nonurban land cover, especially water bodies and vegetation areas in core urban areas. By contrast, the VWANUI for these areas is exceptionally low-almost or close to zero, as shown in Figure 5. The results show that the VWANUI effectively eliminates blooming.
This study not only analyzed the latitudinal transect of the research areas, but also compared cities with different economic levels. Figure 6 visually compares the corrected images obtained by applying the different approaches to the different cities. In the DMSP-OLS NTL images, the urban coastline or riverbank and lakeside are significantly affected by the blooming of the NTL. Blooming is the most obvious phenomenon in the image, and the flashing NTL entering the adjacent water bodies is apparent. Figure 6 also shows that neither the VANUI nor the HSI can distinguish water bodies from urban land areas, whereas the VWANUI accurately removes water bodies and vegetation areas within the urban areas to reveal an accurate spatial distribution of urban land. In a word, NTL blooming is significantly reduced or even eliminated in the VWANUI images.

Accuracy of the Various Approaches
The accuracy with which urban land is extracted reflects the extent to which saturation and blooming are reduced. In the present work, we selected 11 cities as samples for comparing the urban land extracted by using the DMSP, HSI, and VANUI and the proposed VWANUI to assess the accuracy with which urban land is extracted by using the VWANUI. Figure 7 compares the OA and F-score for the urban land of 11 cities in the study area, extracted by the different approaches. The mean OA obtained by using the VWANUI was 4.32%, 3.59%, and 2.98% higher than that obtained by using the DMSP, HSI, and VANUI, respectively. Likewise, the corresponding mean urban F-score was 10.66%, 7.35%, and 6.20% higher. These results indicate that the VWANUI provides a higher extraction accuracy than the other approaches.   Figure 8 compares the PA and UA obtained with the different approaches for the 11 cities in the study area. The mean urban PA obtained by using the VWANUI was 13.92%, 5.69%, and 6.42% higher than that obtained by using the DMSP, HSI, and VANUI, respectively. Likewise, the corresponding mean urban UA was 7.02%, 8.88%, and 5.98% greater, respectively. These results also show that the VWANUI more precisely extracts the urban land than the other three approaches. In short, compared with the DMSP, HSI, and VANUI, the VWANUI provides the highest accuracy in terms of the OA, urban PA, urban UA, and F-score for all the cities in the study area.
Furthermore, Guangzhou, Dongguan, and Shenzhen were selected as representative cities for comparing the extraction results (Figure 9). The results show that the spatial pattern of urban land obtained by using the VWANUI is more consistent than that obtained from the LUC dataset. Urban land extracted based on DMSP-OLS NTL usually has larger errors (in green) because of saturation in the urban core areas and blooming effects near coastal areas, rivers, vegetation cover, bare soil, and water bodies. However, the VWANUI not only reduces the saturation of urban core areas, but also removes blooming around coastal areas, rivers, large water bodies, etc., although some error remains near small rivers. Finally, the error in urban land extracted by using the VWANUI is also reduced, so the extraction accuracy for urban land is improved when using the VWANUI.

Estimation of Urban Land with VWANUI
Given that auxiliary vegetation-water information reduces the saturation and blooming of DMSP-OLS NTL data, the VWANUI can extract accurate urban information from coarse-resolution NTL data. Urban land is an important indicator for measuring the degree of urbanization, and numerous studies have shown that NTL data correlate positively with urban land areas [6,43,64].
In this work, we used a regression model to estimate urban land in the study area. In the linear regression model, urban land (UL) is obtained from a LUC dataset as reference data (dependent variable), and the various indexes serve as independent variables. We then compared the models of regression between various indexes and reference data to further evaluate the capability of the VWANUI to extract urban land. The coefficient of the correlation between the extracted results and the reference data was used to determine the performance of the regression model. In addition, the root-mean-square error (RMSE), which indicates the average size of the error, served for assessing the accuracy. The RMSE was calculated as follows: (14) where N is the number of samples, and p i and t i are the reference and estimated values, respectively. We first established the regression models for the normalized DMSP-OLS, HSI, VANUI, and VWANUI, respectively, and calculated the quantitative indicators, including the determination coefficient R 2 , the correlation coefficient R, and the RMSE, to evaluate the regression models. Table 2 provides the results for the regression model and the calculated indicators. The R 2 values for the regression models were 0.7000, 0.6352, 0.7530, and 0.8266 for the DMSP, HSI, VANUI, and VWANUI, respectively. The R 2 for the VANUI or VWANUI regression model exceeded the R 2 for the DMSP model, and the VWANUI regression model had the highest R 2 of 0.8266. This result shows that the VWANUI is the most appropriate index for estimating urban land. By contrast, the DMSP regression model had the lowest R 2 of 0.7000. The results for the correlation coefficient R are similar to those for R 2 . The VWANUI regression model had the highest R of 0.9092, whereas the R values for the DMSP, HSI, and VANUI regression models were 0.8367, 0.7970, and 0.8678, respectively. These results show that the VWANUI regression model provides the best performance for the study area. In addition, the VWANUI regression model had the lowest RMSE of 0.1425. The scatter diagram plotting the extraction results and the reference data ( Figure 10) shows that the correlation between the reference area and the extraction results for the VWANUI is higher than that between the reference area and the extraction results based on other indexes. This also indicates that the VWANUI regression model reduces saturation and blooming in DMSP-OLS NTL data, which allows it to estimate the urban land more accurately than the other models.

Analysis of Extraction Accuracy
The VWANUI has advantages in terms of reducing saturation and eliminating blooming from the DMSP-OLS NTL data. Because the NDVI and NDWI complement each other in terms of land cover, they are incorporated into the VWANUI to extract urban land information. The VWANUI introduces the NDVI and combines DMSP-OLS NTL with NDVI data. The vegetation coverage on urban land is generally less than that on nonurban land [41], so the NDVI correlates negatively with urban land areas, which can be used to reduce the saturation of NTL data in urban core areas. Thus, combining DMSP-OLS NTL with the NDVI can reduce the saturation in the NTL brightness to improve the accuracy of extraction. For example, Figure 11 shows that extraction using the VWANUI removes vegetated areas in the urban centers of Guangzhou, Foshan, and Huizhou.
Next, the NDWI is introduced into the VWANUI to remove blooming from saturated NTL intensities near water bodies inside core urban areas. As shown in Figure 11, the extraction results from using the VWANUI do not contain water bodies in the cities. In addition, the application of the logarithmic transformation enlarges the NTL difference between urban and nonurban areas, which can improve the accuracy with which urban land is extracted from NTL data. Therefore, the VWANUI constructed by combining the log-transformed DMSP-OLS NTL, the NDVI, and the NDWI reduces saturation and eliminates blooming from DMSP-OLS NTL data, thereby improving the accuracy with which urban land is extracted. In a previous study, the combination of the NDVI with NTL data reduced the saturation of NTL in urban core areas [43], which indicates the feasibility of combining NTL and vegetation data. In future studies, population data, surface-temperature data, or topographic and geomorphic data could be used to construct a more accurate method for estimating urban land areas [65].

Analysis of Regression Model
To further verify the performance of the VWANUI, we compared the R 2 for the reference data and those for the results extracted by using the DMSP, HSI, VANUA, and VWANUI ( Figure 12). In the model of regression between the extracted urban land and reference data, the R 2 values from the regression models based on the DMSP, HSI, VANUI, and VWANUI were 0.9942, 0.9926, 0.9996, and 0.9998, respectively. In the model of regression between the extracted common urban land and reference data, the R 2 values of the regression models based on the DMSP, HSI, VANUI, and VWANUI were 0.9503, 0.9229, 0.9445, and 0.9655, respectively. Common urban land refers to the common land areas between extracted areas and reference data. These results show that the VWANUI produces an R 2 for the regression model greater than that produced by the other three regression models.
This comparative analysis led to the conclusion that the VWANUI accurately extracts the spatial distribution of urban land and that the VWANUI regression model provides the most accurate estimates of urban land compared with the other three approaches. Therefore, we used the proposed VWANUI to extract urban land from the research area; the results appear in Figure 13 and show that the urban land was extracted with high precision in most cities, but the error in the extraction results was slightly larger in Huizhou, Jiangmen, and Zhaoqing, which are less economically developed. This discrepancy is attributed to the fact that the infrastructure in these cities is not sufficiently developed. The NTL brightness is lower in some areas, making the DMSP-OLS NTL data more difficult to obtain. Next, we briefly show that urban land is mainly distributed in the estuary of the Pearl River Basin and in coastal areas ( Figure 13). In particular, urban land distribution accounts for a larger proportion of the total area in Guangzhou, Foshan, Dongguan, and Shenzhen, indicating that these cities have a higher degree of urbanization.

Limitations of Data
The process used herein of fusing DMSP-OLS NTL data with the NDVI and NDWI data may introduce some errors. In addition, the limitations of NTL data are as follows. First, the uncertainty caused by the angle of atmospheric and surface optical properties, diurnal variation, and seasonal variation is the primary source of error in NTL data [66]. Surface factors such as surface reflectivity, vegetation coverage, snow cover, and surface impermeability also result in errors in NTL data [67]. 9 Figure 13. Urban land extracted based on proposed VWANUI applied to 11 cities within the research area.
Next, because DMSP-OLS data are archived historical data, they should be calibrated with the new NTL data from the Suomi National Polar-orbiting Partnership Visible Infrared Imaging Radiometer Suite (NPP-VIIRS). Recently, Hu et al. [68] and Ryan et al. [69] showed that ground-based stabilization and radiometric calibration light sources might provide a useful method for cross-calibrating NTL sensors. However, this remains a significant challenge due to differences in transit time, sensor calibration, spatial resolution, and other considerations [70,71]. Therefore, obtaining a consistent, long-term time series of NTL data across different platforms and sensors remains a challenge at present.
Third, given the lack of on-board calibration, varying atmospheric conditions, and sensor degradation, the time series of DMSP-OLS NTL data cannot be directly compared across years [72]. Finally, the degree of economic development, differences in urban and rural electricity consumption, and even differences in culture may produce variations in the brightness of urban NTL in different areas [73].

Research Plans and Prospects
An urgent need exists for timely and reliable quantitative information on urban land areas to support sustainable urban development and management [7]. First, the integration of DMSP-OLS NTL data with daytime optical remote sensing data and the fusing of DMSP-OLS with NPP-VIIRS data, or with the higher-resolution Luojia-01 data, can be further studied to develop a longer dataset for future research. In addition, a new model should be constructed by combining NTL remote sensing data with other remote sensing data for exploring the interactions between all the variables [24,74] and obtaining high-resolution data on urban land information with longer time series, which is vital for addressing the environmental problems caused by rapid and long-term urban expansion.
Next, it is necessary to make NTL data more quantifiable; ground and space measurements should be further calibrated. Some researchers have tried to calibrate images with accurate spatial resolution. For example, based on ground measurements, Hale et al. [75] calibrated aerial NTL images and obtained illuminance units. Li et al. [76] proposed using an unmanned aerial vehicle (UAV) to obtain nighttime light with a time resolution on the scale of hours and showed that the UAV is an effective tool for measuring urban NTL. Therefore, in future work, multiangle observation, high-dynamic-range imaging technology, radiation calibration, and more advanced sensors and methods available with UAVs could play a practical role in the accurate monitoring of urban NTL.
Finally, NTL observations should be integrated with other data and knowledge. Integrating NTL observations with geolocated big data, field data, and multispectral and multisource remote sensing data promises to lead to a comprehensive characterization and understanding of human activity, which will provide more opportunities for NTL remote sensing [77,78].

Conclusions
Accurate and timely information on urban land on regional and global scales is vital for discussing and studying environmental issues. The quantification of urban land is crucial for evaluating sustainable urban development. To address these needs, this work proposes using the VWANUI to describe urban characteristics. The research results show that the VWANUI reduces the saturation of NTL data from core urban areas, eliminates the blooming of NTL brightness from water bodies and vegetation areas within urban core areas, and provides more accurate and abundant surface details.
The proposed approach allows for more rapid and accurate extraction of urban land than previous methods. The mean OA obtained by using the VWANUI was 4.32%, 3.59%, and 2.98% greater than that obtained by using the DMSP, HSI, and VANUI, respectively; the corresponding mean urban PA increased by 13.92%, 5.69%, and 6.42%, respectively; and the corresponding mean urban UA increased by 7.02%, 8.88%, and 5.98%, respectively. In addition, the corresponding mean urban F-score increased by 10.66%, 7.35%, and 6.20%, respectively.
The determination coefficients, R 2 , of the regression models were 0.7000, 0.6352, 0.7530, and 0.8266 for the DMSP, HSI, VANUI, and VWANUI, respectively. The VWANUI regression model thus had the highest R 2 of 0.8266 and the lowest RMSE, at 0.1425, which indicates that the VWANUI is the most appropriate approach for estimating urban land.
In summary, the VWANUI combined with multisource remote sensing data produces a more accurate and detailed extraction of urban land than using single-source data. Our evaluation shows that the VWANUI reduces the saturation and blooming from DMSP-OLS NTL and provides more accurate urban land estimates. The VWANUI is easy to interpret and implement, facilitating the implementation of the accurate mapping of urban land over large-scale areas within a reasonable amount of time and at a low cost. In the future, the VWANUI will be applied to NPP-VIIRS NTL data or Luojia-01 NTL data with higher spatial resolution and wider radiant detection to extract accurate and timely urban information with higher resolution on national and global scales.

Institutional Review Board Statement: Not applicable.
Informed Consent Statement: Not applicable.

Data Availability Statement:
The data presented in this study are available on request from the corresponding author. The data are not publicly available because it is currently being used in relevant research projects.