Next Article in Journal
Preface: Earth Observations for Addressing Global Challenges
Next Article in Special Issue
Constructing a New Inter-Calibration Method for DMSP-OLS and NPP-VIIRS Nighttime Light
Previous Article in Journal
Estimation of the Particulate Organic Carbon to Chlorophyll-a Ratio Using MODIS-Aqua in the East/Japan Sea, South Korea
Previous Article in Special Issue
A New Fusion Approach for Extracting Urban Built-up Areas from Multisource Remotely Sensed Data
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Modeling China’s Prefecture-Level Economy Using VIIRS Imagery and Spatial Methods

1
School of Resource and Environment Science, Wuhan University, Wuhan 430079, China
2
Spatial Sciences Institute, University of Southern California, Los Angeles, CA 90089, USA
*
Author to whom correspondence should be addressed.
Remote Sens. 2020, 12(5), 839; https://doi.org/10.3390/rs12050839
Submission received: 8 December 2019 / Revised: 28 February 2020 / Accepted: 3 March 2020 / Published: 5 March 2020
(This article belongs to the Special Issue Remote Sensing of Nighttime Observations)

Abstract

:
Nighttime light (NTL) data derived from the Visible Infrared Imaging Radiometer Suite (VIIRS), carried by the Suomi National Polar Orbiting Partnership (NPP) satellite, has been widely used to evaluate gross domestic product (GDP). Nevertheless, due to the monthly VIIRS data fluctuation and missing data (excluded by producers) over high-latitude regions, the suitability of VIIRS data for longitudinal city-level economic estimation needs to be examined. While GDP distribution in China is always accompanied by regional disparity, previous studies have hardly considered the spatial autocorrelation of the GDP distribution when using NTL imagery. Thus, this paper aims to enhance the precision of the longitudinal GDP estimation using spatial methods. The NTL images are used with road networks and permanent resident population data to estimate the 2013, 2015, and 2017 3-year prefecture-level (342 regions) GDP in mainland China, based on eigenvector spatial filtering (ESF) regression (mean R2 = 0.98). The ordinary least squares (OLS) (mean R2 = 0.86) and spatial error model (SEM) (mean pseudo R2 = 0.89) were chosen as reference models. The ESF regression exhibits better performance for root-mean-square error (RMSE), mean absolute relative error (MARE), and Akaike information criterion (AIC) than the reference models and effectively eliminated the spatial autocorrelation in the residuals in all 3 years. The results indicate that the spatial economic disparity, as well as population distribution across China’s prefectures, is decreasing. The ESF regression also demonstrates that the population is crucial to the local economy and that the contribution of urbanization is growing.

1. Introduction

Gross domestic product (GDP) refers to the final outcome of the production activities of all permanent units in a certain period of time, calculated according to the market price. GDP is a crucial comprehensive statistical indicator in the accounting system and is also a core indicator in the national economic accounting system. It reflects the economic strength and market size of a city (or region). As a vital indicator, the GDP serves as an important reference function for political decision-making and national development [1]. The GDP is typically estimated by the government after a series of statistical procedures, but problems of inadequate measurement and slow publication remain [2]. Therefore, new methods are needed to provide timely and precise GDP estimates.
Nighttime light (NTL) data is an effective proxy for socioeconomic activity and has been widely used in many aspects, including socioeconomic statistical index estimation, urbanization, energy consumption, and ecological environments due to its availability and veracity [3,4,5,6,7,8,9,10,11,12]. Currently, two kinds of NTL data are widely applied to estimate GDP. The NTL data from the American Defense Meteorological Satellite Program (DMSP) Operational Linescan System (OLS) has approximately 2.7 km spatial resolution and has been used for large scale and time series GDP estimation, despite its relatively coarse spatial resolution and oversaturation problems [13,14,15,16,17,18,19,20,21,22,23]. The second is the new generation NTL data, collected by the Visible Infrared Imaging Radiometer Suite (VIIRS), carried by the Suomi National Polar Orbiting Partnership (NPP) satellite and made available for free by the Earth Observation Group (EOG) of Colorado School of Mines. With a resolution of approximately 500 m, the NPP-VIIRS data offers better data quality due to the wider radiometric detection and on-board calibration [24]. There are two configurations of VIIRS monthly composite NTL data, the “vcm” version that excludes any data impacted by stray light and the “vcmsl” version that includes the stray-light corrected data. Many scientists have used VIIRS data to map socioeconomic parameters [25,26,27,28,29,30,31], especially GDP. For example, Li et al. [32] explored the potential of NPP-VIIRS imagery for modeling China’s GDP in 31 provinces and 393 counties by linear regression. The VIIRS data also exhibits better performance than the DMSP-OLS data. Shi et al. [33] compared the effect of DMSP and VIIRS in 31 provinces and 268 prefectures using linear regression and showed that the corrected VIIRS data offers higher average R2 than DMSP in estimating GDP and electric power consumption. Zhao et al. [34] produced a pixel-level GDP map in south China and proved that the quadratic polynomial relationship offers a better fit than simple linear correlations between total nighttime light and economic activity at both the prefecture and county levels.
However, there are several problems in GDP studies using VIIRS data. First, researchers have found that the values of VIIRS imagery could be influenced by many factors, such as the observing angle and human activity intensity [35,36,37]. The VIIRS vcm version monthly composite images also have data impacted by stray light excluded over high-latitude areas of the northern hemisphere in summer. The fluctuations and “missing data” may influence the longitudinal measurements, so whether the vcm monthly VIIRS data is suitable for annual regional GDP estimation needs to be examined for smaller geographical units than the province level. Relevant experiments at the prefecture or county level for mainland China are still lacking. Second, many methods applied to estimate the GDP have not taken the spatial effect into consideration, which may decrease the accuracy and level of spatial explanation. The GDP distribution is accompanied by regional disparities, which depend on the study area and scale. The reasons NTL images shed light on economic characteristics are not only their availability and low cost but also the location information. Some of the information is lost when using NTL data to analyze the GDP distribution using non-spatial methods. Third, models using the NTL data are often difficult to interpret. Nonlinear models, such as the backpropagation neural network and quadratic polynomial model, lack explanatory information for multiple variables. Both precision and interpretation need to be considered when constructing GDP models incorporating spatial effects.
To solve the aforementioned problems, the main model we used here was eigenvector spatial filtering (ESF) regression. As a general linear model, eigenvector spatial filtering regression can reduce the spatial effects in model residuals by extracting eigenvectors from the adjacent matrix and linearly combining them in a spatial filtering function. ESF regression was developed by Griffith et al. and is becoming recognized because of its accuracy and usability [38]. ESF regression has been used to model PM2.5, the normalized difference vegetation index (NDVI), and landslide risk [39,40,41,42,43].
In this paper, a method using NTL combined with permanent resident population and OpenStreetMap (OSM) road networks based on eigenvector spatial filtering regression is proposed to estimate the GDP distribution in 342 Chinese regions (mostly prefecture-level) over 3 years: 2013, 2015, and 2017. Ordinary least squares (OLS) as a classical method and the spatial error model (SEM) as a popular spatial method were chosen as reference models. The aims of this study were to examine the efficacy of using VIIRS data to build a sustainable and precise method for estimating prefecture GDP across mainland China and how the significant variables in the ESF regression varied from one region of China to another. To illustrate the importance of NTL in GDP estimation, several experiments with and without NTL data were conducted and the effect of different versions of the VIIRS monthly data on GDP estimates was estimated.

2. Data and Methods

2.1. Study Area

The research scale focused on the mainland city level in China, including 329 prefectures, 4 municipalities, and 9 county-level cities, which are directly under the jurisdiction of the Hubei, Henan, and Xinjiang provinces. The study area excluded Hong Kong, Macao, and Taiwan. Several coastal cities and regions that lacked borders on the mainland, such as the city of Zhou Shan and the whole of Hainan province, were omitted because their geographical relations were difficult to define. Following these adjustments, the number of regions was 342. The study area and nighttime light image in 2013 are shown in Figure 1.

2.2. Dataset

2.2.1. Data Collection

The research data include population and economic statistical, nighttime light image, land cover, and OSM data. The population data refers to the actual population that lived in an area for a certain period (more than half a year). The 2013 prefecture-level city population and GDP data were relatively easy to collect as they were included in the 2014 China Statistical Yearbook for Regional Economy (http://tongji.cnki.net/kns55/Navi/HomePage.aspx?id=N2015070200&name=YZXDR&floor=1). Most of the 2015 and 2017 statistical data can be collected from the corresponding provincial statistical yearbooks, which can be downloaded from the China National Knowledge Infrastructure (CNKI) (http://tongji.cnki.net/kns55/), while population and GDP data in many remote regions were obtained from local statistics bureau websites.
The original “vcm” and “vcmsl” version monthly nighttime light (NTL) data were obtained from the Earth Observation Group, Colorado School of Mines (https://payneinstitute.mines.edu/eog/viirs/). The NTL data used here was the “vcm” NPP-VIIRS day/night band (DNB) cloud-free monthly average data, which had been filtered for stray light. Monthly composite NPP-VIIRS average radiance images for the years 2013, 2015, and 2017 were acquired to match the time window used for this project.
The 2013 and 2015 global land cover (LC) data were produced by the European Space Agency (ESA) Climate Change Initiative (CCI). The CCI-LC data has a 300 m spatial resolution from 1992 to 2015 (http://maps.elie.ucl.ac.be/CCI/viewer/download.php).
The 2014, 2016, and 2018 January road lengths were obtained from the OSM (http://download.geofabrik.de/asia/china.html) and used to represent the 2013, 2015, and 2017 end-of-year road lengths.
The administrative boundary map was downloaded from the Resource and Environment Data Cloud Platform (http://www.resdc.cn/Default.aspx), which includes some towns.

2.2.2. Dataset description

The chosen independent variables are the permanent resident population of the prefecture-level regions, denoted as Permanent; the average brightness of the region is denoted as MEAN; and the road length is denoted as Road. The calculation of the MEAN variable is shared in the next section. Our research region included 342 regions in mainland China over 3 years—2013, 2015, and 2017. The 3 datasets are summarized in Table 1, Table 2 and Table 3.
The tables show that all of the elements displayed have experienced varying rates of growth. The GDP and road length increased quickly, while the population remained more stable.

2.3. Methodology

The experiments included 3 steps: (1) the pre-processing of the input data; (2) the construction of the ESF regressions; and (3) the selection of the reference models and evaluation metrics.

2.3.1. Data Pre-Processing

To calculate the regional mean brightness, denoted as MEAN, original images need calibration. The 2013, 2015, and 2017 annual data were calculated from monthly images. Although the Colorado School of Mines Earth Observation Group (EOG) released 2015 annual NTL images, we chose to use monthly data to maximize data consistency. After assessing the quality of the NPP-VIIRS monthly data, we found that the May to August images contained missing data in high latitude regions, such as northeastern China. Since the study extent is the Chinese mainland, vcm version NPP-VIIRS data for the remaining 8 months were averaged, and the null values and negative values were converted to 0. The 12-month calibrated VIIRS data (vcmsl version) were processed in the same way and the results of the difference experiments are discussed in Section 4.1. According to previous research, a ceiling value needs to be obtained by comparing the highest value of the most economically advanced metropolitan area, such as Beijing, Shanghai, or Guangzhou [28]. The ceiling values for the 3 years of images and their corresponding city names are shown in Table 4.
The water body and snow mask were derived from CCI-LC data to eliminate the moonlight reflection effects of the snow and water, and the 2015 land cover data were used with 2015 and 2017 annual VIIRS images because it provided the closest ICC-LC data to 2017. Finally, the regional average brightness was calculated using the calibrated nighttime images:
M E A N =   1 n × 1 n D N i
where D N i refers to the region and n denotes the total pixel numbers in each of the administrative regions.
When extracting roads from the OSM, it should be specified that the whole road length includes primary roads, secondary roads, tertiary roads, cycle ways, and service roads. The OSM data were reprojected from the WGS-84 geographic coordinate system to the Asia North Albers Equal Area geographical coordinate system to calculate lengths of the roads included in the 2013 China road network data reproduced in Figure 2.
Logarithmic transformation was used to process all of the collected variables, since OLS regression requires normally distributed data. For the comparability of different variables, all of the collected independent variables in models were standardized. The 2015 and 2017 data were processed in the same way as the 2013 data.

2.3.2. ESF Regression Construction

The ESF regression uses the linear combination of selected eigenvectors calculated from the spatial weights matrix to filter out the spatial autocorrelation. The basic equation of the ESF regression is as follows:
Y   =   X β + E α + ε
where Y is an n × 1 vector of the dependent variables, X is an n × p matrix containing the independent factors, E is an n × m matrix consisting of m filtered eigenvectors, α   and   β are the corresponding regression coefficients, and ε is an independent and identically distributed random error.
Building the GDP ESF regression model includes the following procedures:
(1)
Building a spatial weights matrix according to the adjacent relationships of the 342 regions.
(2)
Centering the spatial weights matrix and calculating the eigenvectors, as well as the corresponding eigenvalues.
(3)
Selecting the eigenvectors by two methods. First, by singling out the eigenvectors, whose corresponding eigenvalues divided by the maximum eigenvalue were larger than 0.1 [43,44], which means selecting the most important eigenvectors. The second method used a stepwise procedure with all of the covariates to simplify the model [45,46]. The independent variables, such as average brightness or population, will be dislodged with this method if they are not reasonable.
(4)
Calculating the regression coefficients in the OLS estimate.
The rook criterion, meaning that the different city regions were defined as adjacent if they shared a common side, was used to construct the spatial weights matrix C0. Because of the irregular regional boundaries, the neighbor relation is almost the same, regardless of whether queen or rook adjacency rules are used. It is imperative to center the spatial weights matrix to calculate the eigenvectors. C 1 is the matrix after C 0 is centered, as follows:
C 1 = ( I 11 T n ) C 0 ( I 11 T n )
where 1 is an n × 1 vector of ones, T is the matrix transpose operator, such that 11 T equals an n × n matrix, whose elements are one and n is 342, and I is an n × n identity matrix, whose diagonal elements are 1 and the others are 0.
The final ESF regression model with the best model fitness had the following form:
ln ( G D P ) = β 1 ln ( P e r m a n e n t ) + β 2 ln ( M E A N ) + β 3 ln ( R o a d ) + β k E k + β 0 + ε  
where β 0 is the intercept, β i (i = 1,2,3) are the regression coefficients, E k is an n × k matrix consisting of the selected eigenvectors, and β k E k represents the spatial autocorrelation.

2.3.3. Reference Model and Evaluation Standards

Several models were chosen to compare model performances. The classic ordinary least squares (OLS) is a non-spatial method and was chosen to compare spatial and non-spatial models. The spatial error model (SEM) was chosen to represent spatial methods that endeavor, like ESF, to eliminate the spatial autocorrelation in the regression residuals.
The SEM produces pseudo R 2 values, which cannot be compared with R 2 . The model fitness was evaluated from five perspectives: R 2 and adjusted   R 2 , root-mean-square error (RMSE), mean absolute relative error (MARE), the Akaike information criterion (AIC), and Morans’ I for the regression residuals. Although the GDP used a logarithmic transformation, to make the results more intuitive, the calculation of RMSE and MARE used the predicted GDP values as follows:
RMSE = 1 n 1 n ( G D P i G D P i e s t i m a t e ) 2
RE i = | G D P i G D P i e s t i m a t e | G D P i
MARE = 1 n RE i   × 100 n
where G D P i e s t i m a t e is the estimated GDP value in the ith region and G D P i is the official statistical GDP value of the corresponding region. REi is the absolute value of regional relative errors. The Morans’ I of the residuals is used to measure the spatial autocorrelation in the residuals. In addition to the above standards, leave-one-out cross validation was also needed to test the model robustness.

3. Results

3.1. Correlation Analysis and Multi-Collinearity test

To test whether or not the independent variables are qualified to estimate the GDP, the Pearson correlation coefficients between the GDP and other covariates were calculated. Table 5 shows that all of the chosen independent variables passed the significance test, and the correlation results demonstrate that the covariates had a strong relationship with the GDP. The minimum correlation value was 0.68 between the GDP and road length. The average correlation values of Permanent, MEAN, and Road with the GDP were 0.83, 0.70, and 0.70, respectively, showing that the GDP and population had the closest connection. Generally, all the chosen variables were qualified to estimate the prefecture-level GDP.
Both the ESF and OLS regression models passed the multi-collinearity test in each of the 3 years, as the variance inflation factor (VIF) values shown in Table 6 and Table 7.
The results in this pair of tables shows that none of the VIF values were larger than 10. This means that the two linear models passed the multi-collinearity test in all 3 years and that these coefficients were able to explain the influence between the GDP and independent variables.

3.2. Model Performance

3.2.1. Regression results

The top 3 ESF, OLS, and SEM regression models are summarized in Equations (8)–(10), (11)–(13), and (14)–(16), respectively:
ln ( G D P ) = 0.54 × ln ( P e r m a n e n t ) + 0.45 × ln ( M E A N ) + 0.20 × ln ( R o a d ) + β k × E k + 6.99 + ε ( 2013 )
ln ( G D P ) = 0.62 × ln ( P e r m a n e n t ) + 0.49 × ln ( M E A N ) + 0.22 × ln ( R o a d ) + β k × E k + 7.08 + ε ( 2015 )
ln ( G D P ) = 0.52 × ln ( P e r m a n e n t ) + 0.61 × ln ( M E A N ) + 0.23 × ln ( R o a d ) + β k × E k + 7.19 + ε ( 2017 )
ln ( G D P ) = 0.66 × ln ( P e r m a n e n t ) + 0.45 × ln ( M E A N ) + 0.17 × ln ( R o a d ) + 6.97 + ε ( 2013 )
ln ( G D P ) = 0.70 × ln ( P e r m a n e n t ) + 0.44 × ln ( M E A N ) + 0.16 × ln ( R o a d ) + 7.07 + ε ( 2015 )
ln ( G D P ) = 0.72 × ln ( P e r m a n e n t ) + 0.47 × ln ( M E A N ) + 0.10 × ln ( R o a d ) + 7.18 + ε ( 2017 )
ln ( G D P ) = 0.58 × ln ( P e r m a n e n t ) + 0.48 × ln ( M E A N ) + 0.24 × ln ( R o a d ) + ( 1 0.59 × C 0 ) 1 + 6.94 + ε   ( 2013 )
ln ( G D P ) = 0.62 × ln ( P e r m a n e n t ) + 0.48 × ln ( M E A N ) + 0.22 × ln ( R o a d ) + ( 1 0.57 × C 0 ) 1 + 7.06 + ε   ( 2015 )
ln ( G D P ) = 0.62 × ln ( P e r m a n e n t ) + 0.52 × ln ( M E A N ) + 0.19 × ln ( R o a d ) + ( 1 0.56 × C 0 ) 1 + 7.17 + ε   ( 2017 )
where E k is the eigenvectors that were selected to filter the spatial autocorrelation, so the residual ε could satisfy the independent identity distribution requirement. The β k × E k in ESF regressions can filter the spatial effect out of the residuals and the coefficients of the collected variables included in the aforementioned results were significant ( α   =   0.001 , two-tailed test).

3.2.2. Model Fitness

The various metrics noted earlier can be used to assess different aspects of model performance and to decide which model is the best for estimating mainland China GDP at the prefecture level. The RMSE results reproduced in Table 8, Table 9 and Table 10 use 108 yuan.
Although pseudo R 2 and R 2 cannot technically be compared because their statistical methods are different, they are still included in the tables, since they can reveal the difference between the ESF and OLS regressions.
From Table 8, Table 9 and Table 10, it is difficult to obtain intuitive knowledge of regression fitness. The visualization of the models is more telling. All of the regression outputs for 3 years are displayed in Figure 3, Figure 4 and Figure 5.
Take Figure 4 as an example, it shows the three models with different regression characteristics in 2015. All of the GDP distribution images used the same segmentation scale, so the corresponding color is more illustrative, and the four images can be directly compared. The OLS fit was poor in many regions, such as the inner Mongolia region, Yunnan Province, and mid-south of China. The SEM is accurate in most regions, except those in the west. The best regression result comes from the ESF model, which, for the most part, reduces differences to the original GDP distribution with only parts of the southeastern regions not fitting reality.

3.2.3. Cross Validation.

Cross validation is necessary because the model needs to simulate the unknown situation to a certain extent, and this provides a test of model robustness. Table 11 and Table 12 show the leave-one-out cross validation (LOOCV) for the RMSE and MARE results of the OLS and ESF models, because the leave-one-out cross validation cannot be processed for the SEM model. The RMSE units are 108 Chinese yuan (as before).
The results in this pair of tables show that the ESF regression is better than OLS in terms of RMSE and MARE in all 3 years, proving the robustness of the ESF regression. Moreover, there is a trend of quickly declining accuracy for the OLS, while the MARE of the ESF regression is relatively stable.

4. Discussion

4.1. VIIRS Data Evaluation

During the NPP-VIIRS data processing, the May to August monthly average composite images were filtered for missing data and the three annual images were derived from the other months. As shown in Table 2 and Table 3, the average brightness of the 2015 annual image was slightly higher than that of the 2013 annual image. The mean value of MEAN increased rapidly from 1.16 to 1.44 in the 2017 annual image.
The NTL data was obviously important to GDP estimation. The same experiments were processed without the NTL variable and it turned out that nearly all of the accuracy metrics for the three models declined. The results are summarized in Appendix A Table A1, Table A2, Table A3, Table A4 and Table A5 and include the leave-one-out cross validations. Although the ESF regression could maintain the R 2 at a relatively high level, the MARE and RMSE of all methods increased dramatically, which indicates the importance of NTL data. Table A4 and Table A5 suggest that the model robustness dropped sharply because the minimum value of the ESF regression LOOCV MARE was >30%, while the maximum MARE value of the ESF regression with the NTL variable was only 23.92%. These differences show that the NTL data was indispensable to GDP estimation in our research.
The MEAN factor was calculated from the 12 months’ vcmsl version VIIRS data using the same procedures, and the results of the parallel experiments for 2015 and 2017 are shown in Appendix A Table A6, Table A7 and Table A8. These results reveal that the vcmsl version NTL is appropriate for evaluating the GDP distribution across China. The vcm version performed a little better than the vcmsl version in terms of RMSE and MARE. For example, in 2015 and 2017 regressions, 10 of the 12 regression accuracy metrics, as well as 5 of the 8 LOOCV metrics of vcm version’s RMSE and MARE, were a little better than the vcmsl version results. But overall, the differences between the NTL versions were much smaller than those between models and ESF regression performed better than other methods.
Although the vcm version VIIRS data contain fluctuations and “missing data”, it is still possible to estimate GDP at the prefecture scale across multiple years in combination with other variables, and it has great impact on GDP estimates. From Table 9, Table 10 and Table 11, it is evident that the three models have similar fitness for different years. Apart from the AIC, which cannot be compared over the years, and the RMSE, which increased due to the GDP growth, the other aspects of each model displayed similar accuracies. For instance, the maximum difference for R 2 of the OLS in the 3 years was only 0.02 and for A d j u s t e d   R 2 was 0.03. The spatial models were more stable. The ESF regression, for example, maintained R 2 at 0.98 in all 3 years. Consequently, the main difference of the study came from the model instead of NPP-VIIRS, proving the ability of these data to estimate GDP over multiple years at the prefecture scale over all of mainland China.

4.2. Accuracy Assesment

In terms of overall model fitness, spatial models made different degrees of progress compared to OLS, and the ESF regression achieved significant improvement over the other models. In 2013, the R 2 of the ESF regression was 0.98 and the A d j u s t e d   R 2 was 0.96. Compared to the OLS regression, the ESF regression improved 12.64% in R 2 and 10.34% in A d j u s t e d   R 2 . The SEM model showed progress compared to OLS by reducing AIC values from 366.13 to 285.75. The difference between the AIC values in the SEM and ESF regression models was only 28.49. The improvement was also noticeable in the other 2 years.
In terms of regression error, the precision of ESF regression was obvious. The average RMSEs of the OLS, SEM, and ESF models were 1135.63, 1047.96, and 463.41, respectively. The average MARE of the ESF regression over the 3 years was 12.57% compared to 29.05% for SEM. This difference means that the ESF regression was better for estimating the prefecture-level GDP in China. Interestingly, the ESF regression was very stable, given that the 3-year ESF regressions of some areas obtained similar MARE results. The MARE of over 200 regions was less than 10% for 3 years and 11 repeat regions, whose MARE was larger than 30% over the 3 years. The average regional REi (Equation (4)) values were calculated for the three models and are shown in Figure 6. All images used the same segmentation scale, and it was observed that larger the light areas were, the better the regression effect. The ESF regression obviously maintained relative errors <30% in most regions, while the other methods did not work well. In terms of the maximum relative error, the OLS and SEM both reached over 200% compared to 83.78% for the ESF regression.
The last accuracy metric was Morans’ I, which measures the spatial autocorrelation in the regression model residuals. The OLS had the largest residual Morans’ I, since the nonspatial model could not remove the spatial effects in the residuals. All of the spatial regressions had better outcomes than OLS. Over 3 years, the SEM could reduce the Morans’ I to nearly 0, and, although the ESF regression was not as good as the SEM, it still eliminated most of the spatial effects in the residuals by lowering the absolute values of Morans’ I values 82.86, 82.76, and 81.25% from OLS.
Although the ESF regression did not have the best performance in eliminating the spatial autocorrelation effects in the residuals, it was a close second. The ESF regression had great accuracy in R 2 , A d j u s t e d   R 2 , AIC, RMSE, and MARE over the 3 years that were used. To conclude, the ESF regression had the best fitness of the chosen models here.

4.3. Variable Coefficients Analysis

The three variables in all of the regressions positively correlated with prefecture GDP. However, the individual influences were very different. Different aspects can be discussed to explain the contribution or potential meaning of the independent variables. All of the chosen variables were standardized so that they could be compared using their coefficients.
The coefficient of population was the largest, with the exception of the 2017 ESF regression, which suggests that Permanent was the most important variable. The coefficients of population increased from 0.66 to 0.72 over the three OLS regressions. The population of permanent residents undoubtedly influenced local economic development. This may explain why many cities, such as Wuhan, Hangzhou, and Chengdu, have sought to improve city competitiveness by trying to attract people to work and live there.
The NTL data has a strong relationship with local urbanization [47]. The average regional brightness can reveal the regional construction activity, and we use the MEAN factor to represent urbanization. The MEAN factor has the second largest coefficient in most of the regressions, suggesting the importance of urbanization level to economic activity. According to the 3-year ESF regression results, the coefficient of the MEAN increased, which indicates that the contribution of urbanization to regional economic activity is increasing.
There is an interesting common point that the sum of the coefficients of the Mean and Permanent increase with time in all three models. The sum increased from 0.99 to 1.11 to 1.13 for the ESF regression, from 1.11 to 1.14 and then to 1.19 for the OLS regression, and from 1.06 to 1.10 and then to 1.14 for the SEM regression. This situation may not have been a coincidence. The combination of these two variables contributed to most of the variables, meaning that population and urbanization were significant in GDP development. These factors play increasingly important roles in regional economic activity, which is consistent with the reality of the politics of attracting talent, especially in second-tier cities with continuous urban construction. These increasing coefficients may demonstrate that the policy will continue to rely on talent recruitment and city construction to maintain economic growth.

4.4. Spatial Autocorrelation Analysis

The regional results point to development inequalities. The GDP is a fine indicator of regional development degree. Morans’ I was used to explore whether the GDP distribution in China is accompanied by spatial autocorrelation, and the results are summarized in Table 13.
The mean Morans’ I of China prefecture GDP for the 3 years is 0.25, showing that the distribution of prefecture GDP exhibits a positive spatial autocorrelation relationship. The economically advanced regions are clustered in the southern and eastern coastal zones, and the same is evident in the relatively underdeveloped areas, such as the Tibet and Xinjiang regions. It could explain why the two spatial models performed better than OLS, since spatial effects had an impact on model accuracy. However, the Morans’ I value declines from 0.26 to 0.23 from 2013 to 2017, indicating that China’s prefecture development inequalities are slowly declining.
The population shows the same trend as GDP (Table 14). From 2013 to 2017, the Morans’ I declined from 0.28 to 0.23. The positive autocorrelation degree of permanent resident population across the prefecture regions is declining.

5. Conclusions

Despite the presence of signal fluctuations and missing data in the vcm version NPP-VIIRS monthly cloud free average composite data, it is still suitable to estimate prefecture GDP across all of mainland China and multiple years. Our study indicates that the main differences in the results depends on the choice of model instead of the VIIRS monthly data version.
The main advantages of the ESF regression in estimating the GDP is its reliability and explanatory power. The reliability of the ESF regression in estimating the GDP includes precision and stability. The mean R 2 of the ESF regression is approximately 0.98, and the MARE varies from 11.83 to 13.83% in 2013 and 2017, respectively. The cross validation also proved the robustness of the ESF regression. In addition, the method was very stable over the 3 years, which means that the ESF regression could be used in time series studies.
With reliable results, the interpretation of the GDP distribution is important because it reveals the contribution of the variables over time. The results of the ESF regression demonstrate that the population is crucial to local economic activity and that the contribution of urbanization is growing. Furthermore, the spatial autocorrelation of China prefecture GDP distribution and permanent resident population decreased from 2013 to 2017, indicating that regional economic disparities are weakening.
Since the relationship between NTL and GDP is very complicated and the GDP distribution is influenced by various factors, future work will concentrate on improving the precision of GDP ESF regression by including more accurate independent variables to describe economic structures and land use. We also plan to use more precise data sources, such as Baidu map or Amap, not only to promote accuracy but also to provide real-time updates.

Author Contributions

J.C. collected and processed the data, performed analysis, and wrote the paper. J.P.W. helped to write and edit the article. J.C., Y.C., and H.T. analyzed the results. J.Y. and Z.X. contributed to the validation. All authors have read and agreed to the published version of the manuscript.

Funding

This work is financially supported by the National Key R&D Program of China: [grant number 2018YFB0505302] and the National Nature Science Foundation of China: [grant numbers 41671380].

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

Table A1, Table A2, Table A3, Table A4 and Table A5 summarize the results for the experiments without the NTL variable. Table A6, Table A7 and Table A8 summarize the results using the two versions of the VIIRS monthly data for GDP estimation.
Table A1. Model performance without NTL variable in 2013.
Table A1. Model performance without NTL variable in 2013.
R 2 A d j u s t e d   R 2 AICRMSEMAREMorans’ I
OLS0.740.73595.451582.3849.76%0.37
SEM0.80 (pseudo)/494.871472.4538.52%−0.03
ESF0.970.94209.94494.4115.06%−0.04
Table A2. Model performance without NTL variable in 2015.
Table A2. Model performance without NTL variable in 2015.
R 2 A d j u s t e d   R 2 AICRMSEMAREMorans’ I
OLS0.760.76577.911802.6351.36%0.33
SEM0.81 (pseudo)/499.351701.2141.43%−0.03
ESF0.970.94181.40420.6913.57%−0.01
Table A3. Model performance without NTL variable in 2017.
Table A3. Model performance without NTL variable in 2017.
R 2 A d j u s t e d   R 2 AICRMSEMAREMorans’ I
OLS0.730.73638.982457.0257.46%0.35
SEM0.79 (pseudo)/555.372286.9944.36%−0.04
ESF0.960.93287.56888.5616.53%−0.05
Table A4. LOOCV RMSE of the OLS and ESF without NTL variable.
Table A4. LOOCV RMSE of the OLS and ESF without NTL variable.
201320152017
OLS1597.791820.532478.36
ESF941.88884.181757.20
Table A5. LOOCV MARE of the OLS and ESF without NTL variable.
Table A5. LOOCV MARE of the OLS and ESF without NTL variable.
201320152017
OLS50.34%51.98%58.03%
ESF30.98%31.07%34.94%
Table A6. Model performance of vcmsl version NTL data in 2015.
Table A6. Model performance of vcmsl version NTL data in 2015.
R 2 A d j u s t e d   R 2 AICRMSEMAREMorans’ I
OLS0.870.87366.771037.0734.04%0.31
SEM0.89 (pseudo)/301.55988.4528.71%−0.01
ESF0.980.9638.41502.5512.82%−0.05
Table A7. Model performance of vcmsl version NTL data in 2017.
Table A7. Model performance of vcmsl version NTL data in 2017.
R 2 A d j u s t e d   R 2 AICRMSEMAREMorans’ I
OLS0.850.85442.181385.2539.62%0.32
SEM0.88 (pseudo)/373.061246.8632.38%−0.03
ESF0.980.9678.52626.4413.55%−0.06
Table A8. LOOCV results of the OLS and ESF for vcmsl version NTL data.
Table A8. LOOCV results of the OLS and ESF for vcmsl version NTL data.
RMSEMARE
2015201720152017
OLS1065.791420.0434.61%40.22%
ESF795.991050.3220.89%24.27%

References

  1. Henderson, J.V.; Storeygard, A.; Weil, D.N. Measuring Economic Growth from Outer Space. Am. Econ. Rev. 2012, 102, 994–1028. [Google Scholar] [CrossRef] [Green Version]
  2. Feige, E.L.; Urban, I. Measuring underground (unobserved, non-observed, unrecorded) economies in transition countries: Can we trust GDP? J. Comp. Econ. 2008, 36, 287–306. [Google Scholar] [CrossRef] [Green Version]
  3. Chen, T.-H.K.; Prishchepov, A.V.; Fensholt, R.; Sabel, C.E. Detecting and monitoring long-term landslides in urbanized areas with nighttime light data and multi-seasonal Landsat imagery across Taiwan from 1998 to 2017. Remote Sens. Environ. 2019, 225, 317–327. [Google Scholar] [CrossRef]
  4. Liu, Z.; He, C.; Zhang, Q.; Huang, Q.; Yang, Y. Extracting the dynamics of urban expansion in China using DMSP-OLS nighttime light data from 1992 to 2008. Landsc. Urban Plan. 2012, 106, 62–72. [Google Scholar] [CrossRef]
  5. Shi, K.; Chen, Y.; Yu, B.; Xu, T.; Yang, C.; Li, L.; Huang, C.; Chen, Z.; Liu, R.; Wu, J. Detecting spatiotemporal dynamics of global electric power consumption using DMSP-OLS nighttime stable light data. Appl. Energy 2016, 184, 450–463. [Google Scholar] [CrossRef]
  6. Tan, M.; Li, X.; Li, S.; Xin, L.; Wang, X.; Li, Q.; Li, W.; Li, Y.; Xiang, W. Modeling population density based on nighttime light images and land use data in China. Appl. Geogr. 2018, 90, 239–247. [Google Scholar] [CrossRef]
  7. Yu, B.; Shu, S.; Liu, H.; Song, W.; Wu, J.; Wang, L.; Chen, Z. Object-based spatial cluster analysis of urban landscape pattern using nighttime light satellite images: A case study of China. Int. J. Geogr. Inf. Sci. 2014, 28, 2328–2355. [Google Scholar] [CrossRef]
  8. Zhao, J.; Ji, G.; Yue, Y.; Lai, Z.; Chen, Y.; Yang, D.; Yang, X.; Wang, Z. Spatio-temporal dynamics of urban residential CO2 emissions and their driving forces in China using the integrated two nighttime light datasets. Appl. Energy 2019, 235, 612–624. [Google Scholar] [CrossRef]
  9. Zheng, Q.; Jiang, R.; Wang, K.; Huang, L.; Ye, Z.; Gan, M.; Ji, B. Monitoring the trajectory of urban nighttime light hotspots using a Gaussian volume model. Int. J. Appl. Earth Obs. Geoinf. 2018, 65, 24–34. [Google Scholar] [CrossRef]
  10. Yu, B.; Shi, K.; Hu, Y.; Huang, C.; Chen, Z.; Wu, J. Poverty Evaluation Using NPP-VIIRS Nighttime Light Composite Data at the County Level in China. IEEE J. Sel. Top. Appl. Earth Observ. 2015, 8, 1217–1229. [Google Scholar] [CrossRef]
  11. Yu, B.; Tang, M.; Wu, Q.; Yang, C.; Deng, S.; Shi, K.; Peng, C.; Wu, J.; Chen, Z. Urban Built-Up Area Extraction From Log- Transformed NPP-VIIRS Nighttime Light Composite Data. IEEE Geosci. Remote Sens. Lett. 2018, 15, 1279–1283. [Google Scholar] [CrossRef]
  12. Ma, T. Multi-Level Relationships between Satellite-Derived Nighttime Lighting Signals and Social Media–Derived Human Population Dynamics. Remote Sens. 2018, 10, 1128. [Google Scholar] [CrossRef] [Green Version]
  13. Cao, X.; Hu, Y.; Zhu, X.; Shi, F.; Zhuo, L.; Chen, J. A simple self-adjusting model for correcting the blooming effects in DMSP-OLS nighttime light images. Remote Sens. Environ. 2019, 224, 401–411. [Google Scholar] [CrossRef]
  14. Elvidge, C.D.; Baugh, K.E.; Kihn, E.A.; Kroehl, H.W.; Davis, E.R.; Davis, C.W. Relation between satellite observed visible-near infrared emissions, population, economic activity and electric power consumption. Int. J. Remote Sens. 2010, 18, 1373–1379. [Google Scholar] [CrossRef]
  15. Ghosh, T.; Anderson, S.; Powell, R.; Sutton, P.; Elvidge, C. Estimation of Mexico’s Informal Economy and Remittances Using Nighttime Imagery. Remote Sens. 2009, 1, 418–444. [Google Scholar] [CrossRef] [Green Version]
  16. Pandey, B.; Zhang, Q.; Seto, K.C. Comparative evaluation of relative calibration methods for DMSP/OLS nighttime lights. Remote Sens. Environ. 2017, 195, 67–78. [Google Scholar] [CrossRef]
  17. Propastin, P.; Kappas, M. Assessing Satellite-Observed Nighttime Lights for Monitoring Socioeconomic Parameters in the Republic of Kazakhstan. GISci. Remote Sens. 2013, 49, 538–557. [Google Scholar] [CrossRef]
  18. Roychowdhury, K.; Jones, S.D.; Arrowsmith, C.; Reinke, K. A Comparison of High and Low Gain DMSP/OLS Satellite Images for the Study of Socio-Economic Metrics. IEEE J. Sel. Top. Appl. Earth Observ. 2011, 4, 35–42. [Google Scholar] [CrossRef]
  19. Wu, J.; Wang, Z.; Li, W.; Peng, J. Exploring factors affecting the relationship between light consumption and GDP based on DMSP/OLS nighttime satellite imagery. Remote Sens. Environ. 2013, 134, 111–119. [Google Scholar] [CrossRef]
  20. Zhu, X.; Ma, M.; Yang, H.; Ge, W. Modeling the Spatiotemporal Dynamics of Gross Domestic Product in China Using Extended Temporal Coverage Nighttime Light Data. Remote Sens. 2017, 9, 626. [Google Scholar] [CrossRef] [Green Version]
  21. Alahmadi, M.; Atkinson, P.M. Three-Fold Urban Expansion in Saudi Arabia from 1992 to 2013 Observed Using Calibrated DMSP-OLS Night-Time Lights Imagery. Remote Sens. 2019, 11, 2266. [Google Scholar] [CrossRef] [Green Version]
  22. Imhoff, M.L.; Lawrence, W.T.; Stutzer, D.C.; Elvidge, C.D. A technique for using composite DMSP/OLS “City Lights” satellite data to map urban area. Remote Sens. Environ. 1997, 61, 361–370. [Google Scholar] [CrossRef]
  23. Wang, L.; Li, W.; Wang, P.; Liu, X.; Yang, F.; Qu, J.J. Spatiotemporal characterization of the urban sprawl and its impacts on urban island in China with DMSP/OLS and MODIS measurements. Theor. Appl. Climatol. 2019, 138, 293–303. [Google Scholar] [CrossRef]
  24. Elvidge, C.D.; Baugh, K.E.; Zhizhin, M.; Hsu, F.-C. Why VIIRS data are superior to DMSP for mapping nighttime lights. Proc. Asia Pac. Adv. Netw. 2013, 35. [Google Scholar] [CrossRef] [Green Version]
  25. Dai, Z.; Hu, Y.; Zhao, G. The Suitability of Different Nighttime Light Data for GDP Estimation at Different Spatial Scales and Regional Levels. Sustainability 2017, 9, 305. [Google Scholar] [CrossRef] [Green Version]
  26. Li, X.; Li, D.; Xu, H.; Wu, C. Intercalibration between DMSP/OLS and VIIRS night-time light images to evaluate city light dynamics of Syria’s major human settlement during Syrian Civil War. Int. J. Remote Sens. 2017, 38, 5934–5951. [Google Scholar] [CrossRef]
  27. Zhang, L.; Sun, C. Modeling Population Density using a New Index Derived from Multi-Sensor Image Data. Remote Sens. 2019, 11, 2620. [Google Scholar] [CrossRef] [Green Version]
  28. Shi, K.; Yu, B.; Hu, Y.; Huang, C.; Chen, Y.; Huang, Y.; Chen, Z.; Wu, J. Modeling and mapping total freight traffic in China using NPP-VIIRS nighttime light composite data. GISci. Remote Sens. 2015, 52, 274–289. [Google Scholar] [CrossRef]
  29. Zhang, P.; Liu, S.; Du, J. A Map Spectrum-Based Spatiotemporal Clustering Method for GDP Variation Pattern Analysis Using Nighttime Light Images of the Wuhan Urban Agglomeration. ISPRS Int. J. Geo Inf. 2017, 6, 160. [Google Scholar] [CrossRef] [Green Version]
  30. Zhou, Y.; Ma, T.; Zhou, C.; Xu, T. Nighttime Light Derived Assessment of Regional Inequality of Socioeconomic Development in China. Remote Sens. 2015, 7, 1242–1262. [Google Scholar] [CrossRef] [Green Version]
  31. Ou, J.; Liu, X.; Li, X.; Li, M.; Li, W. Evaluation of NPP-VIIRS Nighttime Light Data for Mapping Global Fossil Fuel Combustion CO2 Emissions: A Comparison with DMSP-OLS Nighttime Light Data. PLoS ONE 2015, 10, e0138310. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  32. Li, X.; Xu, H.; Chen, X.; Li, C. Potential of NPP-VIIRS Nighttime Light Imagery for Modeling the Regional Economy of China. Remote Sens. 2013, 5, 3057–3081. [Google Scholar] [CrossRef] [Green Version]
  33. Shi, K.; Yu, B.; Huang, Y.; Hu, Y.; Yin, B.; Chen, Z.; Chen, L.; Wu, J. Evaluating the Ability of NPP-VIIRS Nighttime Light Data to Estimate the Gross Domestic Product and the Electric Power Consumption of China at Multiple Scales: A Comparison with DMSP-OLS Data. Remote Sens. 2014, 6, 1705–1724. [Google Scholar] [CrossRef] [Green Version]
  34. Zhao, M.; Cheng, W.; Zhou, C.; Li, M.; Wang, N.; Liu, Q. GDP Spatialization and Economic Differences in South China Based on NPP-VIIRS Nighttime Light Imagery. Remote Sens. 2017, 9, 673. [Google Scholar] [CrossRef] [Green Version]
  35. Chen, Z.; Yu, B.; Ta, N.; Shi, K.; Yang, C.; Wang, C.; Zhao, X.; Deng, S.; Wu, J. Delineating Seasonal Relationships Between Suomi NPP-VIIRS Nighttime Light and Human Activity Across Shanghai, China. IEEE J. Sel. Top. Appl. Earth Observ. 2019, 12, 4275–4283. [Google Scholar] [CrossRef]
  36. Li, X.; Ma, R.; Zhang, Q.; Li, D.; Liu, S.; He, T.; Zhao, L. Anisotropic characteristic of artificial light at night—Systematic investigation with VIIRS DNB multi-temporal observations. Remote Sens. Environ. 2019, 233, 111357. [Google Scholar] [CrossRef]
  37. Liu, S.; Li, X.; Levin, N.; Jendryke, M. Tracing cultural festival patterns using time-series of VIIRS monthly products. Remote Sens. Lett. 2019, 10, 1172–1181. [Google Scholar] [CrossRef]
  38. Griffith, D.A. Some robustness assessments of Moran eigenvector spatial filtering. Spat. Stat. 2017, 22, 155–179. [Google Scholar] [CrossRef]
  39. Chun, Y.; Griffith, D.A. A quality assessment of eigenvector spatial filtering based parameter estimates for the normal probability model. Spat. Stat. 2014, 10, 1–11. [Google Scholar] [CrossRef]
  40. Fang, T.; Chen, Y.; Tan, H.; Cao, J.; Liao, J.; Huang, L. Flood Risk Evaluation in the Middle Reaches of the Yangtze River Based on Eigenvector Spatial Filtering Poisson Regression. Water 2019, 11, 1969. [Google Scholar] [CrossRef] [Green Version]
  41. Yang, J.; Chen, Y.; Chen, M.; Yang, F.; Yao, M. A Segmented Processing Approach of Eigenvector Spatial Filtering Regression for Normalized Difference Vegetation Index in Central China. ISPRS Int. J. Geo Inf. 2018, 7, 330. [Google Scholar] [CrossRef] [Green Version]
  42. Li, H.; Chen, Y.; Deng, S.; Chen, M.; Fang, T.; Tan, H. Eigenvector Spatial Filtering-Based Logistic Regression for Landslide Susceptibility Assessment. ISPRS Int. J. Geo Inf. 2019, 8, 332. [Google Scholar] [CrossRef] [Green Version]
  43. Zhang, J.; Li, B.; Chen, Y.; Chen, M.; Fang, T.; Liu, Y. Eigenvector Spatial Filtering Regression Modeling of Ground PM2.5 Concentrations Using Remotely Sensed Data. Int. J. Environ. Res. Public Health 2018, 15, 1228. [Google Scholar] [CrossRef] [PubMed] [Green Version]
  44. Griffith, D.A.; Paelinck, J.H.P. Spatial Filter Versus Conventional Spatial Model Specifications: Some Comparisons. In Non-Standard Spatial Statistics and Spatial Econometrics; Springer: New York, NY, USA, 2011; pp. 117–149. [Google Scholar]
  45. Chun, Y. Analyzing Space-Time Crime Incidents Using Eigenvector Spatial Filtering: An Application to Vehicle Burglary. Geogr. Anal. 2014, 46, 165–184. [Google Scholar] [CrossRef]
  46. Chun, Y.; Griffith, D.A.; Lee, M.; Sinha, P. Eigenvector selection with stepwise regression techniques to construct eigenvector spatial filters. J. Geogr. Syst. 2016, 18, 67–85. [Google Scholar] [CrossRef]
  47. Bennett, M.M.; Smith, L.C. Advances in using multitemporal night-time lights satellite imagery to detect, estimate, and monitor socioeconomic dynamics. Remote Sens. Environ. 2017, 192, 176–197. [Google Scholar] [CrossRef]
Figure 1. Study area and nighttime light in 2013.
Figure 1. Study area and nighttime light in 2013.
Remotesensing 12 00839 g001
Figure 2. 2013 OpenStreetMap (OSM) road network.
Figure 2. 2013 OpenStreetMap (OSM) road network.
Remotesensing 12 00839 g002
Figure 3. China prefecture GDP map in 2013: (a) original GDP distribution. (b) OLS fitted GDP distribution. (c) SEM fitted GDP distribution. (d) ESF regression fitted GDP distribution.
Figure 3. China prefecture GDP map in 2013: (a) original GDP distribution. (b) OLS fitted GDP distribution. (c) SEM fitted GDP distribution. (d) ESF regression fitted GDP distribution.
Remotesensing 12 00839 g003
Figure 4. China prefecture GDP map in 2015: (a) original GDP distribution. (b) OLS fitted GDP distribution. (c) SEM fitted GDP distribution. (d) ESF regression fitted GDP distribution.
Figure 4. China prefecture GDP map in 2015: (a) original GDP distribution. (b) OLS fitted GDP distribution. (c) SEM fitted GDP distribution. (d) ESF regression fitted GDP distribution.
Remotesensing 12 00839 g004
Figure 5. China prefecture GDP map in 2017: (a) original GDP distribution. (b) OLS fitted GDP distribution. (c) SEM fitted GDP distribution. (d) ESF regression fitted GDP distribution.
Figure 5. China prefecture GDP map in 2017: (a) original GDP distribution. (b) OLS fitted GDP distribution. (c) SEM fitted GDP distribution. (d) ESF regression fitted GDP distribution.
Remotesensing 12 00839 g005
Figure 6. 3-year mean values of the absolute value for the regional relative error: (a) The OLS mean relative errors. (b) SEM mean relative errors. (c) ESF mean relative errors.
Figure 6. 3-year mean values of the absolute value for the regional relative error: (a) The OLS mean relative errors. (b) SEM mean relative errors. (c) ESF mean relative errors.
Remotesensing 12 00839 g006
Table 1. 2013 summary.
Table 1. 2013 summary.
Metrics.GDPPermanentMEANRoad
( 10 8   yuan ) ( 10 4   person ) ( nanoWatts · cm 2 · sr 1 ) ( km )
Min18.607.430.0856.00
Max21,602.122415.1018.0021,909.20
Mean1898.11387.711.152684.00
Table 2. 2015 summary.
Table 2. 2015 summary.
MetricsGDPPermanentMEANRoad
( 10 8   yuan ) ( 10 4   person ) ( nanoWatts · cm 2 · sr 1 ) ( km )
Min20.957.680.07310.42
Max25,123.453016.5518.2526,101.33
Mean2158.45396.541.164689.02
Table 3. 2017 summary.
Table 3. 2017 summary.
MetricsGDPPermanentMEANRoad
( 10 8   yuan ) ( 10 4   person ) ( nanoWatts · cm 2 · sr 1 ) ( km )
Min25.517.680.24366.10
Max30,632.993075.1618.2641,192.7
Mean2504.03401.811.446744.2
Table 4. Nighttime light (NTL) image thresholds for reference cities in 2013, 2015, and 2017.
Table 4. Nighttime light (NTL) image thresholds for reference cities in 2013, 2015, and 2017.
Year201320152017
Threshold276.13264.69275.56
CityBeijingBeijingGuangzhou
Table 5. Pearson correlation coefficients between the GDP and other covariates.
Table 5. Pearson correlation coefficients between the GDP and other covariates.
TimePermanentMEANRoad
20130.82 ***0.70 ***0.73 ***
20150.84 ***0.70 ***0.69 ***
20170.83 ***0.70 ***0.68 ***
*** Significant at α   =   0.001 level (two-tailed).
Table 6. The VIF values for the ordinary least squares (OLS) regression models.
Table 6. The VIF values for the ordinary least squares (OLS) regression models.
201320152017
Permanent1.441.521.84
MEAN1.251.271.31
Road1.201.241.46
Table 7. The VIF values for the eigenvector spatial filtering (ESF) regression models.
Table 7. The VIF values for the eigenvector spatial filtering (ESF) regression models.
201320152017
Permanent5.333.636.07
MEAN2.932.252.41
Road3.602.803.79
EVs1.941.722.07
Table 8. Model performance in 2013.
Table 8. Model performance in 2013.
R 2 Adjusted   R 2 AICRMSEMAREMorans’ I
OLS0.870.87366.13998.3332.10%0.35
SEM0.89 (pseudo)/285.75915.0326.62%−0.02
ESF0.980.9628.49336.1111.83%−0.06
Table 9. Model performance in 2015.
Table 9. Model performance in 2015.
R 2 Adjusted   R 2 AICRMSEMAREMorans’ I
OLS0.880.87363.021043.0533.06%0.29
SEM0.89 (pseudo)/298.27995.3428.40%−0.01
ESF0.980.9632.66470.3712.05%−0.05
Table 10. Model performance in 2017.
Table 10. Model performance in 2017.
R 2 Adjusted   R 2 AICRMSEMAREMorans’ I
OLS0.850.85436.391367.5139.12%0.32
SEM0.88 (pseudo)/369.301233.5032.12%−0.02
ESF0.980.9675.42583.7513.83%−0.06
Table 11. Leave-one-out cross validation (LOOCV) RMSE of the OLS and ESF regression models.
Table 11. Leave-one-out cross validation (LOOCV) RMSE of the OLS and ESF regression models.
201320152017
OLS1023.641072.291399.83
ESF625.54813.391023.89
Table 12. LOOCV MARE of the OLS and ESF regression models.
Table 12. LOOCV MARE of the OLS and ESF regression models.
201320152017
OLS32.60%33.59%39.71%
ESF21.92%21.57%23.92%
Table 13. Morans’ I of China prefecture-level GDP.
Table 13. Morans’ I of China prefecture-level GDP.
Year201320152017
Morans’ I0.260.250.23
p-value2.63 × 10−152.66 × 10−149.22 × 10−13
Table 14. Morans’ I of China prefecture-level population.
Table 14. Morans’ I of China prefecture-level population.
Year201320152017
Morans’ I0.280.240.23
p-value2.2 × 10−164.77 × 10−131.48 × 10−12

Share and Cite

MDPI and ACS Style

Cao, J.; Chen, Y.; Wilson, J.P.; Tan, H.; Yang, J.; Xu, Z. Modeling China’s Prefecture-Level Economy Using VIIRS Imagery and Spatial Methods. Remote Sens. 2020, 12, 839. https://doi.org/10.3390/rs12050839

AMA Style

Cao J, Chen Y, Wilson JP, Tan H, Yang J, Xu Z. Modeling China’s Prefecture-Level Economy Using VIIRS Imagery and Spatial Methods. Remote Sensing. 2020; 12(5):839. https://doi.org/10.3390/rs12050839

Chicago/Turabian Style

Cao, Jiping, Yumin Chen, John P. Wilson, Huangyuan Tan, Jiaxin Yang, and Zhiqiang Xu. 2020. "Modeling China’s Prefecture-Level Economy Using VIIRS Imagery and Spatial Methods" Remote Sensing 12, no. 5: 839. https://doi.org/10.3390/rs12050839

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop