An Assessment of Electric Power Consumption Using Random Forest and Transferable Deep Model with Multi-Source Data
Abstract
:1. Introduction
2. Study Area and Data
2.1. Study Area
2.2. Data Source
2.2.1. GIS Data and Statistics
- Administrative boundaries. Shenzhen administrative boundaries were used for two reasons. First, they restricted the spatial scope of the data used, and second, they stratified the data, which ensured that features from the raster dataset were on a sub-district level administrative district (level 4 in China). The administrative unit exists as a polygon and is in the World Geodetic System 1984 (WGS 84) geographic coordinate system (EPSG:4326).
- Electric power consumption data. National Bureau of Statistics provides annual per capita power consumption based on power consumed by the population at the national level. Therefore, in this study, the EPC at sub-district scale was calculated by multiplying the annual per capita domestic electricity consumption and resident population. EPC statistics from 55 sub-districts were obtained for the training model. This data was used as a label for training the RFR model and was used to calibrate results for the EPC estimation. The number of electricity scales (unit: kWh) was used to present local electricity energy consumption levels. Figure 1a maps the sub-district level EPC covering Shenzhen in 2016.
- Housing price data. The house price data in Shenzhen is obtained from the Fang.com website (https://Fang.com/) (accessed on 10 February 2021), which is the largest online housing market website in China. The data is vector point format with longitude, latitude, and house price. An average total of 8519 sets of valid data are obtained after preprocessing every year.
2.2.2. Remote Sensing Imagery
- Nighttime light imagery. Nighttime light can reveal regional EPC to a certain extent [17]. Accordingly, nighttime light imagery serves as a critical data source for EPC estimation in this study. Currently, there is much multi-source nighttime light data available. However, considering the limitations of data quality, spatial resolution, and temporal continuity, Suomi NPP-VIIRS nighttime light data was selected. This data was obtained using a new generation of US polar-orbiting operating environment project satellite (National Polar-orbiting Operational Environmental Satellite System Preparatory Project, NPP). The VIIRS provided by NPP can receive nearly 22 bands of image data, among which the day/night band can identify even weak light sources. The spatial resolution of the nighttime light products is 500 m. Monthly Suomi NPP-VIIRS composite products were used in this study, which was a one-month interval from 2013 to 2019. Due to the increase in spatial resolution, noise in the original data was unavoidable. The noise here mainly refers to the light seen from the earth and not from a stable light source (such as light from flames and gas burning). Therefore, to ensure the best results, it was necessary to remove noise from the original data. The process used to do so has been described in detail in the data processing section.
- High-resolution satellite imagery. High-resolution satellite imagery contain rich semantic information about the building environment on the ground [18]. The urban landscape environment can also reflect the gap between the rich and poor, which can indirectly reflect electricity consumption [19]. Google Earth provides a rich source of high-resolution satellite imagery and is constantly updated. The high-resolution satellite imagery was utilized in our method to extract surface observation features [18]. Considering the research unit and the presence of noise in super-spatial resolution remote sensing data, the Google Static Maps covering Shenzhen city at a zoom level of 14 (spatial resolution of 17 m × 17 m) were used. Datasets can be downloaded via the Google Static Maps API (https://developers.google.com/maps/documentation/static-maps/intro?hl=zh-cn/) (accessed on 1 May 2021) with three multispectral bands (red, green, blue).
- Gridded population. WorldPop population data, which holds population density data from AfriPop, AsiaPop, and AmeriPop mapping and started in 2013 with a spatial resolution of 100 m (http://www.worldpop.org.uk/) (accessed on 1 May 2021). This open archive of spatial demographic datasets for Central and South America, Africa, and Asia is intended to support development, disaster response, and health applications.
3. Methodology
3.1. Data Processing
3.2. Feature Extraction from Multi-Source Data
3.2.1. Social-Economic and Population Variables Extraction
3.2.2. Landscape Features Extracted Using CNN-Based Transfer Learning
3.3. EPC Estimation Using RFR Model
4. Experimental Results
4.1. EPC Estimation at Pixel Scale
4.2. Accuracy Assessment for EPC Estimation
5. Discussion
5.1. Method Validation
5.2. Spatiotemporal Dynamics Analysis of EPC from 2013 to 2019
6. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Conflicts of Interest
References
- Liu, A.; Ye, Z. China statistical yearbook—2020. China Stat. Yearb. 2020, 4, 978–5037. [Google Scholar]
- United Nations. The Sustainable Development Goals Report 2018; Technical Report; United Nations: New York, NY, USA, 2018. [Google Scholar]
- Tso, G.K.; Yau, K.K. Predicting electricity energy consumption: A comparison of regression analysis, decision tree and neural networks. Energy 2007, 32, 1761–1768. [Google Scholar] [CrossRef]
- Huang, M.; He, Y.; Cen, H. Predictive analysis on electric-power supply and demand in China. Renew. Energy 2007, 32, 1165–1174. [Google Scholar] [CrossRef]
- Shi, K.; Chen, Y.; Yu, B.; Xu, T.; Yang, C.; Li, L.; Huang, C.; Chen, Z.; Liu, R.; Wu, J. Detecting spatiotemporal dynamics of global electric power consumption using DMSP-OLS nighttime stable light data. Appl. Energy 2016, 184, 450–463. [Google Scholar] [CrossRef]
- Stokes, E.C.; Seto, K.C. Characterizing urban infrastructural transitions for the Sustainable Development Goals using multi-temporal land, population, and nighttime light data. Remote. Sens. Environ. 2019, 234, 111430. [Google Scholar] [CrossRef]
- Letu, H.; Hara, M.; Yagi, H.; Naoki, K.; Tana, G.; Nishio, F.; Shuhei, O. Estimating energy consumption from night-time DMPS/OLS imagery after correcting for saturation effects. Int. J. Remote. Sens. 2010, 31, 4443–4458. [Google Scholar] [CrossRef]
- Sahoo, S.; Gupta, P.K.; Srivastav, S. Comparative analysis between VIIRS-DNB and DMSP-OLS night-time light data to estimate electric power consumption in Uttar Pradesh, India. Int. J. Remote. Sens. 2020, 41, 2565–2580. [Google Scholar] [CrossRef]
- Hu, T.; Huang, X. A novel locally adaptive method for modeling the spatiotemporal dynamics of global electric power consumption based on DMSP-OLS nighttime stable light data. Appl. Energy 2019, 240, 778–792. [Google Scholar] [CrossRef]
- Lu, L.; Weng, Q.; Xie, Y.; Guo, H.; Li, Q. An assessment of global electric power consumption using the Defense Meteorological Satellite Program-Operational Linescan System nighttime light imagery. Energy 2019, 189, 116351. [Google Scholar] [CrossRef]
- Sun, Y.; Wang, S.; Zhang, X.; Chan, T.O.; Wu, W. Estimating local-scale domestic electricity energy consumption using demographic, nighttime light imagery and Twitter data. Energy 2021, 226, 120351. [Google Scholar] [CrossRef]
- Wang, W.; Zhang, X.; Wu, Y.; Zhou, L.; Skitmore, M. Development priority zoning in China and its impact on urban growth management strategy. Cities 2017, 62, 1–9. [Google Scholar] [CrossRef] [Green Version]
- Wang, J.; Ren, Y.; Shen, L.; Liu, Z.; Wu, Y.; Shi, F. A novel evaluation method for urban infrastructures carrying capacity. Cities 2020, 105, 102846. [Google Scholar] [CrossRef]
- Jean, N.; Burke, M.; Xie, M.; Davis, W.M.; Lobell, D.B.; Ermon, S. Combining satellite imagery and machine learning to predict poverty. Science 2016, 353, 790–794. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Yeh, C.; Perez, A.; Driscoll, A.; Azzari, G.; Tang, Z.; Lobell, D.; Ermon, S.; Burke, M. Using publicly available satellite imagery and deep learning to understand economic well-being in Africa. Nat. Commun. 2020, 11, 1–11. [Google Scholar] [CrossRef]
- Chi, G.; Peiya, Z.; Junhui, X.; Youliang, X. Office for National Statistics. Local Authority Districts Boundaries Shenzhen 2020. [EB/OL]. 2020. Available online: http://pnr.sz.gov.cn/ywzy/chgl/index.html/ (accessed on 4 April 2020).
- Elvidge, C.D.; Baugh, K.E.; Kihn, E.A.; Kroehl, H.W.; Davis, E.R.; Davis, C.W. Relation between satellite observed visible-near infrared emissions, population, economic activity and electric power consumption. Int. J. Remote. Sens. 1997, 18, 1373–1379. [Google Scholar] [CrossRef]
- Malarvizhi, K.; Kumar, S.V.; Porchelvan, P. Use of high resolution Google Earth satellite imagery in landuse map preparation for urban related applications. Procedia Technol. 2016, 24, 1835–1842. [Google Scholar] [CrossRef]
- Zhao, F.; Ding, J.; Zhang, S.; Luan, G.; Song, L.; Peng, Z.; Du, Q.; Xie, Z. Estimating rural electric power consumption using NPP-VIIRS night-time light, toponym and POI data in ethnic minority areas of China. Remote. Sens. 2020, 12, 2836. [Google Scholar] [CrossRef]
- Hu, W.; Liu, C.; Zhan, Q. Synthetic method and comparison verification of NPP-VIIRS annual nighttime lighting data in China. J. Guilin. Univ. Technol 2020, 510, 1–10. [Google Scholar]
- Rousseeuw, P.J. Silhouettes: A graphical aid to the interpretation and validation of cluster analysis. J. Comput. Appl. Math. 1987, 20, 53–65. [Google Scholar] [CrossRef] [Green Version]
- Lan, F.; Gong, X.; Da, H.; Wen, H. How do population inflow and social infrastructure affect urban vitality? Evidence from 35 large- and medium-sized cities in China. Cities 2020, 100, 102454. [Google Scholar] [CrossRef]
- Liang, H.; Guo, Z.; Wu, J.; Chen, Z. GDP spatialization in Ningbo City based on NPP/VIIRS night-time light and auxiliary data using random forest regression. Adv. Space Res. 2020, 65, 481–493. [Google Scholar] [CrossRef]
- He, C.; Ma, Q.; Li, T.; Yang, Y.; Liu, Z. Spatiotemporal dynamics of electric power consumption in Chinese Mainland from 1995 to 2008 modeled using DMSP/OLS stable nighttime lights data. J. Geogr. Sci. 2012, 22, 125–136. [Google Scholar] [CrossRef]
- Xie, Y.; Weng, Q. Detecting urban-scale dynamics of electricity consumption at Chinese cities using time-series DMSP-OLS (Defense Meteorological Satellite Program-Operational Linescan System) nighttime light imageries. Energy 2016, 100, 177–189. [Google Scholar] [CrossRef]
- He, D.; Shi, Q.; Liu, X.; Zhong, Y.; Zhang, X. Deep subpixel mapping based on semantic information modulated network for urban land use mapping. IEEE Trans. Geosci. Remote. Sens. 2021, 59, 10628–10646. [Google Scholar] [CrossRef]
- He, D.; Shi, Q.; Liu, X.; Zhong, Y.; Zhang, L. Generating 2m fine-scale urban tree cover product over 34 metropolises in China based on deep context-aware sub-pixel mapping network. Int. J. Appl. Earth Obs. Geoinf. 2022, 106, 102667. [Google Scholar] [CrossRef]
- Shin, H.C.; Roth, H.R.; Gao, M.; Lu, L.; Xu, Z.; Nogues, I.; Yao, J.; Mollura, D.; Summers, R.M. Deep convolutional neural networks for computer-aided detection: CNN architectures, dataset characteristics and transfer learning. IEEE Trans. Med. Imaging 2016, 35, 1285–1298. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Xie, S.M.; Jean, N.; Burke, M.; Lobell, D.B.; Ermon, S. Transfer Learning from Deep Features for Remote Sensing and Poverty Mapping. In Proceedings of the Thirtieth AAAI Conference on Artificial Intelligence, Phoenix, AZ, USA, 12–17 February 2016; Schuurmans, D., Wellman, M.P., Eds.; AAAI Press: Palo Alto, CA, USA, 2016; pp. 3929–3935. [Google Scholar]
- Yosinski, J.; Clune, J.; Bengio, Y.; Lipson, H. How transferable are features in deep neural networks? Advances in Neural Information Processing Systems 27. In Proceedings of the Annual Conference on Neural Information Processing Systems 2014, Montreal, QC, Canada, 8–13 December 2014; Ghahramani, Z., Welling, M., Cortes, C., Lawrence, N.D., Weinberger, K.Q., Eds.; Curran Associates, Inc.: Red Hook, NY, USA, 2014; pp. 3320–3328. [Google Scholar]
- Zhou, B.; Khosla, A.; Lapedriza, A.; Torralba, A.; Oliva, A. Places: An image database for deep scene understanding. arXiv 2016, arXiv:1610.02055. [Google Scholar] [CrossRef]
- Simonyan, K.; Zisserman, A. Very Deep Convolutional Networks for Large-Scale Image Recognition. In Proceedings of the 3rd International Conference on Learning Representations, ICLR 2015, San Diego, CA, USA, 7–9 May 2015. [Google Scholar]
- Yapıcı, M.M.; Tekerek, A.; Topaloğlu, N. Performance comparison of convolutional neural network models on GPU. In Proceedings of the 2019 IEEE 13th International Conference on Application of Information and Communication Technologies (AICT), Baku, Azerbaijan, 23–25 October 2019; pp. 1–4. [Google Scholar]
- Zhao, X.; Yu, B.; Liu, Y.; Chen, Z.; Li, Q.; Wang, C.; Wu, J. Estimation of Poverty Using Random Forest Regression with Multi-Source Data: A Case Study in Bangladesh. Remote Sens. 2019, 11, 375. [Google Scholar] [CrossRef] [Green Version]
- Breiman, L. Random Forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
- Ishwaran, H.; Lu, M. Standard errors and confidence intervals for variable importance in random forest regression, classification, and survival. Stat. Med. 2019, 38, 558–582. [Google Scholar] [CrossRef]
- Pedregosa, F.; Varoquaux, G.; Gramfort, A.; Michel, V.; Thirion, B.; Grisel, O.; Blondel, M.; Prettenhofer, P.; Weiss, R.; Dubourg, V.; et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 2011, 12, 2825–2830. [Google Scholar]
- Lerman, P. Fitting segmented regression models by grid search. J. R. Stat. Soc. Ser. 1980, 29, 77–84. [Google Scholar] [CrossRef]
- Wang, F.; Wang, Y.; Zhang, K.; Hu, M.; Weng, Q.; Zhang, H. Spatial heterogeneity modeling of water quality based on random forest regression and model interpretation. Environ. Res. 2021, 202, 111660. [Google Scholar] [CrossRef] [PubMed]
- Lin, B.; Omoju, O.E.; Okonkwo, J.U. Factors influencing renewable electricity consumption in China. Renew. Sustain. Energy Rev. 2016, 55, 687–696. [Google Scholar] [CrossRef]
- Shiu, A.; Lam, P.L. Electricity consumption and economic growth in China. Energy Policy 2004, 32, 47–54. [Google Scholar] [CrossRef]
- Zhang, C.; Su, B.; Zhou, K.; Yang, S. Analysis of electricity consumption in China (1990–2016) using index decomposition and decoupling approach. J. Clean. Prod. 2019, 209, 224–235. [Google Scholar] [CrossRef]
Data Description | Data Source | Spatial Resolution | Year Used |
---|---|---|---|
Monthly Suomi NPP-VIIRS nighttime lights | NOAA/NGDC (https://ngdc.noaa.gov/eog/viirs/(accessed on 10 December 2020)) | 500 m | 2013–2019 |
High-resolution satellite imagery (Level-14) | Google Static Maps API (http://www.google.cn/intl/zh-CN/earth/(accessed on 1 May 2021)) | 17 m | 2013–2019 |
WorldPop gridded population | WorldPop (https://www.worldpop.org/(accessed on 1 May 2021)) | 100 m | 2013–2019 |
Demographic at Sub-district level | National Bureau of Statistics (http://www.stats.gov.cn/(accessed on 10 May 2021)) | Sub-district level | 2013–2019 |
Per capita domestic electric power consumption | National Bureau of Statistics (http://www.stats.gov.cn/(accessed on 10 February 2021)) | National-level | 2013–2019 |
Shenzhen administrative boundaries | Geospatial Data Cloud (http://www.gscloud.cn/(accessed on 10 February 2021)) | – | 2015 |
Shenzhen house price | Fang.com website (http://Fang.com/(accessed on 10 February 2021)) | – | 2013–2019 |
Indicators | Feature Name | Description | Data Source |
---|---|---|---|
Economic variables | Nighttime light | The data value of nighttime light in each pixel; The mean data value of nighttime light in each sub-district. | Suomi NPP-VIIRS nighttime light data |
House price | The max housing price at pixel level and sub-district level, respectively. | Shenzhen house price | |
Population density | Population density | Population density at pixel level and sub-district level, respectively. | worldPop product |
Landscape features | FC0–FC24 | 25 principal components extracted from high-resolution satellite images at pixel level and sub-district level, respectively. | Google Static Maps |
Indicators | Correlation Coefficient | p-Values | Indicators | Correlation Coefficient | p-Values |
---|---|---|---|---|---|
FC0 | 0.164 * | 0.004 | FC14 | 0.107 * | 0.007 |
FC1 | −0.110 * | 0.030 | FC15 | 0.076 * | 0.039 |
FC2 | −0.153 * | 0.000 | FC16 | 0.218 * | 0.000 |
FC3 | 0.239 * | 0.005 | FC17 | 0.013 * | 0.005 |
FC4 | 0.076 * | 0.005 | FC18 | −0.228 * | 0.004 |
FC5 | 0.001 * | 0.001 | FC19 | −0.026 | 0.605 |
FC6 | −0.138 * | 0.007 | FC20 | −0.149 * | 0.003 |
FC7 | −0.326 * | 0.008 | FC21 | −0.116 * | 0.023 |
FC8 | 0.201 * | 0.000 | FC22 | −0.144 * | 0.005 |
FC9 | 0.119 * | 0.020 | FC23 | −0.207 * | 0.000 |
FC10 | 0.010 * | 0.008 | FC24 | −0.039 * | 0.005 |
FC11 | 0.204 * | 0.000 | Nighttime light | 0.264 * | 0.000 |
FC12 | 0.364 * | 0.009 | House price | 0.102 * | 0.000 |
FC13 | 0.312 * | 0.000 | Population density | 0.082 * | 0.004 |
Parameter Name | Description | Optimal Value |
---|---|---|
N_estimators | Number of trees | 289 |
Max_depth | Maximum depth of the tree | 22 |
Min_samples_split | Minimum number of samples required to split an internal node | 2 |
Min_samples_leaf | Minimum number of samples required to be at a leaf node | 2 |
Model | RMSE | MAE | RSD | |
---|---|---|---|---|
Shi’s | 72,178.27 | 52,149.42 | 0.46 | 87.30% |
Ours | 4357.10 | 3250.34 | 0.82 | 63.15% |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Cheng, L.; Feng, R.; Wang, L.; Yan, J.; Liang, D. An Assessment of Electric Power Consumption Using Random Forest and Transferable Deep Model with Multi-Source Data. Remote Sens. 2022, 14, 1469. https://doi.org/10.3390/rs14061469
Cheng L, Feng R, Wang L, Yan J, Liang D. An Assessment of Electric Power Consumption Using Random Forest and Transferable Deep Model with Multi-Source Data. Remote Sensing. 2022; 14(6):1469. https://doi.org/10.3390/rs14061469
Chicago/Turabian StyleCheng, Luxiao, Ruyi Feng, Lizhe Wang, Jining Yan, and Dong Liang. 2022. "An Assessment of Electric Power Consumption Using Random Forest and Transferable Deep Model with Multi-Source Data" Remote Sensing 14, no. 6: 1469. https://doi.org/10.3390/rs14061469
APA StyleCheng, L., Feng, R., Wang, L., Yan, J., & Liang, D. (2022). An Assessment of Electric Power Consumption Using Random Forest and Transferable Deep Model with Multi-Source Data. Remote Sensing, 14(6), 1469. https://doi.org/10.3390/rs14061469