Park Development, Potential Measurement, and Site Selection Study Based on Interpretable Machine Learning—A Case Study of Shenzhen City, China
Abstract
:1. Introduction
2. Materials and Methods
2.1. Study Area
2.2. Multi-Source Urban Data
2.3. Data Preprocessing
2.4. Feature Matrix Construction
2.5. Methodology
2.5.1. Geographically Weighted Regression
2.5.2. The Random Forest
2.5.3. The Geographically Weighted Random Forest
2.5.4. Shapley Additive Explanations
3. Results
3.1. Characterization of the Spatial Differentiation of the Current Situation
3.1.1. Differentiation of Natural Landscape Elements
3.1.2. Differentiation of Economic Activity Variations
3.2. Measurement of Park Development Potential
3.2.1. Model Training and Evaluation
3.2.2. Development Potential Measurement Results
3.2.3. Interpretability Analysis
4. Discussion
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Jamei, E.; Rajagopalan, P.; Seyedmahmoudian, M.; Jamei, Y. Review on the impact of urban geometry and pedestrian level greening on outdoor thermal comfort. Renew. Sustain. Energy Rev. 2016, 54, 1002–1017. [Google Scholar] [CrossRef]
- Liao, W.; Guldmann, J.M.; Hu, L.; Cao, Q.; Gan, D.; Li, X. Linking urban park cool island effects to the landscape patterns inside and outside the park: A simultaneous equation modeling approach. Landsc. Urban Plan. 2023, 232, 104681. [Google Scholar] [CrossRef]
- Lai, Y.; Lu, Y.; Ding, T.; Sun, H.; Li, X.; Ge, X. Effects of Low-Impact Development Facilities (Water Systems of the Park) on Stormwater Runoff in Shallow Mountainous Areas Based on Dual-Model (SWMM and MIKE21) Simulations. Int. J. Environ. Res. Public Health 2022, 19, 14349. [Google Scholar] [CrossRef] [PubMed]
- Browning, M.H.E.M.; Rigolon, A.; McAnirlin, O. Where greenspace matters most: A systematic review of urbanicity, greenspace, and physical health. Landsc. Urban Plan. 2022, 217, 104233. [Google Scholar] [CrossRef]
- Jennings, V.; Bamkole, O. The relationship between social cohesion and urban green space: An avenue for health promotion. Int. J. Environ. Res. Public Health 2019, 16, 452. [Google Scholar] [CrossRef]
- Hussain, T.; Abbas, J.; Zou, W.; Ahmad, S.; Bi, X.; Zhu, G. Impact of urban village disamenity on neighboring residential properties: Empirical evidence from Nanjing through hedonic pricing model appraisal. J. Urban Plan. Dev. 2021, 147, 04020055. [Google Scholar] [CrossRef]
- Brandeau, M.L.; Chiu, S.S. An overview of representative problems in location research. Manag. Sci. 1989, 35, 645–674. [Google Scholar] [CrossRef]
- Yan, S.; Liang, L.; Yu, R.; Wang, W. The Indicator System for the Suitability Evaluation of Theme Park Construction in Urban Areas in China. Sci. Geogr. Sin. 2016, 36, 213–221. [Google Scholar]
- Xiao, Z.; Liu, P.; Liu, Y.; Hu, F.; Peng, H.; Zhang, Z.; Gao, P. Evaluation and Selection of Potential Scenic View-point Distribution Areas Based on GIS and Fuzzy Analytic Hierarchy Process—An Example from Jiulangshan Mountain Park in Zhuzhou. For. Resour. Manag. 2020, 01, 158–165. [Google Scholar]
- Xia, C.; Zhai, G. Research on Dynamic Mechanism of Territorial Space Evolution in Yangtze River Delta Based on GBDT Machine Learning Algorithm. Urban Dev. Stud. 2022, 29, 12–19. [Google Scholar]
- Hou, Y.; Wang, Q.; Zhou, K.; Zhang, L.; Tan, T. Integrated machine learning methods with oversampling technique for regional suitability prediction of waste-to-energy incineration projects. Waste Manag. 2024, 174, 251–262. [Google Scholar] [CrossRef]
- Yousefi, S.; Avand, M.; Yariyan, P.; Goujani, H.J.; Costache, R.; Tavangar, S.; Tiefenbacher, J.P. Identification of the most suitable afforestation sites by Juniperus excels specie using machine learning models: Firuzkuh semi-arid region, Iran. Ecol. Inform. 2021, 65, 101427. [Google Scholar] [CrossRef]
- Hu, Y.; Wen, W.; Liu, Y.; Zheng, H. Location Optimization of Urban Express Pickup Service Facilities Based on GAN. J. Tianjin Univ. Sci. Technol. 2022, 55, 1237–1248. [Google Scholar]
- Zaheer, N.; Hassan, S.U.; Ali, M.; Shabbir, M. Optimal school site selection in Urban areas using deep neural networks. J. Ambient. Intell. Humaniz. Comput. 2022, 13, 1–15. [Google Scholar] [CrossRef]
- Awadh, M.A.; Mallick, J. A decision-making framework for landfill site selection in Saudi Arabia using explainable artificial intelligence and multi-criteria analysis. Environ. Technol. Innov. 2024, 33, 103464. [Google Scholar] [CrossRef]
- Rao, S.; Mehta, S.; Kulkarni, S.; Dalvi, H.; Katre, N.; Narvekar, M. A Study of LIME and SHAP Model Explainers for Autonomous Disease Predictions. In Proceedings of the 2022 IEEE Bombay Section Signature Conference (IBSSC), Mumbai, India, 8–10 December 2022; pp. 1–6. [Google Scholar]
- Wang, X.; Xiong, F.; Wang, Z.; Chen, S.; Zhang, Z.; Peng, L. Planning and Layout of Facilities for the Elders Based on POI and Machine Learning: A Case Study of Wuhan. Econ. Geogr. 2021, 41, 49–56. [Google Scholar]
- Huang, J.; Lin, Y.; Tao, Z.; Yang, J. Evaluation of park green space accessibility in Shenzhen from the perspective of social equity. Sci. Geogr. Sin. 2022, 42, 896–906. [Google Scholar]
- Zhang, G.; Zhao, D.; Sun, L.; Li, X. The analysis of spatial suitability and multi-objective optimal location of Urtan Pocket Park. Sci. Surv. Mapp. 2022, 47, 224–234. [Google Scholar]
- Huang, Q.; Yang, B.; Xu, X.; Hao, H.; Liang, L.; Wang, M. Location selection and prediction of SexyTea store in Changsha City based on multisource spatial data and random forest model. J. Geo-Inf. Sci. 2022, 24, 723–737. [Google Scholar]
- Mei, Z.; Zhang, Y.; Huang, X.; Liu, Z. Ecological risk assessment and its influencing factors of spatial heterogeneity based on ecosystem services in Qinghai Lake Basin. Acta Ecol. Sin. 2024, 44, 4973–4986. [Google Scholar]
- Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
- Georganos, S.; Grippa, T.; Niang Gadiaga, A.; Linard, C.; Lennert, M.; Vanhuysse, S.; Mboga, N.; Wolff, E.; Kalogirou, S. Geographical random forests: A spatial extension of the random forest algorithm to address spatial heterogeneity in remote sensing and population modelling. Geocarto Int. 2021, 36, 121–136. [Google Scholar] [CrossRef]
- Belgiu, M.; Drăguţ, L. Random forest in remote sensing: A review of applications and future directions. ISPRS J. Photogramm. Remote Sens. 2016, 114, 24–31. [Google Scholar] [CrossRef]
- Mangalathu, S.; Hwang, S.H.; Jeon, J.S. Failure mode and effects analysis of RC members based on machine-learning-based SHapley Additive exPlanations (SHAP) approach. Eng. Struct. 2020, 219, 110927. [Google Scholar] [CrossRef]
- Chen, Y.; Deng, M.; Wang, C. Spatial-temporal Differentiation and Influencing Mechanism of Accessibility of Park Green Spaces: A Case Study Based on Shenzhen City. South Archit. 2023, 7, 66–76. [Google Scholar]
Datatypes | Year of Data | Source of Data | Description of Data |
---|---|---|---|
POI, AOI data | 2024 | Baidu Map API Data Open Interface (https://lbsyun.baidu.com/) | Obtains 14 types of POI, and park AOI data based on the Python 3.10 crawler |
Administrative zoning data | 2024 | Shenzhen Planning and Natural Resources Bureau (https://pnr.sz.gov.cn/) | Obtains the study area through Shenzhen cadastral map to determine the study scope |
Road network, water system | 2024 | OpenStreetMap (http://www.openstreetmap.org/) | Reflects the traffic and water system conditions, and calculates the relevant characterization data. |
House price data | 2024 | Fang (https://www1.fang.com/) | Obtains 452 new house prices in Shenzhen based on Python crawler. |
Street index data | 2020 | Shenzhen Government Data Open Platform (https://opendata.sz.gov.cn/) | Obtains traffic congestion, air pollution, environmental health, and street security index of 74 streets in Shenzhen through an open data platform. |
Light intensity at night data | 2020 | Earth Observation Group (https://eogdata.mines.edu/products/vnl/) URL (accessed on 20 April 2024) | Reflects the regional economic situation, related to construction and development |
Population data | 2020 | WorldPop (https://www.worldpop.org/) Shenzhen Government Data Open Platform (https://tjj.sz.gov.cn/) | Reflects the population distribution, and corrects the WorldPop data by using 7P data to get the final population distribution data. |
Surface Cover Data | 2020 | National Geographic Information Resources Catalogue Service System (https://www.webmap.cn/) | Reflects the surface coverage, which is related to spatial construction and development. |
DEM data | 2020 | Geospatial data cloud (https://www.gscloud.cn/home) | Reflects the geographical altitude, which can be used to calculate slope and direction of the slope. |
FVC Vegetation Cover Data | 2020 | National Tibetan Plateau Science Data Centre (https://data.tpdc.ac.cn/home) | Characterizes the degree of vegetation cover, which is related to the regional landscape environment. |
NPP | 2020 | Google earth engine (https://lpdaac.usgs.gov/) | Reflects the ecological restoration capacity of the region, and related plant abundance. |
No. | Classifications | Number of POI | No. | Classifications | Number of POI |
---|---|---|---|---|---|
1 | Dining Facilities | 75,029 | 8 | Science, Education, and Culture | 26,730 |
2 | Accommodation | 12,644 | 9 | Healthcare | 20,326 |
3 | Sports and Leisure | 17,106 | 10 | Government Offices | 15,388 |
4 | Shopping Services | 4661 | 11 | Corporations | 141,861 |
5 | Financial Services | 14,164 | 12 | Real Estate | 27,480 |
6 | Park Facilities | 617 | 13 | Transportation | 37,535 |
7 | Tourist Attractions | 1897 | 14 | Life Service | 42,149 |
Factors | Feature Construction | Characterization |
---|---|---|
Natural background | Elevation | Impact on human activities, closely related to natural country parks |
Slope | Impacts on vegetation growth and human activities | |
Direction of slope | Impact on natural landscape patterns | |
Percentage of waters | Impacts on human activities, bodies of water restrict the movement of people | |
Nature of land | Impacts on development and construction, with differences in environmental carrying capacity | |
Socio-economic | Population density | Reflecting the degree of population concentration; the higher the population density, the greater the demand for parks |
House price level | Relevant to the income of the population, reflecting the level of social development | |
Light intensity at night | Reflects the regional economy and is relevant to construction activities | |
Surrounding facilities | Index of facilities | Reflects construction of regional facilities |
Traffic location | Road density | Reflects accessibility; the higher the density, the more conducive to development |
Traffic congestion index | Reflects the level of traffic congestion on the street | |
Distance from road | Reflects transportation accessibility; the closer the distance, the more accessible it is | |
Distance from water | Reflects ease of access to water sources | |
Ecological environment | Air pollution index | Use of PM2.5 data to reflect air pollution levels |
Environmental health index | Reflects the street environment and relates to park development potential | |
FVC Vegetation cover | Degree of vegetation cover, positively correlated with park development potential | |
NPP Net Primary Productivity of Vegetation | Reflecting regional ecological restoration capacity | |
Space security | Street security index | Reflects street safety and relates to crowd activity |
No. | Characteristic Factor | VIF | No. | Characteristic Factor | VIF |
---|---|---|---|---|---|
1 | Percentage of waters | 1.153010 | 16 | Accommodation POI density | 5.651371 |
2 | Population density | 5.647349 | 17 | Corporations POI density | 7.943229 |
3 | Light intensity at night | 2.348380 | 18 | Road density | 2.143956 |
4 | Elevation | 3.243215 | 19 | FVC Vegetation cover | 2.942994 |
5 | Life Service POI density | 73.027483 | 20 | House price level | 2.388707 |
6 | Tourist Attractions POI density | 1.680275 | 21 | Slope | 3.827688 |
7 | Transportation POI density | 20.176181 | 22 | Direction of slope | 1.274712 |
8 | Real Estate POI density | 10.875558 | 23 | Street security index | 4.243193 |
9 | Government Offices POI density | 11.506439 | 24 | Environmental health index | 7.218214 |
10 | Healthcare POI density | 45.646812 | 25 | Air pollution index | 3.190530 |
11 | Science, Education and Culture POI density | 18.147154 | 26 | Distance from road | 1.244019 |
12 | Financial Services POI density | 5.840311 | 27 | Nature of land | 1.172999 |
13 | Shopping Services POI density | 19.976599 | 28 | Traffic congestion index | 4.643790 |
14 | Sports and Leisure POI density | 28.988934 | 29 | Distance from water | 1.556129 |
15 | Dining Facilities POI density | 41.559060 | 30 | NPP Net Primary Productivity of Vegetation | 1.565043 |
Metrics | Formula | RF | GWRF | Interpretation | |
---|---|---|---|---|---|
Average accuracy | (5) | 0.83 | 0.87 | where TP is the true case, TN is the true negative case, FP is the false positive case and FN is the false negative case | |
Average precision | (6) | 0.82 | 0.85 | ||
Average recall | (7) | 0.47 | 0.60 | ||
Average F1 score | (8) | 0.60 | 0.70 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Published by MDPI on behalf of the International Society for Photogrammetry and Remote Sensing. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Li, H.; He, L. Park Development, Potential Measurement, and Site Selection Study Based on Interpretable Machine Learning—A Case Study of Shenzhen City, China. ISPRS Int. J. Geo-Inf. 2025, 14, 184. https://doi.org/10.3390/ijgi14050184
Li H, He L. Park Development, Potential Measurement, and Site Selection Study Based on Interpretable Machine Learning—A Case Study of Shenzhen City, China. ISPRS International Journal of Geo-Information. 2025; 14(5):184. https://doi.org/10.3390/ijgi14050184
Chicago/Turabian StyleLi, Haihong, and Li He. 2025. "Park Development, Potential Measurement, and Site Selection Study Based on Interpretable Machine Learning—A Case Study of Shenzhen City, China" ISPRS International Journal of Geo-Information 14, no. 5: 184. https://doi.org/10.3390/ijgi14050184
APA StyleLi, H., & He, L. (2025). Park Development, Potential Measurement, and Site Selection Study Based on Interpretable Machine Learning—A Case Study of Shenzhen City, China. ISPRS International Journal of Geo-Information, 14(5), 184. https://doi.org/10.3390/ijgi14050184