Landscape Drivers of Trail Formation in Peri-Urban Mountains: Insights from an Explainable Machine Learning Approach
Abstract
1. Introduction
- (1)
- How do key environmental variables nonlinearly structure spatial decision-making in wilderness contexts? Specifically, do identifiable threshold effects give rise to physical exclusion zones or landscape-mediated compensation mechanisms?
- (2)
- How does the integration of continuous spatial representations with ensemble learning algorithms alter the model’s capacity to approximate underlying response functions and improve generalization across heterogeneous terrains?
- (3)
- Under varying environmental pressures, how do distinct behavioral regimes emerge, and what structural differences characterize topography-dominated versus resource-dependent decision logics?
2. Materials and Methods
2.1. Study Area
2.2. Variable Construction
2.2.1. Explanatory Variables
2.2.2. Dependent Variables
2.3. Analytical Framework
2.3.1. Research Workflow
2.3.2. Predictive Model and Interpretability Framework
3. Results
3.1. Correlation
3.2. Models Comparison
3.3. SHAP
4. Discussion
4.1. Theoretical Implications with Previous Similar Studies
4.2. Methodological Contribution of the Research
4.3. Planning Implications
5. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| Abbreviation | Full Form/Description |
| RF | Random Forest |
| XGBoost | eXtreme Gradient Boosting |
| LightGBM | Light Gradient Boosting Machine |
| GBDT | Gradient Boosted Decision Trees |
| SHAP | SHapley Additive exPlanations |
| DEM | Digital Elevation Model |
| Slope | Slope |
| Prec | Annual Precipitation |
| Temp | Annual Mean Temperature |
| Water_Dist | Distance to Water Systems |
| NDVI | Normalized Difference Vegetation Index |
| NDVI_Max | NDVI Annual Maximum |
| NDVI_Min | NDVI Annual Minimum |
| NDVI_Mean | NDVI Annual Mean |
| NDVI_Range | NDVI Annual Range |
| NTL | Nighttime Light |
| NTL_Max | NTL Annual Maximum |
| NTL_Min | NTL Annual Minimum |
| NTL_Mean | NTL Annual Mean |
| NTL_Range | NTL Annual Range |
| Trail_Dens | Hiking Road Density |
| Trail_Dist | Distance to Hiking Trails |
| R2 | R-squared (Coefficient of Determination) |
| RMSE | Root Mean Square Error |
| MAE | Mean Absolute Error |
| OLS | Ordinary Least Squares |
| GPS | Global Positioning System |
| GIS | Geographic Information System |
| 2bulu | Two-step Road (Chinese outdoor hiking platform) |
Appendix A


References
- Bichler, B.; Peters, M. Soft adventure motivation: An exploratory study of hiking tourism. Tour. Rev. 2020, 76, 473–488. [Google Scholar] [CrossRef]
- Ahmed, Z.; Nihei, T. Assessing the environmental impacts of adventure tourism in the world’s highest mountains: A comprehensive review for promoting sustainable tourism in high-altitude areas. J. Adv. Res. Soc. Sci. Humanit. 2024, 9, 1–15. [Google Scholar] [CrossRef]
- Winter, P.L.; Selin, S.; Cerveny, L.; Bricker, K. Outdoor recreation, nature-based tourism, and sustainability. Sustainability 2019, 12, 81. [Google Scholar] [CrossRef]
- Nordin, M.R.; Jamal, S.A. Hiking tourism in Malaysia: Origins, benefits and post COVID-19 transformations. Int. J. Acad. Res. Bus. Soc. Sci. 2021, 11, 88–100. [Google Scholar] [CrossRef] [PubMed]
- Duc, M.D.; Duy, L.T.; Thi Thanh, T.N.; Le Minh, T. Human–nature relations in the Anthropocene: Responsible hiking and ecological balance in Thiềng Liềng. J. Outdoor Recreat. Tour. 2025, 51, 100926. [Google Scholar] [CrossRef]
- Cooke, M.T.; Xia, L. Impacts of land-based recreation on water quality. Nat. Areas J. 2020, 40, 179–188. [Google Scholar] [CrossRef]
- Cole, D.N. Impacts of hiking and camping on soils and vegetation: A review. In Environmental Impacts of Ecotourism; CABI: Wallingford, UK, 2004; pp. 41–60. [Google Scholar] [CrossRef]
- Ballantyne, M.; Gudes, O.; Pickering, C.M. Recreational trails are an important cause of fragmentation in endangered urban forests: A case-study from Australia. Landsc. Urban Plan. 2014, 130, 112–124. [Google Scholar] [CrossRef]
- Wolf, I.D.; Croft, D.B. Impacts of tourism hotspots on vegetation communities show a higher potential for self-propagation along roads than hiking trails. J. Environ. Manag. 2014, 143, 173–185. [Google Scholar] [CrossRef]
- Liedtke, R.; Barros, A.; Essl, F.; Lembrechts, J.J.; Wedegärtner, R.E.; Pauchard, A.; Dullinger, S. Hiking trails as conduits for the spread of non-native species in mountain areas. Biol. Invasions 2020, 22, 1121–1134. [Google Scholar] [CrossRef]
- Marzano, M.; Dandy, N. Recreationist behaviour in forests and the disturbance of wildlife. Biodivers. Conserv. 2012, 21, 2967–2986. [Google Scholar] [CrossRef]
- Dertien, J.S.; Larson, C.L.; Reed, S.E. Recreation effects on wildlife: A review of potential quantitative thresholds. Nat. Conserv. 2021, 44, 51–68. [Google Scholar] [CrossRef]
- Maté-González, M.Á.; Sáez Blázquez, C.; Herranz Herranz, D.; Camargo Vargas, S.A.; Martín Nieto, I. GIS-Based Assessment of Shaded Road Segments for Enhanced Winter Risk Management. Remote Sens. 2026, 18, 476. [Google Scholar] [CrossRef]
- Yulu, A.; Şekertekin, A. Why Climbers Die on Mount Ağrı (Ararat)?: Risks, gaps, and safety strategies. J. Geogr. 2025, 184–200. [Google Scholar] [CrossRef]
- Scanlon, R. Surviving the Trail: Five Essential Skills to Prepare Every Hiker for Adventure’s Most Common Perils; Simon and Schuster: New York, NY, USA, 2025. [Google Scholar]
- Wu, K.; Xing, A.; Zhou, J.; Su, L.; Zhang, S.; Yang, S. Influencing factors of sports tourism safety accidents in Tibet, China: fsQCA analysis based on the SCM. PLoS ONE 2025, 20, e0334226. [Google Scholar] [CrossRef] [PubMed]
- Rausch, L.; Limmer, M.; Pocecco, E.; Ruedl, G.; Posch, M.; Faulhaber, M. Sex-specific analysis of hiking accidents in the Austrian Alps: A follow-up from 2015 to 2021. AIMS Public Health 2024, 11, 160–175. [Google Scholar] [CrossRef]
- Fu, X.; Zhang, Y.; Ortúzar, J.d.D.; Lü, G. Activity-travel pattern inference based on multi-source big data. Transp. Rev. 2025, 45, 26–48. [Google Scholar] [CrossRef]
- Šauer, M.; Pařil, V.; Jandová, M.; Paleta, T.; Farbiak, M. How Much Does Mobile Phone Data Reveal Mobility and Tourist Behaviour? Available online: https://ssrn.com/abstract=5197715 (accessed on 10 April 2026).
- Connolly, C.; Steinbach, S.; Vo, M.; Wan, X. Harnessing Human Mobility Data for Applied Economic Research: Current Knowledge, Challenges, and Emerging Opportunities. J. Econ. Surv. 2025, early view. [Google Scholar] [CrossRef]
- Zhang, C.; Zhang, K.; Yuan, Q.; Zhang, L.; Hanratty, T.; Han, J. Gmove: Group-level mobility modeling using geo-tagged social media. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining; Association for Computing Machinery: New York, NY, USA, 2016; pp. 1305–1314. [Google Scholar]
- Liu, L.; Wang, R.; Guan, W.W.; Bao, S.; Yu, H.; Fu, X.; Liu, H. Assessing reliability of Chinese geotagged social media data for spatiotemporal representation of human mobility. ISPRS Int. J. Geo-Inf. 2022, 11, 145. [Google Scholar] [CrossRef]
- Noviansyah, A.; Uno, N.; Matsunaka, R.; Nishigaki, T. Analyzing temporal mobility pattern shifts: Dynamic origin–destination estimation from aggregated mobile phone location data. Transp. Res. Interdiscip. Perspect. 2026, 36, 101806. [Google Scholar] [CrossRef]
- Liu, E.; Wang, X.; Wang, Y.; Zhao, D.; Niu, X.; Lu, X. A hybrid deep learning model for human mobility prediction. J. Phys. Complex. 2026, 7, 015009. [Google Scholar] [CrossRef]
- Xian, Y.; Chen, M.; Hu, M.; Chen, L. Unveiling pattern and structure of inter-urban mobility: Integrating flow space and geospatial information. Cities 2026, 171, 106789. [Google Scholar] [CrossRef]
- Yang, C.; Huang, H.; Meng, S.; Zhang, Y.; Lo, J.T.; Ma, R. Typology-based spatial modeling of urban block commercial vitality: Evidence from Shenzhen for land use planning. Land Use Policy 2026, 164, 107973. [Google Scholar] [CrossRef]
- Liu, R.; Xia, H.; Li, L.; Li, Q.; Liu, S. Transit oriented development under the influence of urban air mobility: A public transit-based vertiport siting method. J. Air Transp. Manag. 2026, 133, 102962. [Google Scholar] [CrossRef]
- Liu, W.; Cai, H.; Liu, B.; Xing, H.; Gong, L. Rethinking spatial community detection in human mobility: A random walk-based method. Cities 2026, 170, 106674. [Google Scholar] [CrossRef]
- Tweed, W.C. Uncertain Path: A Search for the Future of National Parks; University of California Press: Berkeley, CA, USA, 2010. [Google Scholar]
- Peng, J.; Tang, J.; Deng, M.; Liu, H.; Hu, Z.; Jiang, X.; Xiang, J.; Ning, X.; Zhao, W. Discovering spatiotemporal patterns of human outdoor activities with crowdsourced trajectory data. Geo-Spat. Inf. Sci. 2025, 28, 1214–1236. [Google Scholar] [CrossRef]
- Chen, C.; Zhao, W.; Zhao, B. Mechanisms Underlying the Effects of Recreation Services on Tourist Satisfaction in Forest Parks: A Case Study of the Yangtze River Delta, China. Sustainability 2026, 18, 1936. [Google Scholar] [CrossRef]
- Wang, X.; Mei, J.; Mei, Z.; Cheng, H.; Li, W.; Wang, L.; Chen, D.; Wang, Y.; Gao, Z. Spatio-Temporal Dynamics, Driving Forces, and Location–Distance Attenuation Mechanisms of Beautiful Leisure Tourism Villages in China. Land 2026, 15, 250. [Google Scholar] [CrossRef]
- Kosamkar, P.; Kulkarni, V. Forecasting Influencing Parameter for CO2 and CH4 Emission from Agriculture Using Lasso and Ridge Regression. In Application of Machine Learning in Earth Sciences: A Practical Approach; Springer Nature: Cham, Switzerland, 2026; pp. 461–474. [Google Scholar] [CrossRef]
- Schroeder, M.A.; Lander, J.; Levine-Silverman, S. Diagnosing and Dealing with Multicollinearity. West. J. Nurs. Res. 1990, 12, 175–187. [Google Scholar] [CrossRef] [PubMed]
- Du, Z.; Wang, Z.; Wu, S.; Zhang, F.; Liu, R. Geographically neural network weighted regression for the accurate estimation of spatial non-stationarity. Int. J. Geogr. Inf. Sci. 2020, 34, 1353–1377. [Google Scholar] [CrossRef]
- Tang, G.; Du, X.; Wang, S. Impact mechanisms of 2D and 3D spatial morphologies on urban thermal environment in high-density urban blocks: A case study of Beijing’s Core Area. Sustain. Cities Soc. 2025, 123, 106285. [Google Scholar] [CrossRef]
- Vogel, R.M.; Papalexiou, S.M.; Lamontagne, J.R.; Dolan, F.C. When Heavy Tails Disrupt Statistical Inference. Am. Stat. 2025, 79, 221–235. [Google Scholar] [CrossRef]
- Zube, E.H.; Sell, J.L.; Taylor, J.G. Landscape perception: Research, application and theory. Landsc. Plan. 1982, 9, 1–33. [Google Scholar] [CrossRef]
- Chen, J.; Cao, X.; Yang, D.; Huang, R.; Song, H.; Huang, L.; Hu, S. Deciphering socio-spatial drivers of visitor sentiment in cultural heritage tourism: A text mining and XGBoost-based empirical study of Go Seigen in Fuzhou, China. Asia Pac. J. Tour. Res. 2026, 1–24. [Google Scholar] [CrossRef]
- Liu, X.; Wang, S.; Tang, G. Understanding nonlinear and spatially heterogeneous effects of urban residential morphology on land surface temperature: Integrating SOM, XGBoost-SHAP, and GWR models. Sustain. Cities Soc. 2026, 136, 107100. [Google Scholar] [CrossRef]
- Zhang, Y.; Ge, J.; Wang, S.; Dong, C. Optimizing urban green space configurations for enhanced heat island mitigation: A geographically weighted machine learning approach. Sustain. Cities Soc. 2025, 119, 106087. [Google Scholar] [CrossRef]
- Zhou, K.; Zheng, X.; Huang, S.; Li, H.; Yin, H. Quantifying the combined and individual impacts of climate and human activity on the urban green space carbon sink capacity in Beijing. Sustain. Cities Soc. 2025, 122, 106253. [Google Scholar] [CrossRef]
- Zhang, J.; Hong, S.; Shi, S.; Chen, B.; Wu, S. Synergistic impacts of 2D/3D grey-green spatial morphology on land surface temperature across local climate zones in the Guangdong-Hong Kong-Macao Greater Bay Area. Sustain. Cities Soc. 2026, 139, 107231. [Google Scholar] [CrossRef]
- Gao, F.; Liu, H.; Guo, Q.; Xin, Y. Explainable Machine Learning Reveals Nonlinear Human-Environment Relationships in Mid-Holocene Settlement Patterns Across China. Available online: https://ssrn.com/abstract=6149458 (accessed on 10 April 2026).
- Yuan, B.; Guo, S.; Mu, H.; Pan, X.; Li, C.; Xia, Z.; Zhang, X.; Du, P. Assessment of land surface vulnerability using time-series geospatial datasets. Ecol. Inform. 2025, 89, 103178. [Google Scholar] [CrossRef]
- Helbing, D.; Keltsch, J.; Molnár, P. Modelling the evolution of human trail systems. Nature 1997, 388, 47–50. [Google Scholar] [CrossRef]
- Chen, Z.; Yu, B.; Yang, C.; Zhou, Y.; Yao, S.; Qian, X.; Wang, C.; Wu, B.; Wu, J. An extended time series (2000–2018) of global NPP-VIIRS-like nighttime light data from a cross-sensor calibration. Earth Syst. Sci. Data 2021, 13, 889–906. [Google Scholar] [CrossRef]
- Xiao, Y.; Lin, J.; Zhang, X.; Zhang, M.; Chen, W.; Li, J. Designing outdoor emergency rescue stations based on the spatiotemporal behavior of outdoor adventure tourists using GPS trajectory data. Saf. Sci. 2024, 175, 106497. [Google Scholar] [CrossRef]
- Belkin, M.; Hsu, D.; Ma, S.; Mandal, S. Reconciling modern machine-learning practice and the classical bias–variance trade-off. Proc. Natl. Acad. Sci. USA 2019, 116, 15849–15854. [Google Scholar] [CrossRef] [PubMed]
- Lambert, D. Zero-Inflated Poisson Regression, With an Application to Defects in Manufacturing. Technometrics 1992, 34, 1–14. [Google Scholar] [CrossRef]
- Lou, L.; Ma, W.; Cheng, P.; Liu, H.; Huang, Y. Climatic and Fuel Drivers of Lightning-Induced Forest Fire Burned Area in the Da Hinggan Ling Region, Northeast China. Remote Sens. 2026, 18, 657. [Google Scholar] [CrossRef]
- Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining (KDD ’16); Association for Computing Machinery: New York, NY, USA, 2016; pp. 785–794. [Google Scholar] [CrossRef]
- Zhang, J.; Hong, S.; Chen, B.; Wu, S. Multiscale synergistic effects of urban green space morphology on heat-pollution: A case study of Guangdong-Hong Kong-Macao Greater Bay Area, China. Ecol. Indic. 2025, 173, 113390. [Google Scholar] [CrossRef]
- Seraj, H.; Abbaspour, A.; Bahadori-Jahromi, A. Interpretable Data-Driven Models for Energy Performance Assessment in Residential Buildings. Sustainability 2026, 18, 457. [Google Scholar] [CrossRef]
- Mauldin, R.L.; Mattingly, S.P.; Jang, S.; Handique, S.; Haque, M.; Parekh, R. Quantifying the Spatial Burden of Informal Ride Provision for Older Adults Using Activity Space Analysis and GIS. ISPRS Int. J. Geo-Inf. 2026, 15, 86. [Google Scholar] [CrossRef]
- Zhong, Y.; Guo, M.; Zhang, M.; Tan, L. Identifying Street Environmental Factors That Attract Public Attention from the Jogger’s Perspective: A Multiscale Spatial Exploration. Buildings 2024, 14, 1935. [Google Scholar] [CrossRef]
- Thiene, M.; Scarpa, R. Hiking in the Alps: Exploring Substitution Patterns of Hiking Destinations. Tour. Econ. 2008, 14, 263–282. [Google Scholar] [CrossRef]
- Kołodziejczyk, K. Factors determining changes in the network of marked hiking trails in the Sudetes. J. Mt. Sci. 2024, 21, 1075–1099. [Google Scholar] [CrossRef]
- Bachinger, M.; Hafner, M.; Harprecht, P. Cultural and space-based factors influencing recreational conflicts in forests. The example of cyclists and other forest visitors in Freiburg (Germany). Eur. J. Cult. Manag. Policy 2024, 13, 12494. [Google Scholar] [CrossRef]
- Zoderer, B.M.; Tasser, E. The plurality of wilderness beliefs and their mediating role in shaping attitudes towards wilderness. J. Environ. Manag. 2021, 277, 111392. [Google Scholar] [CrossRef]
- Buijs, A.E. Lay People’s Images of Nature: Comprehensive Frameworks of Values, Beliefs, and Value Orientations. Soc. Nat. Resour. 2009, 22, 417–432. [Google Scholar] [CrossRef]
- Xu, H.; Guo, Q.; Siqin, C.; Li, Y.; Gao, F. Study of Settlement Patterns in Farming–Pastoral Zones in Eastern Inner Mongolia Using Planar Quantization and Cluster Analysis. Sustainability 2023, 15, 15077. [Google Scholar] [CrossRef]
- Ravinsky Raichel, N.; Yahel, H. Planning Challenges and Opportunities in the Conservation of National Trails: The Case of the Israel National Trail. Land 2024, 13, 1449. [Google Scholar] [CrossRef]
- Schweinsberg, S. Tourism and trails: Cultural, ecological and management issues. Ann. Leis. Res. 2017, 20, 123–124. [Google Scholar] [CrossRef]
- Upadhayaya, P.K. Sustainable Management of Trekking Trails for the Adventure Tourism in Mountains: A Study of Nepal’s Great Himalaya Trails. J. Tour. Adventure 2018, 1, 1–31. [Google Scholar] [CrossRef]














| Category | Variable Name | Abbreviation/Code |
|---|---|---|
| Topography | Elevation | DEM |
| Slope | Slope | |
| Climate | Annual Precipitation | Prec |
| Annual Mean Temperature | Temp | |
| Hydrology | Distance to Water Systems | Water_Dist |
| Vegetation (NDVI) | NDVI Annual Maximum | NDVI_Max |
| NDVI Annual Minimum | NDVI_Min | |
| NDVI Annual Mean | NDVI_Mean | |
| NDVI Annual Range | NDVI_Range | |
| Nighttime Light (NTL) | NTL Annual Maximum | NTL_Max |
| NTL Annual Minimum | NTL_Min | |
| NTL Annual Mean | NTL_Mean | |
| NTL Annual Range | NTL_Range | |
| Hiking Network | Hiking Road Density | Trail_Dens |
| Distance to Hiking Trails | Trail_Dist |
| Target Variable | Model | R2 | RMSE | MAE | n_Estimators | Max_Depth | Learning_Rate |
|---|---|---|---|---|---|---|---|
| Trail_Dens | Random Forest | 0.373 | 0.011 | 0.003 | 200 | 30 | |
| XGBoost | 0.327 | 0.012 | 0.003 | 200 | 12 | 0.05 | |
| LightGBM | 0.279 | 0.012 | 0.003 | 100 | 9 | 0.1 | |
| Trail_Dist | Random Forest | 0.828 | 980.751 | 681.712 | 200 | 30 | |
| XGBoost | 0.869 | 858.155 | 568.787 | 200 | 12 | 0.05 | |
| LightGBM | 0.824 | 992.651 | 699.637 | 200 | −1 | 0.1 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Guo, Q.; Chen, S.; Bai, X.; Zhang, Y. Landscape Drivers of Trail Formation in Peri-Urban Mountains: Insights from an Explainable Machine Learning Approach. Land 2026, 15, 715. https://doi.org/10.3390/land15050715
Guo Q, Chen S, Bai X, Zhang Y. Landscape Drivers of Trail Formation in Peri-Urban Mountains: Insights from an Explainable Machine Learning Approach. Land. 2026; 15(5):715. https://doi.org/10.3390/land15050715
Chicago/Turabian StyleGuo, Qin, Shili Chen, Xueyue Bai, and Yue Zhang. 2026. "Landscape Drivers of Trail Formation in Peri-Urban Mountains: Insights from an Explainable Machine Learning Approach" Land 15, no. 5: 715. https://doi.org/10.3390/land15050715
APA StyleGuo, Q., Chen, S., Bai, X., & Zhang, Y. (2026). Landscape Drivers of Trail Formation in Peri-Urban Mountains: Insights from an Explainable Machine Learning Approach. Land, 15(5), 715. https://doi.org/10.3390/land15050715

