An Explainable Machine-Learning Framework Based on XGBoost–SHAP and Big Data for Revealing the Socioeconomic Drivers of Population Urbanization in China
Abstract
1. Introduction
- (1)
- Identify the critical socioeconomic variables influencing population urbanization;
- (2)
- Reveal the primary driving factors across China’s eastern, central, and western regions, highlighting regional heterogeneity;
- (3)
- Analyze the nonlinear and time-varying relationships between key sensitive variables and urbanization levels;
- (4)
- Investigate how interactions among these sensitive factors shape population urbanization.
2. Literature Review and Indicator System Development
3. Materials and Methods
3.1. Overview of the XGBoost Regression Model
3.2. SHAP Value Method
3.3. Partial Dependence Plots
3.4. Data Collection
3.5. Research Framework
4. Results
4.1. Importance Analysis of Population-Urbanization-Driving Factors
4.2. Regional Heterogeneity of Driving Factors According to the SHAP Model
4.3. Partial Dependence Analysis of Regional Sensitivity Factors
4.3.1. Static-Effect Analysis
4.3.2. Temporal-Effect Analysis
4.4. Interaction Analysis of Regional Sensitivity Factors
5. Discussion
5.1. China’s Tripartite Mechanism of Population Urbanization
5.2. Structural Differentiation in Regional Population Urbanization Mechanisms
5.3. Nonlinear Dynamics of Sensitivity Factors in Population Urbanization
5.4. Divergence in the Interaction Effects of Regional Population Urbanization Sensitivity Factors
6. Conclusions
- Economically advanced regions should prioritize aligning knowledge capital with human resources and promoting high-value service industries to continuously attract skilled populations and support sustainable urban growth;
- Cities at intermediate development stages should optimize investment structures, reducing excessive reliance on real estate and encouraging integrated development across industry, population, and land to achieve more balanced urban expansion;
- Resource-dependent and less-developed regions should emphasize strengthening employment opportunities, education infrastructure, and essential public services, thereby enhancing their capacity to sustain population inflow and improve the overall urban sustainability;
- Establish threshold-based monitoring frameworks anchored in the effective intervals of key indicators, including the share of the tertiary sector, the real estate investment intensity, the proportion of the working-age population, and the GDP growth rate.
Funding
Data Availability Statement
Conflicts of Interest
References
- Kundu, D.; Pandey, A.K. World urbanisation: Trends and patterns. In Develo National Urban Policies: Ways Forward to Green and Smart Cities; Springer: Singapore, 2020; pp. 13–49. [Google Scholar]
- Zhang, X.Q. The trends, promises and challenges of urbanisation in the world. Habitat Int. 2016, 54, 241–252. [Google Scholar] [CrossRef]
- Onyango, A.O. Global and regional trends of urbanization: A critical review of the environmental and economic imprints. World Environ. 2018, 8, 47–62. [Google Scholar]
- Balula, L.; Seixas, J. Contemporary city and plural knowledge: Reframing urban planning. In Architecture and the Social Sciences: Inter-and Multidisciplinary Approaches Between Society and Space; Springer International Publishing: Cham, Switzerland, 2017; pp. 69–84. [Google Scholar]
- Buriachenko, A.; Koczar, J.; Biliavska, O.; Levchenko, K.; Kolomiiets, H. The impact of urbanization on socio-economic development: The experience of Poland, Spain and Ukraine. Sci. Bull. Natl. Min. Univ. 2024, 158–168. [Google Scholar] [CrossRef]
- Zhu, C.; Sun, S.; Li, Z. The influencing factors and spatial distribution of population urbanization in China. Geogr. Res. 2008, 27, 13–23. [Google Scholar]
- Tacoli, C.; McGranahan, G.; Satterthwaite, D. Urbanisation, Rural-Urban Migration and URBAN Poverty; Human Settlements Group, International Institute for Environment and Development: London, UK, 2015. [Google Scholar]
- Castells-Quintana, D.; Royuela, V.; Veneri, P. Inequality and city size: An analysis for OECD functional urban areas. Pap. Reg. Sci. 2020, 99, 1045–1065. [Google Scholar] [CrossRef]
- Gyawali, B.R.; Hill, A.; Banerjee, S.; Chembezi, D.; Christian, C.; Bukenya, J.; Silitonga, M. Examining rural-urban population change in the southeastern United States. J. Rural. Soc. Sci. 2013, 28, 4. [Google Scholar]
- Shangguan, Z. Investigating the spatiotemporal dynamics and interplay mechanisms between population urbanization and carbon dioxide emissions in China. J. Geochem. Explor. 2024, 266, 107571. [Google Scholar] [CrossRef]
- Shangguan, Z. Exploring the impact of population urbanization on the green economy development: A case study of 30 provincial-level administrative regions in China. Phys. Chem. Earth Parts A/B/C 2024, 136, 103727. [Google Scholar] [CrossRef]
- Zhang, P.; Yuan, H.; Tian, X. Sustainable development in China: Trends, patterns, and determinants of the “Five Modernizations” in Chinese cities. J. Clean. Prod. 2019, 214, 685–695. [Google Scholar] [CrossRef]
- Liu, Y.; Lu, S.; Chen, Y. Spatio-temporal change of urban–rural equalized development patterns in China and its driving factors. J. Rural. Stud. 2013, 32, 320–330. [Google Scholar] [CrossRef]
- Yang, K. New urbanization and coordinated regional development. Chin. J. Urban Environ. Stud. 2019, 7, 1975009. [Google Scholar] [CrossRef]
- Ahrend, R.; Lembcke, A.C.; Schumann, A. The role of urban agglomerations for economic and productivity growth. Int. Product. Monit. 2017, 32, 161–179. [Google Scholar]
- Cohen, M.; Simet, L. Macroeconomy and urban productivity. In Urban Planet: Knowledge Towards Sustainable Cities; Cambridge University Press & Assessment: Cambridge, UK, 2018; pp. 130–146. [Google Scholar]
- Cuaresma, J.C.; Doppelhofer, G.; Feldkircher, M. The determinants of economic growth in European regions. Reg. Stud. 2014, 48, 44–67. [Google Scholar] [CrossRef]
- Liao, F.H.; Wei, Y.D. Space, scale, and regional inequality in provincial China: A spatial filtering approach. Appl. Geogr. 2015, 61, 94–104. [Google Scholar] [CrossRef]
- Xing, C. Human Capital and Urbanization in the People’s Republic of China; ADBI Working Paper, No. 603; Asian Development Bank Institute: Tokyo, Japan, 2016. [Google Scholar]
- Ginsburg, C.; Bocquier, P.; Beguy, D.; Afolabi, S.; Augusto, O.; Derra, K.; Odhiambo, F.; Otiende, M.; Soura, A.B.; Zabre, P.; et al. Human capital on the move: Education as a determinant of internal migration in selected INDEPTH surveillance populations in Africa. Demogr. Res. 2016, 34, 845. [Google Scholar] [CrossRef]
- Zhiqiang, L. Human capital externalities and rural–urban migration: Evidence from rural China. China Econ. Rev. 2008, 19, 521–535. [Google Scholar] [CrossRef]
- Zhou, J.; Song, J.; Huang, X. Human capital, well-being and growth rate of rural–urban migration in China. Singap. Econ. Rev. 2021, 1–34. [Google Scholar] [CrossRef]
- Tong, Y.; Tan, C.H.; Sia, C.L.; Shi, Y.; Teo, H.-H. Rural-urban healthcare access inequality challenge: Transformative roles of information technology. Mis Q. 2022, 46, 1937–1982. [Google Scholar] [CrossRef]
- Qin, V.M.; McPake, B.; Raban, M.Z.; Cowling, T.E.; Alshamsan, R.; Chia, K.S.; Smith, P.C.; Atun, R. Rural and urban differences in health system performance among older Chinese adults: Cross-sectional analysis of a national sample. BMC Health Serv. Res. 2020, 20, 372. [Google Scholar] [CrossRef] [PubMed]
- City, B.L.; Assessment, E.; Tool, R. Urbanization and health. Bull World Health Organ 2010, 88, 245–246. [Google Scholar]
- Chindarkar, N.; Nakajima, M.; Wu, A.M. Inequality of Opportunity in Health Among Urban, Rural, and Migrant Children: Evidence from China. J. Soc. Policy 2024, 53, 950–969. [Google Scholar] [CrossRef]
- Hong, Y.; Li, X.; Stanton, B.; Lin, D. Too costly to be ill: Health care access and health seeking behaviors among rural-to-urban migrants in China. World Health Popul. 2006, 8, 22. [Google Scholar] [CrossRef] [PubMed]
- Lu, H.; Zhao, P.; Hu, H.; Zeng, L.; Wu, K.S.; Lv, D. Transport infrastructure and urban-rural income disparity: A municipal-level analysis in China. J. Transp. Geogr. 2022, 99, 103292. [Google Scholar] [CrossRef]
- Wu, W. Urban infrastructure financing and economic performance in China. Urban Geogr. 2010, 31, 648–667. [Google Scholar] [CrossRef]
- Xu, Y.; Zhu, S. Transport infrastructure, intra-regional inequality and urban-rural divide: Evidence From China’s high-speed rail construction. Int. Reg. Sci. Rev. 2024, 47, 378–406. [Google Scholar] [CrossRef]
- Chanieabate, M.; He, H.; Guo, C.; Abrahamgeremew, B.; Huang, Y. Examining the relationship between transportation infrastructure, urbanization level and rural-urban income gap in China. Sustainability 2023, 15, 8410. [Google Scholar] [CrossRef]
- Du, Y.; Yang, C. Demographic transition and labour market changes: Implications for economic development in China. China’s Econ. A Collect. Surv. 2015, 28, 25–44. [Google Scholar]
- Che, S.Y.; Chen, W.; Guo, L. Demographic dividend in china’s economic growth. Popul. Econ. 2011, 3, 16–23. [Google Scholar]
- Eggleston, K.; Oi, J.C.; Rozelle, S.; Sun, A.; Walder, A.; Zhou, X. Will demographic change slow China’s rise? J. Asian Stud. 2013, 72, 505–518. [Google Scholar] [CrossRef]
- Yin, Z.; Gang, Y. A Brief Analysis on the Development Strategies for New-Type Urbanization Simulated by Demographic Factors: Based on Real Evidence in Chongqing. Can. Soc. Sci. 2014, 10, 126. [Google Scholar]
- Zhan, P.; Ma, X.; Li, S. Migration, population aging, and income inequality in China. J. Asian Econ. 2021, 76, 101351. [Google Scholar] [CrossRef]
- Chen, M.; Zhang, H.; Gong, Y. China’s Population Aging and New Urbanization. In E-Planning and Collaboration: Concepts, Methodologies, Tools, and Applications; IGI Global Scientific Publishing: Hershey, PA, USA, 2018; pp. 382–400. [Google Scholar]
- Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd Acm Sigkdd International Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
- Wang, Y. Social Security: Establishment and Equalized Provision of Basic Public Services. In The Chinese Approach: How China Has Transformed Its Economy and System? Springer: Singapore, 2021; pp. 311–351. [Google Scholar]
- Sancar, C.; Akbaş, Y.E. The Effect of Unemployment and Urbanization on Migration in Turkey: An Evaluation in terms of the Harris-Todaro Model. Sosyoekonomi 2022, 30, 215–239. [Google Scholar] [CrossRef]
- Du, Y.; Wang, M. Chapter Eight Population, Industrial Development, and Employment in Chinese Urbanization. In The China Population and Labor Yearbook; Brill: Leiden, The Netherlands, 2012; Volume 3, pp. 189–214. [Google Scholar]
- Pannell, C.W. China’s continuing urban transition. Environ. Plan. A 2002, 34, 1571–1589. [Google Scholar] [CrossRef]
- Hamnett, C. Is Chinese urbanisation unique? Urban Stud. 2020, 57, 690–700. [Google Scholar] [CrossRef]
- Huang, S.; Lin, Y. Research on High-Quality Urbanization Development and Optimization Pathways Based on the Coupling Coordination Perspective of “Population–Land–Economy–Environment”: A Case Study of Jiangsu Province, China. Land 2025, 14, 435. [Google Scholar] [CrossRef]
- Floater, G.; Rode, P.; Robert, A.; Kennedy, C.; Hoornweg, D.; Slavcheva, R.; Godfrey, N. Cities and the New Climate Economy: The Transformative Role of Global Urban Growth. New Climate Economy Contributing Paper. Retrieved from: New Climate Economy and London School of Economic and Political Science, 2014. Available online: www.newclimateeconomy.net (accessed on 15 May 2017).
- Zhang, K.H. Urbanization and industrial development in China. In China’s Urbanization and Socioeconomic Impact; Springer: Singapore, 2017; pp. 21–35. [Google Scholar]
- Song, F.; Timberlake, M. Chinese urbanization, state policy, and the world economy. J. Urban Aff. 1996, 18, 285–306. [Google Scholar] [CrossRef]
- Pannell, C. China’s urban transition. J. Geogr. 1995, 94, 394–403. [Google Scholar] [CrossRef] [PubMed]
- Wu, Y.; Luo, J.; Zhang, X.; Skitmore, M. Urban growth dilemmas and solutions in China: Looking forward to 2030. Habitat Int. 2016, 56, 42–51. [Google Scholar] [CrossRef]
- Sun, Y.; Zhu, Y.; Zheng, W. Policy simulation on national large scale investment program based on the impaction of financial crisis. Geogr. Res. 2010, 29, 789–800. [Google Scholar]
- Dang, J.; Wang, J.; Tu, B. The impact of National Forest City Construction on local employment: Evidence from China. For. Policy Econ. 2025, 172, 103421. [Google Scholar] [CrossRef]
- Shan, Z.; Wang, Y. Strategic talent development in the knowledge economy: A comparative analysis of global practices. J. Knowl. Econ. 2024, 15, 19570–19596. [Google Scholar] [CrossRef]
- Oyelaran-Oyeyinka, B.; Lal, K. Structural Transformation and Economic Development: Cross Regional Analysis of Industrialization and Urbanization; Routledge: London, UK, 2016. [Google Scholar]
- Su, H.; Wei, H.; Zhao, J. Density effect and optimum density of the urban population in China. Urban Stud. 2017, 54, 1760–1777. [Google Scholar] [CrossRef]
- Ali, A. Nonlinear effects of urbanization routes on environmental degradation, evidence from China, India, Indonesia, the United States, and Brazil. Front. Psychol. 2023, 34, 3391–3416. [Google Scholar]
- Sodhi, I.S. Urbanization in China and India. Rise India China Soc. Econ. Environ. Impacts 2020, 73, 73. [Google Scholar]
- Wang, Z.; Wang, C.; Zhang, Q. Population ageing, urbanization and housing demand. J. Serv. Sci. Manag. 2015, 8, 516–525. [Google Scholar] [CrossRef]
- Kose, M.A.; Ohnsorge, F. Falling Long-Term Growth Prospects: Trends, Expectations, and Policies; World Bank Publications: Washington, DC, USA, 2024. [Google Scholar]
- Polenske, K.R. Growth pole theory and strategy reconsidered: Domination, linkages, and distribution. In Regional Economic Development; Routledge: London, UK, 2017; pp. 91–111. [Google Scholar]
- Qi, A.; Feng, Z.; Xie, M.; Song, Y.; Guan, H.; Hao, F. Measurement and Driving Mechanisms of Coordinated Development Between Innovation Capacity and Industrial Transformation in Northeast China Under the Background of Population Shrinkage. Chin. Geogr. Sci. 2025, 35, 819–834. [Google Scholar] [CrossRef]
- Nijkamp, P.; Kourtit, K.; Krugman, P.; Moreno, C. Old wisdom and the New Economic Geography: Managing uncertainty in 21st century regional and urban development. Reg. Sci. Policy Pract. 2024, 16, 100124. [Google Scholar] [CrossRef]
- Sassen, S. The Global City: New York, London, Tokyo; Princeton University Press: Princeton, NJ, USA, 2013. [Google Scholar]
- Hunt, S.D. Economic growth: Should policy focus on investment or dynamic competition? Eur. Bus. Rev. 2007, 19, 274–291. [Google Scholar] [CrossRef]
- Wu, F. Land financialisation and the financing of urban development in China. Land Use Policy 2022, 112, 104412. [Google Scholar] [CrossRef]
- Ren, X. Urban China; John Wiley & Sons: Hoboken, NJ, USA, 2013. [Google Scholar]
- Wackernagel, M.; Rees, W.E. Perceptual and structural barriers to investing in natural capital: Economics from an ecological footprint perspective. Ecol. Econ. 1997, 20, 3–24. [Google Scholar] [CrossRef]
- Saghir, J.; Santoro, J. Urbanization in Sub-Saharan Africa. In Meeting Challenges by Bridging Stakeholders; Center for Strategic & International Studies: Washington, DC, USA, 2018. [Google Scholar]
Dimension | Variable Name | Indicator Formula | Abb. |
---|---|---|---|
Economic Development | GDP Per Person | Gross Regional Product/Total Population | GPP |
GDP Growth Rate | GDP change/Previous GDP | GGR | |
Tertiary Sector Share | Value of the Tertiary Sector/Total GDP | TSS | |
Education Resources | Higher Education Share | Higher-Education Population/Total Population | HES |
Teacher–Student Ratio | Number of Students/Number of Teachers | TSR | |
Per Capita Education | Total Education Spending/Total Population | PCE | |
Healthcare and Services | Hospital Bed Rate | Total Hospital Beds/Total Population | HBR |
Doctor Service Rate | Number of Doctors/Total Population | DSR | |
Per Capita Health | Total Health Spending/Total Population | PCH | |
Infrastructure Access | Road Density Degree | Total Road Length/Land Area | RDL |
Real Estate Investment | Total Real Estate Investment/Total GDP | REI | |
Transit Vehicle Rate | Number of Public Buses/Total Population | TVR | |
Population and Labor | Working-Age Share | Population Aged 15–64/Total Population | WAS |
Population Density Level | Total Population/Land Area | PDL | |
Urban Job Growth | Employment Change/Previous Employment | UJG |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Shangguan, Z. An Explainable Machine-Learning Framework Based on XGBoost–SHAP and Big Data for Revealing the Socioeconomic Drivers of Population Urbanization in China. Systems 2025, 13, 679. https://doi.org/10.3390/systems13080679
Shangguan Z. An Explainable Machine-Learning Framework Based on XGBoost–SHAP and Big Data for Revealing the Socioeconomic Drivers of Population Urbanization in China. Systems. 2025; 13(8):679. https://doi.org/10.3390/systems13080679
Chicago/Turabian StyleShangguan, Ziheng. 2025. "An Explainable Machine-Learning Framework Based on XGBoost–SHAP and Big Data for Revealing the Socioeconomic Drivers of Population Urbanization in China" Systems 13, no. 8: 679. https://doi.org/10.3390/systems13080679
APA StyleShangguan, Z. (2025). An Explainable Machine-Learning Framework Based on XGBoost–SHAP and Big Data for Revealing the Socioeconomic Drivers of Population Urbanization in China. Systems, 13(8), 679. https://doi.org/10.3390/systems13080679