Measuring Multi-Dimensional Urban Boundaries Influencing Theft: A Case Study of Guangzhou, China
Abstract
1. Introduction
2. Data and Methods
2.1. Study Area
2.2. Data
2.2.1. Dependent Variable
2.2.2. Social Boundaries
2.2.3. Physical Boundaries
2.2.4. Control Variables
2.3. Methods
2.3.1. ML Models
2.3.2. Model Performance Evaluation
2.3.3. The Feature Importance Analysis
3. Results
3.1. Results of the Boundary Measurement
3.2. ML Model Analysis Results
3.3. Impact of Multiple Boundary Variables on Crime
4. Discussion
4.1. Urban Boundaries as Strong Predictors of Crime Hotspots
4.2. Boundary Zones Are More Vulnerable to Theft than Interior Areas
4.3. Social Boundaries Are More Important than Physical Boundaries to Urban Crime
4.4. Limitations
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Smith, S.J. Crime, Space and Society; Cambridge University Press: Cambridge, UK, 1986; ISBN 0-521-26456-1. [Google Scholar]
- Hillier, B. Can Streets Be Made Safe? Urban Des. Int. 2004, 9, 31–45. [Google Scholar] [CrossRef]
- Kim, Y.-A.; Hipp, J.R. Both Sides of the Street: Introducing Measures of Physical and Social Boundaries Based on Differences Across Sides of the Street, and Consequences for Crime. J. Quant. Criminol. 2022, 38, 75–103. [Google Scholar] [CrossRef]
- Legewie, J. Living on the Edge: Neighborhood Boundaries and the Spatial Dynamics of Violent Crime. Demography 2018, 55, 1957–1977. [Google Scholar] [CrossRef]
- Brantingham, P.L.; Brantingham, P.J. Nodes, Paths and Edges: Considerations on the Complexity of Crime and the Physical Environment. J. Environ. Psychol. 1993, 13, 3–28. [Google Scholar] [CrossRef]
- Kim, Y.-A.; Hipp, J.R. Physical Boundaries and City Boundaries: Consequences for Crime Patterns on Street Segments? Crime Delinq. 2018, 64, 227–254. [Google Scholar] [CrossRef]
- Rengert, G.; Chakravorty, S.; Bole, T.; Henderson, K. A Geographic Analysis of Illegal Drug Markets. Crime Prev. Stud. 2000, 11, 219–240. [Google Scholar]
- Song, J.; Andresen, M.A.; Brantingham, P.L.; Spicer, V. Crime on the Edges: Patterns of Crime and Land Use Change. Cartogr. Geogr. Inf. Sci. 2017, 44, 51–61. [Google Scholar] [CrossRef]
- Lamont, M.; Molnár, V. The Study of Boundaries in the Social Sciences. Annu. Rev. Sociol. 2002, 28, 167–195. [Google Scholar] [CrossRef]
- Brantingham, P.J.; Tita, G.E.; Short, M.B.; Reid, S.E. The Ecology of Gang Territorial Boundaries. Criminology 2012, 50, 851–885. [Google Scholar] [CrossRef]
- Song, J.; Spicer, V.; Brantingham, P. The Edge Effect: Exploring High Crime Zones near Residential Neighborhoods. In Proceedings of the 2013 IEEE International Conference on Intelligence and Security Informatics, Washington, DC, USA, 4–7 June 2013; pp. 245–250. [Google Scholar]
- Kim, S.; Lee, S. Nonlinear Relationships and Interaction Effects of an Urban Environment on Crime Incidence: Application of Urban Big Data and an Interpretable Machine Learning Method. Sustain. Cities Soc. 2023, 91, 104419. [Google Scholar] [CrossRef]
- Buonanno, P.; Leonida, L. Non-Market Effects of Education on Crime: Evidence from Italian Regions. Econ. Educ. Rev. 2009, 28, 11–17. [Google Scholar] [CrossRef]
- Yin, J. Crime Prediction Methods Based on Machine Learning: A Survey. Comput. Mater. Contin. 2022, 74, 4601–4629. [Google Scholar] [CrossRef]
- Alves, L.G.A.; Ribeiro, H.V.; Rodrigues, F.A. Crime Prediction through Urban Metrics and Statistical Learning. Phys. A Stat. Mech. Its Appl. 2018, 505, 435–443. [Google Scholar] [CrossRef]
- Xia, Z.; Stewart, K.; Fan, J. Incorporating Space and Time into Random Forest Models for Analyzing Geospatial Patterns of Drug-Related Crime Incidents in a Major U.S. Metropolitan Area. Comput. Environ. Urban Syst. 2021, 87, 101599. [Google Scholar] [CrossRef]
- Bunting, R.J.; Chang, O.Y.; Cowen, C.; Hankins, R.; Langston, S.; Warner, A.; Yang, X.; Louderback, E.R.; Roy, S.S. Spatial Patterns of Larceny and Aggravated Assault in Miami–Dade County, 2007–2015. Prof. Geogr. 2018, 70, 34–46. [Google Scholar] [CrossRef]
- Mustaine, E.E.; Tewksbury, R. Predicting Risks of Larceny Theft Victimization: A Routine Activity Analysis Using Refined Lifestyle Measures. Criminology 1998, 36, 829–858. [Google Scholar] [CrossRef]
- Jiang, S.; Land, K.C.; Wang, J. Social Ties, Collective Efficacy and Perceived Neighborhood Property Crime in Guangzhou, China. Asian Criminol. 2013, 8, 207–223. [Google Scholar] [CrossRef]
- Jing, F.; Liu, L.; Zhou, S.; Song, J.; Wang, L.; Zhou, H.; Wang, Y.; Ma, R. Assessing the Impact of Street-View Greenery on Fear of Neighborhood Crime in Guangzhou, China. Int. J. Environ. Res. Public Health 2021, 18, 311. [Google Scholar] [CrossRef]
- Kounadi, O.; Ristea, A.; Araujo, A.; Leitner, M. A Systematic Review on Spatial Crime Forecasting. Crime Sci. 2020, 9, 7. [Google Scholar] [CrossRef]
- Long, D.; Liu, L.; Feng, J.; Song, G.; He, Z.; Cao, J. Comparisons of the Community Environment Effects on Burglary and Outdoor-Theft: A Case Study of ZH Peninsula in ZG City. Acta Geogr. Sin. 2017, 72, 341–355. [Google Scholar] [CrossRef]
- Zhao, N.; Liu, Y.; Cao, G.; Samson, E.L.; Zhang, J. Forecasting China’s GDP at the Pixel Level Using Nighttime Lights Time Series and Population Images. GIScience Remote Sens. 2017, 54, 407–425. [Google Scholar] [CrossRef]
- Rowe, P.G.; Forsyth, A.; Kan, H.Y. China’s Urban Communities: Concepts, Contexts, and Well-Being; Birkhäuser: Basel, Switzerland, 2016; ISBN 978-3-0356-0833-5. [Google Scholar]
- Gong, P.; Chen, B.; Li, X.; Liu, H.; Wang, J.; Bai, Y.; Chen, J.; Chen, X.; Fang, L.; Feng, S.; et al. Mapping Essential Urban Land Use Categories in China (EULUC-China): Preliminary Results for 2018. Sci. Bull. 2020, 65, 182–187. [Google Scholar] [CrossRef]
- Guo, L.; Xi, X.; Yang, W.; Liang, L. Monitoring Land Use/Cover Change Using Remotely Sensed Data in Guangzhou of China. Sustainability 2021, 13, 2944. [Google Scholar] [CrossRef]
- Feng, D.; Zhang, Z.; Shi, H. Fine extraction of urban villages in provincial capitals based on multivariate data. Gtzyyg 2021, 33, 272–278. [Google Scholar] [CrossRef]
- Boman, J.H.; Gallupe, O. Has COVID-19 Changed Crime? Crime Rates in the United States during the Pandemic. Am. J. Crim. Justice 2020, 45, 537–545. [Google Scholar] [CrossRef] [PubMed]
- Zhang, Y.; Cai, L.; Song, G.; Zhu, C. The Long-Term Theft Prediction in Beijing Using Machine Learning Algorithms: Comparison and Interpretation. Crime Delinq 2023, 71, 2061–2091. [Google Scholar] [CrossRef]
- Dev, V.A.; Eden, M.R. Formation Lithology Classification Using Scalable Gradient Boosted Decision Trees. Comput. Chem. Eng. 2019, 128, 392–404. [Google Scholar] [CrossRef]
- Kononenko, I. Estimating Attributes: Analysis and Extensions of RELIEF. In Proceedings of the Machine Learning: ECML-94; Bergadano, F., De Raedt, L., Eds.; Springer: Berlin/Heidelberg, Germany, 1994; pp. 171–182. [Google Scholar]
- Bishop, C.M. Neural Networks for Pattern Recognition; Oxford University Press: Oxford, UK, 1995. [Google Scholar]
- He, H.; Garcia, E.A. Learning from Imbalanced Data. IEEE Trans. Knowl. Data Eng. 2009, 21, 1263–1284. [Google Scholar] [CrossRef]
- Fernández, A.; Garcia, S.; Herrera, F.; Chawla, N.V. SMOTE for Learning from Imbalanced Data: Progress and Challenges, Marking the 15-Year Anniversary. J. Artif. Intell. Res. 2018, 61, 863–905. [Google Scholar] [CrossRef]
- Chen, C.; Liaw, A.; Breiman, L. Using Random Forest to Learn Imbalanced Data; Department of Statistics, University of California: Berkeley, CA, USA, 2004. [Google Scholar]
- Luu, C.; Pham, B.T.; Phong, T.V.; Costache, R.; Nguyen, H.D.; Amiri, M.; Bui, Q.D.; Nguyen, L.T.; Le, H.V.; Prakash, I.; et al. GIS-Based Ensemble Computational Models for Flood Susceptibility Prediction in the Quang Binh Province, Vietnam. J. Hydrol. 2021, 599, 126500. [Google Scholar] [CrossRef]
- Chen, W.; Xie, X.; Wang, J.; Pradhan, B.; Hong, H.; Bui, D.T.; Duan, Z.; Ma, J. A Comparative Study of Logistic Model Tree, Random Forest, and Classification and Regression Tree Models for Spatial Prediction of Landslide Susceptibility. CATENA 2017, 151, 147–160. [Google Scholar] [CrossRef]
- Fisher, A.; Rudin, C.; Dominici, F. All Models Are Wrong, but Many Are Useful: Learning a Variable’s Importance by Studying an Entire Class of Prediction Models Simultaneously. J. Mach. Learn. Res. 2019, 20, 177. [Google Scholar]
- He, S. Evolving Enclave Urbanism in China and Its Socio-Spatial Implications: The Case of Guangzhou. Soc. Cult. Geogr. 2013, 14, 243–275. [Google Scholar] [CrossRef]
- Tse, S.; Tung, V.W.S. Residents’ Discrimination against Tourists. Ann. Tour. Res. 2021, 88, 103060. [Google Scholar] [CrossRef] [PubMed]
- Guangzhou Municipal Bureau of Planning and Natural Resources. Haizhu District Branch Office Public Announcement of the Draft National Territory Spatial Planning of Haizhu District, Guangzhou City (2021–2035). 2023; p. 16. Available online: https://www.haizhu.gov.cn/hzdt/hzyw/hzzc/content/post_10199965.html (accessed on 1 May 2023).
- Kitteringham, G.; Fennelly, L.J. Chapter 19-Environmental Crime Control. In Handbook of Loss Prevention and Crime Prevention, 6th ed.; Fennelly, L.J., Ed.; Butterworth-Heinemann: Oxford, UK, 2020; pp. 207–222. ISBN 978-0-12-816459-4. [Google Scholar]
- Higgins, B.R.; Hunt, J. Collective Efficacy: Taking Action to Improve Neighborhoods. NIJ J. 2016, 277, 18–21. [Google Scholar]
- Sampson, R.J.; Raudenbush, S.W.; Earls, F. Neighborhoods and Violent Crime: A Multilevel Study of Collective Efficacy. Science 1997, 277, 918–924. [Google Scholar] [CrossRef]







| Variable | Mean | SD | Data Source |
|---|---|---|---|
| Dependent variables | |||
| Theft | 0.063 | 0.379 | China Judgements Online (https://wenshu.court.gov.cn/, accessed on 1 May 2023) |
| Physical boundaries | |||
| Retail land use | 0.814 | 0.155 | Land use dataset from [25] |
| Residential land use | 0.871 | 0.125 | |
| Office land use | 0.816 | 0.161 | |
| Industrial land use | 0.774 | 0.213 | |
| The other land use | 0.837 | 0.149 | |
| Parks | 0.808 | 0.149 | Baidu Map AOI |
| Expressways | 0.176 | 0.038 | OpenStreetMap |
| Rivers | 0.823 | 0.158 | |
| Subdistricts | 0.780 | 0.185 | NCBGI |
| Social boundaries | |||
| Urban villages | 0.871 | 0.119 | Urban village dataset from [27] |
| GDP | 0.052 | 0.213 | China’s GDP dataset from [23] |
| Home value | 0.049 | 0.206 | Property trading platform Anjuke (https://guangzhou.anjuke.com/, accessed on 1 May 2023) |
| Salary | 0.051 | 0.208 | Job recruitment platform 58 Tongcheng (https://cn.58.com/, accessed on 1 May 2023) |
| Education level | 0.050 | 0.208 | |
| Control variables | |||
| Population (logged) | 3.815 | 1.875 | WorldPop |
| Commercial facilities | 1.239 | 4.181 | Baidu Map POI |
| Entertainment facilities | 0.057 | 0.338 | |
| Security facilities | 0.009 | 0.094 | |
| The pandemic | 0.029 | 0.167 | National Health Commission of the People’s Republic of China |
| Home value (logged) | 7.062 | 4.817 | Property trading platform Anjuke (https://guangzhou.anjuke.com/, accessed on 1 May 2023) |
| GDP (logged) | 6.450 | 2.126 | China’s GDP dataset from [23] |
| Salary | 6.322 | 5.009 | Job recruitment platform 58 Tongcheng (https://cn.58.com/, accessed on 1 May 2023) |
| Education level | 2.418 | 1.827 | |
| Percent retail land use | 0.066 | 0.232 | Land use dataset from [25] |
| Percent residential land use | 0.228 | 0.382 | |
| Percent office land use | 0.041 | 0.173 | |
| Percent industrial land use | 0.122 | 0.306 | |
| Percent the other land use | 0.228 | 0.388 | |
| Percent urban villages | 0.082 | 0.246 | Urban village dataset from [27] |
| Theft Cases | Original Count | Original Percent | After SVMSMOTE Count (Per Train Fold) | After SVMSMOTE Percent |
|---|---|---|---|---|
| 0 | 33,437 | 95.00% | 15,158 | 26.83% |
| 1 | 1148 | 3.26% | 8570 | 15.17% |
| 2 | 280 | 0.80% | 7642 | 13.53% |
| 3 | 115 | 0.33% | 8377 | 14.83% |
| 4 | 74 | 0.21% | 6375 | 11.29% |
| 5 | 142 | 0.40% | 10,366 | 18.35% |
| Variable | ReliefF Score | Rank | Fisher Score | Rank | Average Rank |
|---|---|---|---|---|---|
| Population (logged) | 0.9715 | 4 | 0.0231 | 1 | 2.5 |
| Industrial land use boundaries | 1.0254 | 1 | 0.0121 | 5 | 3 |
| GDP (logged) | 0.7974 | 8 | 0.017 | 2 | 5 |
| Home value (logged) | 0.7574 | 10 | 0.0162 | 3 | 6.5 |
| Retail land use boundaries | 0.8274 | 7 | 0.0097 | 8 | 7.5 |
| Salary | 0.9923 | 3 | 0.0079 | 13 | 8 |
| Education level | 1.0045 | 2 | 0.0077 | 14 | 8 |
| Office land use boundaries | 0.6434 | 16 | 0.01 | 7 | 11.5 |
| Urban village boundaries | 0.1764 | 14 | 0.0087 | 10 | 12 |
| Commercial facilities | 0.3434 | 13 | 0.0086 | 11 | 12 |
| Residential land use boundaries | 0.677 | 18 | 0.0095 | 9 | 13.5 |
| Percent urban villages | 0.6889 | 12 | 0.007 | 15 | 13.5 |
| Percent residential land use | 0.5781 | 5 | 0.0016 | 22 | 13.5 |
| Epidemic | 0.7158 | 22 | 0.011 | 6 | 14 |
| GDP boundaries | 0.2404 | 25 | 0.0122 | 4 | 14.5 |
| Percent the other land use | 0.9264 | 9 | 0.002 | 21 | 15 |
| Rivers | 0.7647 | 6 | 0.0002 | 27 | 16.5 |
| Home value boundaries | 0.653 | 23 | 0.0081 | 12 | 17.5 |
| Subdistrict boundaries | 0.2101 | 15 | 0.002 | 20 | 17.5 |
| Park boundaries | 0.484 | 11 | 0.0008 | 25 | 18 |
| Entertainment facilities | 0.4289 | 19 | 0.0022 | 19 | 19 |
| Percent retail land use | 0.8634 | 21 | 0.0061 | 18 | 19.5 |
| Education level difference | 0.076 | 24 | 0.0068 | 16 | 20 |
| Percent industrial land use | 0.7226 | 20 | 0.0015 | 23 | 21.5 |
| Salary boundaries | 0.4706 | 27 | 0.0062 | 17 | 22 |
| The other land use boundaries | 0.6001 | 17 | 0.0001 | 28 | 22.5 |
| Percent office land use | 0.1584 | 26 | 0.0014 | 24 | 25 |
| Expressways | 0.0217 | 28 | 0.0004 | 26 | 27 |
| Model | Hyperparameter | ACC | PR AUC * | Precision * | Recall * | F1 Score * | Per-Class F1 Score (0, 1, 2, 3, 4, 5) |
|---|---|---|---|---|---|---|---|
| KNN | n_neighbors = 15 | 0.9859 ± 0.0083 (0.9756–0.9962) | 0.9289 ± 0.0084 (0.9185–0.9393) | 0.8823 ± 0.0099 (0.8700–0.8946) | 0.8803 ± 0.0091 (0.8690–0.8916) | 0.8740 ± 0.0101 (0.8615–0.8865) | 0.97, 0.95, 0.94, 0.79, 0.70, 0.89 |
| DT | max_depth = 22, min_samples_leaf = 5, min sample split = 5 | 0.9445 ± 0.0045 (0.9389–0.9501) | 0.8291 ± 0.0028 (0.8256–0.8326) | 0.7900 ± 0.0080 (0.7801–0.7999) | 0.7923 ± 0.0065 (0.7842–0.8004) | 0.7902 ± 0.0071 (0.7814–0.7990) | 0.94, 0.90, 0.87, 0.69, 0.60, 0.75 |
| RF | n_estimators = 150, max_depth = 15, min_samples_leaf = 5 | 0.9868 ± 0.0067 (0.9785–0.9951) | 0.9520 ± 0.0070 (0.9433–0.9607) | 0.9051 ± 0.0043 (0.8998–0.9104) | 0.9040 ± 0.0043 (0.8987–0.9093) | 0.9033 ± 0.0045 (0.8977–0.9089) | 0.99, 0.98, 0.95, 0.81, 0.79, 0.91 |
| Adaboost-DT | n_estimators = 40 learning rate = 0.01, max_depth = 12 | 0.7170 ± 0.0142 (0.6994–0.7346) | 0.8162 ± 0.0154 (0.7971–0.8353) | 0.7301 ± 0.0161 (0.7101–0.7501) | 0.5750 ± 0.0103 (0.5622–0.5878) | 0.5680 ± 0.0122 (0.5529–0.5831) | 0.84, 0.80, 0.64, 0.53, 0.15, 0.45 |
| Model | ACC | F1 | AUC |
|---|---|---|---|
| Boundary only model | 0.8953 ± 0.0077 (0.8857–0.9049) | 0.8944 ± 0.0079 (0.8846–0.9042) | 0.9464 ± 0.0015 (0.9445–0.9483) |
| Internal only model | 0.8296 ± 0.0078 (0.8199–0.8393) | 0.8270 ± 0.0079 (0.8172–0.8368) | 0.9297 ± 0.0021 (0.9271–0.9323) |
| Combined model | 0.9868 ± 0.0067 (0.9785–0.9951) | 0.9033 ± 0.0045 (0.8977–0.9089) | 0.9520 ± 0.0070 (0.9433–0.9607) |
| Comparison | ΔACC | ΔF1 | ΔAUC | p_bootstrap ACC | p_bootstrap F1 | p_bootstrap AUC |
|---|---|---|---|---|---|---|
| Boundary only vs. internal only | 0.0657 | 0.0674 | 0.0167 | <0.001 | <0.001 | <0.001 |
| Combined vs. boundary only | 0.0915 | 0.0089 | 0.0056 | <0.001 | <0.001 | <0.001 |
| Combine vs. internal only | 0.1572 | 0.0763 | 0.0223 | <0.001 | <0.001 | <0.001 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Chen, T.; Chen, R.; Xu, Z.; Gu, X.; Wang, C. Measuring Multi-Dimensional Urban Boundaries Influencing Theft: A Case Study of Guangzhou, China. Urban Sci. 2026, 10, 13. https://doi.org/10.3390/urbansci10010013
Chen T, Chen R, Xu Z, Gu X, Wang C. Measuring Multi-Dimensional Urban Boundaries Influencing Theft: A Case Study of Guangzhou, China. Urban Science. 2026; 10(1):13. https://doi.org/10.3390/urbansci10010013
Chicago/Turabian StyleChen, Tong, Ran Chen, Zihao Xu, Xinyue Gu, and Chengfang Wang. 2026. "Measuring Multi-Dimensional Urban Boundaries Influencing Theft: A Case Study of Guangzhou, China" Urban Science 10, no. 1: 13. https://doi.org/10.3390/urbansci10010013
APA StyleChen, T., Chen, R., Xu, Z., Gu, X., & Wang, C. (2026). Measuring Multi-Dimensional Urban Boundaries Influencing Theft: A Case Study of Guangzhou, China. Urban Science, 10(1), 13. https://doi.org/10.3390/urbansci10010013

