Towards Trustworthy Urban Land Use Classification: A Synergistic Fusion of Deep Learning and Explainable Machine Learning with a Nanning Case Study
Abstract
1. Introduction
- To evaluate the performance of machine learning models using GSE data for urban land use classification.
- To enhance model interpretability by decoding the geographic semantics of the features and dissecting the model’s decision-making mechanisms.
2. Materials and Methods
2.1. Study Area
2.2. Data Source and Preprocessing
2.2.1. World Imagery
2.2.2. OpenStreetMap (OSM) Network Data
2.2.3. Points of Interest (POI) Data
2.2.4. Google Satellite Embedding (GSE) Data
2.2.5. Sentinel-2 Multispectral Imagery
2.3. Methods
- First, a high-precision sample library was constructed by fusing multi-source geospatial data with expert visual interpretation within ArcMap 10.8.
- Second, two parallel feature sets were formulated on the GEE platform: the high-dimensional deep embedding features for model training, and a set of physical reference features for attribution.
- Finally, the modeling and analysis pipeline was implemented in Python 3.9. We trained and evaluated machine learning models using the scikit-learn (for RF) and xgboost libraries, which were then deconstructed using the shap library to analyze their internal decision-making mechanisms. This involved quantifying the contributions of key abstract features and systematically correlating them with the physical reference features using the pandas and scipy libraries.
2.3.1. Semantically Constrained Urban Parcel Generation
2.3.2. Feature Engineering
- Second, texture features, derived from the Gray-Level Co-occurrence Matrix (GLCM), were computed to describe the spatial structure and heterogeneity of the land surface, complementing the spectral signatures which are often insufficient in heterogeneous urban environments [50]. However, utilizing the full set of GLCM features often leads to high data redundancy and computational inefficiency due to strong inter-correlations [51]. Therefore, drawing upon established feature selection strategies for remote sensing classification, a specific subset of metrics was chosen to form a representative and minimally redundant set [39,52]. These metrics targeted three complementary textural dimensions critical for distinguishing urban functions: (1) Spatial Order/Disorder (ASM, Entropy) to characterize the complexity of building arrangements; (2) Local Contrast (Contrast, Dissimilarity) to capture the edge intensity typical of built-up areas; and (3) Homogeneity (IDM) to identify uniform surfaces such as open spaces and water bodies.
2.3.3. Machine Learning Modeling and Explainability Analysis
- At the global level, the most influential features were identified by calculating the mean absolute SHAP value for each feature across all samples. To ensure stable results, these importance values were averaged across the five folds of the cross-validation, and the final ranking is presented as a percentage contribution.
- At the local level, stratified SHAP beeswarm plots were employed for visual analysis. These plots intuitively display how a single feature contributes to each individual sample’s prediction, while simultaneously using color to reveal the patterns between the feature’s original value (high or low) and the direction and magnitude of its contribution [56,57].
3. Results
3.1. Performance Evaluation of Embedding-Based Models
3.1.1. Quantitative Evaluation
3.1.2. Qualitative Evaluation
3.2. Identification and Physical Interpretation of Key Features
- Features associated with the built-up environment.
- Features associated with vegetation.
3.3. SHAP-Based Insights into Classification Decision
4. Discussion
4.1. Decoding the Geospatial Semantics of Key Features
4.1.1. Features for Built-Up Area Characterization
4.1.2. Features for Vegetation Characterization
4.2. Deconstructing the Model’s Hierarchical Decision Logic
4.2.1. Convergent Strategies for Distinct Classes
4.2.2. Divergent Strategies for Ambiguous Classes
4.3. Deconstructing the Mechanisms Behind XGBoost’s Superior Performance
4.4. Major Contributions
4.5. Limitations
- First, a fundamental limitation arises from the trade-off between semantic relevance and spatial detail. The parcel-based approach, which defines the parcel as the de facto Minimum Mapping Unit (MMU), is a deliberate choice to align the analysis with the scale of urban planning. However, this aggregation inevitably results in a loss of intra-parcel heterogeneity. By assigning a single functional label to each parcel, the model cannot resolve fine-grained, mixed-use realities, such as ground-floor retail within a residential building. This limitation is intertwined with the simplified classification scheme and controlled sample size. for instance, merging commercial and industrial land overlooks nuanced functional differences. Future work should aim to develop hybrid models that not only adopt more granular classification hierarchies but also quantify the degree of functional mixture within each parcel, rather than assigning a single hard label.
- Second, a key limitation relates to the sample size and its implications for model complexity and overfitting. The dataset, consisting of 800 high-quality parcels, is modest relative to the 64-dimensional GSE feature space. This imbalance introduces the risk of the Hughes phenomenon, where classification accuracy can degrade as feature dimensionality increases relative to the number of training samples [84]. While rigorous cross-validation, hyperparameter tuning, and early stopping proactively mitigated this limitation, the limited sample size necessarily constrained the complexity of the classification scheme that could be reliably implemented. This constraint directly led to decisions such as merging commercial and industrial land into a single class. Future research with access to larger, more comprehensive ground-truth datasets would be crucial to validate the findings and explore more granular classification systems without the heightened risk of overfitting.
- Third, the practical applicability and transferability of the framework face significant barriers. The methodology’s performance is highly dependent on the availability of high-quality auxiliary data, particularly a reliable road network (e.g., from OSM) for accurate parcel delineation, which may be unavailable in many urban contexts. Relatedly, the reliance on GSE data inherently lacks direct socioeconomic information. While effective for capturing physical patterns, remote sensing data alone often struggles to distinguish morphologically similar but functionally distinct zones, a gap that social sensing data is better positioned to fill [10]. Future research could enhance transferability by exploring fusion with more universally available social sensing data. Furthermore, the current multi-step workflow requires considerable technical expertise, posing a challenge for widespread adoption. A critical direction for future work is therefore to encapsulate this entire process into an automated, user-friendly tool or cloud-based platform.
- Finally, regarding interpretability, several nuances must be acknowledged. It is essential to remember that SHAP analysis elucidates model-specific correlations, not real-world causal mechanisms [75]. Furthermore, the raw outputs of XAI methods like SHAP, while scientifically robust, can be cognitively overwhelming due to the high volume of data they present. They do not automatically generate a holistic image for non-specialists. This aligns with recent critiques in Human–Computer Interaction, which suggest that algorithmic explanations often require a human-in-the-loop to bridge the gap between technical feature importance and domain-specific semantic understanding [85]. Therefore, a critical step, demonstrated in this study, is the researcher’s role in synthesizing these complex outputs into a coherent narrative. Any future user-friendly tool based on this framework must not only provide explainability charts but also include features that guide the user through this interpretive process.
5. Conclusions
Supplementary Materials
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
| AI | Artificial Intelligence |
| GSE | Google Satellite Embedding |
| XGBoost | Extreme Gradient Boosting |
| RF | Random Forest |
| XAI | Explainable Artificial Intelligence |
| SHAP | Shapley Additive Explanations |
| OSM | OpenStreetMap |
| POI | Points of Interest |
| AEF | AlphaEarth Foundation |
| ASAN | Association of Southeast Asian Nations |
| MoHURD | Ministry of Housing and Urban-Rural Development |
| GEE | Google Earth Engine |
| GLCM | Gray-Level Co-occurrence Matrix |
| NIR | Near-Infrared |
| SWIR | Short-Wave Infrared |
| NDVI | Normalized Difference Vegetation Index |
| EVI | Enhanced Vegetation Index |
| SAVI | Soil-Adjusted Vegetation Index |
| NDBI | Normalized Difference Built-up Index |
| NDWI | Normalized Difference Water Index |
| ASM | Angular Second Moment |
| IDM | Inverse Difference Moment |
| DISS | Dissimilarity |
| OA | Overall Accuracy |
| PA | Producer’s Accuracy |
| UA | User’s Accuracy |
| PCA | Principal Component Analysis |
| MMU | Minimum Mapping Unit |
References
- Chen, M.; Chen, L.; Cheng, J.; Yu, J. Identifying interlinkages between urbanization and Sustainable Development Goals. Geogr. Sustain. 2022, 3, 339–346. [Google Scholar] [CrossRef]
- Ma, Y.; Zhang, P.; Zhao, K.; Zhou, Y.; Zhao, S. A Dynamic Performance and Differentiation Management Policy for Urban Construction Land Use Change in Gansu, China. Land 2022, 11, 942. [Google Scholar] [CrossRef]
- Chen, F.; Luo, Q.; Zhu, Z. Integrating static and dynamic analyses in a spatial management framework to enhance ecological networks connectivity in the context of rapid urbanization. Ecol. Modell. 2025, 501, 111022. [Google Scholar] [CrossRef]
- Cheng, J.; Turkstra, J.; Peng, M.; Du, N.; Ho, P. Urban land administration and planning in China: Opportunities and constraints of spatial data models. Land Use Policy 2006, 23, 604–616. [Google Scholar] [CrossRef]
- Chaturvedi, V.; de Vries, W.T. Machine Learning Algorithms for Urban Land Use Planning: A Review. Urban Sci. 2021, 5, 68. [Google Scholar] [CrossRef]
- Drici, H.; Carpio-Pinedo, J. Urban land use mix and AI: A systematic review. Cities 2025, 165, 2. [Google Scholar] [CrossRef]
- Radhakrishnan, N.; Hemalatha, S.; Devarajan, G.G.; Nachiyappan, S.; Karthick, S.; Singhal, A. Urban data fusion for spatio-temporal incident forecasting using graph attention and generative AI. Inf. Fusion 2026, 126, 103532. [Google Scholar] [CrossRef]
- Doda, S.; Kahl, M.; Ouan, K.; Obadic, I.; Wang, Y.; Taubenböck, H.; Zhu, X.X. Interpretable deep learning for consistent large-scale urban population estimation using Earth observation data. Int. J. Appl. Earth Obs. Geoinf. 2024, 128, 103731. [Google Scholar] [CrossRef]
- Gong, P.; Chen, B.; Li, X.; Liu, H.; Wang, J.; Bai, Y.; Chen, J.; Chen, X.; Fang, L.; Feng, S.; et al. Mapping essential urban land use categories in China (EULUC-China): Preliminary results for 2018. Sci. Bull. 2020, 65, 182–187. [Google Scholar] [CrossRef]
- Rosier, J.F.; Taubenböck, H.; Verburg, P.H.; van Vliet, J. Fusing Earth observation and socioeconomic data to increase the transferability of large-scale urban land use classification. Remote Sens. Environ. 2022, 278, 113076. [Google Scholar] [CrossRef]
- Li, Z.; Chen, B.; Huang, Y.; Wang, H.; Wang, Y.; Yuan, Y.; Li, X.; Chen, J.M.; Xu, B.; Gong, P. Enhanced mapping of essential urban land use categories in China (EULUC-China 2.0): Integrating multimodal deep learning with multisource geospatial data. Sci. Bull. 2025, 70, 3029–3041. [Google Scholar] [CrossRef]
- Yao, Y.; Jiang, Y.; Sun, Z.; Li, L.; Chen, D.; Xiong, K.; Dong, A.; Cheng, T.; Zhang, H.; Liang, X.; et al. Applicability and sensitivity analysis of vector cellular automata model for land cover change. Comput. Environ. Urban Syst. 2024, 109, 102090. [Google Scholar] [CrossRef]
- Wu, P.; Zhang, S.; Li, H.; Dale, P.; Ding, X.; Lu, Y. Urban Parcel Grouping Method Based on Urban Form and Functional Connectivity Characterisation. ISPRS Int. J. Geo-Inf. 2019, 8, 282. [Google Scholar] [CrossRef]
- Guo, Y.; Tang, J.; Liu, H.; Yang, X.; Deng, M. Identifying up-to-date urban land-use patterns with visual and semantic features based on multisource geospatial data. Sust. Cities Soc. 2024, 101, 105184. [Google Scholar] [CrossRef]
- Wang, J.; Bretz, M.; Dewan, M.A.A.; Delavar, M.A. Machine learning in modelling land-use and land cover-change (LULCC): Current status, challenges and prospects. Sci. Total Environ. 2022, 822, 153559. [Google Scholar] [CrossRef]
- Casali, Y.; Aydin, N.Y.; Comes, T. Machine learning for spatial analyses in urban areas: A scoping review. Sust. Cities Soc. 2022, 85, 104050. [Google Scholar] [CrossRef]
- Salman, H.A.; Kalakech, A.; Steiti, A. Random Forest Algorithm Overview. Babylon. J. Mach. Learn. 2024, 2024, 69–79. [Google Scholar] [CrossRef]
- Li, Z.; Chen, B.; Wu, S.; Su, M.; Chen, J.M.; Xu, B. Deep learning for urban land use category classification: A review and experimental assessment. Remote Sens. Environ. 2024, 311, 114290. [Google Scholar] [CrossRef]
- Fayaz, M.; Nam, J.; Dang, L.M.; Song, H.-K.; Moon, H. Land-Cover Classification Using Deep Learning with High-Resolution Remote-Sensing Imagery. Appl. Sci. 2024, 14, 1844. [Google Scholar] [CrossRef]
- Zhu, Q.; Lei, Y.; Sun, X.; Guan, Q.; Zhong, Y.; Zhang, L.; Li, D. Knowledge-guided land pattern depiction for urban land use mapping: A case study of Chinese cities. Remote Sens. Environ. 2022, 272, 112916. [Google Scholar] [CrossRef]
- Su, Y.; Zhong, Y.; Zhu, Q.; Zhao, J. Urban scene understanding based on semantic and socioeconomic features: From high-resolution remote sensing imagery to multi-source geographic datasets. ISPRS J. Photogramm. Remote Sens. 2021, 179, 50–65. [Google Scholar] [CrossRef]
- Liu, Y.; Zhong, Y.; Shi, S.; Zhang, L. Scale-aware deep reinforcement learning for high resolution remote sensing imagery classification. ISPRS J. Photogramm. Remote Sens. 2024, 209, 296–311. [Google Scholar] [CrossRef]
- Sun, E.; Cui, Y.; Liu, P.; Yan, J. A decade of deep learning for remote sensing spatiotemporal fusion: Advances, challenges, and opportunities. Inf. Fusion 2025, 126, 103675. [Google Scholar] [CrossRef]
- Barredo Arrieta, A.; Díaz-Rodríguez, N.; Del Ser, J.; Bennetot, A.; Tabik, S.; Barbado, A.; Garcia, S.; Gil-Lopez, S.; Molina, D.; Benjamins, R.; et al. Explainable Artificial Intelligence (XAI): Concepts, taxonomies, opportunities and challenges toward responsible AI. Inf. Fusion 2020, 58, 82–115. [Google Scholar] [CrossRef]
- Li, Z. Extracting spatial effects from machine learning model using local interpretation method: An example of SHAP and XGBoost. Comput. Environ. Urban Syst. 2022, 96, 101845. [Google Scholar] [CrossRef]
- Wang, S.; Liu, Y.; Wang, W.; Zhao, G.; Liang, H. Interpretable machine learning guided by physical mechanisms reveals drivers of runoff under dynamic land use changes. J. Environ. Manag. 2024, 367, 121978. [Google Scholar] [CrossRef]
- Temenos, A.; Temenos, N.; Kaselimi, M.; Doulamis, A.; Doulamis, N. Interpretable Deep Learning Framework for Land Use and Land Cover Classification in Remote Sensing Using SHAP. IEEE Geosci. Remote Sens. Lett. 2023, 20, 8500105. [Google Scholar] [CrossRef]
- Yao, Y.; Gao, R.; Wu, H.; Dong, A.; Hu, Z.; Ma, Y.; Guan, Q.; Luo, P. Explainable Mapping of the Irregular Land Use Parcel with a Data Fusion Deep-Learning Model. IEEE Trans. Geosci. Electron. 2025, 63, 5612015. [Google Scholar] [CrossRef]
- Zhai, X.; Jiang, J.; Dejl, A.; Rago, A.; Guo, F.; Toni, F.; Sivakumar, A. Heterogeneous graph neural networks with post-hoc explanations for multi-modal and explainable land use inference. Inf. Fusion 2025, 120, 103057. [Google Scholar] [CrossRef]
- Tollefson, J. Google AI model mines trillions of images to create maps of Earth ‘at any place and time’. Nature 2025, 644, 313–314. [Google Scholar] [CrossRef]
- Brown, C.F.; Kazmierski, M.R.; Pasquarella, V.J.; Rucklidge, W.J.; Samsikova, M.; Zhang, C.; Shelhamer, E.; Lahera, E.; Wiles, O.; Ilyushchenko, S.; et al. AlphaEarth Foundations: An embedding field model for accurate and efficient global mapping from sparse label data. arXiv 2025, arXiv:2507.22291. [Google Scholar] [CrossRef]
- Wang, Q.; Li, R.; Cheong, K.C. Nanning—Perils and promise of a frontier city. Cities 2018, 72, 51–59. [Google Scholar] [CrossRef]
- Du, S.; Zhang, X.; Lei, Y.; Huang, X.; Tu, W.; Liu, B.; Meng, Q.; Du, S. Mapping urban functional zones with remote sensing and geospatial big data: A systematic review. GISci. Remote Sens. 2024, 61, 2404900. [Google Scholar] [CrossRef]
- Zhu, J.; Zhu, M.; Na, J.; Lang, Z.; Lu, Y.; Yang, J. Incorporation of Spatially Heterogeneous Area Partitioning into Vector-Based Cellular Automata for Simulating Urban Land-Use Changes. Land 2023, 12, 1893. [Google Scholar] [CrossRef]
- Chen, D.; Feng, Y.; Li, X.; Qu, M.; Luo, P.; Meng, L. Interpreting core forms of urban morphology linked to urban functions with explainable graph neural network. Comput. Environ. Urban Syst. 2025, 118, 102267. [Google Scholar] [CrossRef]
- Theobald, D.M. Development and applications of a comprehensive land use classification and map for the US. PLoS ONE 2014, 9, e94628. [Google Scholar] [CrossRef]
- Liu, X.; Long, Y. Automated identification and characterization of parcels with OpenStreetMap and points of interest. Environ. Plann. B Plann. Des. 2015, 43, 341–360. [Google Scholar] [CrossRef]
- CJJ 37-2012; Code for Design of Urban Road Engineering. China Architecture & Building Press: Beijing, China, 2012.
- Yin, J.; Fu, P.; Hamm, N.A.S.; Li, Z.; You, N.; He, Y.; Cheshmehzangi, A.; Dong, J. Decision-Level and Feature-Level Integration of Remote Sensing and Geospatial Big Data for Urban Land Use Mapping. Remote Sens. 2021, 13, 1579. [Google Scholar] [CrossRef]
- Xu, S.; Qing, L.; Han, L.; Liu, M.; Peng, Y.; Shen, L. A New Remote Sensing Images and Point-of-Interest Fused (RPF) Model for Sensing Urban Functional Regions. Remote Sens. 2020, 12, 1032. [Google Scholar] [CrossRef]
- Xu, Y.; Zhou, B.; Jin, S.; Xie, X.; Chen, Z.; Hu, S.; He, N. A framework for urban land use classification by integrating the spatial context of points of interest and graph convolutional neural network method. Comput. Environ. Urban Syst. 2022, 95, 101807. [Google Scholar] [CrossRef]
- GB501377; Code for Classification of Urban Land Use and Planning Standards of Development Land. China Architecture & Building Press: Beijing, China, 2011.
- Zhang, X.; Du, S.; Zheng, Z. Heuristic sample learning for complex urban scenes: Application to urban functional-zone mapping with VHR images and POI data. ISPRS J. Photogramm. Remote Sens. 2020, 161, 1–12. [Google Scholar] [CrossRef]
- Hafner, S.; Ban, Y.; Nascetti, A. Unsupervised domain adaptation for global urban extraction using Sentinel-1 SAR and Sentinel-2 MSI data. Remote Sens. Environ. 2022, 280, 113192. [Google Scholar] [CrossRef]
- Phiri, D.; Simwanda, M.; Salekin, S.; Nyirenda, V.; Murayama, Y.; Ranagalage, M. Sentinel-2 Data for Land Cover/Use Mapping: A Review. Remote Sens. 2020, 12, 2291. [Google Scholar] [CrossRef]
- Hu, Y.; Han, Y. Identification of Urban Functional Areas Based on POI Data: A Case Study of the Guangzhou Economic and Technological Development Zone. Sustainability 2019, 11, 1385. [Google Scholar] [CrossRef]
- Kraff, N.J.; Wurm, M.; Taubenbock, H. Uncertainties of Human Perception in Visual Image Interpretation in Complex Urban Environments. IEEE J. Sel. Top. Appl. Earth Observ. Remote Sens. 2020, 13, 4229–4241. [Google Scholar] [CrossRef]
- da Silva, V.S.; Salami, G.; da Silva, M.I.O.; Silva, E.A.; Monteiro Junior, J.J.; Alba, E. Methodological evaluation of vegetation indexes in land use and land cover (LULC) classification. Geol. Ecol. Landsc. 2019, 4, 159–169. [Google Scholar] [CrossRef]
- Zheng, Y.; Tang, L.; Wang, H. An improved approach for monitoring urban built-up areas by combining NPP-VIIRS nighttime light, NDVI, NDWI, and NDBI. J. Clean. Prod. 2021, 328, 129488. [Google Scholar] [CrossRef]
- Xie, C.; Wang, J.; Haase, D.; Wellmann, T.; Lausch, A. Measuring spatio-temporal heterogeneity and interior characteristics of green spaces in urban neighborhoods: A new approach using gray level co-occurrence matrix. Sci. Total Environ. 2023, 855, 158608. [Google Scholar] [CrossRef]
- Hall-Beyer, M. Practical guidelines for choosing GLCM textures to use in landscape classification tasks over a range of moderate spatial scales. Int. J. Remote Sens. 2017, 38, 1312–1338. [Google Scholar] [CrossRef]
- Park, Y.; Guldmann, J.-M. Measuring continuous landscape patterns with Gray-Level Co-Occurrence Matrix (GLCM) indices: An alternative to patch metrics? Ecol. Indic. 2020, 109, 105802. [Google Scholar] [CrossRef]
- Jafarzadeh, H.; Mahdianpari, M.; Gill, E.; Mohammadimanesh, F.; Homayouni, S. Bagging and Boosting Ensemble Classifiers for Classification of Multispectral, Hyperspectral and PolSAR Data: A Comparative Evaluation. Remote Sens. 2021, 13, 4405. [Google Scholar] [CrossRef]
- Georganos, S.; Grippa, T.; Vanhuysse, S.; Lennert, M.; Shimoni, M.; Wolff, E. Very High Resolution Object-Based Land Use–Land Cover Urban Classification Using Extreme Gradient Boosting. IEEE Geosci. Remote Sens. Lett. 2018, 15, 607–611. [Google Scholar] [CrossRef]
- Ramezan, C.A.; Warner, T.A.; Maxwell, A.E. Evaluation of Sampling and Cross-Validation Tuning Strategies for Regional-Scale Machine Learning Classification. Remote Sens. 2019, 11, 185. [Google Scholar] [CrossRef]
- Guo, R.; Yang, B.; Guo, Y.; Li, H.; Li, Z.; Zhou, B.; Hong, B.; Wang, F. Machine learning-based prediction of outdoor thermal comfort: Combining Bayesian optimization and the SHAP model. Build. Sci. 2024, 254, 111301. [Google Scholar] [CrossRef]
- Antonini, A.S.; Tanzola, J.; Asiain, L.; Ferracutti, G.R.; Castro, S.M.; Bjerg, E.A.; Ganuza, M.L. Machine Learning model interpretability using SHAP values: Application to Igneous Rock Classification task. Appl. Comput. Geosci. 2024, 23, 100178. [Google Scholar] [CrossRef]
- Lundberg, S.M.; Erion, G.; Chen, H.; DeGrave, A.; Prutkin, J.M.; Nair, B.; Katz, R.; Himmelfarb, J.; Bansal, N.; Lee, S.-I. From local explanations to global understanding with explainable AI for trees. Nat. Mach. Intell. 2020, 2, 56–67. [Google Scholar] [CrossRef]
- Bengio, Y.; Courville, A.; Vincent, P. Representation Learning: A Review and New Perspectives. IEEE Trans. Pattern Anal. Mach. Intell. 2013, 35, 1798–1828. [Google Scholar] [CrossRef]
- Zhang, J.; Li, P.; Wang, J. Urban Built-Up Area Extraction from Landsat TM/ETM+ Images Using Spectral Information and Multivariate Texture. Remote Sens. 2014, 6, 7339–7359. [Google Scholar] [CrossRef]
- Kumari, S.; Lal, P.; Kumar, A. Spatial heterogeneity for urban built-up footprint and its characterization using microwave remote sensing. Adv. Space Res. 2022, 70, 3822–3832. [Google Scholar] [CrossRef]
- Kebede, T.A.; Hailu, B.T.; Suryabhagavan, K.V. Evaluation of spectral built-up indices for impervious surface extraction using Sentinel-2A MSI imageries: A case of Addis Ababa city, Ethiopia. Environ. Chall. 2022, 8, 100568. [Google Scholar] [CrossRef]
- Alvarez, C.I.; Ulloa Vaca, C.A.; Echeverria Llumipanta, N.A. Machine Learning for Urban Air Quality Prediction Using Google AlphaEarth Foundations Satellite Embeddings: A Case Study of Quito, Ecuador. Remote Sens. 2025, 17, 3472. [Google Scholar] [CrossRef]
- Huang, S.; Tang, L.; Hupy, J.P.; Wang, Y.; Shao, G. A commentary review on the use of normalized difference vegetation index (NDVI) in the era of popular remote sensing. J. For. Res. 2020, 32, 1–6. [Google Scholar] [CrossRef]
- Ridd, M.K. Exploring a V-I-S (vegetation-impervious surface-soil) model for urban ecosystem analysis through remote sensing: Comparative anatomy for cities†. Int. J. Remote Sens. 2007, 16, 2165–2185. [Google Scholar] [CrossRef]
- Franke, J.; Roberts, D.A.; Halligan, K.; Menz, G. Hierarchical Multiple Endmember Spectral Mixture Analysis (MESMA) of hyperspectral imagery for urban environments. Remote Sens. Environ. 2009, 113, 1712–1723. [Google Scholar] [CrossRef]
- Guindon, B.; Zhang, Y.; Dillabaugh, C. Landsat urban mapping based on a combined spectral–spatial methodology. Remote Sens. Environ. 2004, 92, 218–232. [Google Scholar] [CrossRef]
- Rashed, T.; Weeks, J.R.; Roberts, D.; Rogan, J.; Powell, R. Measuring the Physical Composition of Urban Morphology Using Multiple Endmember Spectral Mixture Models. Photogramm. Eng. Remote Sens. 2003, 69, 1011–1020. [Google Scholar] [CrossRef]
- De Luca, G.; Pancorbo, J.L.; Carotenuto, F.; Gioli, B.; Modica, G.; Genesio, L. PRISMA imaging for land covers and surface materials composition in urban and rural areas adopting multiple endmember spectral mixture analysis (MESMA). ISPRS-J. Photogramm. Remote Sens. 2025, 225, 196–220. [Google Scholar] [CrossRef]
- Small, C.; Sousa, D. Spectral Characteristics of the Dynamic World Land Cover Classification. Remote Sens. 2023, 15, 575. [Google Scholar] [CrossRef]
- Gao, X.; Huete, A.R.; Ni, W.; Miura, T. Optical–Biophysical Relationships of Vegetation Spectra without Background Contamination. Remote Sens. Environ. 2000, 74, 609–620. [Google Scholar] [CrossRef]
- Strobl, C.; Boulesteix, A.-L.; Zeileis, A.; Hothorn, T. Bias in random forest variable importance measures: Illustrations, sources and a solution. BMC Bioinform. 2007, 8, 25. [Google Scholar] [CrossRef]
- Darst, B.F.; Malecki, K.C.; Engelman, C.D. Using recursive feature elimination in random forest to account for correlated variables in high dimensional data. BMC Genet. 2018, 19, 65. [Google Scholar] [CrossRef]
- Chen, T.; Guestrin, C. XGBoos: A Scalable Tree Boosting System. arXiv 2016. [Google Scholar] [CrossRef]
- Lundberg, S.; Lee, S.-I. A Unified Approach to Interpreting Model Predictions. arXiv 2017. [Google Scholar] [CrossRef]
- Abdi, A.M. Land cover and land use classification performance of machine learning algorithms in a boreal landscape using Sentinel-2 data. GISci. Remote Sens. 2019, 57, 1–20. [Google Scholar] [CrossRef]
- Gregorutti, B.; Michel, B.; Saint-Pierre, P. Correlation and variable importance in random forests. Stat. Comput. 2016, 27, 659–678. [Google Scholar] [CrossRef]
- Ghazizade-Fard, M.; Koupaie, E.H. Anaerobic co-digestion of wastewater sludge and food waste: A machine learning approach to process modeling and optimization. J. Environ. Manage. 2025, 393, 126985. [Google Scholar] [CrossRef]
- Zhu, X.X.; Tuia, D.; Mou, L.; Xia, G.-S.; Zhang, L.; Xu, F.; Fraundorfer, F. Deep Learning in Remote Sensing: A Comprehensive Review and List of Resources. IEEE Geosci. Remote Sens. Mag. 2017, 5, 8–36. [Google Scholar] [CrossRef]
- Reichstein, M.; Camps-Valls, G.; Stevens, B.; Jung, M.; Denzler, J.; Carvalhais, N.; Prabhat, F. Deep learning and process understanding for data-driven Earth system science. Nature 2019, 566, 195–204. [Google Scholar] [CrossRef]
- Lei, C.; Wagner, P.D.; Fohrer, N. Identifying the most important spatially distributed variables for explaining land use patterns in a rural lowland catchment in Germany. J. Geogr. Sci. 2019, 29, 1788–1806. [Google Scholar] [CrossRef]
- Chen, H.; Yang, L.; Wu, Q. Enhancing Land Cover Mapping and Monitoring: An Interactive and Explainable Machine Learning Approach Using Google Earth Engine. Remote Sens. 2023, 15, 4585. [Google Scholar] [CrossRef]
- Wu, W.; Xia, Y.; Jin, W. Predicting Bus Passenger Flow and Prioritizing Influential Factors Using Multi-Source Data: Scaled Stacking Gradient Boosting Decision Trees. IEEE Trans. Intell. Transp. Syst. 2021, 22, 2510–2523. [Google Scholar] [CrossRef]
- Hughes, G. On the mean accuracy of statistical pattern recognizers. IEEE Trans. Inf. Theory 1968, 14, 55–63. [Google Scholar] [CrossRef]
- Miller, T. Explanation in artificial intelligence: Insights from the social sciences. Artif. Intell. 2019, 267, 1–38. [Google Scholar] [CrossRef]








| Classes | Road Descriptions | Road Widths (m) |
|---|---|---|
| Level 1 | Motorway, motorway_link, primary, primary_link, trunk, trunk_link, rail | 40 |
| Level 2 | Secondary, secondary_link | 20 |
| Level 3 | Tertiary, tertiary_link, residential, service, unclassified, unknown, other | 10 |
| Classification | Main Subcategories | Description |
|---|---|---|
| Institution | Governmental organization; Medical service; Culture and Education. | This class primarily includes land for public administration, public services, and education. |
| Open space | Tourist attraction | This category encompasses public green spaces and other open areas intended for recreation, ecological functions, or public access. |
| Business | Enterprises; Shopping; Accommodation service | This class represents areas dedicated to commercial activities, financial services, corporate functions, and some light industry. |
| Residence | Commercial house; Daily life service | This category includes land primarily used for housing. |
| Feature Types | Indices | Description |
|---|---|---|
| Spectral features | Normalized Difference Vegetation Index (NDVI) | Indicates the presence and density of green vegetation |
| Enhanced Vegetation Index (EVI) | Quantifies vegetation with reduced sensitivity to soil and atmospheric noise | |
| Soil-Adjusted Vegetation Index (SAVI) | Minimizes soil brightness effects to accurately map sparse vegetation | |
| Normalized Difference Built-up Index (NDBI) | Highlights impervious surfaces and built-up areas | |
| Normalized Difference Water Index (NBVI) | Delineates open water bodies and aids in identifying building shadows | |
| Textural features | Entropy | Measures the disorder or complexity of the spatial texture |
| Contrast | Captures the intensity of local variations and edges | |
| Angular Second Moment (ASM) | Measures textural uniformity and order | |
| Inverse Difference Moment (IDM) | Quantifies the local homogeneity of the image texture | |
| Dissimilarity (DISS) | Describes the distinctiveness of spatial patterns and local contrast |
| Evaluation Metrics | RF | XGBoost | |
|---|---|---|---|
| OA | 81.87% ± 1.72% | 85.00% ± 2.24% | |
| Macro F1 | 81.72% ± 1.73% | 84.81% ± 2.32% | |
| Kappa | 0.7583 ± 0.0230 | 0.8000 ± 0.0298 | |
| UA | Institution | 69.00% ± 4.36% | 74.50% ± 5.10% |
| Open space | 98.00% ± 1.87% | 99.00% ± 1.22% | |
| Business | 84.50% ± 6.60% | 87.00% ± 7.81% | |
| Residence | 76.00% ± 7.84% | 79.50% ± 8.72% | |
| PA | Institution | 72.95% ± 6.08% | 80.02% ± 4.21% |
| Open space | 96.15% ± 2.84% | 96.18% ± 2.42% | |
| Business | 81.97% ± 5.68% | 84.11% ± 4.50% | |
| Residence | 76.89% ± 1.20% | 80.32% ± 5.65% | |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2026 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license.
Share and Cite
Zheng, Y.; Huang, X.; Yao, H. Towards Trustworthy Urban Land Use Classification: A Synergistic Fusion of Deep Learning and Explainable Machine Learning with a Nanning Case Study. Land 2026, 15, 158. https://doi.org/10.3390/land15010158
Zheng Y, Huang X, Yao H. Towards Trustworthy Urban Land Use Classification: A Synergistic Fusion of Deep Learning and Explainable Machine Learning with a Nanning Case Study. Land. 2026; 15(1):158. https://doi.org/10.3390/land15010158
Chicago/Turabian StyleZheng, Yusheng, Xinying Huang, and Huanmei Yao. 2026. "Towards Trustworthy Urban Land Use Classification: A Synergistic Fusion of Deep Learning and Explainable Machine Learning with a Nanning Case Study" Land 15, no. 1: 158. https://doi.org/10.3390/land15010158
APA StyleZheng, Y., Huang, X., & Yao, H. (2026). Towards Trustworthy Urban Land Use Classification: A Synergistic Fusion of Deep Learning and Explainable Machine Learning with a Nanning Case Study. Land, 15(1), 158. https://doi.org/10.3390/land15010158

