Next Article in Journal
Analysis on Durability of Bentonite Slurry–Steel Slag Foam Concrete Under Wet–Dry Cycles
Previous Article in Journal
Risk Prioritization of RC Buildings in Bitlis (Türkiye) in the Light of the 2023 Kahramanmaraş Earthquakes
Previous Article in Special Issue
The Correlation Between Crime Frequency and Urban Spatial Hierarchy in Busan
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Review

Machine Learning in Land Use Prediction: A Comprehensive Review of Performance, Challenges, and Planning Applications

1
State Key Laboratory for Ecological Security of Regions and Cities, Institute of Urban Environment, Chinese Academy of Sciences, Xiamen 361021, China
2
College of Harbour and Environmental Engineering, Jimei University, Xiamen 361021, China
3
School of Environment, Hangzhou Institute for Advanced Study, University of Chinese Academy of Sciences, Hangzhou 310024, China
*
Author to whom correspondence should be addressed.
Buildings 2025, 15(19), 3551; https://doi.org/10.3390/buildings15193551
Submission received: 27 August 2025 / Revised: 19 September 2025 / Accepted: 28 September 2025 / Published: 2 October 2025
(This article belongs to the Special Issue Advances in Urban Planning and Design for Urban Safety and Operations)

Abstract

The accelerated global urbanization process has positioned land use/land cover change modeling as a critical component of contemporary geographic science and urban planning research. Traditional approaches face substantial challenges when addressing urban system complexity, multiscale spatial interactions, and high-dimensional data associations, creating urgent demand for sophisticated analytical frameworks. This review comprehensively evaluates machine learning applications in land use prediction through systematic analysis of 74 publications spanning 2020–2024, establishing a taxonomic framework distinguishing traditional machine learning, deep learning, and hybrid methodologies. The review contributes a comprehensive methodological assessment identifying algorithmic evolution patterns and performance benchmarks across diverse geographic contexts. Traditional methods demonstrate sustained reliability, while deep learning architectures excel in complex pattern recognition. Most significantly, hybrid methodologies have emerged as the dominant paradigm through algorithmic complementarity, consistently outperforming single-algorithm implementations. However, contemporary applications face critical constraints including computational complexity, scalability limitations, and interpretability issues impeding practical adoption. This review advances the field by synthesizing fragmented knowledge into a coherent framework and identifying research trajectories toward integrated intelligent systems with explainable artificial intelligence.

1. Introduction

The accelerated global urbanization process has positioned land use/land cover change (LULC) modeling [1] as a critical component of contemporary geographic science and urban planning research [2]. Traditional LULC modeling approaches encounter substantial challenges when addressing the inherent complexity of urban systems [3,4], multiscale spatial interactions [5,6], and high-dimensional data associations, creating urgent demand for more sophisticated analytical frameworks [7,8]. Contemporary research indicates that machine learning technologies, through intelligent algorithm design and automated pattern recognition, provide unprecedented capabilities for solving complex urban applications across multiple scales, from land use coverage analysis to building environment configuration and architectural design optimization [9,10].
Land use prediction methodologies have evolved through distinct phases reflecting technological advancement and computational capabilities. Early applications relied on traditional statistical approaches and cellular automata models that established foundational frameworks for spatial simulation. The emergence of machine learning introduced sophisticated algorithmic capabilities, with traditional methods including random forest algorithms achieving overall accuracy exceeding 90% and support vector machine approaches demonstrating superior performance in spatial prediction tasks [11,12]. Advanced neural network implementations marked a significant methodological transition, with multilayer perceptron models achieving exceptional performance, significantly outperforming conventional approaches [13]. The deep learning revolution introduced unprecedented capabilities through convolutional-recurrent neural network integration, achieving remarkable performance metrics (overall accuracy of 99.18% and Kappa coefficient of 0.88) [14], while long short-term memory networks demonstrated superior capability in capturing temporal dependencies for urban expansion modeling [15]. Contemporary applications have progressed toward hybrid methodologies that systematically integrate multiple algorithmic components, with ensemble approaches utilizing random forest and Markov chain models demonstrating reliable performance [16,17] and advanced integration strategies, achieving substantial performance gains through comprehensive utilization of spatial and temporal information [18,19].
Machine learning applications in land use prediction have evolved significantly from conventional statistical approaches toward integrated intelligent systems. Random forest algorithms consistently achieve overall accuracy exceeding 90% with a Kappa coefficient around 0.88 in metropolitan applications [11], while support vector machine approaches demonstrate superior performance in spatial prediction tasks [12]. Advanced neural network implementations, including multilayer perceptron models, achieve exceptional performance with a Kappa coefficient of 0.9025, significantly outperforming traditional approaches in comparative assessments [13]. Deep learning technologies have introduced methodological innovations, with convolutional-recurrent neural network integration achieving exceptional performance metrics, including an overall accuracy of 99.18% and a Kappa coefficient of 0.88 [14]. Long short-term memory networks demonstrate superior capability in capturing temporal dependencies for urban expansion modeling [15], while cellular automata-artificial neural network integration achieves substantial accuracy improvements over traditional methods [20,21]. Ensemble approaches utilizing random forest and Markov chain models demonstrate reliable performance in understanding metropolitan urban growth trajectories [16,17].
Contemporary applications demonstrate increasing sophistication in addressing multidimensional urban complexity through algorithmic integration and optimization techniques [22,23]. Hybrid methodologies consistently outperform single-algorithm implementations, with advanced integration strategies achieving substantial performance gains through comprehensive utilization of spatial and temporal information [18,19]. Vector-based cellular automata enhanced with convolutional neural networks achieve superior performance in urban land use change simulation [24], while temporal-enhanced approaches achieve precision improvements through fine spatial granularity utilization [25]. Deep learning architectures, including U-Net, enable accurate urban land use prediction with minimal parameterization requirements [26], and patch-generating land use simulation models demonstrate effectiveness in multi-scenario urban growth assessment [27,28]. Optimization algorithms integrated with cellular automata provide enhanced parameter optimization capabilities [29], while artificial intelligence applications demonstrate promising potential for pioneering urban spatial planning through synergistic integration approaches [30,31].
Three-dimensional modeling extensions address both horizontal expansion and vertical development complexity, representing significant methodological frontiers [32]. Enhanced simulation models achieve comprehensive 3D urban dynamics modeling under shared socioeconomic pathways [33] while building height prediction approaches demonstrate robust applicability for complex urban planning scenarios [34,35]. Large-scale applications demonstrate the capability to process comprehensive parcel-level datasets while maintaining robust performance across different spatial and temporal scales [36,37]. Global applications processing over 10,000 urban settlements demonstrate the feasibility of consistent boundary delineation using automated remote sensing approaches [38]. Cross-regional transferability studies indicate that advanced models can maintain performance across different geographic contexts [39,40]. Specialized modeling frameworks address diverse urban challenges, including carbon storage dynamics assessment [41], ecosystem service value prediction [42,43], and integrated disaster risk evaluation [44,45]. Advanced applications encompass flood risk assessment, achieving high accuracy in urban contexts [46,47], and a comprehensive landscape ecological risk assessment using driving mechanisms [48,49].
Machine learning applications in land use prediction have demonstrated substantial maturity and technical sophistication across diverse methodological implementations and geographic contexts. The field has evolved from experimental studies to operational applications, achieving consistently high-performance standards, with sophisticated approaches enabling comprehensive analysis of complex urban systems while maintaining robust performance [50,51]. Advanced modeling frameworks incorporate multi-objective optimization and uncertainty analysis, with integrated approaches combining multiple analytical capabilities, achieving superior performance in environmental assessment applications [52]. Specialized applications have expanded to address critical sustainability challenges, including carbon emission prediction for administrative regions [53], and a comprehensive urban thermal field assessment [54,55]. Global-scale validation across 11,581 cities in 61 countries confirms consistent urban scaling relationships, with remote sensing providing uniform standards for cross-regional comparisons [56]. International applications demonstrate broad geographic applicability, with successful implementations across diverse contexts, including European land cover change simulation [40], and developing economy applications, achieving substantial accuracy improvements [57,58]. These technological advances indicate that machine learning applications have transitioned from research innovations to essential tools for evidence-based urban planning, supporting sustainable development through comprehensive scenario analysis and adaptive management frameworks [59,60].
Despite substantial technical progress, contemporary applications face multifaceted constraints, including algorithm complexity trade-offs, scalability limitations, cross-regional transferability challenges, and model interpretability issues that limit widespread adoption in planning practice. Implementation challenges include varying complexity-accuracy trade-offs where advanced methods require larger computational resources compared to traditional approaches [61]. Model interpretability requirements for planning applications necessitate a balance between technical advancement and practical applicability [59,62]. This review addresses these challenges through comprehensive analysis of 74 research publications spanning 2020–2024, with three main objectives: (1) to characterize current applications, technical characteristics, and development prospects of machine learning technologies in land use prediction; (2) to elucidate implementation barriers including technical constraints, data quality issues, and model interpretability problems; (3) to identify future development trajectories toward integrated intelligent systems while providing recommendations for technology transfer, capacity building, and institutional adaptation in planning practice.

2. Research Methodology

2.1. Literature Search and Selection

This systematic review was conducted following the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA, Detailed information can be found in the Appendix A) 2020 guidelines. The review protocol was not registered. The analysis examined machine learning applications in land use prediction through a comprehensive examination of 74 research publications spanning 2020–2024. The final results were independently obtained by the three authors (Figure 1). Literature search was conducted using the Web of Science Core Collection database with the following search strategy:
TS = ((“machine learning” OR “deep learning” OR “artificial intelligence” OR “random forest” OR “neural network” OR “support vector machine” OR “gradient boosting” OR “XGBoost” OR “decision tree”) AND (“land use change prediction” OR “land use simulation” OR “land use modeling” OR “land use forecasting” OR “urban growth prediction” OR “urban growth modeling” OR “spatial prediction” OR “LUCC prediction” OR “cellular automata” OR “CA-Markov” OR “PLUS model” OR “CLUE-S” OR “FLUS” OR “urban expansion modeling”) AND (“spatial planning” OR “urban planning” OR “city planning” OR “urban development”)) AND PY = (2020–2024) AND LA = (English) AND DT = (Article OR Review).
Study selection followed a two-stage screening process. Three reviewers (T.S., T.L., and J.L.) independently screened titles and abstracts, followed by full-text assessment of potentially eligible articles. Disagreements were resolved through discussion, with final decisions made by the lead author (C.L.) when consensus could not be reached.
Inclusion Criteria: (1) Studies focusing on land use/land cover change prediction, urban growth modeling, or spatial land use simulation; (2) Studies investigating land use change prediction or urban expansion forecasting using quantitative approaches; (3) Studies employing machine learning methods either standalone or integrated with spatial models; (4) Studies addressing spatial planning applications or urban development planning; (5) Peer-reviewed articles and reviews published between 2020 and 2024 in English.
Exclusion Criteria: (1) Studies employing only traditional statistical methods without machine learning approaches; (2) Studies focusing primarily on land use classification rather than prediction; (3) Studies using machine learning solely for data preprocessing; (4) Studies lacking quantitative validation or accuracy assessment; (5) Studies with insufficient methodological details; (6) Studies where land use prediction was not the primary objective.

2.2. Data Extraction

A standardized data extraction form was developed to systematically collect information from included studies. Data extraction was conducted by one reviewer (C.L.) and independently verified by three additional reviewers (T.S., T.L., and J.L.). Extracted variables included study characteristics (geographical location, temporal scope, and data sources), methodological details (algorithm types, model parameters, and validation approaches), and performance outcomes (accuracy metrics, computational efficiency indicators). Discrepancies were resolved through consensus discussion.
Given the methodological diversity of included studies and the absence of standardized assessment tools for machine learning applications in spatial modeling, formal risk of bias evaluation was not conducted. Study quality was assessed based on methodological rigor, validation procedures, and reporting transparency.

2.3. Methodological Classification Framework

To address the complexity and diversity of machine learning approaches in land use prediction, this review establishes a systematic classification framework based on technological maturity and implementation complexity. The framework categorizes methodologies into three distinct groups that reflect both algorithmic sophistication and practical deployment considerations.
Traditional Machine Learning Methods encompass established algorithmic foundations with proven institutional adoption patterns. This category includes classical statistical learning approaches (Random Forest and Support Vector Machine) and conventional neural network architectures implemented through standard training procedures (Multi-Layer Perceptron and Artificial Neural Networks). The defining characteristic involves reliance on well-established mathematical foundations with predictable implementation requirements and institutional familiarity, operating as standalone algorithms or with minimal computational enhancement. Deep Learning Architectures represent advanced neural network innovations characterized by sophisticated automatic feature learning capabilities and complex spatial-temporal pattern recognition. These methods leverage recent advances in neural network design, including specialized convolutional architecture (U-Net and ConvLSTM), and demonstrate state-of-the-art performance through architectural sophistication rather than conventional training paradigms. Hybrid and Ensemble Methodologies constitute systematic algorithmic integration that deliberately combines multiple distinct computational paradigms through explicit coordination mechanisms. These approaches include neural network-cellular automata coupling, multi-algorithm fusion frameworks, and ensemble methods that demonstrate emergent capabilities exceeding individual component performance through strategic multi-paradigm integration and comprehensive optimization frameworks.
Classification decisions prioritize the core algorithmic innovation over implementation details. Methods involving systematic integration of multiple distinct algorithmic paradigms are categorized as hybrid methodologies regardless of individual component complexity, while computational enhancements of established algorithms remain within their base category when the fundamental algorithmic approach is unchanged.

3. Results

3.1. Current Status and Characteristics of ML Applications in Land Use Prediction

A total of 74 studies met the inclusion criteria and were included in this review (Figure 1). The included studies demonstrated diverse geographical coverage and methodological approaches in machine learning applications for land use prediction.

3.1.1. Application Distribution and Method Selection Patterns

Figure 2 presents the global distribution of machine learning applications in land use prediction based on our analysis of 74 studies, revealing significant regional concentrations and methodological preferences across different continents. Contemporary machine learning applications demonstrate clear evolutionary trajectories across three distinct methodological categories, each exhibiting unique performance characteristics and application preferences. Traditional machine learning methods maintain substantial adoption through proven reliability and interpretability, with random forest algorithms consistently achieving overall accuracy exceeding 90% and a Kappa coefficient around 0.88 in metropolitan-scale applications [11]. The Random Forest model demonstrates superior performance with AUC-ROC values of 99.5%, Kappa of 98%, and accuracy of 90% in landslide susceptibility mapping applications, significantly outperforming other approaches, including decision tree (93% AUC), XGBoost (98% AUC), and k-nearest neighbor (91% AUC) methods [46]. Support vector machine applications achieve mixture discriminant analysis performance with 83.2% AUC, followed by random forest (80.6%), boosted regression tree (78.0%), and multivariate adaptive regression spline (75.5%) in groundwater spring potential mapping [12].
Deep learning architecture represents the most sophisticated category, demonstrating exceptional capabilities in complex pattern recognition and multiscale analysis. Convolutional-recurrent neural network integration achieves remarkable performance metrics, including 99.18% overall accuracy, a Kappa coefficient of 0.88, and a figure of merit of 0.13, with improvements attributed to multiscale neighborhood information powered by U-Net and time series information of historical urban expansion uncovered by LSTM [14]. The Beijing–Tianjin–Hebei application demonstrates urban land projected to peak at 8736–9155 km2 during 2039–2043, representing increases in the range of 10.99–16.31% compared to 2020 levels. Three-dimensional modeling innovations achieve Figure of Merit values ranging from 0.21 to 0.35 for horizontal expansion, overall accuracy values of 83% for refinement of urban functional types, and root mean squared error values of 5–7 m for built-up height simulations across Beijing–Tianjin–Hebei, Yangtze River Delta, and Pearl River Delta regions [33].
Hybrid and ensemble methodologies have emerged as the dominant paradigm through algorithmic complementarity that systematically addresses limitations of individual approaches while amplifying respective strengths. The PST-CA model integrates self-organizing map partitioning with spatiotemporal 3D convolutional neural networks, achieving overall accuracy improvements of 4.66–6.41% compared to four traditional models, including logistic regression-CA, support vector machine-CA, random forest-CA, and artificial neural network-CA [19]. Advanced integration strategies demonstrate substantial performance gains, with K-means-CNN-LSTM-CA approaches improving FoM index by 9.86–19.43% compared to traditional logistic regression-CA and artificial neural network-CA models, while temporal dependency consideration increases FoM index by 0.98–3.51% and spatial heterogeneity consideration contributes 1.08–5.15% improvements [18]. The Temporal-VCA framework achieves precision improvements up to 22.12% through comprehensive utilization of fine spatial and temporal granularity information of cadastral plot temporal data, outperforming regular VCA models and traditional raster CA models [25].

3.1.2. Technical Integration and Innovation Strategies

Technical integration strategies demonstrate increasing sophistication in addressing multidimensional urban complexity through intelligent algorithm fusion and adaptive optimization frameworks. Spatial partitioning methodologies employ advanced clustering techniques where a self-organizing map, coupled with hierarchical clustering and patch generation land use simulation models, automatically determine optimal partition schemes, achieving FoM improvements compared to traditional approaches [63]. The approach addresses spatial heterogeneity by dividing study areas into homogeneous sub-regions using machine learning-based partitioning strategies, with spatiotemporal 3D convolutional neural networks extracting spatiotemporal neighborhood features for enhanced land use change simulation.
Optimization techniques have evolved beyond traditional parameter tuning toward intelligent algorithm fusion with real-time adaptation capabilities. The artificial fish swarm algorithm integrated with cellular automata demonstrates superior performance for optimizing parameters of variables in urban growth models, outperforming genetic algorithm-cellular automata and other competing models [29]. Dynamic spatiotemporal rolling prediction models achieve an average adjusted R2 of 0.89 through integrated historical trends, neighborhood status, and spatial proximity analysis, representing paradigm shifts from static macroscopic to dynamic microscopic prediction approaches that offer valuable insights for future urban development and planning decisions [50].
Three-dimensional modeling innovations represent a significant methodological advancement through vertical urban development integration with traditional horizontal expansion analysis. The enhanced Future Land Use Simulation model in a 3D version simulates continuous 3D dynamics of real-world urban development, with the distinctive characteristic of concurrently updating 3D information of developed land grids during simulation processes [33]. Building height prediction utilizes random forest regression algorithms to predict building height growth in the vertical direction, with crucial factors affecting building heights, and simulation results of future urban 3D expansion hotspot areas providing scientific support for urban spatial planning decisions [34]. The R2 was 0.8628, RMSE was 2.0180, MAE was 1.3707, and EV was 0.8632 for building height prediction accuracy in Shenzhen applications.

3.2. ML Performance in Land Use Change Driving Mechanism Identification

3.2.1. Driving Factor Identification Capability Assessment

Machine learning algorithms demonstrate exceptional capabilities in identifying and quantifying complex driving mechanisms underlying land use change processes through advanced feature importance analysis and comprehensive relationship discovery techniques. Margin-based random forest approaches prove more reliable and sensitive than traditional importance measures when detecting driving mechanisms behind land use change, with importance values and ranking orders remaining stable regardless of similarity measure selection [64]. The methodology successfully identifies core driving factors of urban land use change using multitemporal global land cover products and point-of-interest data, revealing that topographic conditions persistently affect urban land use change while transportation factors gradually become the most important human driving factors.
Advanced ensemble learning frameworks enhance driving factor analysis through integrated importance ranking and cross-validation approaches across multiple algorithmic components. Random forest algorithms achieve the best prediction accuracy among selected machine learning methods for regional-level urban growth prediction, with the methodology demonstrating high prediction accuracy that confirms most variability of urban growth can be described by past observations of self and neighboring changes [65]. The role of random forest and Markov chain models in understanding metropolitan urban growth trajectory reveals significant shifts in land use and land cover influenced largely by socioeconomic factors, with overall accuracy exceeding 90% and a Kappa coefficient around 0.88 [11].
Deep learning architecture enables the discovery of latent driving relationships previously undetectable through conventional statistical approaches, demonstrating particular strength in processing unstructured spatial data and complex geographic patterns. Convolutional neural networks integrated with vector-based cellular automata extract high-level features of driving factors within neighborhoods of irregularly shaped cells, discovering relationships between multiple land use changes and driving factors at the neighborhood level while obtaining more details on morphological characteristics of land parcels [24]. Deep forest algorithms integrated with vector-based cellular automata achieve best simulation performance at parcel-level (Figure of Merit = 39.88%), pattern-level (similarity = 96.47%), and community-level (correlation coefficient = 0.9269) through capacity to directly mine high-level features from unstructured data [66].

3.2.2. Complex Relationship Modeling and Discovery

Deep learning architecture excels in modeling nonlinear relationships and complex interaction patterns through sophisticated feature extraction and temporal dependency analysis frameworks. Integrated convolutional neural networks and recurrent neural networks capture multiscale neighborhood information powered by U-Net and time series information of historical urban expansion uncovered by LSTM, with improvements attributed to these complementary capabilities, enabling simultaneous simulation of edge expansion and leapfrog expansion [14]. U-Net deep learning algorithms automatically learn complex urban development patterns and processes, such as neighborhood influence, gravity effects of large cities, and the tendency for linear development, without requiring explicit parameterization or forcing data [26].
Sophisticated hybrid modeling approaches enable comprehensive relationship discovery through algorithmic complementarity and multidimensional analysis frameworks. The coupling of cellular automata with area partitioning and spatiotemporal convolution extracts spatiotemporal neighborhood features using a machine learning-based partitioning strategy and spatiotemporal 3D convolutional neural network, achieving overall accuracy improvements of 4.66–6.41% compared to traditional models [19]. K-means-CNN-LSTM-CA models demonstrate that considering temporal dependencies increases FoM index by 0.98–3.51%, while considering spatial heterogeneity increases FoM index by 1.08–5.15%, with comprehensive integration improving FoM index by 9.86–19.43% compared to traditional approaches [18].
Advanced relationship discovery techniques utilize generative approaches and attention mechanisms to uncover complex urban transformation patterns. GAN-based land use and land cover change prediction models using multiscale local spatial information achieve the highest accuracy in both short-time interval and long-time interval scenarios, with results closest to ground truth from a landscape pattern perspective [67]. Temporal vector cellular automata frameworks utilizing coupled temporal data and vector cellular automata achieve precision improvements up to 22.12%, outperforming regular VCA models and traditional raster CA models while revealing complex nonlinear temporal patterns within cadastral-scale urban development processes [25].

3.3. ML Prediction Performance and Method Applicability Assessment

Contemporary machine learning applications demonstrate distinct performance hierarchies across methodological categories, with comprehensive comparative assessments revealing clear accuracy-complexity trade-offs (Table 1). Traditional approaches provide reliable baseline performance, with Random Forest achieving consistent metropolitan applications (OA: 90.89–91.19%, Kappa: 0.87–0.88) and multilayer perceptron demonstrating superior comparative results (Kappa: 0.9025 vs. CA-MARKOV: 0.6941) [7,27]. Deep learning architectures achieve exceptional accuracy through sophisticated pattern recognition, with convolutional-recurrent integration reaching peak performance (OA: 99.18%, Kappa: 0.88, FoM: 0.13) and 3D modeling maintaining reliability across multiple metropolitan regions (OA: 83%, FoM: 0.21–0.35) [68,69]. Hybrid methodologies consistently outperform individual algorithms, with integrated approaches improving performance indices by 9.86–19.43% through systematic algorithmic complementarity [26].
Computational efficiency and practical applicability vary significantly across approaches, with critical trade-offs influencing implementation decisions. GPU acceleration enables substantial scalability improvements, achieving 16,000× spatial processing acceleration while maintaining 92% accuracy for large-scale parcel analysis [2]. However, complexity-performance analysis reveals that simpler approaches often provide better practical outcomes, with CA-Markov achieving a 97.20% Kappa coefficient compared to ConvLSTM’s 94.50% while requiring substantially lower computational resources [63]. Parameter-free approaches like U-Net algorithms enable accurate predictions with minimal forcing data requirements, offering practical solutions for resource-constrained applications [42]. Implementation success correlates strongly with methodological selection based on institutional capacity, data availability, and computational constraints rather than peak performance metrics alone, with mixed-cell approaches providing effective solutions for balancing simulation performance and computational intensity in large-scale applications [70].
Table 1. Algorithm Performance Comparison here.
Table 1. Algorithm Performance Comparison here.
CategoriesAlgorithm/
Method
Study AreaPerformance MetricsTechnical CapabilitiesImplementation Requirements
Traditional Machine Learning MethodsRandom
Forest [11]
IslamabadOA: 90.89–91.19%
Kappa: 0.87–0.88
  • Large-scale reliability
  • Stable performance over decades
  • Traditional ML limits
  • Slower processing
Multi-Layer Perceptron [13]KolkataOA: 92.78%
Kappa: 0.9025
vs. CA-MARKOV: 0.6941
vs. STCHOICE: 0.5392
  • Nonlinear mapping
  • Superior performance
  • Best performer among methods
  • Data requirements
  • Complex architecture
GPU-Accelerated ANN [36]FloridaOA: 92.0%
AUC: 0.902
FoM: 0.053
F1: 0.373
  • Massive scalability
  • 16,000× faster processing
  • GPU acceleration
  • Hardware needs
  • Technical expertise
Support
Vector
Machine [12]
Wadi Az-zarqaAUC: 80.2% (SVM)
MDA: 83.2% (best)
RF: 80.5%
BRT: 78.0%
  • High-dimensional data processing
  • Robust classification
  • Highest MDA accuracy
  • Parameter tuning
  • Computational cost
Deep Learning ArchitecturesCNN-RNN Integration (U-Net + LSTM) [14]BTHUAOA: 99.18%
Kappa: 0.8812
FoM: 0.1323
  • Multiscale feature
  • Temporal information
  • Highest accuracy combination
  • High computational cost
  • Large training data
3D-FLUS Model [33]BTH, YRD, PRDOA: 83%
Kappa: >0.77
FoM: 0.21–0.35
Height RMSE: 5–7 m
  • 3D simulation capability
  • First concurrent modeling
  • Horizontal–vertical integration
  • Complex calibration
  • High data needs
ConvLSTM [61]CasablancaKappa: 94.50%
vs. CA-Markov: 97.20%
vs. MLP-Markov: 89.40%
  • Advanced deep learning
  • Spatiotemporal feature
  • Regional transferability
  • Higher complexity
  • Large data volumes
U-Net Architecture [26]ShenzhenOA: 91.0% (median)
AUC: 0.81 (median)
FoM: 0.20
Hit rate: 0.33
  • Parameter-free approach
  • Minimal tuning requirements
  • Auto-learned patterns
  • Limited interpretability
  • Data quality dependent
Hybrid and Ensemble MethodologiesANN + CA-Markov [20]IrbidOA: 90.04%
vs. CA-MC: 86.29%
  • Integration benefit
  • Accuracy improvement
  • Performance enhancement
  • Implementation complexity
  • Training needs
CNN + Vector CA [24]ShenzhenFoM: 0.361
  • Vector representation
  • Irregular neighborhoods
  • High-level features
  • Implementation complexity
  • Limited metrics
PST-CA (SOM + 3D CNN) [19]ShanghaiOA: +4.66–6.41%
vs. 4 traditional models
  • Spatial partitioning
  • Temporal extraction
  • SOM partitioning strategy
  • Complex implementation
  • Parameter sensitive
K-means-CNN-LSTM-CA [18]HangzhouFoM: +9.86–19.43%
Temporal: +0.98–3.51%
Spatial: +1.08–5.15%
  • Multi-algorithm fusion
  • Highest accuracy gains
  • Comprehensive integration
  • High complexity
  • Coordination challenges
Temporal-VCA Framework [25]ShenzhenPrecision: +22.12%
vs. regular VCA & raster CA
  • Cadastral precision
  • Fine temporal patterns
  • Comprehensive utilization
  • Data requirements
  • Limited availability
Random Forest + CA [17]ShanghaiOA: 94.88%
Kappa: 0.8772
  • Scenario modeling
  • High consistency
  • Pattern simulation
  • RF limitations
  • Pattern aggregation
LSTM-CA Integration [71]LanzhouOA: 91.01%
  • Long-term effects
  • Temporal modeling
  • LSTM transition rules
  • Data requirements
  • Training complexity
CA(GBDT) Ensemble [72]YRDOA: >89%
FoM: >27%
  • Ensemble strength
  • Nonlinear patterns
  • Gradient boosting integration
  • Model complexity
  • Interpretability issues
Deep Forest-VCA [66]ShenzhenFoM: 39.88%
Similarity: 96.47%
Correlation: 0.9269
  • Multi-level performance
  • Deep ensemble
  • Best simulation performance
  • Computational intensity
  • Parameter tuning

3.4. Translation Mechanisms of ML Applications in Planning Decision Support

3.4.1. Decision Support Effectiveness and Planning Applications

Machine learning applications demonstrate substantial effectiveness in translating technical predictions into actionable planning insights through comprehensive scenario-based frameworks and policy-relevant modeling capabilities. Multi-scenario approaches enable systematic evaluation of alternative development strategies, with 3D future land use simulation models successfully applied to simulate future evolution of 3D urban dynamics under Shared Socioeconomic Pathways, effectively illustrating the influence of each SSP on 3D urban development [33]. The predicted urban land use shows planned development scenario projections of 242.10, 312.69, 363.80, and 400.72 km2 compared to unplanned development scenario projections of 242.91, 314.31, 366.23, and 403.98 km2 during 2021, 2031, 2041, and 2051 [73].
Decision support translation mechanisms demonstrate particular strength in addressing integrated urban challenge assessment through comprehensive technical frameworks. Contemporary applications across major urban regions demonstrate these capabilities through diverse case implementations. The Beijing–Tianjin–Hebei urban agglomeration exemplifies advanced applications through convolutional-recurrent neural network integration, achieving 99.18% overall accuracy and a Kappa coefficient of 0.88. Modeling results project urban land expansion to peak at 8736–9155 km2 during 2039–2043, representing increases of 10.99–16.31% compared to 2020 baseline levels [14]. Comparative framework applications for spatially explicit urban growth modeling successfully monitor urban land use efficiency and sustainable urban development, with multilayer perceptron neural network models enabling projection of urban expansion dynamics while providing a scientific basis for technical selection and method optimization [13]. Three-dimensional simulation models provide scientific support for decisions in urban spatial planning by effectively simulating horizontal expansion and vertical growth of real cities in 3D space, with crucial factors affecting building heights, and simulation results of future urban 3D expansion hotspot areas [34].
Temporal–spatial decision support capabilities enable dynamic planning adaptation through sophisticated rolling prediction frameworks. Multidimensional effects modeling using dynamic spatiotemporal rolling prediction models achieves an average adjusted R2 of 0.89, representing shifts from static macroscopic to dynamic microscopic prediction of urban land demand, offering valuable insights for future urban development and planning decisions [50]. Multi-scenario-based urban growth modeling provides promising guidelines for urban planners and conservation scientists to implement robust artificial intelligence-based hybrid geo-simulation models for compact, organized, and integrated land use-transportation development [73].

3.4.2. Implementation Challenges and Standardization Requirements

Implementation challenges persist despite technical advances, particularly regarding model interpretability, computational requirements, and institutional capacity limitations affecting widespread adoption. Deep learning approaches, while achieving superior prediction performance with 99.18% overall accuracy and a 0.88 Kappa coefficient, often function as complex systems requiring substantial computational resources and technical expertise [14]. Convolutional LSTM models demonstrate higher complexity due to the number of elementary operations and require larger data volumes as deep learning models, making them more demanding in terms of data volume compared to cellular automata–Markov approaches [61].
Data standardization and integration challenges significantly constrain machine learning adoption in planning contexts, requiring comprehensive datasets encompassing multiple temporal periods and diverse driving factors. Successful margin-based random forest implementations require systematic analysis of core driving factors while maintaining stability across different similarity measures, demanding extensive data collection and preprocessing capabilities [64]. Large-scale applications processing parcel-level land use changes demonstrate both analytical potential and computational demands, requiring substantial data infrastructure and processing capabilities for effective implementation [36].
Institutional integration requirements encompass technical capacity development, workflow adaptation, and stakeholder engagement frameworks necessary for effective machine learning implementation in planning practice. Successful translation requires addressing the balance between prediction accuracy and computational intensity, with mixed-cell cellular automata models offering practical solutions for large-scale applications while achieving balance between simulation performance and computational intensity [74]. Scalability assessment suggests that cellular automata–Markov-based methods show great potential for urban growth forecasting, especially for short-term forecasting when computational resources are limited, while sophisticated approaches require substantial infrastructure investment for effective implementation. The translation from research innovations to practical applications necessitates comprehensive capacity building programs, standardized implementation protocols, and institutional frameworks supporting cross-disciplinary collaboration between technical specialists and planning practitioners.

4. Discussion

4.1. Implementation Challenges and Performance Trade-Offs

Contemporary machine learning applications face fundamental trade-offs between algorithmic sophistication and practical implementation feasibility, with computational complexity presenting significant adoption barriers across diverse institutional contexts. The Beijing–Tianjin–Hebei application demonstrates this paradox, where convolutional-recurrent integration achieved 99.18% overall accuracy and projected urban expansion to 8736–9155 km2 during 2039–2043, yet demands substantial computational resources often exceeding institutional capabilities [14]. Comparative analysis reveals critical performance-complexity relationships: Casablanca applications showed ConvLSTM achieving 94.50% Kappa coefficient while requiring larger data volumes compared to CA-Markov approaches reaching 97.20% Kappa with greater computational efficiency [61], illustrating how sophisticated methods may underperform simpler approaches in resource-constrained environments where computational demands exceed institutional capabilities.
Institutional and socio-political constraints compound these technical barriers through decision-making requirements that conflict with algorithmic characteristics. Deep learning architectures achieve exceptional performance with an overall accuracy of 99.18% and a Kappa coefficient of 0.88, but their “black box” nature undermines transparency requirements essential for planning decision-making contexts [14]. Democratic planning processes require explainable AI approaches that balance predictive performance with decision support transparency [59,62], creating institutional resistance where stakeholder engagement frameworks struggle to incorporate algorithmic predictions into participatory planning processes. Implementation complexity varies substantially across organizational capabilities, where Kolkata applications demonstrated multilayer perceptron performance (Kappa: 0.9025) significantly outperforming CA-MARKOV (0.6941) and STCHOICE (0.5392) [13]; successful deployment requires technical expertise often unavailable in planning organizations. Professional capacity limitations significantly constrain effective translation from research innovations to planning practice, where practitioners may lack the technical background necessary to properly implement sophisticated analytical tools that require professional development and institutional adaptation that many agencies are unprepared to undertake [50].
Cross-regional transferability remains limited despite claims of model generalizability, with performance variations reflecting underlying differences in geographic and institutional contexts. The Temporal-VCA framework achieved precision improvements up to 22.12% through comprehensive utilization of fine spatial and temporal information [31], but such detailed datasets are rarely available outside specific institutional contexts. European implementations achieve 88–95% accuracy through standardized data protocols and mature planning frameworks [40,75], while developing economies demonstrate different performance characteristics requiring region-specific adaptation strategies [57]. Data quality and integration challenges significantly constrain machine learning adoption in planning contexts, requiring comprehensive datasets encompassing multiple temporal periods and diverse driving factors that demand extensive data collection and preprocessing capabilities often exceeding available technical resources [64]. Multi-source integration challenges, temporal consistency issues, and validation costs significantly complicate model development while creating barriers to widespread adoption in resource-constrained environments.

4.2. Future Development and Application Pathways

Machine learning applications demonstrate substantial potential for market-level decision support and economic development optimization, extending beyond traditional planning applications to address comprehensive urban challenges. Real-world implementations demonstrate this transformation: Shanghai’s multi-scenario planning framework exemplifies policy-relevant applications, where Random Forest–CA integration achieved 94.88% overall accuracy while enabling systematic comparison between planned development scenarios (projecting 242.10–400.72 km2 for 2021–2051) versus unplanned scenarios (242.91–403.98 km2) [17,73]. These quantitative scenario comparisons directly inform municipal land use policy decisions and development control strategies. Beijing–Tianjin–Hebei applications demonstrate large-scale regional planning capabilities, where CNN–RNN integration projected urban expansion to 8736–9155 km2 during 2039–2043, representing 10.99–16.31% increases that directly support infrastructure investment planning and resource allocation decisions across the mega-region [14].
Cost–benefit considerations for resource-constrained agencies reveal critical implementation priorities that balance technological sophistication with practical applicability. Cellular automata–Markov-based methods show great potential for urban growth forecasting, especially for short-term applications when computational resources are limited, while sophisticated approaches require substantial infrastructure investment for effective implementation. Mixed-cell cellular automata models offer practical solutions for large-scale applications while achieving a balance between simulation performance and computational intensity [74], providing cost-effective alternatives to GPU-accelerated implementations that enable 16,000× spatial processing acceleration but require substantial infrastructure investment [36]. Successful translation requires addressing the balance between prediction accuracy and computational intensity, where simpler approaches may achieve better practical outcomes than sophisticated but resource-intensive alternatives requiring specialized technical support.
Contemporary research necessitates standardized benchmarking frameworks addressing methodological fragmentation and performance comparability limitations across diverse planning contexts. Essential framework components should include: standardized reference datasets incorporating multi-resolution urban data across representative geographic contexts; unified performance metrics combining traditional measures with planning-specific indicators; computational efficiency benchmarks establishing processing time standards for different institutional capacity levels; cross-regional validation protocols requiring model testing across multiple geographic contexts; reproducibility standards mandating code availability and detailed parameter specifications. Implementation requirements encompass systematic professional development programs addressing technical knowledge gaps constraining widespread adoption, comprehensive data governance frameworks ensuring quality consistency, and institutional capacity building initiatives balancing methodological sophistication with practical applicability.
The translation from research innovations to practical applications necessitates comprehensive capacity-building programs, standardized implementation protocols, and institutional frameworks supporting cross-disciplinary collaboration between technical specialists and planning practitioners. Future applications demand integrated decision support platforms combining multiple analytical capabilities within unified operational frameworks that facilitate non-technical user engagement with sophisticated modeling systems. Critical implementation priorities include coordinated training frameworks integrating theoretical understanding with implementation experience through collaborative institutional partnerships, with development trajectories emphasizing integrated intelligent systems combining multiple analytical approaches, 3D modeling capabilities addressing vertical development complexity, and comprehensive decision support platforms. The research demonstrates that machine learning technologies can enable more responsive and adaptive urban management through real-time monitoring and scenario-based planning capabilities, but successful implementation depends critically on addressing institutional capacity, stakeholder engagement, and transparency requirements while maintaining democratic legitimacy in planning processes.

4.3. Limitations

This review has several limitations. First, we limited our search to English-language publications in Web of Science, which may have introduced language and database bias. Second, the rapid evolution of machine learning technologies means that some recent developments may not be fully captured. Third, the lack of standardized evaluation metrics across studies made direct comparisons challenging. Fourth, we did not conduct a formal quality assessment due to the absence of appropriate tools for this field.

5. Conclusions

This systematic review of 74 publications spanning 2020–2024 reveals that machine learning applications in land use prediction have achieved remarkable technological advancement while encountering persistent implementation barriers limiting widespread adoption in planning practice. Our analysis establishes clear methodological evolution across three categories: traditional machine learning approaches achieving consistent reliability with random forest algorithms reaching over 90% overall accuracy, deep learning architectures delivering exceptional performance with convolutional-recurrent integration achieving 99.18% overall accuracy, and hybrid ensemble methods consistently outperforming single-algorithm approaches through sophisticated algorithmic complementarity.
Previous reviews have focused primarily on algorithmic performance without systematically addressing implementation barriers constraining professional adoption in planning practice. Our research identifies significant disparities between technical capabilities and practical implementation requirements, with critical implications for planning policy development. While sophisticated algorithms demonstrate superior analytical performance, their complexity often exceeds institutional technical capacity, suggesting that planning agencies should prioritize graduated implementation strategies aligning technological sophistication with organizational capabilities. Cross-regional transferability remains limited despite claims of model generalizability, indicating that regulatory frameworks must establish model validation standards and region-specific adaptation protocols while maintaining transparency requirements essential for democratic planning processes. Data quality and availability constraints emerge as fundamental limiting factors, highlighting the need for systematic investment in data infrastructure and professional capacity building rather than advanced algorithmic acquisition.
Future development trajectories should emphasize integrated intelligent systems combining multiple analytical approaches, 3D modeling capabilities, and comprehensive decision support platforms. However, successful implementation requires policy frameworks that balance innovation adoption with accountability mechanisms, ensuring algorithmic predictions support rather than replace professional judgment. Budget allocation strategies should prioritize long-term capacity development over short-term technology acquisition, with cost–benefit analysis revealing that simpler, well-implemented approaches often provide better practical value than sophisticated systems requiring substantial ongoing technical support. Critical research priorities include standardized benchmarking frameworks and explainable artificial intelligence approaches for transparent planning decision-making. The research demonstrates that machine learning technologies can enable more responsive and adaptive urban management through real-time monitoring and scenario-based planning capabilities, but sustainable deployment depends critically on phased implementation strategies, cross-sector partnerships, and continuous evaluation mechanisms that maintain public trust while advancing technical capabilities in planning practice.

Author Contributions

Conceptualization, C.L. and L.N.; methodology, C.L.; literature search, C.L.; study selection, T.S., T.L. and J.L.; data extraction, C.L.; data validation, C.W.; validation, T.S., T.L., J.L. and W.Y.; formal analysis, C.L.; investigation, C.L.; writing—original draft preparation, C.L.; writing—review and editing, C.L., L.N., C.W., T.S., T.L., J.L., W.Y. and H.W.; visualization, C.L.; supervision, C.W., L.N. and H.W.; project administration, L.N.; funding acquisition, H.W. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by the Natural Science Foundation of Xiamen, China [grant numbers 3502Z202372054 and 3502Z202472039], the Fujian Provincial Natural Science Foundation of China [grant number 2023J05080], the Science and Technology Planning Project of Fujian Province, China [grant number 2022H0044], and the Science and Technology Planning Project of Fujian Province, China [grant number 2023H0048].

Data Availability Statement

The data extraction forms and list of included studies are available from the corresponding author upon reasonable request.

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Table A1. PRISMA 2020 Main Checklist.
Table A1. PRISMA 2020 Main Checklist.
TopicNo.ItemLocation Where Item Is Reported
TITLE
Title1Identify the report as a systematic review. Page 1, title
ABSTRACT
Abstract2See the PRISMA 2020 for Abstracts checklist
INTRODUCTION
Rationale3Describe the rationale for the review in the context of existing knowledge. Section 1 (Introduction)
Objectives4Provide an explicit statement of the objective(s) or question(s) the review addresses.End of Section 1
METHODS
Eligibility criteria5Specify the inclusion and exclusion criteria for the review and how studies were grouped for the syntheses.Section 2.1
Information sources6Specify all databases, registers, websites, organizations, reference lists, and other sources searched or consulted to identify studies. Specify the date when each source was last searched or consulted.Section 2.1
Search strategy7Present the full search strategies for all databases, registers, and websites, including any filters and limits used.Section 2.1
Selection process8Specify the methods used to decide whether a study met the inclusion criteria of the review, including how many reviewers screened each record and each report retrieved, whether they worked independently, and, if applicable, details of automation tools used in the process.Section 2.1
Data collection process9Specify the methods used to collect data from reports, including how many reviewers collected data from each report, whether they worked independently, any processes for obtaining or confirming data from study investigators, and if applicable, details of automation tools used in the process. Section 2.2
Data items10aList and define all outcomes for which data were sought. Specify whether all results that were compatible with each outcome domain in each study were sought (e.g., for all measures, time points, analyses), and if not, the methods used to decide which results to collect.Section 2.2
10bList and define all other variables for which data were sought (e.g., participant and intervention characteristics, funding sources). Describe any assumptions made about any missing or unclear information.Section 2.2
Study risk of bias assessment11Specify the methods used to assess risk of bias in the included studies, including details of the tool(s) used, how many reviewers assessed each study and whether they worked independently, and if applicable, details of automation tools used in the process. Section 2.2
Effect measures12Specify for each outcome the effect measure(s) (e.g., risk ratio, mean difference) used in the synthesis or presentation of results.Results section
Synthesis methods13aDescribe the processes used to decide which studies were eligible for each synthesis (e.g., tabulating the study intervention characteristics and comparing against the planned groups for each synthesis (item 5)).Section 2.3
13bDescribe any methods required to prepare the data for presentation or synthesis, such as handling of missing summary statistics or data conversions.Not applicable—qualitative synthesis
13cDescribe any methods used to tabulate or visually display the results of individual studies and syntheses.Not applicable—no statistical synthesis
13dDescribe any methods used to synthesize results and provide a rationale for the choice(s). If meta-analysis was performed, describe the model(s), method(s) to identify the presence and extent of statistical heterogeneity, and software package(s) used.Not conducted—qualitative review
13eDescribe any methods used to explore possible causes of heterogeneity among study results (e.g., subgroup analysis, meta-regression).Not applicable—no statistical synthesis
13fDescribe any sensitivity analyses conducted to assess robustness of the synthesized results.Not applicable—qualitative synthesis
Reporting bias assessment14Describe any methods used to assess the risk of bias due to missing results in a synthesis (arising from reporting biases).Not conducted—qualitative review
Certainty assessment15Describe any method used to assess certainty (or confidence) in the body of evidence for an outcome.Not conducted—qualitative review
RESULTS
Study selection16aDescribe the results of the search and selection process, from the number of records identified in the search to the number of studies included in the review, ideally using a flow diagram.Figure 1, Section 3.1
16bCite studies that might appear to meet the inclusion criteria, but which were excluded, and explain why they were excluded.Figure 1, Section 3.1
Study characteristics17Cite each included study and present its characteristics.Section 3.1
Risk of bias in studies18Present assessments of risk of bias for each included study.Not conducted—see Methods Section 2.2
Results of individual studies19For all outcomes, present, for each study: (a) summary statistics for each group (where appropriate) and (b) an effect estimate and its precision (e.g., confidence/credible interval), ideally using structured tables or plots.Table 1, Section 3.3
Results of syntheses20aFor each synthesis, briefly summarize the characteristics and risk of bias among contributing studies.Section 3.1, Section 3.2 and Section 3.3
20bPresent results of all statistical syntheses conducted. If meta-analysis was performed, present for each the summary estimate and its precision (e.g., confidence/credible interval) and measures of statistical heterogeneity. If comparing groups, describe the direction of the effect.Not applicable—qualitative synthesis
20cPresent results of all investigations of possible causes of heterogeneity among study results.Not applicable—no statistical synthesis
20dPresent results of all sensitivity analyses conducted to assess the robustness of the synthesized results.Not applicable—qualitative synthesis
Reporting biases21Present assessments of risk of bias due to missing results (arising from reporting biases) for each synthesis assessed.Not conducted—qualitative review
Certainty of evidence22Present assessments of certainty (or confidence) in the body of evidence for each outcome assessed.Not conducted—qualitative review
DISCUSSION
Discussion23aProvide a general interpretation of the results in the context of other evidence.Section 4 (Discussion)
23bDiscuss any limitations of the evidence included in the review.Section 4.1
23cDiscuss any limitations of the review processes used.Section 4.3 (Limitations)
23dDiscuss implications of the results for practice, policy, and future research.Section 4.2, Conclusions
OTHER INFORMATION
Registration and protocol24aProvide registration information for the review, including register name and registration number, or state that the review was not registered. Other Information section
24bIndicate where the review protocol can be accessed, or state that a protocol was not prepared.Other Information section
24cDescribe and explain any amendments to information provided at registration or in the protocol.Not applicable—no protocol prepared
Support25Describe sources of financial or non-financial support for the review, and the role of the funders or sponsors in the review.Funding section
Competing interests26Declare any competing interests of review authors.Conflicts of Interest section
Availability of data, code and other materials27Report which of the following are publicly available and where they can be found: template data collection forms; data extracted from included studies; data used for all analyses; analytic code; any other materials used in the review.Data Availability section
Table A2. PRIMSA Abstract Checklist.
Table A2. PRIMSA Abstract Checklist.
TopicNo.ItemReported?
TITLE
Title1Identify the report as a systematic review.Yes
BACKGROUND
Objectives2Provide an explicit statement of the main objective(s) or question(s) the review addresses.Yes
METHODS
Eligibility criteria3Specify the inclusion and exclusion criteria for the review.Yes
Information sources4Specify the information sources (e.g., databases, registers) used to identify studies and the date when each was last searched. Yes
Risk of bias5Specify the methods used to assess risk of bias in the included studies.Yes
Synthesis of results6Specify the methods used to present and synthesize results. Yes
RESULTS
Included studies7Give the total number of included studies and participants and summarize relevant characteristics of studies.Yes
Synthesis of results8Present results for main outcomes, preferably indicating the number of included studies and participants for each. If meta-analysis was performed, report the summary estimate and confidence/credible interval. If comparing groups, indicate the direction of the effect (i.e., which group is favored).Yes
DISCUSSION
Limitations of evidence9Provide a brief summary of the limitations of the evidence included in the review (e.g., study risk of bias, inconsistency, and imprecision).Yes
Interpretation10Provide a general interpretation of the results and important implications.Yes
OTHER
Funding11Specify the primary source of funding for the review.Yes
Registration12Provide the register name and registration number.Yes
From: Page MJ, McKenzie JE, Bossuyt PM, Boutron I, Hoffmann TC, Mulrow CD et al. [76]. For more information, visit: www.prisma-statement.org (accessed on 27 September 2025).

References

  1. Azizi, P.; Soltani, A.; Bagheri, F.; Sharifi, S.; Mikaeili, M. An Integrated Modelling Approach to Urban Growth and Land Use/Cover Change. Land 2022, 11, 1715. [Google Scholar] [CrossRef]
  2. Gaur, S.; Singh, R. A Comprehensive Review on Land Use/Land Cover (LULC) Change Modeling for Urban Development: Current Status and Future Prospects. Sustainability 2023, 15, 903. [Google Scholar] [CrossRef]
  3. Aryal, J.; Sitaula, C.; Frery, A.C. Land use and land cover (LULC) performance modeling using machine learning algorithms: A case study of the city of Melbourne, Australia. Sci. Rep. 2023, 13, 13510. [Google Scholar] [CrossRef]
  4. Weng, Q. Remote Sensing and GIS Integration: Theories, Methods, and Applications; McGraw-Hill: New York, NY, USA, 2010. [Google Scholar]
  5. Wang, J.; Bretz, M.; Dewan, M.A.A.; Delavar, M.A. Machine learning in modelling land-use and land cover-change (LULCC): Current status, challenges and prospects. Sci. Total Environ. 2022, 822, 153559. [Google Scholar] [CrossRef]
  6. Zhang, C.; Li, X. Land Use and Land Cover Mapping in the Era of Big Data. Land 2022, 11, 1692. [Google Scholar] [CrossRef]
  7. Koutra, S.; Ioakimidis, C.S. Unveiling the Potential of Machine Learning Applications in Urban Planning Challenges. Land 2023, 12, 83. [Google Scholar] [CrossRef]
  8. Guan, X.; Li, J.; Yang, C.; Xing, W. Development Process, Quantitative Models, and Future Directions in Driving Analysis of Urban Expansion. ISPRS Int. J. Geo-Inf. 2023, 12, 174. [Google Scholar] [CrossRef]
  9. Ding, S.; Liu, S.; Chang, M.; Lin, H.; Lv, T.; Zhang, Y.; Zeng, C. Spatial Optimization of Land Use Pattern toward Carbon Mitigation Targets-A Study in Guangzhou. Land 2023, 12, 1903. [Google Scholar] [CrossRef]
  10. Liu, C.; Lu, S.; Tian, J.; Yin, L.; Wang, L.; Zheng, W. Research Overview on Urban Heat Islands Driven by Computational Intelligence. Land 2024, 13, 2176. [Google Scholar] [CrossRef]
  11. Badshah, M.T.; Hussain, K.; Rehman, A.U.; Mehmood, K.; Muhammad, B.; Wiarta, R.; Silamon, R.F.; Khan, M.A.; Meng, J. The role of random forest and Markov chain models in understanding metropolitan urban growth trajectory. Front. For. Glob. Change 2024, 7, 1345074. [Google Scholar] [CrossRef]
  12. Al-Fugara, A.; Pourghasemi, H.R.; Al-Shabeeb, A.R.; Habib, M.; Al-Adamat, R.; Ai-Amoush, H.; Collins, A.L. A comparison of machine learning models for the mapping of groundwater spring potential. Environ. Earth Sci. 2020, 79, 206. [Google Scholar] [CrossRef]
  13. Mithun, S.K.; Sahana, M.; Chattopadhyay, S.; Chatterjee, S.; Islam, J.; Costache, R. Comparative framework for spatially explicit urban growth modeling for monitoring urban land-use efficiency and sustainable urban development (SDG 11.3.1): A study on Kolkata metropolitan area, India. Geocarto Int. 2022, 37, 17933–17970. [Google Scholar] [CrossRef]
  14. Pan, X.; Liu, Z.; He, C.; Huang, Q. Modeling urban expansion by integrating a convolutional neural network and a recurrent neural network. Int. J. Appl. Earth Obs. Geoinf. 2022, 112, 102977. [Google Scholar] [CrossRef]
  15. Zhou, Z.; Chen, Y.; Liu, X.; Zhang, X.; Zhang, H. A maps-to-maps approach for simulating urban land expansion based on convolutional long short-term memory neural networks. Int. J. Geogr. Inf. Sci. 2024, 38, 503–526. [Google Scholar] [CrossRef]
  16. Liao, Z.; Zhang, L. Spatio-temporal evolution and future simulation of urban agglomeration expansion in the Guangdong-Hongkong-Macau Greater Bay Area. Humanit. Soc. Sci. Commun. 2023, 10, 468. [Google Scholar] [CrossRef]
  17. Zhou, L.; Dang, X.; Sun, Q.; Wang, S. Multi-scenario simulation of urban land change in Shanghai by random forest and CA-Markov model. Sustain. Cities Soc. 2020, 55, 102045. [Google Scholar] [CrossRef]
  18. Zhou, Y.; Huang, C.; Wu, T.; Zhang, M. A novel spatio-temporal cellular automata model coupling partitioning with CNN-LSTM to urban land change simulation. Ecol. Model. 2023, 482, 110394. [Google Scholar] [CrossRef]
  19. Qian, Y.; Xing, W.; Guan, X.; Yang, T.; Wu, H. Coupling cellular automata with area partitioning and spatiotemporal convolution for dynamic land use change simulation. Sci. Total Environ. 2020, 722, 137738. [Google Scholar] [CrossRef]
  20. Gharaibeh, A.; Shaamala, A.; Obeidat, R.; Al-Kofahi, S. Improving land-use change modeling by integrating ANN with Cellular Automata-Markov Chain model. Heliyon 2020, 6, e05092. [Google Scholar] [CrossRef] [PubMed]
  21. Uddin, M.S.; Mahalder, B.; Mahalder, D. Assessment of Land Use Land Cover Changes and Future Predictions Using CA-ANN Simulation for Gazipur City Corporation, Bangladesh. Sustainability 2023, 15, 12329. [Google Scholar] [CrossRef]
  22. Bettencourt, L.M.A.; Lobo, J.; Helbing, D.; Kühnert, C.; West, G.B. Growth, innovation, scaling, and the pace of life in cities. Proc. Natl. Acad. Sci. USA 2007, 104, 7301–7306. [Google Scholar] [CrossRef] [PubMed]
  23. Batty, M. The Size, Scale, and Shape of Cities. Science 2008, 319, 769–771. [Google Scholar] [CrossRef] [PubMed]
  24. Zhai, Y.; Yao, Y.; Guan, Q.; Liang, X.; Li, X.; Pan, Y.; Yue, H.; Yuan, Z.; Zhou, J. Simulating urban land use change by integrating a convolutional neural network with vector-based cellular automata. Int. J. Geogr. Inf. Sci. 2020, 34, 1475–1499. [Google Scholar] [CrossRef]
  25. Yao, Y.; Zhou, K.; Liu, C.; Sun, Z.; Chen, D.; Li, L.; Cheng, T.; Guan, Q. Temporal-VCA: Simulating urban land use change using coupled temporal data and vector cellular automata. Cities 2024, 149, 104975. [Google Scholar] [CrossRef]
  26. Wang, J.; Hadjikakou, M.; Hewitt, R.J.; Bryan, B.A. Simulating large-scale urban land-use patterns and dynamics using the U-Net deep learning architecture. Comput. Environ. Urban Syst. 2022, 97, 101855. [Google Scholar] [CrossRef]
  27. Liu, X.; Wei, M.; Li, Z.; Zeng, J. Multi-scenario simulation of urban growth boundaries with an ESP-FLUS model: A case study of the Min Delta region, China. Ecol. Indic. 2022, 135, 108538. [Google Scholar] [CrossRef]
  28. Liu, X.; Li, Y.; Zhang, S.; Niu, Q. Spatiotemporal patterns, driving mechanism, and multi-scenario simulation of urban expansion in Min Delta Region, China. Ecol. Indic. 2024, 158, 111312. [Google Scholar] [CrossRef]
  29. Huang, X.; Xu, G.; Xiao, F. Optimization of a Novel Urban Growth Simulation Model Integrating an Artificial Fish Swarm Algorithm and Cellular Automata for a Smart City. Sustainability 2021, 13, 2338. [Google Scholar] [CrossRef]
  30. Cheng, W.; Chu, Y.; Xia, C.; Zhang, B.; Chen, J.; Jia, M.; Wang, W. UrbanGenoGAN: Pioneering urban spatial planning using the synergistic integration of GAN.; GA.; and GIS. Front. Environ. Sci. 2023, 11, 1287858. [Google Scholar] [CrossRef]
  31. Yue, W.; Qin, C.; Su, M.; Teng, Y.; Xu, C. Simulation and prediction of land use change in Dongguan of China based on ANN cellular automata—Markov chain model. Environ. Sustain. Indic. 2024, 22, 100355. [Google Scholar] [CrossRef]
  32. Opgenhaffen, L.; Sepers, M.H. 3D MODELLING: CROSSING TRADITIONAL BOUNDARIES BETWEEN DIFFERENT RESEARCH AREAS. In Proceedings of the 3D-ARCH 2015—3D VIRTUAL RECONSTRUCTION AND VISUALIZATION OF COMPLEX ARCHITECTURES, Ávila, Spain, 25–27 February 2015; pp. 411–414. [Google Scholar] [CrossRef]
  33. Xu, X.; Ding, D.; Liu, X. A three-dimensional future land use simulation (FLUS-3D) model for simulating the 3D urban dynamics under the shared socio-economic pathways. Landsc. Urban Plan. 2024, 250, 105135. [Google Scholar] [CrossRef]
  34. Zhao, L.; Liu, X.; Xu, X.; Liu, C.; Chen, K. Three-Dimensional Simulation Model for Synergistically Simulating Urban Horizontal Expansion and Vertical Growth. Remote Sens. 2022, 14, 1503. [Google Scholar] [CrossRef]
  35. Wang, K.; He, T.; Xiao, W.; Yang, R. Projections of future spatiotemporal urban 3D expansion in China under shared socioeconomic pathways. Landsc. Urban Plan. 2024, 247, 105043. [Google Scholar] [CrossRef]
  36. Tepe, E.; Safikhani, A. Spatio-temporal modeling of parcel-level land-use changes using machine learning methods. Sustain. Cities Soc. 2023, 90, 104390. [Google Scholar] [CrossRef]
  37. Gomez, J.A.; Patino, J.E.; Duque, J.C.; Passos, S. Spatiotemporal Modeling of Urban Growth Using Machine Learning. Remote Sens. 2020, 12, 109. [Google Scholar] [CrossRef]
  38. Xu, Z.; Jiao, L.; Lan, T.; Zhou, Z.; Cui, H.; Li, C.; Xu, G.; Liu, Y. Mapping hierarchical urban boundaries for global urban settlements. Int. J. Appl. Earth Obs. Geoinf. 2021, 103, 102480. [Google Scholar] [CrossRef]
  39. Zhai, H.; Lv, C.; Liu, W.; Yang, C.; Fan, D.; Wang, Z.; Guan, Q. Understanding Spatio-Temporal Patterns of Land Use/Land Cover Change under Urbanization in Wuhan, China 2000–2019. Remote Sens. 2021, 13, 3331. [Google Scholar] [CrossRef]
  40. Mozaffaree Pour, N.; Karasov, O.; Burdun, I.; Oja, T. Simulation of land use/land cover changes and urban expansion in Estonia by a hybrid ANN-CA-MCA model and utilizing spectral-textural indices. Environ. Monit. Assess. 2022, 194, 584. [Google Scholar] [CrossRef]
  41. Wang, C.; Li, M.; Wang, X.; Deng, M.; Wu, Y.; Hong, W. Spatio-Temporal Dynamics of Carbon Storage in Rapidly Urbanizing Shenzhen, China: Insights and Predictions. Land 2024, 13, 1566. [Google Scholar] [CrossRef]
  42. Luan, C.; Liu, R. A Comparative Study of Various Land Use and Land Cover Change Models to Predict Ecosystem Service Value. Int. J. Environ. Res. Public Health 2022, 19, 16484. [Google Scholar] [CrossRef]
  43. Peng, K.; Jiang, W.; Ling, Z.; Hou, P.; Deng, Y. Evaluating the potential impacts of land use changes on ecosystem service value under multiple scenarios in support of SDG reporting: A case study of the Wuhan urban agglomeration. J. Clean. Prod. 2021, 307, 127321. [Google Scholar] [CrossRef]
  44. Zhao, H.; Gu, T.; Tang, J.; Gong, Z.; Zhao, P. Urban flood risk differentiation under land use scenario simulation. iScience 2023, 26, 106479. [Google Scholar] [CrossRef]
  45. Wang, Q.; Zhao, G.; Zhao, R. Resilient urban expansion: Identifying critical conflict patches by integrating flood risk and land use predictions: A case study of Min Delta Urban Agglomerations in China. Int. J. Disaster Risk Reduct. 2024, 100, 104192. [Google Scholar] [CrossRef]
  46. Leggesse, E.; Derseh, W.A.; Zimale, F.A.; Admasu, S.T.; Meshesha, M.A. Urban flash flood hazard mapping using machine learning, Bahir Dar, Ethiopia. J. Hydroinformatics 2024, 26, 2124–2145. [Google Scholar] [CrossRef]
  47. Norallahi, M.; Kaboli, H.S. Urban flood hazard mapping using machine learning models: GARP.; RF.; MaxEnt and NB. Nat. Hazards 2021, 106, 119–137. [Google Scholar] [CrossRef]
  48. Liu, L.; Wei, J.; Luo, P.; Zhang, Y.; Wang, Y.; Elbeltagi, A.; Zainol, M.R.R.M.A. A novel quantity assessment of landscape ecological risk using human-nature driving mechanism for sustainable society. Sci. Total Environ. 2024, 947, 173892. [Google Scholar] [CrossRef]
  49. He, B.; Cao, Y.; Lin, K.; Wu, M.; Zhu, Y.; Cui, X.; Hu, L.; Yang, Y.; Liu, X. Enhanced bulk and interfacial charge transfer in Fe:VOPO4 modified Mo:BiVO4 photoanodes for photoelectrochemical water splitting. Escience 2025, 5, 100242. [Google Scholar] [CrossRef]
  50. Ren, Y.; Yang, J.; Shen, Y.; Wang, L.; Zhang, Z.; Zhao, Z. Multidimensional effects of history, neighborhood, and proximity on urban land growth: A dynamic spatiotemporal rolling prediction model (STRM). Trans. Gis 2024, 28, 1928–1956. [Google Scholar] [CrossRef]
  51. Shen, C.; Hou, H.; Zheng, Y.; Murayama, Y.; Wang, R.; Hu, T. Prediction of the future urban heat island intensity and distribution based on landscape composition and configuration: A case study in Hangzhou. Sustain. Cities Soc. 2022, 83, 103992. [Google Scholar] [CrossRef]
  52. Chen, C.; Liu, Y. Spatiotemporal changes of ecosystem services value by incorporating planning policies: A case of the Pearl River Delta, China. Ecol. Model. 2021, 461, 109777. [Google Scholar] [CrossRef]
  53. Luo, H.; Li, Y.; Gao, X.; Meng, X.; Yang, X.; Yan, J. Carbon emission prediction model of prefecture-level administrative region: A land-use-based case study of Xi’an city, China. Appl. Energy 2023, 348, 121488. [Google Scholar] [CrossRef]
  54. Mokarram, M.; Taripanah, F.; Pham, T.M. Spatial-temporal analysis of atmospheric environment in urban areas using remote sensing and neural networks. Sustain. Comput. -Inform. Syst. 2024, 42, 100987. [Google Scholar] [CrossRef]
  55. Zhang, M.; Tan, S.; Zhang, C.; Chen, E. Machine learning in modelling the urban thermal field variance index and assessing the impacts of urban land expansion on seasonal thermal environment. Sustain. Cities Soc. 2024, 106, 105345. [Google Scholar] [CrossRef]
  56. Xu, Z.; Xu, G.; Lan, T.; Li, X.; Chen, Z.; Cui, H.; Zhou, Z.; Wang, H.; Jiao, L.; Small, C. Global consistency of urban scaling evidenced by remote sensing. PNAS Nexus 2025, 4, pgaf037. [Google Scholar] [CrossRef]
  57. Rahman, F.; Rahman, M.T.U. Use of cellular automata-based artificial neural networks for detection and prediction of land use changes in North-Western Dhaka City. Environ. Sci. Pollut. Res. 2023, 30, 1428–1450. [Google Scholar] [CrossRef]
  58. Sarif, M.O.; Gupta, R.D.; Sharifi, A. Predicting Prayagraj’s Urbanization Trajectory using CA-ANN Modelling: Population Pressures and Land Use Dynamics. J. Environ. Manag. 2024, 370, 122469. [Google Scholar] [CrossRef]
  59. Hao, H.; Wang, Y.; Chen, J. Empowering Scenario Planning with Artificial Intelligence: A Perspective on Building Smart and Resilient Cities. Engineering 2024, 43, 272–283. [Google Scholar] [CrossRef]
  60. Roy, S.K.; Alam, M.T.; Mojumder, P.; Mondal, I.; Kafy, A.A.; Dutta, M.; Ferdous, M.N.; Mamun, M.A.A.; Mahtab, S.B. Dynamic assessment and prediction of land use alterations influence on ecosystem service value: A pathway to environmental sustainability. Environ. Sustain. Indic. 2024, 21, 100319. [Google Scholar] [CrossRef]
  61. Yaagoubi, R.; Lakber, C.-E.; Miky, Y. A comparative analysis on the use of a cellular automata Markov chain versus a convolutional LSTM model in fore-casting urban growth using sentinel 2A images. J. Land Use Sci. 2024, 19, 258–277. [Google Scholar] [CrossRef]
  62. Peng, Z.-R.; Lu, K.-F.; Liu, Y.; Zhai, W. The Pathway of Urban Planning AI: From Planning Support to Plan-Making. J. Plan. Educ. Res. 2024, 44, 2263–2279. [Google Scholar] [CrossRef]
  63. He, F.; Yang, J.; Zhang, Y.; Yu, W.; Xiao, X.; Xia, J. Does partition matter? A new approach to modeling land use change. Comput. Environ. Urban Syst. 2023, 106, 102041. [Google Scholar] [CrossRef]
  64. Wu, H.; Lin, A.; Xing, X.; Song, D.; Li, Y. Identifying core driving factors of urban land use change from global land cover products and POI data using the random forest method. Int. J. Appl. Earth Obs. Geoinf. 2021, 103, 102475. [Google Scholar] [CrossRef]
  65. Kim, Y.; Safikhani, A.; Tepe, E. Machine learning application to spatio-temporal modeling of urban growth. Comput. Environ. Urban Syst. 2022, 94, 101801. [Google Scholar] [CrossRef]
  66. Zhuang, H.; Liu, X.; Yan, Y.; Zhang, D.; He, J.; He, J.; Zhang, X.; Zhang, H.; Li, M. Integrating a deep forest algorithm with vector-based cellular automata for urban land change simulation. Trans. Gis 2022, 26, 2056–2080. [Google Scholar] [CrossRef]
  67. Sun, S.; Mu, L.; Feng, R.; Wang, L.; He, J. GAN-Based LUCC Prediction via the Combination of Prior City Planning Information and Land-Use Probability. IEEE J. Sel. Top. Appl. Earth Obs. Remote Sens. 2021, 14, 10189–10198. [Google Scholar] [CrossRef]
  68. Tan, X.; Deng, M.; Chen, K.; Shi, Y.; Zhao, B.; Liu, Q. A spatial hierarchical learning module based cellular automata model for simulating urban expansion: Case studies of three Chinese urban areas. GIScience Remote Sens. 2024, 61, 2290352. [Google Scholar] [CrossRef]
  69. He, F.; Yang, J.; Zhang, Y.; Sun, D.; Wang, L.; Xiao, X.; Xia, J. Offshore Island Connection Line: A new perspective of coastal urban development boundary simulation and multi-scenario prediction. GIScience Remote Sens. 2022, 59, 801–821. [Google Scholar] [CrossRef]
  70. Rabby, Y.W.; Li, Y.; Abedin, J.; Sabrina, S. Impact of Land Use/Land Cover Change on Landslide Susceptibility in Rangamati Municipality of Rangamati District, Bangladesh. iSprs Int. J. Geo-Inf. 2022, 11, 89. [Google Scholar] [CrossRef]
  71. Liu, J.; Xiao, B.; Li, Y.; Wang, X.; Bie, Q.; Jiao, J. Simulation of Dynamic Urban Expansion under Ecological Constraints Using a Long Short Term Memory Network Model and Cellular Automata. Remote Sens. 2021, 13, 1499. [Google Scholar] [CrossRef]
  72. Li, P.; Feng, Y.; Tong, X.; Wang, R.; Zhai, S.; Tang, Y.; Liu, W. Spatial planning-constrained modeling of urban growth in the Yangtze River Delta considering the element flows. GIScience Remote Sens. 2022, 59, 1491–1508. [Google Scholar] [CrossRef]
  73. Mustak, S.; Baghmar, N.K.; Singh, S.K.; Srivastava, P.K. Multi-scenario based urban growth modeling and prediction using earth observation datasets towards urban policy improvement. Geocarto Int. 2022, 37, 18275–18303. [Google Scholar] [CrossRef]
  74. Liang, Z.; Liang, X.; Jiang, X.; Li, T.; Guan, Q. Balancing simulation performance and computational intensity of CA models for large-scale land-use change simulations. Environ. Model. Softw. 2025, 185, 106293. [Google Scholar] [CrossRef]
  75. Adam, E.; Masupha, N.E.E.; Xulu, S. Spatial Assessment and Prediction of Urbanization in Maseru Using Earth Observation Data. Appl. Sci. 2023, 13, 5854. [Google Scholar] [CrossRef]
  76. Page, M.J.; McKenzie, J.E.; Bossuyt, P.M.; Boutron, I.; Hoffmann, T.C.; Mulrow, C.D.; Shamseer, L.; Tetzlaff, J.M.; Akl, E.A.; Brennan, S.E.; et al. The PRISMA 2020 statement: An updated guideline for reporting systematic reviews. BMJ 2021, 372, n71. [Google Scholar] [CrossRef] [PubMed]
Figure 1. PRISMA flowchart for the systematic literature selection process.
Figure 1. PRISMA flowchart for the systematic literature selection process.
Buildings 15 03551 g001
Figure 2. Distribution map of land use prediction application cases. Different shapes and colors represent different levels of case selection: the pentagon represents the core area point of the regional research case point; the triangle represents the central area point of the case study for state-level provincial regions; the circle represents the central coordinates of the urban-level case points; the size of the pattern represents the number of selected research articles.
Figure 2. Distribution map of land use prediction application cases. Different shapes and colors represent different levels of case selection: the pentagon represents the core area point of the regional research case point; the triangle represents the central area point of the case study for state-level provincial regions; the circle represents the central coordinates of the urban-level case points; the size of the pattern represents the number of selected research articles.
Buildings 15 03551 g002
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

Share and Cite

MDPI and ACS Style

Li, C.; Wang, C.; Sun, T.; Lin, T.; Liu, J.; Yu, W.; Wang, H.; Nie, L. Machine Learning in Land Use Prediction: A Comprehensive Review of Performance, Challenges, and Planning Applications. Buildings 2025, 15, 3551. https://doi.org/10.3390/buildings15193551

AMA Style

Li C, Wang C, Sun T, Lin T, Liu J, Yu W, Wang H, Nie L. Machine Learning in Land Use Prediction: A Comprehensive Review of Performance, Challenges, and Planning Applications. Buildings. 2025; 15(19):3551. https://doi.org/10.3390/buildings15193551

Chicago/Turabian Style

Li, Cui, Cuiping Wang, Tianlei Sun, Tongxi Lin, Jiangrong Liu, Wenbo Yu, Haowei Wang, and Lei Nie. 2025. "Machine Learning in Land Use Prediction: A Comprehensive Review of Performance, Challenges, and Planning Applications" Buildings 15, no. 19: 3551. https://doi.org/10.3390/buildings15193551

APA Style

Li, C., Wang, C., Sun, T., Lin, T., Liu, J., Yu, W., Wang, H., & Nie, L. (2025). Machine Learning in Land Use Prediction: A Comprehensive Review of Performance, Challenges, and Planning Applications. Buildings, 15(19), 3551. https://doi.org/10.3390/buildings15193551

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop