Machine Learning-Based Land Cover Mapping of Nanfeng Village with Emphasis on Landslide Detection

Nguyen, Kieu Anh; Huang, Chiao-Shin; Chen, Walter

doi:10.3390/su17188250

Open AccessArticle

Machine Learning-Based Land Cover Mapping of Nanfeng Village with Emphasis on Landslide Detection

by

Kieu Anh Nguyen

,

Chiao-Shin Huang

and

Walter Chen

^*

Department of Civil Engineering, National Taipei University of Technology, Taipei 10608, Taiwan

^*

Author to whom correspondence should be addressed.

Sustainability 2025, 17(18), 8250; https://doi.org/10.3390/su17188250

Submission received: 23 June 2025 / Revised: 9 September 2025 / Accepted: 11 September 2025 / Published: 14 September 2025

(This article belongs to the Special Issue Sustainable Assessment and Risk Analysis on Landslide Hazards)

Download

Browse Figures

Versions Notes

Abstract

Landslides pose a significant threat to Taiwan’s mountainous regions, particularly after extreme weather events such as typhoons. This study introduces a machine learning framework for post-disaster land use-land cover (LULC) classification and landslide detection in Nanfeng Village, central Taiwan, following Typhoon Khanun in August 2023. Using high-resolution Pléiades imagery and 22 environmental and spectral factors, a Random Forest classifier was developed. To address class imbalance, the Synthetic Minority Oversampling Technique (SMOTE) was systematically evaluated across multiple variants. The Distance_SMOTE method yielded the best results, increasing overall accuracy from 74% to 85% and the Kappa coefficient from 0.69 to 0.82. F1-scores for landslides, roads, and grassland improved markedly, reaching 0.97, 0.85, and 0.78, respectively. The optimized model produced accurate pre- and post-typhoon LULC maps, revealing significant expansion of landslide zones after the event. This study demonstrates the practical value of combining SMOTE-based resampling with Random Forest for rapid, reliable post-disaster assessment, offering actionable insights for disaster response and land management in data-imbalanced conditions. By enabling timely mapping of hazard-affected areas and informing targeted recovery actions, the approach supports disaster risk reduction, sustainable land use planning, and ecosystem restoration. These outcomes contribute to the Sustainable Development Goals, particularly SDG 11 (Sustainable Cities and Communities), SDG 13 (Climate Action), and SDG 15 (Life on Land), by strengthening community resilience, promoting climate adaptation, and protecting terrestrial ecosystems in hazard-prone regions.

Keywords:

Nanfeng Village; Typhoon Khanun; post-disaster land cover mapping; landslide detection; Pléiades imagery; distance-SMOTE; random forest; Taiwan

1. Introduction

Landslides are among the most destructive natural hazards, causing significant damage to infrastructure, ecosystems, and human life, particularly in mountainous regions with steep terrain, fragile geology, and heavy rainfall [1]. Between 2004 and 2016, over 55,000 fatalities were attributed to landslides worldwide, with the highest concentration in tropical and subtropical Asia [2]. In Taiwan, vulnerability is amplified by its location along the Pacific Ring of Fire and frequent typhoons, which deliver intense seasonal rainfall [3]. Combined with active tectonics, these conditions create a high risk of slope failure.

Historic events, including the 1999 Chi-Chi Earthquake and Typhoon Morakot in 2009, underscore this susceptibility. Recently, Typhoon Khanun (August 2023) triggered widespread shallow landslides in central Taiwan. Long-term analyses using multi-seasonal Landsat imagery and nighttime light data indicate persistent landslide expansion in urban and peri-urban areas from 1998 to 2017 [4], reflecting increasing interactions between hazard processes and human settlements.

Conventional landslide mapping methods, such as field surveys and aerial photo interpretation, are labor-intensive and limited in spatial coverage. Consequently, remote sensing (RS) and digital terrain analysis integrated with geographic information systems (GIS) have become essential for large-scale landslide detection [5]. Recent studies highlight the growing use of object-based image analysis and machine learning for landslide mapping [6,7]. Algorithms like Support Vector Machines (SVM) [8], Artificial Neural Networks (ANN) [9], and Random Forest (RF) [10] have demonstrated strong performance in geospatial applications [11,12,13]. RF, in particular, is widely favored for its high accuracy, robustness to noise, and ability to model nonlinear relationships through ensemble decision trees [10].

RF has emerged as one of the most reliable and widely adopted machine learning algorithms for landslide detection, susceptibility mapping, and risk assessment, owing to its ability to capture complex, nonlinear relationships among environmental, geological, and anthropogenic factors [14,15,16,17]. Unlike traditional statistical models, RF builds an ensemble of decision trees that collectively reduce overfitting risk and improve prediction stability, even when datasets contain noise or redundant variables. Numerous studies have shown that RF achieves higher classification accuracy and predictive power compared with conventional methods such as logistic regression, single decision trees, and other non-ensemble algorithms [15,18]. A key strength of RF lies in its capacity to process large volumes of multi-source heterogeneous data thereby enabling comprehensive modeling of landslide conditioning factors and enhancing the physical relevance of susceptibility maps [16,17]. RF has also demonstrated robustness when applied to highly imbalanced datasets, which are common in landslide mapping, particularly when combined with oversampling strategies such as Synthetic Minority Oversampling Technique (SMOTE) variants [19,20]. Its versatility has been demonstrated across diverse geographic contexts, from regional-scale assessments in Kerala, India [21] to highland tropical regions such as Cameron Highlands, Malaysia [14]. Collectively, these findings highlight RF’s position as a leading tool for landslide hazard assessment, capable of integrating diverse data sources and advanced optimization techniques to deliver accurate, reliable, and site-specific predictions.

While classical machine learning models like RF remain widely used, recent studies have shown that deep learning (DL) approaches—especially convolutional neural networks (CNNs), recurrent models (e.g., long short-term memory (LSTM)), and hybrid architectures—have become state-of-the-art in landslide susceptibility mapping. These methods excel at learning hierarchical spatial patterns from high-resolution imagery and large datasets. Notable recent examples include CNN–BiLSTM–Attention models [22], DL-based landslide detection using U-Net [23,24], DL-based rainfall-induced landslide prediction [25], and deep ensembles integrating vision transformers [26]. However, despite achieving strong accuracy in benchmark studies, DL approaches often require extensive labeled datasets, long training times, and substantial computational resources—conditions that are rarely met immediately after a disaster. In such time-critical contexts, rapid-response mapping demands models that can be trained quickly and effectively on smaller datasets. In addition, recent work has introduced advanced modular intelligence models, such as the Hybrid Block Neural Network (HBNN), which integrates modular neural structures with genetic algorithms to further enhance landslide susceptibility mapping [27]. Against this backdrop, the present study adopts classical ML methods—specifically RF combined with advanced oversampling techniques—as a practical compromise that balances accuracy and efficiency, making it particularly suitable for small-area, data-limited, post-disaster applications.

A persistent challenge in these tasks is class imbalance (also referred to as skewed class distribution, non-uniform class distribution, or disproportionate class representation), where minority classes such as landslides are underrepresented, leading to biased predictions. Technically, any dataset with unequal class distributions can be considered imbalanced; however, the community generally reserves this term for cases of significant or extreme imbalance [28]. This issue is particularly critical in post-disaster mapping, where rapid detection of landslide zones is essential despite their small spatial extent. Synthetic oversampling methods like SMOTE address this problem by generating new minority class samples via interpolation [29]. Variants such as Borderline-SMOTE, Adaptive Synthetic Sampling (ADASYN), and Geometric SMOTE (G-SMOTE) further refine sample generation near class boundaries [30,31]. These techniques have been successfully applied in environmental and geospatial studies [32,33,34], including landslide susceptibility mapping [19,35] and land use/land cover (LULC) classification in soil erosion modeling [36]. However, comparative evaluations across a broad range of SMOTE variants remain scarce, as most studies consider only a few variants and primarily target large-scale susceptibility mapping rather than detailed, site-specific classification after recent disasters. Given that SMOTE algorithms can perform differently depending on dataset characteristics, a systematic side-by-side comparison is especially important for localized, high-resolution post-disaster applications, where time-sensitive and operationally reliable mapping is essential but largely untested.

This study applies RF and 65 SMOTE-based oversampling variants to classify land cover and detect landslides in Nanfeng Village, Nantou County, following Typhoon Khanun (August 2023). This study focuses on shallow rainfall-induced landslides common in Taiwan’s mountainous terrain during typhoons, characterized by rapid soil and colluvium movement on steep slopes. Differentiating landslide types is beyond the current scope but remains an important direction for future research to refine predictive variables for specific failure mechanisms. High-resolution Pléiades imagery, obtained from the Center for Space and Remote Sensing Research (CSRSR) at National Central University, and topographic derivatives (slope, aspect, curvature) derived from a 20 m Digital Elevation Model (DEM) provided by Taiwan’s National Land Surveying and Mapping Center (NLSC) were resampled to 2 m for alignment with the satellite imagery. Although resampling does not improve inherent resolution, it ensures spatial consistency for integrated analysis.

This approach combines high-resolution imagery, terrain factors, and advanced oversampling to enhance landslide detection under severe class imbalance. The integration of SMOTE with RF demonstrates practical potential for timely, accurate post-disaster mapping in complex mountainous terrain. This research is highly relevant to hazard mapping teams, local governments, and emergency management agencies in Taiwan and other mountainous regions. By determining which SMOTE variant most effectively improves landslide detection performance under severe imbalance, the findings offer an evidence-based guide for operational mapping workflows. This supports faster and more accurate post-disaster assessments, facilitates the targeted allocation of recovery resources, and strengthens the resilience of vulnerable communities. The objectives of this study are as follows:

To develop an RF-based land cover classification model capable of detecting landslides from high-resolution imagery and terrain features.
To evaluate the performance of 65 SMOTE variants in mitigating class imbalance and improving sensitivity to minority classes.
To generate a detailed, high-resolution LULC map emphasizing landslide distribution for disaster response and planning.

2. Materials and Methods

This study was conducted in Nanfeng Village, Nantou County, Taiwan, an area characterized by steep terrain and frequent slope failures (Figure 1). Elevation ranges from approximately 606 m to 2419 m above sea level, and the landscape is dissected by Mei Creek and its tributary, Nanshan Creek, forming a network of valleys and ridges. The region spans both subtropical and temperate monsoon zones, with an average annual rainfall of about 2100 mm and significant diurnal temperature variation. These conditions, combined with abundant water resources, support diverse agricultural activities, including high-mountain tea, plums, and other horticultural crops.

Nanfeng Village is highly susceptible to natural hazards due to its fragile geology, seismic history, and frequent typhoon exposure. The 1999 Chi-Chi Earthquake destabilized slopes across the region, and subsequent typhoons—such as Mindulle (2004), Sinlaku (2008), and most recently Typhoon Khanun on 3 August 2023—have repeatedly triggered severe landslides, damaging infrastructure and threatening residents.

To address the research objectives—developing an accurate post-disaster land cover classification model, assessing the effectiveness of SMOTE-based oversampling techniques, and generating detailed landslide detection maps—a workflow integrating satellite imagery, DEM-derived terrain features, and land cover information was implemented (Figure 2).

The core dataset comprises high-resolution Pléiades imagery and topographic data. The Pléiades images were acquired before and after Typhoon Khanun (26 June 2023 and 25 August 2023), providing valuable temporal context for mapping (Figure 3). Each image includes four multispectral bands—Blue (B), Green (G), Red (R), and Near-Infrared (NIR)—at 2 m resolution and a panchromatic band at 0.5 m, enabling detailed land cover discrimination and landslide identification.

A 20 m-resolution DEM obtained from the NLSC was resampled to 2 m using bilinear interpolation for alignment with Pléiades imagery. While this resampling ensured spatial consistency, it may introduce interpolation artifacts that could affect terrain derivatives such as slope and curvature [37,38]. All datasets were standardized to the TWD97/TM2 coordinate reference system to ensure consistency across layers. Additionally, preprocessing steps were applied to guarantee spatial alignment of all data sources, including high-resolution Pléiades imagery, DEM-derived topographic layers, and vegetation indices. Temporal consistency was addressed by selecting satellite images immediately before (26 June 2023) and after (25 August 2023) Typhoon Khanun, minimizing seasonal variation and allowing accurate assessment of typhoon-induced changes. From the DEM, key topographic attributes were derived, including elevation, slope, aspect, curvature indices (general, profile, and plan), and terrain metrics such as the Terrain Ruggedness Index (TRI), Topographic Position Index (TPI), and roughness. These variables capture the geomorphological context influencing land cover and landslide occurrence.

In total, 22 features were prepared for classification, encompassing spectral bands, vegetation indices such as the Normalized Difference Vegetation Index (NDVI) and Soil Adjusted Vegetation Index (SAVI), a water index (Normalized Difference Water Index, NDWI), band ratios, and DEM-derived terrain variables (Table 1). Figure 4 illustrates examples of these inputs, including elevation, slope, NDVI, and NDWI.

LULC data were obtained from the 2023 NLSC dataset and reclassified into seven categories: farmland, forest, roads, water bodies, built-up areas, grassland, and landslides (Figure 5). The landslide class was manually delineated through visual interpretation of a post-disaster Pléiades image acquired on 25 August 2023, following Typhoon Khanun, as no official post-typhoon inventory was available. Landslides were identified directly from the image, focusing on indicators such as newly exposed bare soil, vegetation loss, and slope disturbances. A standardized interpretation protocol was applied, and random spot checks were conducted against high-resolution Google Earth imagery to reduce manual interpretation errors. Small ambiguous patches were excluded during digitization to minimize potential false positives. These annotations were used for both training and validation, ensuring that the dataset captured both typical land cover patterns and event-specific landslide disturbances.

2.1. Random Forest Classification Framework

The LULC classification model was developed to detect landslides by integrating high-resolution Pléiades imagery, DEM-derived terrain attributes, and machine learning techniques, following the workflow shown in Figure 2. The process included image preprocessing, sample preparation, handling class imbalance, RF model training, and performance evaluation.

To support model development, two datasets were constructed: a stratified training dataset and a balanced test dataset. The stratified sampling approach ensured that the training set reflected the actual land cover distribution, while the balanced test set enabled unbiased performance evaluation across all categories.

The training dataset contained 1000 samples selected proportionally to class area from the reclassified LULC classes. Class proportions were based on the 2023 national land use database, and the allocation of training and test samples by class is presented in Table 2. In contrast, the test dataset consisted of 210 samples (30 per class) selected through random sampling to achieve class balance. This dual sampling strategy provided a realistic basis for model training and a fair evaluation framework for minority classes, particularly landslides.

The RF classifier was implemented in Python 3.9 using the RandomForestClassifier from sklearn.ensemble. RF constructs multiple decision trees, each trained on a bootstrap sample of the training set, and determines splits based on the Gini impurity criterion [10]. Predictions are aggregated by majority vote, improving generalization and reducing overfitting, which makes RF suitable for heterogeneous geospatial data.

In this study, the RF model was configured with the following:

n_estimators: 100 (number of trees)
max_depth: 5 (tree depth to control complexity)

These parameters were fixed across all experiments for consistency when comparing SMOTE variants.

The RF model used the 22 input features previously described in Table 1, including Pléiades spectral bands, band ratios, vegetation and water indices, and DEM-derived terrain attributes. These features were selected for their relevance to land cover discrimination and landslide susceptibility.

Feature importance analysis was performed to quantify the contribution of each input variable to the classification process. The analysis utilized the built-in functionality of the RF model implemented in the scikit-learn library. Feature importance was computed using the Gini importance metric [10], also referred to as mean decrease in impurity. This approach measures the total decrease in Gini impurity attributable to each feature across all decision trees in the ensemble. The impurity decrease is accumulated for each feature over all internal nodes where the feature is used and is then averaged across the forest. The resulting scores were normalized such that the total importance across all features equals 1, allowing for direct comparison of relative contributions. In the broader domain of intelligent modeling, feature analysis is often conducted using the weight database of the optimum model in combination with sensitivity analysis techniques and feature selection approaches [39,40,41]. By contrast, in this study, we relied on the Random Forest’s built-in implementation, where feature importance is derived directly from the Gini importance function in scikit-learn [10].

2.2. Class Imbalance Mitigation Using SMOTE

Dataset imbalance is characterized by disproportionate sample sizes across classes, which can bias machine learning models towards majority classes, reducing detection accuracy for minority classes. Severe class imbalance posed a key challenge in this study. Landslides accounted for only 0.5% of the training samples (5 out of 1000), while forest dominated with over 87% (Table 2). Without correction, the model would likely favor majority classes, underperforming in detecting landslides.

To address this, we applied synthetic oversampling techniques using the smote-variants Python package (version 0.7.3) [42], which implements over 65 SMOTE-based algorithms. These include the original SMOTE [29], Borderline-SMOTE [30], ADASYN [31], and hybrid approaches such as SMOTE-Tomek Links [43]. Each method was applied to the same imbalanced training dataset, and the resulting models were evaluated on the fixed balanced test set (210 samples).

For each SMOTE variant, an RF model was trained with identical hyperparameters (n_estimators = 100, max_depth = 5) and evaluated using class-wise F1-scores. The variant that produced the best overall and minority-class performance was selected for final map generation.

2.3. Performance Evaluation Metrics

To comprehensively assess classification performance, this study employed widely used metrics: Overall Accuracy, Kappa coefficient (

κ

), Producer’s Accuracy, User’s Accuracy, and F1-score. OA measures the proportion of correctly classified samples, while

κ

(Cohen’s kappa index [44]) accounts for chance agreement. PA and UA indicate class-level recall and precision, respectively, and F1-score combines both in a harmonic mean, offering a balanced measure of accuracy for each class.

Overall Accuracy (OA): The proportion of correctly classified samples to the total number of samples N:

$O A = \frac{\sum_{i = 1}^{n} X_{i i}}{N}$

(1)

where $X_{i i}$ is the number of correctly classified samples in class i, and n is the number of classes.
Kappa Coefficient ( $κ$ ): A measure of agreement corrected for chance, calculated as:

$κ = \frac{N \sum_{i = 1}^{n} X_{i i} - \sum_{i = 1}^{n} X_{i +} X_{+ i}}{N^{2} - \sum_{i = 1}^{n} X_{i +} X_{+ i}}$

(2)

where $X_{i +}$ and $X_{+ i}$ represent the total number of samples predicted as and actually belonging to class i, respectively.
Producer’s Accuracy (PA): Also known as recall, it measures the proportion of correctly predicted samples out of all actual samples in class i:

$P A_{i} = \frac{X_{i i}}{X_{+ i}}$

(3)
User’s Accuracy (UA): Also known as precision, it measures the proportion of correct predictions in class i among all samples predicted as class i:

$U A_{i} = \frac{X_{i i}}{X_{i +}}$

(4)
F1-Score: The harmonic mean of PA and UA for class i:

$F 1_{i} = 2 \cdot \frac{P A_{i} \cdot U A_{i}}{P A_{i} + U A_{i}}$

(5)

All metrics were calculated using the confusion matrix from the balanced test set to provide unbiased performance comparisons. Particular attention was given to the landslide class F1-score, reflecting the model’s capability to detect this critical minority class.

3. Results and Discussion

This section presents the evaluation of the LULC classification and landslide detection models. We begin by assessing model performance on the original dataset, followed by a comparison with SMOTE-based oversampling methods. Lastly, we examine changes in landslide distribution before and after Typhoon Khanun.

3.1. Model Performance Using Original Dataset

The RF model was initially evaluated using the original dataset, which contained 1000 samples and exhibited class imbalance across the seven land cover categories. To ensure robust assessment of the model’s generalization capability, particularly for underrepresented classes, a balanced test set was constructed containing 210 samples, with 30 instances from each land cover class. This balanced evaluation approach is critical in land cover classification studies, as it provides unbiased estimates of model performance across all classes regardless of their representation in the training data. The classification results without applying SMOTE are summarized in the confusion matrix (Table 3), indicating an OA of 0.74 and a

κ

of 0.69, reflecting substantial agreement beyond chance. Although these values suggest acceptable overall reliability, they mask significant variability among individual land cover classes. This variability is largely driven by the inherent class imbalance in the training dataset and spectral similarity between certain land cover types.

The landslide class achieved the highest accuracy, with a PA of 0.90 and a UA of 0.96, resulting in an F1-score of 0.93. This indicates the model was highly effective at both detecting and correctly predicting landslides. Water bodies also performed strongly, with PA and UA values contributing to an F1-score of 0.86, reflecting consistent model reliability for this class. The forest class showed perfect PA (1.00), meaning all actual forest pixels were correctly classified; however, its lower UA (0.60) indicates a notable rate of false positives, suggesting that some non-forest areas were incorrectly classified as forest.

The built-up class showed moderate performance, with both PA and UA measured at 0.70 and 0.72, respectively, leading to an F1-score of 0.71. In contrast, grassland exhibited the weakest performance: despite a perfect UA of 1.00 (indicating high precision), its low PA of 0.30 reflects poor recall and suggests many actual grassland instances were missed. The road class displayed a similar imbalance, with a relatively high UA of 0.89 but a lower PA of 0.53. These discrepancies indicate that while some classes were easily distinguishable (e.g., water and landslides), others—particularly grassland and roads—suffered from misclassification, likely due to overlapping spectral signatures with farmland and forest. This finding underscores the limitations of using an imbalanced dataset for land cover classification and highlights the need for an effective resampling strategy to improve performance for underrepresented classes.

3.2. Effectiveness of SMOTE-Based Oversampling for Model Enhancement

To improve model performance under imbalanced data conditions, this study evaluated a comprehensive set of 65 SMOTE-based oversampling techniques implemented in the smote-variants Python library. The objective was to identify the most effective method for enhancing classification accuracy, especially for the landslide class, without degrading performance on majority classes.

All 65 SMOTE variants were applied to the same original dataset, and RF models were trained using each oversampled dataset. Model performance was evaluated on a fixed, balanced test dataset containing 30 samples per class. Figure 6 presents the OA and

κ

for all variants, providing a comparative view of their effectiveness.

Among all tested methods, Distance_SMOTE emerged as the best-performing technique with an OA of 0.85 and a

κ

of 0.82. These values represent a substantial improvement compared with the baseline model without oversampling, which achieved an OA of 0.74 and

κ

of 0.69. Other top-performing methods included NT_SMOTE and G_SMOTE, each achieving OA values above 0.83 and

κ

above 0.80. This analysis clearly demonstrates that synthetic oversampling can substantially enhance both overall classification accuracy and agreement beyond chance.

To further illustrate the benefits of synthetic balancing, we compared two RF training scenarios: (i) the original imbalanced dataset and (ii) a balanced dataset generated using Distance_SMOTE. Prior to applying SMOTE, the training data exhibited severe imbalance, with the forest class containing 873 samples, whereas the landslide and road classes had only 5 and 8 samples, respectively. After applying Distance_SMOTE, all classes were balanced to 873 samples, ensuring equal representation during model building.

Table 4 summarizes the confusion matrix of the RF model trained on the Distance_SMOTE-enhanced dataset. Compared with the original RF model (Table 3), the OA increased from 0.74 to 0.85, and

κ

improved from 0.69 to 0.82, indicating stronger overall agreement and reduced misclassification rates.

Class-level improvements were significant. The grassland class, previously the weakest performer, showed an increase in F1-score from 0.46 to 0.78, demonstrating markedly better recall and precision. Similarly, the roads class improved from 0.67 to 0.85, and farmland rose from 0.68 to 0.81. The forest class improved from 0.75 to 0.90, while the built-up class remained stable with moderate gains. Even minority classes such as landslides, which initially performed well, exhibited a slight increase in F1-score from 0.93 to 0.97, and water maintained a high level of accuracy. These enhancements confirm that Distance_SMOTE not only elevated performance for underrepresented classes but also preserved strong performance for majority classes.

Our results compare favorably with recent studies that addressed skewed class distributions in landslide mapping. For example, Lu et al. [19] applied four resampling methods to an imbalanced landslide dataset on Penang Island and reported that an RF + SMOTE-ENN (Edited Nearest Neighbor) model achieved a recall of 0.844 and an F2-score of 0.756, underscoring the value of oversampling for sensitivity to landslides. Similarly, Gupta and Shukla [35] used EasyEnsemble and BalanceCascade with SVM/ANN and reported AUC values up to 0.923 for the BCANN model, demonstrating substantial gains after rebalancing. In our case, the optimized RF combined with Distance_SMOTE achieved an overall accuracy of 0.85, a kappa of 0.82, and an F1-score of 0.97 for landslides, which are comparable to or exceed those reported in the literature. This indicates that systematically evaluating a wide range of SMOTE variants can yield substantial improvements for post-disaster, high-resolution landslide mapping where rapid and reliable results are critical.

In summary, applying Distance_SMOTE-based oversampling resulted in substantial accuracy improvements across all land cover categories, particularly for classes that were previously underrepresented. These findings underscore the critical role of advanced data balancing techniques in improving land cover classification in heterogeneous and complex terrains.

Feature importance analysis of the RF model trained with Distance_SMOTE (Figure 7) reveals clear patterns in variable contributions to classification accuracy. As explained in the Section 2.1, the feature importance values shown in Figure 7 are derived from the built-in Gini importance metric of the Random Forest model. The NIR band emerged as the most influential feature, followed closely by Roughness and TRI, underscoring the critical role of both spectral and topographic variables in capturing landslide-prone areas. Slope and the Red band ranked next, highlighting the importance of terrain gradients and visible spectrum information for differentiating land cover classes. Vegetation-related indices such as NDVI and SAVI, along with band ratios (e.g., Red/NIR), also exhibited strong influence, indicating their value in detecting vegetation disturbance and bare soil exposure commonly associated with landslides. In contrast, curvature-based metrics (general, profile, and plan curvature) and TPI displayed minimal contribution, suggesting that micro-topographic variations are less informative compared with broader terrain and spectral attributes. This ranking emphasizes that a combination of NIR reflectance, terrain ruggedness, and vegetation indicators provides the most discriminative power for accurate LULC classification with landslide detection, while features with low importance may be candidates for dimensionality reduction in future modeling efforts.

3.3. LULC Prediction Map and Landslide Distribution Change Before and After Typhoon Khanun

Building on the model selection results, the RF classifier trained with Distance_SMOTE was used to predict LULC classes and landslide areas for Nanfeng Village. To assess the model’s capability for landslide detection before and after a major event, predictions were performed on two Pléiades satellite images acquired on 26 June 2023 and 25 August 2023 (Figure 3)—capturing the landscape before and after Typhoon Khanun.

Figure 8 illustrates the spatial distribution of the six selected sample areas (A–F) within the study site. These boxes were chosen to represent locations where landslides were observed or likely to occur due to factors such as proximity to creek corridors, tributary junctions, and steep slopes. Boxes A, B, C, and D are situated in the northern part of Nanfeng Village, while Boxes E and F are in the southern section. This selection ensured representation of diverse geomorphic settings most affected by intense rainfall events.

Figure 9 provides a detailed side-by-side comparison of each box before and after Typhoon Khanun, with red overlays representing landslides predicted by the model. The left panels (a, c, e, g, i, k) correspond to pre-typhoon imagery from 26 June 2023, and the right panels (b, d, f, h, j, l) show post-typhoon imagery from 25 August 2023. This visual comparison highlights both the persistence of pre-existing landslides and the occurrence of new failures caused by the typhoon.

Notable patterns include the following:

Box A: Significant lateral expansion of an existing landslide along a creek corridor.
Box B: Enlargement of scars near drainage lines, merging into broader disturbed zones.
Box C: Multiple small slides coalescing into elongated failures along steep slopes adjacent to creeks.
Box D: Formation of new landslides on previously undisturbed vegetated slopes.
Box E: Fresh landslides near slope toes adjacent to tributary streams.
Box F: Extensive new failures forming elongated scars along steep southern slopes.

The application of the Distance_SMOTE and RF model to the two Pléiades images demonstrates robust temporal generalization, accurately detecting both pre-existing landslides and new or enlarged areas following Typhoon Khanun. Observations from Boxes A, C, and F confirm the model’s ability to capture incremental changes as well as abrupt slope failures, underscoring the strong geomorphic response of the terrain to the typhoon, which destabilized slopes in both headwater regions and lower channels. The substantial post-typhoon increase in landslide extent within these boxes validates the model’s effectiveness for identifying both the expansion and initiation of landslides under extreme rainfall, highlighting the suitability of the RF + Distance_SMOTE approach for dynamic landslide hazard assessment.

3.4. Limitations and Future Research Directions

Our findings confirm that the application of SMOTE variants can significantly enhance the accuracy of landslide susceptibility models by addressing class imbalance issues. This improvement aligns with earlier works (e.g., [19,32]), which demonstrated that oversampling techniques effectively boost model sensitivity and overall performance. In this study, we conducted a comprehensive evaluation of 65 SMOTE variants—far exceeding the scope of most previous research—to provide a broader understanding of their relative effectiveness. The results indicate that certain variants, such as Distance_SMOTE, achieved higher accuracy than many commonly used alternatives. This systematic comparison is rare in the existing literature and highlights the importance of selecting appropriate oversampling methods to improve predictive performance in localized post-disaster landslide mapping. Although the combination of Distance_SMOTE and RF improved post-disaster LULC and landslide mapping, several limitations remain.

First, the landslide class in the original training data was extremely underrepresented, requiring synthetic oversampling. While SMOTE-based methods mitigated imbalance, synthetic samples may not fully capture real-world heterogeneity, especially in complex mountainous terrain, which could affect generalizability. Another limitation arises from the manual extraction of landslide inventories from Pléiades imagery, which, despite cross-checking with Google Earth, may still be subject to interpreter bias. Such biases could influence the accuracy of the reference data used for training and evaluation.

Second, despite efforts to maintain spatial independence between training and test sets, residual spatial clustering might still influence model performance. Future studies should apply spatial cross-validation and incorporate physically relevant variables—such as rainfall, lithology, soil properties, and historical landslide inventories—to better represent underlying processes.

Third, this analysis focused on a single event (Typhoon Khanun) in one locality, which limits transferability. Expanding the approach to other regions and events is essential to assess scalability. We explicitly highlight this limitation and propose the use of multi-event training datasets and the inclusion of lithology, soil properties, and hydrological factors to improve model robustness and generalizability. The absence of geological and geotechnical variables in this study further constrains applicability to diverse terrains. In addition, the study does not distinguish between different landslide types; future research should incorporate detailed classifications of landslide mechanisms (e.g., shallow vs. deep-seated failures, debris flows) to enhance the physical relevance of predictive models. This study also did not benchmark against DL or hybrid models, which may offer improved performance. Comparative analyses involving cost-sensitive learning, ensemble resampling, or Generative Adversarial Network (GAN)-based augmentation could provide further insights.

Fourth, while performance metrics were reported, no spatial error analysis was conducted. Misclassifications were possibly concentrated near class boundaries or in shadowed terrain, such as confusion between farmland and grassland and occasional road misclassification due to narrow geometry. Mapping false positives and negatives would help link statistical errors to geomorphic conditions.

Fifth, parameter tuning for SMOTE (e.g., neighbors) and RF (e.g., number of trees, depth) was not explored. Default settings were retained for consistency across 65 SMOTE variants. However, parameter optimization and sensitivity analysis could further improve model robustness and reduce overfitting.

Sixth, DEM resampling from 20 m to 2 m ensured alignment with Pléiades imagery but introduced potential interpolation artifacts, which may affect terrain derivatives such as slope and curvature. Future research should evaluate the impact of DEM quality on classification accuracy.

Seventh, this study provides a timely demonstration of post-disaster mapping in Taiwan using widely available satellite and elevation data. Addressing the above limitations is essential for developing more physically grounded and transferable machine learning frameworks for landslide susceptibility assessment. In particular, incorporating spatial uncertainty quantification and benchmarking interpretable models against deep or hybrid architectures may help balance predictive accuracy with explainability in geospatial hazard applications. Looking forward, novel approaches such as automated hybrid ensemble-based deep learning, integration with 3D geo-models, and explicit uncertainty analysis [45] represent promising directions for enhancing both accuracy and reliability in post-disaster landslide mapping.

4. Conclusions

This study developed a machine-learning-based framework for LULC classification and landslide mapping under severe class imbalance using high-resolution RS and terrain data. An initial RF model trained on an imbalanced dataset achieved acceptable overall accuracy but underperformed for minority classes. To address this, 65 SMOTE-based oversampling methods were systematically evaluated. Distance_SMOTE produced the best results, raising OA from 0.74 to 0.85 and improving F1-scores for minority classes such as landslides, roads, and grassland.

The optimized Distance_SMOTE + RF model was applied to Pléiades images acquired before and after Typhoon Khanun (June and August 2023), enabling detailed post-disaster assessment. The model successfully captured both the expansion of existing landslides and the initiation of new failures, verified through visual analysis in six sample areas.

Rather than introducing a new algorithm, this work demonstrates the operational value of systematically comparing SMOTE variants for rapid post-disaster mapping in data-scarce contexts. The approach highlights how oversampling can enhance interpretability and performance without requiring complex deep learning models, making it suitable for time-sensitive disaster response. These findings provide practical guidance for improving classification workflows in mountainous terrain and underscore the role of data augmentation in supporting geospatial hazard analysis.

Author Contributions

Conceptualization, W.C. and K.A.N.; data curation, W.C., K.A.N. and C.-S.H.; funding acquisition, W.C. and K.A.N.; investigation, W.C. and K.A.N.; methodology, W.C., K.A.N. and C.-S.H.; project administration, W.C. and K.A.N.; resources, W.C. and K.A.N.; software, W.C., K.A.N. and C.-S.H.; supervision, W.C.; validation, W.C., K.A.N. and C.-S.H.; visualization, K.A.N. and C.-S.H.; writing—original draft, W.C. and K.A.N.; writing—review and editing, W.C., K.A.N. and C.-S.H. All authors have read and agreed to the published version of the manuscript.

Funding

This study was partially supported by the National Science and Technology Council (Taiwan) under Research Project Grant Numbers NSTC 113-2121-M-008-004 and NSTC 113-2625-M-027-007.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data used in this study are not publicly available due to restrictions imposed by the data owner or source. Therefore, the data cannot be disseminated or shared as part of this publication. Interested researchers can request access to the data directly from the data owner or source, subject to their terms and conditions. The authors confirm that they do not have the right to distribute the data used in this study.

Acknowledgments

This study is derived from the Master’s thesis of Chiao-Shin Huang. While the central idea remains the same, the input variables used in this study have been substantially expanded, and the analyses have been entirely redone. As a result, the findings presented here differ from those in the original thesis. The authors acknowledge the use of ChatGPT 5, a large language model developed by OpenAI, to improve the readability and language of the manuscript. All AI-generated content was thoroughly reviewed and revised by the authors, who assume full responsibility for the final version of the publication. The authors gratefully acknowledge the Center for Space and Remote Sensing Research (CSRSR) at National Central University (NCU) for providing the satellite imagery used in this study. Pléiades imagery: © CNES 2023–2023, Distribution Airbus DS. The imagery data were sublicensed by Airbus DS from CSRSR, NCU. The authors would also like to thank the National Science and Technology Council (NSTC) and CSRSR/NCU for supplying the satellite imagery data.

Conflicts of Interest

The authors declare no conflicts of interest.

References

Tuganishuri, J.; Yune, C.-Y.; Kim, G.; Lee, S.W.; Adhikari, M.D.; Yum, S.-G. Prediction of the volume of shallow landslides due to rainfall using data-driven models. Nat. Hazards Earth Syst. Sci. 2025, 25, 1481–1499. [Google Scholar] [CrossRef]
Froude, M.J.; Petley, D.N. Global fatal landslide occurrence from 2004 to 2016. Nat. Hazards Earth Syst. Sci. 2018, 18, 2161–2181. [Google Scholar] [CrossRef]
Huang, C.Y.; Lin, Y.H.; Yang, C.H.; Tseng, C.M. Hazard Assessment of Potential Large-Scale Landslides in the Watershed of the Chenyulan River. Water 2022, 14, 3692. [Google Scholar] [CrossRef]
Chen, T.-H.K.; Prishchepov, A.V.; Fensholt, R.; Sabel, C.E. Detecting and monitoring long-term landslides in urbanized areas with nighttime light data and multi-seasonal Landsat imagery across Taiwan from 1998 to 2017. Remote Sens. Environ. 2019, 225, 317–327. [Google Scholar] [CrossRef]
Guzzetti, F.; Mondini, A.C.; Cardinali, M.; Fiorucci, F.; Santangelo, M.; Chang, K.T. Landslide inventory maps: New tools for an old problem. Earth-Sci. Rev. 2012, 112, 42–66. [Google Scholar] [CrossRef]
Zhao, C.; Lu, Z. Remote sensing of landslides—A review. Remote Sens. 2018, 10, 279. [Google Scholar] [CrossRef]
Mohan, A.; Singh, A.K.; Kumar, B.; Dwivedi, R. Review on remote sensing methods for landslide detection using machine and deep learning. Trans. Emerg. Telecommun. Technol. 2021, 32, e3998. [Google Scholar] [CrossRef]
Cortes, C.; Vapnik, V. Support-vector networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
Rumelhart, D.E.; Hinton, G.E.; Williams, R.J. Learning representations by back-propagating errors. Nature 1986, 323, 533–536. [Google Scholar] [CrossRef]
Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef]
Pradhan, B. Application of an advanced fuzzy logic model for landslide susceptibility analysis. Int. J. Comput. Intell. Syst. 2010, 3, 370–381. [Google Scholar] [CrossRef] [PubMed]
Chowdhury, M.S. Comparison of accuracy and reliability of random forest, support vector machine, artificial neural network and maximum likelihood method in land use/cover classification of urban setting. Environ. Challenges 2024, 14, 100800. [Google Scholar] [CrossRef]
Trucchia, A.; Izadgoshasb, H.; Isnardi, S.; Fiorucci, P.; Tonini, M. Machine-learning applications in geosciences: Comparison of different algorithms and vegetation classes’ importance ranking in wildfire susceptibility. Geosciences 2022, 12, 424. [Google Scholar] [CrossRef]
Nhu, V.H.; Mohammadi, A.; Shahabi, H.; Ahmad, B.B.; Al-Ansari, N.; Shirzadi, A.; Nguyen, H. Landslide detection and susceptibility modeling on Cameron Highlands (Malaysia): A comparison between random forest, logistic regression and logistic model tree algorithms. Forests 2020, 11, 830. [Google Scholar] [CrossRef]
Tanyu, B.F.; Abbaspour, A.; Alimohammadlou, Y.; Tecuci, G. Landslide susceptibility analyses using Random Forest, C4.5, and C5.0 with balanced and unbalanced datasets. Catena 2021, 203, 105355. [Google Scholar] [CrossRef]
Liu, W.; Zhang, Y.; Liang, Y.; Sun, P.; Li, Y.; Su, X.; Meng, X. Landslide risk assessment using a combined approach based on InSAR and random forest. Remote Sens. 2022, 14, 2131. [Google Scholar] [CrossRef]
Li, M.; Wang, H.; Chen, J.; Zheng, K. Assessing landslide susceptibility based on the random forest model and multi-source heterogeneous data. Ecol. Indic. 2024, 158, 111600. [Google Scholar] [CrossRef]
Abdelkader, M.M.; Csámer, Á. Comparative assessment of machine learning models for landslide susceptibility mapping: A focus on validation and accuracy. Nat. Hazards 2025, 121, 10299–10321. [Google Scholar] [CrossRef]
Lu, M.; Tay, L.T.; Mohamad-Saleh, J. Landslide susceptibility analysis using random forest model with SMOTE-ENN resampling algorithm. Geomat. Nat. Hazards Risk 2024, 15, 2314565. [Google Scholar] [CrossRef]
Lv, M.-z.; Li, K.-l.; Cai, J.-z.; Mao, J.; Gao, J.-j.; Xu, H. Evaluation of landslide susceptibility based on SMOTE-Tomek sampling and machine learning algorithm. PLoS ONE 2025, 20, e0323487. [Google Scholar] [CrossRef]
Badapalli, P.K.; Nakkala, A.B.; Kottala, R.B.; Gugulothu, S.; Hasher, F.F.B.; Mishra, V.N.; Zhran, M. Landslide susceptibility level mapping in Kozhikode, Kerala, using machine learning-based random forest, remote sensing, and GIS techniques. Land 2025, 14, 1453. [Google Scholar] [CrossRef]
Ju, X.; Li, J.; Sun, C.; Li, B. Landslide susceptibility assessment using a CNN–BiLSTM–AM model. Sustainability 2024, 16, 9476. [Google Scholar] [CrossRef]
Nigelesh, T.M.; Shruthik, V.S.; Reddy, V.S.; Singh, R.P. Landslide Detection in Satellite Images using InceptionU-Net and Convolutional Block Attention Module. Procedia Comput. Sci. 2025, 258, 4301–4310. [Google Scholar] [CrossRef]
Hussaine, S.M.; Mu, L.; Lu, Y.; Hussain, S.S. Landslide Image Segmentation with Attention Residual U-Net: A Hybrid Deep Learning Model. Procedia Comput. Sci. 2025, 258, 2029–2039. [Google Scholar] [CrossRef]
Liu, Y.; Ma, S.; Dong, L.; Xiao, R.; Huang, J.; Zhou, P. A comparative study of regional rainfall-induced landslide early warning models based on RF, CNN, and MLP algorithms. Front. Earth Sci. 2024, 12, 1419421. [Google Scholar] [CrossRef]
Bao, S.; Liu, J.; Wang, L.; Konečný, M.; Che, X.; Xu, S.; Li, P. Landslide susceptibility mapping by fusing convolutional neural networks and vision transformer. Sensors 2023, 23, 88. [Google Scholar] [CrossRef]
Abbaszadeh Shahri, A.; Maghsoudi Moud, F. Landslide susceptibility mapping using hybridized block modular intelligence model. Bull. Eng. Geol. Environ. 2021, 80, 267–284. [Google Scholar] [CrossRef]
He, H.; Garcia, E.A. Learning from imbalanced data. IEEE Trans. Knowl. Data Eng. 2009, 21, 1263–1284. [Google Scholar] [CrossRef]
Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic Minority Over-sampling Technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
Han, H.; Wang, W.Y.; Mao, B.H. Borderline-SMOTE: A new over-sampling method in imbalanced data sets learning. In Proceedings of the International Conference on Intelligent Computing, ICIC 2005, Hefei, China, 23–26 August 2005; pp. 878–887. [Google Scholar] [CrossRef]
He, H.; Bai, Y.; Garcia, E.A.; Li, S. ADASYN: Adaptive synthetic sampling approach for imbalanced learning. In Proceedings of the IEEE International Joint Conference on Neural Networks (IEEE World Congress on Computational Intelligence), Hong Kong, China, 1–8 June 2008; pp. 1322–1328. [Google Scholar] [CrossRef]
Douzas, G.; Bacao, F. Improving imbalanced learning through a heuristic oversampling method based on k-means and SMOTE. Inf. Sci. 2018, 465, 1–20. [Google Scholar] [CrossRef]
Kumar, P.; Priyanka, P.; Uday, K.V.; Dutt, V. Addressing class imbalance in soil movement predictions. Nat. Hazards Earth Syst. Sci. 2024, 24, 1913–1928. [Google Scholar] [CrossRef]
Tang, L.; Yu, X.; Jiang, W.; Zhou, J. Comparative study on landslide susceptibility mapping based on unbalanced sample ratio. Sci. Rep. 2023, 13, 5823. [Google Scholar] [CrossRef] [PubMed]
Gupta, S.K.; Shukla, D.P. Handling data imbalance in machine learning based landslide susceptibility mapping: A case study of Mandakini River Basin, North-Western Himalayas. Landslides 2023, 20, 933–949. [Google Scholar] [CrossRef]
Nguyen, K.A.; Chen, W. Enhancing Cover Management Factor Classification Through Imbalanced Data Resolution. Environments 2024, 11, 250. [Google Scholar] [CrossRef]
Shahri, A.A.; Spross, J.; Johansson, F.; Larsson, S. Landslide susceptibility hazard map in southwest Sweden using artificial neural network. Catena 2019, 183, 104225. [Google Scholar] [CrossRef]
Mishra, V.K.; Nareti, U.; Kumar, R.; Pant, T.; Aleem, A.; Singh, A.; Biable, S.E. GDF: A Novel Image Fusion Approach for Compelling Depiction of Earthly Features. J. Sens. 2023, 2023, 9429505. [Google Scholar] [CrossRef]
Zhang, P. A novel feature selection method based on global sensitivity analysis with application in machine learning-based prediction model. Appl. Soft Comput. 2019, 85, 105859. [Google Scholar] [CrossRef]
Naik, D.L.; Kiran, R. A novel sensitivity-based method for feature selection. J. Big Data 2021, 8, 128. [Google Scholar] [CrossRef]
Yuan, Z.; Liang, P.; Silva, T.; Yu, K.; Mottershead, J.E. Parameter selection for model updating with global sensitivity analysis. Mech. Syst. Signal Process. 2019, 115, 483–496. [Google Scholar] [CrossRef]
Kovács, G. Ssmote-variants: A Python Implementation of 85 Minority Oversampling Techniques. Neurocomputing 2019, 366, 352–354. [Google Scholar] [CrossRef]
Batista, G.E.A.P.A.; Prati, R.C.; Monard, M.C. A Study of the Behavior of Several Methods for Balancing Machine Learning Training Data. SIGKDD Explor. Newsl. 2004, 6, 20–29. [Google Scholar] [CrossRef]
Cohen, J. A coefficient of agreement for nominal scales. Educ. Psychol. Meas. 1960, 20, 37–46. [Google Scholar] [CrossRef]
Abbaszadeh Shahri, A.; Chunling, S.; Larsson, S. A hybrid ensemble-based automated deep learning approach to generate 3D geo-models and uncertainty analysis. Eng. Comput. 2024, 40, 1501–1516. [Google Scholar] [CrossRef]

Figure 1. Study area: Location of Nanfeng Village in Taiwan. The left panel shows Taiwan with the study site marked in red, while the right panel provides a detailed view of Nanfeng Village outlined in red.

Figure 2. Workflow of the RF model incorporating SMOTE-based oversampling for land cover mapping with an emphasis on landslide detection. The process includes data collection (LULC, satellite imagery, and topographical factors), preprocessing, dataset preparation (original and upsampled), and model training for generating the final LULC classification.

Figure 3. Pléiades satellite imagery of Nanfeng Village before and after Typhoon Khanun (3 August 2023). (a) Image acquired on 26 June 2023, prior to significant landslide occurrences. (b) Image acquired on 25 August 2023, showing post-typhoon conditions and visible landslide features.

Figure 4. Terrain feature maps of Nanfeng Village derived from the resampled 2-meter DEM: (a) elevation, (b) slope, (c) NDVI, and (d) NDWI. These features were used as input variables for land cover classification and landslide detection.

Figure 5. LULC map of Nanfeng Village in 2023. Landslide areas were manually delineated based on visual interpretation of a Pléiades image acquired on 25 August 2023, after Typhoon Khanun.

Figure 6. Comparison of OA and

κ

across 65 SMOTE variants applied to the RF model for land cover classification with emphasis on landslide detection.

Figure 6. Comparison of OA and

κ

across 65 SMOTE variants applied to the RF model for land cover classification with emphasis on landslide detection.

Figure 7. Feature importance ranking of the RF model after applying Distance_SMOTE, showing the contribution of spectral bands, vegetation indices, and topographic factors to land cover classification with emphasis on landslide detection. The importance values were computed using the built-in Gini importance (mean decrease in impurity) of the Random Forest model.

Figure 8. Spatial distribution of six selected sample areas (A–F) within Nanfeng Village used for visual comparison of landslide patterns before and after Typhoon Khanun. Each red box highlights a region of significant change: (A) expansion of existing landslides along steep slopes near a creek corridor; (B) enlargement of pre-existing landslide scars; (C) multiple adjacent landslides coalescing into a larger disturbed area; (D) newly triggered landslides on previously stable slopes; (E) additional landslide activity along drainage channels; and (F) extensive new failures on steep southern slopes. These areas illustrate both the expansion of existing landslides and the initiation of new ones, providing visual evidence of the typhoon’s geomorphic impact.

Figure 9. Zoomed-in visual comparison of landslide extent within the six sample areas (A–F) identified in Figure 8, showing pre- and post-Typhoon Khanun conditions. The red overlays indicate landslide-affected areas. (a,b) Box A: Expansion of a pre-existing landslide near a creek corridor, with notable lateral spread. (c,d) Box B: Enlargement of landslide scars along upper slopes adjacent to drainage lines, merging into a broader disturbed zone. (e,f) Box C: Multiple small failures coalescing into elongated scars along steep slopes. (g,h) Box D: Emergence of new landslides on previously undisturbed vegetated slopes. (i,j) Box E: Development of landslides near drainage channels and slope toes. (k,l) Box F: Significant new landslide activity forming elongated scars on steep southern slopes. These observations highlight both the initiation of new landslides and the expansion of existing ones, illustrating the geomorphic impact of Typhoon Khanun on the terrain.

Table 1. List of input factors used in the RF model for land cover classification and landslide detection, including topographic attributes from DEM, terrain indices, vegetation indices, spectral bands, and band ratios derived from Pléiades imagery.

No.	Category	Factor
1	DEM-Topographic	Elevation
2	DEM-Topographic	Slope
3	DEM-Topographic	Aspect
4	DEM-Curvature	General Curvature
5	DEM-Curvature	Profile Curvature
6	DEM-Curvature	Plan Curvature
7	DEM-Terrain Index	TRI
8	DEM-Terrain Index	TPI
9	DEM-Terrain Index	Roughness
10	Pléiades-Vegetation Index	NDVI
11	Pléiades-Vegetation Index	SAVI
12	Pléiades-Water Index	NDWI
13	Pléiades-Spectral Band	Red Band
14	Pléiades-Spectral Band	Green Band
15	Pléiades-Spectral Band	Blue Band
16	Pléiades-Spectral Band	Near-Infrared (NIR) Band
17	Pléiades-Band Ratio	Red/Green
18	Pléiades-Band Ratio	Red/Blue
19	Pléiades-Band Ratio	Blue/Green
20	Pléiades-Band Ratio	Red/NIR
21	Pléiades-Band Ratio	Blue/NIR
22	Pléiades-Band Ratio	Green/NIR

Table 2. Original sample allocation by land cover class for training and test datasets, including area (ha), area percentage, and sample counts for each class.

Land Cover Class	Area (ha)	Area (%)	Training Dataset	Test Dataset
Farmland	254.0	6.0	60	30
Forest	3701.0	87.3	873	30
Transportation (Roads)	36.1	0.9	8	30
Water Bodies	75.3	1.8	18	30
Built-up Areas	106.2	2.5	25	30
Grassland	46.1	1.1	11	30
Landslides	21.2	0.5	5	30
Total	4239.8	100.0	1000	210

Table 3. Confusion matrix for land cover classification using the RF model (without SMOTE), highlighting performance across classes with a focus on landslide detection. Metrics reported include UA, PA, F1-score, OA, and

κ

.

Table 3. Confusion matrix for land cover classification using the RF model (without SMOTE), highlighting performance across classes with a focus on landslide detection. Metrics reported include UA, PA, F1-score, OA, and

κ

.

	Actual
		Farmland	Forest	Roads	Water	Built-Up	Grassland	Landslides	Row Total	UA
Predicted	Farmland	24	0	5	1	5	4	2	41	0.59
	Forest	3	30	0	0	0	17	0	50	0.60
	Roads	0	0	16	1	0	0	1	18	0.89
	Water	1	0	2	28	4	0	0	35	0.80
	Built-up	2	0	6	0	21	0	0	29	0.72
	Grassland	0	0	0	0	0	9	0	9	1.00
	Landslides	0	0	1	0	0	0	27	28	0.96
	Total	30	30	30	30	30	30	30	210
	PA	0.80	1.00	0.53	0.93	0.70	0.30	0.90
	F1-score	0.68	0.75	0.67	0.86	0.71	0.46	0.93
	OA	0.74
	$κ$	0.69

Table 4. Confusion matrix for the RF model with Distance_SMOTE, showing improved performance in landslide detection and overall classification accuracy. Reported metrics include UA, PA, F1-score, OA, and

κ

.

Table 4. Confusion matrix for the RF model with Distance_SMOTE, showing improved performance in landslide detection and overall classification accuracy. Reported metrics include UA, PA, F1-score, OA, and

κ

.

	Actual
		Farmland	Forest	Roads	Water	Built-Up	Grassland	Landslides	Row Total	UA
Predicted	Farmland	26	0	0	0	2	5	1	34	0.76
	Forest	1	30	0	0	0	6	0	37	0.81
	Roads	0	0	23	1	0	0	0	24	0.96
	Water	1	0	2	29	5	0	1	38	0.76
	Built-up	2	0	5	0	23	0	0	30	0.77
	Grassland	0	0	0	0	0	19	0	19	1.00
	Landslides	0	0	0	0	0	0	28	28	1.00
	Column Total	30	30	30	30	30	30	30	210
	PA	0.87	1.00	0.77	0.97	0.77	0.63	0.93
	F1-score	0.81	0.90	0.85	0.85	0.77	0.78	0.97
	OA	0.85
	$κ$	0.82

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Nguyen, K.A.; Huang, C.-S.; Chen, W. Machine Learning-Based Land Cover Mapping of Nanfeng Village with Emphasis on Landslide Detection. Sustainability 2025, 17, 8250. https://doi.org/10.3390/su17188250

AMA Style

Nguyen KA, Huang C-S, Chen W. Machine Learning-Based Land Cover Mapping of Nanfeng Village with Emphasis on Landslide Detection. Sustainability. 2025; 17(18):8250. https://doi.org/10.3390/su17188250

Chicago/Turabian Style

Nguyen, Kieu Anh, Chiao-Shin Huang, and Walter Chen. 2025. "Machine Learning-Based Land Cover Mapping of Nanfeng Village with Emphasis on Landslide Detection" Sustainability 17, no. 18: 8250. https://doi.org/10.3390/su17188250

APA Style

Nguyen, K. A., Huang, C.-S., & Chen, W. (2025). Machine Learning-Based Land Cover Mapping of Nanfeng Village with Emphasis on Landslide Detection. Sustainability, 17(18), 8250. https://doi.org/10.3390/su17188250

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Machine Learning-Based Land Cover Mapping of Nanfeng Village with Emphasis on Landslide Detection

Abstract

1. Introduction

2. Materials and Methods

2.1. Random Forest Classification Framework

2.2. Class Imbalance Mitigation Using SMOTE

2.3. Performance Evaluation Metrics

3. Results and Discussion

3.1. Model Performance Using Original Dataset

3.2. Effectiveness of SMOTE-Based Oversampling for Model Enhancement

3.3. LULC Prediction Map and Landslide Distribution Change Before and After Typhoon Khanun

3.4. Limitations and Future Research Directions

4. Conclusions

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI