1. Introduction
Forests are among the most vital ecosystems on Earth, serving not only as key habitats for global biodiversity but also as essential components of the hydrological cycle, carbon cycle, and soil conservation processes [
1]. By absorbing carbon dioxide, regulating climate, and releasing oxygen, forests play an irreplaceable role in maintaining global climate stability [
2]. However, under the intensification of global climate change and human disturbances, forest ecosystems are increasingly threatened, among which forest wildfires stand out as one of the most severe and widespread natural hazards with profound ecological consequences [
3,
4]. Wildfires are triggered by both natural factors (e.g., lightning) and human activities (e.g., agricultural burning), and they can lead to significant ecological damage, including the loss of forest resources, deterioration of air quality, increased carbon emissions, and reduced soil stability. When fires occur too frequently or with high intensity, the resilience and regeneration capacity of forest ecosystems are weakened, ultimately resulting in long-term functional degradation [
5]. Each year, forest wildfires cause substantial ecological and economic losses globally, severely disrupting the carbon cycle and climate system and imposing considerable environmental and societal costs [
6]. In recent decades, the scale, intensity, and frequency of forest wildfires have continued to increase worldwide, particularly in forest-rich regions such as North America, South America, and Australia [
7]. In China, wildfires are predominantly concentrated in the northeast and southwest, especially in the northeastern forest zone and high-risk regions such as Yunnan and Sichuan, where changing climatic conditions are driving an upward trend in wildfire occurrence and severity, posing growing challenges to local ecosystems, economic development, and social security [
8].
Cloud removal remains a major challenge in satellite-based Earth observation. Common approaches—such as Savitzky–Golay (SG) smoothing [
9] and the Harmonic Analysis of Time Series (HANTS) [
10]—can effectively suppress clouds and their shadows. However, they often perform poorly when the objective is to detect short-lived or abrupt surface changes, such as those associated with forest wildfires. Although HANTS efficiently smooths multi-temporal imagery, misclassification may still occur because clouds and active or burned surfaces exhibit similar spectral characteristics in the infrared domain, particularly during fire events [
11]. Previous studies have also demonstrated that cloud and cloud-shadow contamination can easily be mistaken for burned surfaces, reducing the accuracy of post-fire mapping [
12]. Furthermore, harmonic-based gap-filling algorithms such as HANTS may be unreliable for capturing short-duration and rapidly evolving land-surface dynamics [
13]. Conventional spatial interpolation methods typically infill missing pixels using imagery from other dates, based on the assumption that surface conditions change gradually over time. However, because wildfires occur abruptly and unpredictably, substituting imagery from different periods rarely captures the true fire conditions. Prior work has highlighted that during active fire periods, rapid surface changes and cloud obstruction limit the reliability of temporal gap-filling or compositing, leading to uncertainties in burned area monitoring outputs [
14,
15,
16]. Taken together, these issues indicate that single cloud-removal or interpolation strategies are inadequate for wildfire monitoring, as the fast-changing surface signals and persistent cloud effects during fire episodes exceed the capability of traditional approaches. Sentinel-2 offers high spatial resolution and a short revisit interval; nevertheless, extensive cloud cover remains problematic, especially over plateau regions [
17]. Hofmeister et al. [
18] highlighted that, despite the nominal 5-day revisit, acquiring cloud-free scenes in high-elevation areas within short time windows is difficult, which constrains the timeliness of fire monitoring. Landsat 8 revisits less frequently (8 days) but provides stable imagery at 30 m resolution, enabling precise burned area delineation over broad extents. Therefore, to overcome the limitations of any single sensor, integrating Landsat 8 and Sentinel-2 imagery—thereby combining finer spatial detail with denser temporal sampling—can substantially improve both the accuracy and the timeliness of wildfire monitoring. In current wildfire studies, many efforts rely on incident records from ground agencies, compiled from field reports, suppression activities or aerial patrols [
19,
20]. While useful, these datasets are constrained in spatial coverage and latency, leading to omissions that fail to capture the full distribution and dynamics of fires. Researchers often select a few typical events for analysis to validate single-fire identification [
21,
22,
23]. Although this helps assess feature extraction and classifier suitability, the limited spatial focus hampers recognition of multiple or spatially dispersed fires at regional scales and can bias monitoring outcomes. At the regional scale, sample construction remains pivotal. Most studies still depend on manual labelling, in which researchers delineate burn perimeters by visual interpretation and select representative samples for training [
8,
24,
25,
26]. This works for large, clearly scarred fires but is labor-intensive, subjective and difficult to scale to large areas and multi-temporal monitoring.
To address these constraints, recent work has explored automated or semi-automated sample generation to enhance efficiency and objectivity [
27,
28]. Recent studies have demonstrated the potential of automated or semi-automated sample generation to improve burned-area mapping, yet important limitations remain. For example, Kulinan et al. [
29] applied an object-based workflow to derive training samples from Sentinel-2 imagery for the Uljin wildfire in South Korea, showing that automated sampling can achieve high classification accuracy. Similarly, Silva and Modica [
5] integrated dNBR with radar backscatter features to construct burned and unburned samples through a semi-automated procedure, illustrating the effectiveness of combining multisource indicators. Together, these studies suggest that automation can enhance mapping reliability, but their methods still depend on event-specific conditions and empirical thresholds, limiting generalisability across heterogeneous landscapes. Therefore, developing an approach capable of producing accurate and scalable regional fire recognition under limited training samples remains an important challenge—one that this study seeks to address.
The integrated use of multi-source remote sensing provides rich spectral and index features for burned area detection [
30]. However, expanding the feature set often introduces high-dimensional redundancy and multicollinearity, which increases computational cost and can degrade classification accuracy and stability [
31]. Establishing an efficient feature-selection strategy is therefore essential. Given the heterogeneity of burn signatures across spectral responses, terrain context and vegetation types, identifying the most discriminative variables is a central challenge. Logistic Regression (LR) [
32], Random Forest (RF) [
33] and Boruta [
34] are widely used for feature selection and importance assessment. LR quantifies linear relations between predictors and class membership and facilitates significance testing; RF, as an ensemble learner, evaluates importance within non-linear spaces and is robust to noise and outliers; Boruta augments RF with shadow features to iteratively isolate globally significant variables. Combined, these methods reduce redundancy and systematically identify features that contribute most to burned area classification, enabling models that are both interpretable and accurate.
With the maturation of machine-learning methods, data-driven classifiers show strong potential for wildfire monitoring and burned area mapping [
35]. Pixel-based classification, however, is prone to noise in high-resolution scenes—especially in southwestern mountainous regions and heterogeneous landscapes—leading to fuzzy boundaries and reduced accuracy. Object-based image analysis (GEOBIA) has therefore gained traction [
36]. The Simple Non-Iterative Clustering (SNIC) algorithm aggregates spectrally similar, spatially contiguous pixels into super pixels by fusing spectral and spatial cues, thereby improving spatial consistency and boundary precision [
37]. Although SNIC’s segmentation strengths are well documented, its deeper integration with machine-learning classifiers remains underexplored. In particular, under multi-source data and southwestern mountainous environments, the synergistic performance and generalisation of SNIC-based segmentation with machine-learning classifiers lack systematic validation. Accordingly, this study integrates multi-source imagery with SNIC segmentation and multiple machine-learning classifiers to build a high-accuracy workflow for burned area identification, aiming to deliver regional-scale, high-precision mapping with limited samples and to provide a new technical pathway for fire monitoring in southwestern mountainous regions.
This study develops an FS-SNIC-ML workflow for regional burned forest mapping in a mountainous environment by integrating multi-source optical fusion, semi-automatic sample construction, feature selection, object-based segmentation, and machine-learning classification. The specific objectives are to: (1) construct cloud-free, gap-free, and spectrally harmonised pre- and post-fire reflectance datasets by fusing Sentinel-2 and Landsat 8 imagery using a PIFS-based strategy; (2) implement and evaluate a semi-automatic workflow for generating burned and unburned samples under limited-sample conditions at the regional scale; (3) use LR, RF, and Boruta to identify the most discriminative spectral and index-based variables from the multi-source feature pool, thereby optimising model inputs; (4) integrate SNIC-based GEOBIA with multiple machine-learning classifiers to map burned forest areas in Dali Prefecture and to compare their accuracy and robustness in complex terrain; and (5) derive burn-severity patterns and quantify the influence and interactions of topographic, vegetation, and meteorological factors on wildfire hotspot density using dNBR-based clustering and geographical detector.
5. Conclusions
This study developed the FS-SNIC-ML workflow, which integrates multi-source optical data fusion, semi-automatic sample generation, feature optimisation, and object-based machine learning, providing an accurate, robust, and transferable approach for regional forest burned-area mapping in mountainous environments. The PIFS-based fusion of Sentinel-2 and Landsat 8 enabled the reconstruction of spatially continuous, cloud-free, and gap-free pre- and post-fire imagery under persistent cloud cover and data scarcity, while ensuring spectral consistency and substantially improving the reliability of change detection. The combined SAM–GLCM–PCA–Otsu procedure and county-level stratified sampling ensured representative training data even under limited-sample conditions. dNBR, dNDVI, and dEVI were consistently identified as the most discriminative features, and within the SNIC-supported GEOBIA framework, the RF classifier achieved the best performance, yielding high-accuracy burned-area extraction at the regional scale. GeoDetector results further indicate that wildfire hotspot density is jointly regulated by vegetation conditions, surface moisture, and meteorological factors, among which NDVI, temperature, soil moisture, and their pairwise interactions exhibit the strongest explanatory power with pronounced nonlinear enhancement effects. Overall, the proposed workflow not only enables high-accuracy mapping of forest burned areas but also provides a robust methodological basis for identifying and quantifying the spatial drivers of wildfire occurrence.