2.2. Data Collection
The fire-prone season in the region is characterized by two distinct peaks—spring–summer and summer–autumn—which determine the temporal framework of the field surveys. Field investigations were conducted during the summers of 2023–2025 and included both ground-based and geospatial observations. Unburned reference sites were used as baseline conditions for comparison with areas affected by fires of varying intensity, following standard approaches in disturbance ecology studies [
36,
37].
The establishment of the sampling points followed the Forest Management Instruction (2012) [
35] and was supported by the integration of the Landsat 8 OLI, Sentinel-2 MSI, and NASA Goddard Space Flight Center datasets in combination with the orman.gharysh.kz geospatial platform (
Figure 2). Field surveys were conducted during the same growing seasons as the postfire satellite imagery used in the analysis, ensuring comparability between ground-based and remotely sensed assessments. The use of remote sensing data at the sampling design stage enabled reproducibility of the spatial layout of the sampling plots [
38,
39].
The sampling plots were established in homogeneous forestland sites located at least 30 m from adjacent land use categories. Plots were positioned within stands affected by fires of varying intensity, as well as in clear-cut and burned areas, and in control plots that had not been exposed to fire.
The spatial distribution of the sampling plots ensured the representation of all the burn severity classes while maintaining the comparability of the background environmental conditions, with multiple plots established within each class [
38,
39].
The sampling design included both pine-dominated stands and mixed stands, allowing coverage of the principal types of postfire forest landscapes in the Kostanay region. Of the 22 sampling plots, 16 were established in pine-dominated stands, whereas 6 plots represented mixed or deciduous (birch–aspen) forests.
The control plots included forest areas that had not been affected by fire for an extended period, as well as sites representing postlogging stands and areas that had regenerated following previous fire events. Comparability between control and fire-affected plots was ensured by maintaining similarities in site conditions, topography, soil characteristics, stand composition, and stand age and—where applicable—by accounting for the time elapsed since logging or burning.
The establishment and description of the sampling plots (0.20–0.25 ha in area) were conducted in accordance with the standard methods proposed by V. N. Sukachev and S. V. Zonn [
39]. The classification of fire-affected sites followed the guidelines developed by I. S. Melekhov [
40].
Thus, the identification of fire type and intensity under field conditions is based on standardized indicators of stand damage that are widely used in forest fire research [
40,
41]. The size of each sampling plot was selected to ensure that at least 200 trees of the dominant forest-forming species were represented within its boundaries.
Permanent (fixed) sample plots were not established because the study was designed as a spatially distributed assessment of postfire conditions across multiple sites differing in fire intensity and time since disturbance. The use of temporary sampling plots allowed coverage of a broader range of environmental conditions and fire histories within the study region.
The inclusion of at least 200 trees within each sampling plot ensured a sufficient sample size for reliable estimation of stand structure parameters and reduced the influence of individual tree variability. This threshold is consistent with standard forest inventory practices and provides a statistically robust characterization of stand conditions at the plot level.
The selection of the sampling plots followed a stratified design rather than a purely random approach. Plots were allocated across predefined burn severity classes (reference, low–moderate, and high) to ensure balanced representation of postfire conditions. Within each class, sites were selected to maintain comparable environmental characteristics (soil type, relief, and forest composition).
The spatial distribution of the sampling plots was designed to ensure coverage of the study area while minimizing spatial autocorrelation; the distance between plots typically exceeded 500–1000 m.
The initial classification of burn severity was based on standardized field-based indicators of forest stand conditions, including crown damage, stem char characteristics, and the degree of ground vegetation and litter consumption, prior to subsequent validation using dNBR derived from satellite data.
The burn severity classes were determined using a combination of remote sensing indicators (primarily NBR differencing from Landsat data) and field-based observations of burn severity (including canopy damage, soil surface alteration, and vegetation mortality). This integrated approach ensured consistent classification of burn severity across all the sampling plots.
On the basis of the requirements for plot representativeness, comparability of background environmental conditions, and adequate statistical replication, a total of 22 sampling plots were established and distributed across burn severity classes (low, moderate, and high). Each class was represented by multiple plots (at least 5–7), allowing for statistically robust comparisons. On average, approximately 80 measurement points were collected within each plot.
The established sampling plots represent the main types of forest ecosystems within the study area, including pine-dominated stands and mixed birch–aspen forests distributed across the Kostanay region. The selection of plots was designed to capture the spatial variability of postfire conditions across different fire intensity classes while maintaining comparable site characteristics.
Statistical representativeness was ensured through the balanced distribution of sampling plots among fire intensity classes and the inclusion of multiple plots within each class (at least 5–7 plots per group). This design allowed for robust comparative analysis while minimizing the influence of site-specific anomalies.
The experimental design of the study had a hierarchical structure. The sampling plot (n = 22) was considered the primary experimental unit, whereas individual measurement points within each plot (approximately 80 per plot) were used as subsamples and were not treated as independent observations.
All the variables were aggregated at the level of the sampling plot prior to statistical analysis. Accordingly, statistical analysis was conducted at the plot level, which ensured the avoidance of pseudoreplication and allowed for valid comparisons among plots [
38,
42].
The sampling plots were grouped according to burn severity classes (reference conditions, low–moderate, and high), and all the statistical comparisons were performed among these groups.
Unburned control plots were used as reference conditions and were included in the analysis as a separate group (reference). A total of 13 control plots were identified, which were comparable to the studied sites in terms of species composition, soil conditions, and stand structure.
Control plots were considered part of the overall environmental gradient and were used for statistical comparison with plots affected by different levels of burn severity, which is consistent with established approaches in ecological studies [
43,
44].
The spatial distributions of the sampling plots and their relationships with the fire-affected areas are shown in
Figure 2.
The age of the dominant tree component was determined by counting annual rings in increment cores collected at the root collar or from stumps of felled trees, following standard dendrochronological approaches [
45]. Stem diameter was measured at a height of 1.3 m, and tree height was determined using remote measuring instruments in accordance with standard forest inventory methods [
46]. The physical properties of wood (moisture content and temperature) and soil parameters (bulk density, moisture content, temperature, pH, and humus content) were determined using standard field and laboratory methods.
Soil samples were collected from the upper organomineral horizon at a depth of 0–10 cm, which is the layer most sensitive to fire effects. In each sampling plot, 5–7 sampling points were established and evenly distributed across the area. These subsamples were combined into a single composite sample to provide a representative characterization of soil conditions at the plot level.
To evaluate the success of natural postfire forest regeneration, an integrated set of indicators, including parameters of tree regeneration, stand retention, and soil conditions, was employed. Soil pH, humus content, soil bulk density, and soil moisture are considered interrelated environmental characteristics that collectively shape conditions for postfire forest recovery [
47]. Soil samples for these analyses were collected from the upper organomineral horizon (0–10 cm depth), with 5–7 subsamples taken within each sampling plot and combined into a composite sample. These variables were subsequently used in the analysis to interpret differences in postfire regeneration success among the sites (
Table 1).
The type, form, and intensity of fire were identified on the basis of standardized indicators of forest stand conditions, including the degree of crown damage and dieback; the height of tree trunks; the extent of bark and root collar charring; and the level of ground vegetation, old stump, and downed deadwood combustion.
Field-based fire indicators have been used to validate satellite-derived estimates of burn severity [
48,
49]. The resulting remote sensing metrics of burn severity (NBR and dNBR) were subsequently applied to classify the sampling plots and to support further statistical analyses [
36,
37,
49]. Their effectiveness has been confirmed in recent studies integrating multisensor data, including Landsat, Sentinel-2, and LiDAR observations [
20,
50]. These approaches enable consistent monitoring of vegetation dynamics and structural changes following wildfire disturbances.
The ages of the fire events and interfire intervals were determined via cross-sections from tree trunks bearing fire scars, as well as from stumps of felled dead trees. The time elapsed since the fire was further verified by analyzing the age structure and conditions of the regenerating saplings.
At each sampling point, saplings and naturally regenerated individuals were inventoried within designated subplots, where the number of surviving and newly established postfire saplings was recorded following the methodology proposed by A. I. Buzykin and A. V. Pobedinsky [
51].
Sapling inventories were constructed within fixed-area subplots that were systematically distributed within each sampling plot.
The stand type, species composition, and stand density were determined in accordance with the classification system of M. M. Orlov. Stand age, mean height, and growing stock per hectare and per compartment prior to fire events were established on the basis of data collected at the sampling points.
Saplings were classified as healthy or damaged according to their physiological condition. In accordance with standard forest regeneration classifications, the saplings were further grouped into three height classes: small (≤0.5 m), medium (0.51–1.5 m), and tall (>1.5 m).
Crown closure was classified on an interval scale: 0.3–0.5—open canopy, 0.5–0.7—moderately closed canopy, and >0.7—closed canopy.
Saplings were classified as viable (healthy) when crown integrity was maintained and no significant pyrogenic damage was observed. Damaged saplings included individuals exhibiting partial crown mortality, root collar burns, and pronounced signs of growth suppression.
Field permits and site access: Field observations and sample collection were conducted exclusively on publicly accessible forestland; therefore, no special permits were required for this research. The study did not involve activities within protected areas, endangered species, or privately owned territories. All fieldwork was carried out in accordance with the Forest Management Instruction of the Republic of Kazakhstan (2012) [
52].
The spatial context of the study area and the distribution of forest fires used in the design of the sampling network are illustrated by the burn scar map of the Kostanay Region for the period of 2004–2024 (
Figure 3).
In addition to field data, spectral indices, including the normalized burn ratio (NBR), the normalized difference vegetation index (NDVI), and the normalized difference moisture index (NDMI), which provide a quantitative basis for assessing vegetation burn severity and postfire recovery dynamics [
53], were used in this study.
These indices were calculated from Landsat 7 ETM+ and Landsat 8 OLI satellite imagery processed for surface reflectance via standard radiometric calibration and atmospheric correction procedures, ensuring temporal consistency and intersensor comparability [
53,
54]. Landsat 8 OLI data were specifically applied for wildfire analysis [
55].
To ensure methodological consistency and reproducibility, satellite data from different sources were used with clearly differentiated roles.
Landsat 7 ETM+ and Landsat 8 OLI surface reflectance data (30 m spatial resolution) were used as the primary dataset for quantitative analysis of postfire vegetation dynamics. These data were used to calculate spectral indices (NDVI, NDMI, and NBR) and to assess vegetation recovery at the level of the sampling plots.
All spectral indices were derived exclusively from Landsat imagery to ensure spatial consistency and comparability across the study area. The spatial resolution of the Landsat data is commensurate with the size of the sampling plots (0.20–0.25 ha), which minimizes the mixed-pixel effect and allows reliable integration with field observations.
MODIS data (MCD64A1 BurnDate product; spatial resolution, 500 m) were used solely for spatiotemporal verification of fire events, including the identification of fire occurrence dates and the delineation of burned areas at the regional scale. MODIS data were not used for quantitative analysis of vegetation recovery because of their coarse spatial resolution.
The integration of the Landsat and MODIS datasets was performed conceptually rather than through direct pixel-level combination. Landsat data provided detailed local-scale information on vegetation dynamics, whereas MODIS data ensured the temporal accuracy and completeness of fire event detection.
Such separation of functions between datasets ensures methodological transparency, reduces scale-related uncertainty, and improves the reproducibility of the analysis.
The normalized difference vegetation index (NDVI) reflects vegetation photosynthetic activity, productivity, and overall biomass accumulation [
56,
57]. The NDVI values range from −1 to +1, with positive values indicating the presence and physiological conditions of the vegetation. The NDVI was calculated as follows (Equation (1)) [
58]:
Elevated NDVI values reflect increased phytomass production and the accumulation of combustible vegetation material, which may influence fire spread patterns and fire impact severity in mature forest stands [
58,
59]. Remote sensing enables the assessment of the spatial distribution of fuel loads at the landscape scale (
Table 2) [
58,
59].
The normalized difference moisture index (NDMI) serves as an indicator of the moisture status of the soil–vegetation system and plays a key role in assessing wildfire ignition probability and potential fire intensity [
58]. A higher moisture content in both soils and vegetation reduces the likelihood of ignition and attenuates fire intensity once a fire occurs. Consequently, soil water-holding capacity, evaporation rates, and precipitation regimes—parameters that vary with soil type and local hydroclimatic conditions—are important factors controlling spatial and temporal variability in NDMI values. The NDMI was calculated as follows (Equation (2)) [
58,
60]:
In accordance with standard interpretations of NDMI values, the index ranges from −1 to +1, where positive values (>0) indicate the presence of vegetation and soil moisture, whereas negative values correspond to conditions of pronounced atmospheric or soil drought (
Table 3) [
58,
60].
The NDVI and NDMI were calculated as multiyear mean values separately for each sampling plot, using exclusively postfire observations starting from the year of the most recent fire event affecting each site and extending across subsequent growing seasons. The use of multiyear mean NDVI and NDMI values was intended not to characterize short-term recovery dynamics but to identify stable postfire ecosystem conditions following the completion of the initial regeneration phase. This approach has been widely applied in postfire ecosystem studies that analyze remote sensing time series [
54,
55].
Because the timing of the most recent fire events differed among the sampling plots, the postfire observation periods also varied. To ensure comparability among sites, only postfire observations were included, and multiyear mean values of the NDVI and NDMI were used to represent stabilized postfire conditions rather than specific successional stages.
Accordingly, the analysis focused on comparing relative differences among sites along the burn severity gradient rather than explicitly modelling temporal recovery trajectories.
The time since fire and interfire intervals derived from the tree-ring analysis were used as supporting information for interpreting the results but were not included as independent variables in the statistical models because of the limited sample size and the uneven distribution of plots across fire years [
61,
62].
To minimize the smoothing effect of early successional stages, only postfire years beginning with the year of the most recent fire affecting each sampling plot were included in the analysis. This approach allows comparisons of sites that differ in the timing of the last fire event on the basis of stabilized recovery characteristics and is commonly applied in analyses of spatial gradients of postfire ecosystem recovery [
63,
64].
The interpretation of the vegetation and soil moisture conditions was based on the NDMI, which serves as an indicator of the moisture content in live vegetative fuel and plays a key role in postfire processes [
58]. For an objective and reproducible assessment of burn severity, the normalized burn ratio (NBR) and its differenced form (dNBR) were used, both of which are widely applied in international fire ecology research [
47,
48].
The normalized burn ratio (NBR) is a spectral indicator used to delineate burned areas, detect changes in vegetation cover, and assess the degree of disturbance [
47]. The normalized burn ratio (NBR) was calculated via the standard formulation as the normalized difference between the near-infrared (NIR) and shortwave infrared (SWIR) reflectances (Equation (3)) [
47,
48]:
where
NIR denotes surface reflectance in the near-infrared spectral band, which is sensitive to vegetation structure and physiological conditions, and
SWIR denotes surface reflectance in the shortwave infrared band, which is sensitive to vegetation and soil moisture content as well as to thermal effects associated with fire.
NBR values range from −1 to +1, where higher values indicate healthy, undisturbed vegetation, whereas lower or negative values correspond to areas affected by fire or completely devoid of vegetation.
For the quantitative assessment of burn severity, the difference in the normalized burn ratio (dNBR) was calculated as the difference between the prefire and postfire NBR values (Equation (4)) [
36]:
where
NBRpre denotes the index value calculated for the prefire period and
NBRpost denotes the value obtained for the postfire period.
The resulting dNBR values were classified via standard burn severity threshold values, providing a unified and reproducible quantitative criterion for assigning sampling plots to burn severity classes and eliminating subjective visual interpretation [
36,
48].
The burn severity classes were defined as follows [
36]:
dNBR < 0.10—no or minimal fire impact;
0.10–0.27—low burn severity;
0.27–0.44—moderate burn severity;
0.44—high burn severity.
The ecological interpretation of the defined burn severity classes is presented in
Table 4 [
36].
Each burn severity class was represented by multiple sampling plots, ensuring sample representativeness and enabling robust statistical comparisons among groups.
Thus, the assignment of the sampling plots to burn severity classes was based on clear quantitative criteria rather than subjective visual assessment.
Spatiotemporal verification of fire events
To refine the timing and spatial extent of fire events, the BurnDate layer from the MODIS MCD64A1 product (v6), derived from Terra and Aqua satellite data for the period of 2004–2023, was used [
65] (
Figure 3). The integration of the MODIS and Landsat data was performed at the level of event correspondence rather than through direct pixel-level spatial alignment, thereby avoiding biases associated with differences in spatial resolution.