Next Article in Journal
Eleven-Year Effects of Mechanical Site Preparation on Oaks Planted on Former Agricultural Fields
Previous Article in Journal
Effects of Forest Gaps on Abies faxoniana Rehd. Leaf Litter Mass Loss and Carbon Release along an Elevation Gradient in a Subalpine Forest
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Spatial Predictions of Human and Natural-Caused Wildfire Likelihood across Montana (USA)

by
Adrián Jiménez-Ruano
1,*,†,
William M. Jolly
1,†,
Patrick H. Freeborn
1,†,
Daniel José Vega-Nieva
2,
Norma Angélica Monjarás-Vega
2,
Carlos Iván Briones-Herrera
2 and
Marcos Rodrigues
3
1
Missoula Fire Sciences Laboratory, Rocky Mountain Research Station, USDA Forest Service, 5775 Hwy 10 W, Missoula, MT 59808, USA
2
Facultad de Ciencias Forestales, Universidad Juárez del Estado de Durango, Durango 34000, Mexico
3
Department of Agriculture and Forest Engineering, University of Lleida, Alcalde Rovira Roure 191, 25198 Lleida, Spain
*
Author to whom correspondence should be addressed.
These authors contributed equally to this work.
Forests 2022, 13(8), 1200; https://doi.org/10.3390/f13081200
Submission received: 3 June 2022 / Revised: 6 July 2022 / Accepted: 15 July 2022 / Published: 29 July 2022
(This article belongs to the Section Forest Inventory, Modeling and Remote Sensing)

Abstract

:
Spatial wildfire ignition predictions are needed to ensure efficient and effective wildfire response, and robust methods for modeling new wildfire occurrences are ever-emerging. Here, ignition locations of natural and human-caused wildfires across the state of Montana (USA) from 1992 to 2017 were intersected with static, 30 m resolution spatial data that captured topography, fuel availability, and human transport infrastructure. Once combined, the data were used to train several simple and multiple logistic generalized linear models (GLMs) and generalized additive models (GAMs) to predict the spatial likelihood of natural and human-caused ignitions. Increasingly more complex models that included spatial smoothing terms were better at distinguishing locations with and without natural and human-caused ignitions, achieving area under the receiver operating characteristic curves (AUCs) of 0.84 and 0.89, respectively. Whilst both ignition types were more likely to occur at intermediate fuel loads, as characterized by the local maximum Normalized Difference Vegetation Index (NDVI), naturally-ignited wildfires were more locally influenced by slope, while human-caused wildfires were more locally influenced by distance to roads. Static maps of ignition likelihood were verified by demonstrating that mean annual ignition densities (# yr 1 km 1 ) were higher within areas of higher predicted probabilities. Although the spatial models developed herein only address the static component of wildfire hazard, they provide a foundation upon which dynamic data can be superimposed to forecast and map wildfire ignition probabilities statewide on a timely basis.

1. Introduction

Wildfires are present in practically all vegetated environments worldwide [1,2,3] and they can have profound affects on global ecosystems. Wildfire events can heavily impact the global carbon cycle [4] and can also impact global water supply and quality [5,6]. They are a natural process in many ecosystems [7] but they can pose a threat to life and property in populated areas [8,9]. In many cases, wildfires that would otherwise be beneficial in the long term are immediately suppressed to prevent or limit losses to highly valued resources [10]. However, if a newly ignited wildfire escapes suppression and burns uncontrollably during severe weather conditions, it has the potential to become an extreme wildfire event, causing a disproportionate amount of losses [11]. Although effective wildfire planning requires early information about where new ignitions are likely to start, for example, to increase prevention activities or allocate sparse resources prior to and during the fire season [12], this information is generally not available across large areas at high spatial resolution.
New wildfires start when there is an ignition source, sufficient fuel, and suitable weather conditions. Ignitions can be from either natural or human sources [13], and from 1992–2012, natural ignitions accounted for 16% of all wildfires across the United States [14]. Natural ignitions in the United States are mainly caused by cloud-to-ground lightning strikes and thus depend on the timing and characteristics of the flashes in relation to fuel condition [15,16]. Intentional human ignitions in the United States occur less frequently than those from negligence, such as escaped campfires, discarded cigarettes, or other careless uses of fire [17], and are heavily favored by road networks used for transportation and access to recreational opportunities [18,19]. Therefore, differentiating the spatial distributions of natural and human-caused ignitions is crucial to designing and implementing source-specific prevention and mitigation strategies [20,21].
Natural and human-caused wildfire ignition likelihood may be suitably modeled if proxies are included to represent the fuels, weather, topography, and human presence that control their spatial distribution [22]. While weather is spatially and temporally variant, fuel loadings, topography, and human pressure change slowly over time and could be captured by a static model. Terrain metrics, such as slope, can impact wildfire ignition and propagation potential [23]. Road networks affect fire occurrence and fire spread in two ways: first, they make wildlands accessible to humans, thus promoting new anthropogenic wildfire ignitions [24,25]; and second, roads enable ground-based firefighting resources to respond quickly to new events and can be used as potential containment locations [26]. Finally, although fuel conditions such as moisture content and chemical composition are seasonally dynamic, static fuel factors such as loading and continuity may limit the location of fires where insufficient fuels prevent propagation [27]. Regarding this last variable, in contrast to a more documented effect of human and topographic factors (e.g., [28]), spatially detailed proxies of fuel availability have been less explored in the literature (e.g., [27,29,30]) and deserve further research. Furthermore, the majority of previous predictions of fire occurrence have used coarse spatial resolutions, ranging from 50–100 km (e.g., [2,29]) to 1 km (e.g., [27,30]). Therefore, there is a need to analyze the human, topographic, and fuel drivers of fire activity at finer spatial resolutions, such as those provided by sensors such as Landsat at 30 m. In summary, if ignition sources are differentiated and adequate spatial biophysical and demographic drivers are identified at detailed scales, wildland fire ignition likelihoods could be better modeled, mapped, and predicted across large areas, potentially improving fire management strategic planning.
In this study, we develop and evaluate spatial logistic models of new wildfire occurrences that can be used to map ignition likelihood across the state of Montana, USA. To achieve this, we decouple the static and dynamic components of wildfire occurrence (Figure 1) and focus solely on constructing static ignition models. Simple and multiple logistic generalized linear models (GLMs) and generalized additive models (GAMs) are built by relating records of new wildfire occurrences with several temporally invariant explanatory variables (i.e., the biophysical environment and proximity to roads). We evaluate the efficacy of increasingly more complex models and select the best-performing models to produce statewide static maps of natural and human-caused ignition probabilities.

2. Material and Methods

2.1. Study Area

With a land area of 376,962 square kilometers, the study area covers the state of Montana, USA. Montana was chosen because of its west-to-east gradients in climate, topography, vegetation types, and wildfire causality (Figure 2). Western Montana has a terrain-driven climate whereby maritime air arrives from the Pacific with continental modifications, and topography controls the local precipitation and temperature. Winter precipitation falling as a mix of rain and snow increases with elevation [31], and summer rainfall is dominated by localized, high-intensity thunderstorms [32]. The continental divide is a predominantly north-to-south geographical feature that acts as a barrier, resulting in a precipitation–temperature gradient with warmer and drier conditions and greater cloud-to-ground lightning flash densities in the east [33,34]. Based on data from the LANDFIRE project [35], the main existing vegetation types (EVT) in MT are shrublands, conifers, grasslands, and agricultural lands, covering 30.8%, 24.7%, 19.9%, and 13.9% statewide, respectively; however, these proportions vary across the state, with conifers and shrublands prevailing in the west and shrublands, grasslands, and agricultural lands prevailing in the east. Variations in the biophysical environment coupled with differences in land use result in a gradient of wildfire causes such that human-caused ignitions exceed natural ignitions in the eastern half of the state.

2.2. Data Preparation

Federal, state, and local wildfire records within Montana from 1992–2017 were obtained from the Fire Program Analysis Fire-Occurrence Database (FPA FOD) [36]. Point records in the FPA FOD contain geographic coordinates of the ignition location as well as an ignition cause. After excluding records with an unknown cause, wildfires were partitioned into either natural or human-caused. Of the 12 fire cause codes recorded in the FPA FOD, all natural ignitions consisted of lightning strikes while human ignitions consisted of, among others, campfires, debris burning, equipment use, fireworks, arson, children, smoking, and powerlines. Ignition points (Figure 2b,c) were projected onto the 30 m gridded LANDFIRE domain [35] covering Montana to produce two binary masks (one for natural and one for human-caused ignitions), indicating the locations where at least one new wildfire was reported or not (i.e., presence = 1 and absence = 0).
Several static explanatory variables were explored to predict and map wildfire ignition likelihood. To represent fuel availability, a single 30 m raster of maximum Normalized Difference Vegetation Index (NDVI) was generated in Google Earth Engine [37] using 20 years (1992–2011) of Landsat 5 Thematic Mapper (TM) Collection 1 Tier 1 Annual NDVI Composites available in the Earth Engine public data catalog. Given the relationships between NDVI, primary productivity, and vegetation type [38], maximum NDVI was chosen to capture the peak amount of aboveground live biomass, which is found to be related to fire occurrence in other parts of North America [27]. To coincide with the binary masks of ignition locations, the raster of maximum NDVI was resampled to the 30 m LANDFIRE domain covering Montana using the nearest neighbor method. Topography was represented by slope (degrees), obtained from the LANDFIRE 30 m resolution digital elevation model [35]. Distance from roads (i.e., the straight-line distance in kilometers from the center of each grid cell to the nearest road) was used to represent accessibility, which has been shown to influence the ignition locations of human-caused wildfires across large areas [39]. Primary and secondary roads across Montana were downloaded as a shapefile from the United States Census Bureau [40], and a raster of distance to roads was created at LANDFIRE 30 m resolution. Maps of all potential static explanatory variables are shown in Figure 2d–f.
The datasets used for model building and evaluation were derived from the 30 m presence/absence masks of wildfire ignitions and the co-registered rasters of potential explanatory variables (Figure 3). First, presence datasets were created by using the grid cells containing a new wildfire report to extract the explanatory variables. This produced two presence datasets, one for natural ignitions and one for human-caused ignitions, such that every location with a new wildfire report had an associated value for slope, maximum NDVI, distance to roads, and longitude and latitude. The presence datasets were then split into training (70%) and testing (30%) datasets. Absence datasets were created by randomly selecting grid cells from the binary masks at locations without any recorded wildfire ignitions. To provide balanced datasets, the number of randomly selected absence grid cells was set equal to the number of presence grid cells. After using the absence grid cells to extract the explanatory variables and geographic coordinates, the absence datasets were themselves spit into training (70%) and testing (30%) datasets. Combining the presence and absence datasets for each ignition source yielded four datasets: the natural-caused training and test datasets, and the human-caused training and test datasets.

2.3. Model Construction and Evaluation

A common approach to modeling the probability ( p r ) of a new wildfire ignition is to use a generalized linear model (GLM) to relate the natural logarithm of the odds, otherwise known as l o g i t ( p r ) , to a linear combination of the explanatory variables [41,42]. However, it is likely that some or all relationships between dependent and independent variables may be nonlinear. Generalized additive models (GAMs) overcome the a priori assumption of linearity by replacing the linear terms with smoothed, nonlinear functions to model ignition probabilities [43]. In this work, a suite of logistic GLMs and GAMs were built to model the probability of a new wildfire ignition. In practice, all models were built in R [44] using the ’mgcv’ package [45]. Specifically, the ’bam’ function was called, which is an implementation of GAM fitting designed for use with large datasets [46] and has the advantage of using less memory and can also leverage parallel processing, thus gaining computational speed for both model training and prediction. Both logistic-GLMs and logistic-GAMs were built by specifying a binomial family (the logit link). Whereas the GLMs were fit without specifying any smoothed terms, the GAMs were fit by specifying all smoothed terms. In the case of the GAMs, the fast restricted maximum likelihood (fREML) method was used to optimize the smoothness parameter due to its numerical stability and ability to avoid overfitting [47]. Additionally in the case of the GAMs, the default basis type (thin splines) was used and the basis dimension, k, which controls the maximum number of basis functions and thus the maximum allowable “wigglyness” for the specified basis type, was set to its default value.
To examine the isolated effects, simple logistic-GLMs and simple logistic-GAMs were built using slope, maximum NDVI, and distance to roads as the sole explanatory variables. Then, explanatory variables were procedurally added [48] to create increasingly more complex multiple logistic models with the goal of achieving the best fitting model with the fewest explanatory variables. Additional explanatory variables were retained if they were statistically significant ( p 0.01 ) and if the percent deviance explained increased by more than 0.1% compared to the more basic model. A final multiple logistic-GAM, referred to hereafter as the spatial-GAM, was built by adding the longitude and latitude of the ignition location—and their interaction—as a smoothed explanatory variable. Once added, all basis dimensions, k, in the spatial-GAMs were iteratively increased from their default values to ensure that all smoothed terms had sufficient degrees of freedom without being overfitted or computationally intractable.
After fitting the models with the training datasets, each model was applied to the held-back testing datasets to predict the probabilities of natural and human-caused ignitions. Receiver operating characteristic (ROC) curves [49] were then built by plotting the false positive rate (FPR) versus the true positive rate (TPR) calculated at every predicted probability threshold. The probability along the ROC curve closest to the top-left corner was selected as the optimum threshold, and the area under the ROC curve (AUC) was used to assess the overall ability of each model to differentiate locations with and without natural and human-caused ignitions. Lastly, we assessed the ability of the best-performing models to capture new wildfire occurrences by calculating the annual ignition density ( A I D , # yr 1 km 1 ). The models were applied to the 30 m raster stacks of slope, maximum NDVI, and distance to roads to generate static probability maps of ignition likelihood. The natural (n) and human-caused (h) probability maps were classified into five equal probability intervals (i), and the areas covered by each probability interval ( A n , i and A h , i ) were calculated based on the number of 30 m grid cells in each interval. Locations of all natural and human-caused ignitions reported from 1992–2017 were intersected with their respective static probability maps and sorted into years (y) to yield the annual count of natural ignitions ( c n , y ) and human-caused ignitions ( c h , y ) that occurred in each probability interval ( c n , y , i and c h , y , i ). For each year and probability interval, the A I D was calculated for both ignition sources as follows:
A I D n , y , i = c n , y , i A n , i
A I D h , y , i = c h , y , i A h , i
The 26 values for A I D in each probability interval were then summarized by their mean, median, and interquartile range.

3. Results

From 1992 to 2017, there were 45,488 wildfires reported in Montana, resulting in 3.1 Mha burned area. Of those, 18,986 were natural ignitions (Figure 2b) and 23,255 were human-caused ignitions (Figure 2c). After excluding records with an unknown cause, and after gridding the point records onto the LANDFIRE domain, the number of 30 m grid cells containing at least one newly reported wildfire was nearly evenly split between natural and human-caused ignitions (47% vs. 53%).
Simple logistic-GLMs and simple logistic-GAMs built with a single explanatory variable each illustrate the isolated effects of slope, maximum NDVI, and distance to roads on the probability of a new wildfire ignition across MT (Figure 4). In contrast to the GLMs, which either increase or decrease monotonically across the full range of the explanatory variables, the GAMs, with their greater flexibility, capture nonlinearities in the response, particularly at the lower and upper limits of slope and maximum NDVI. Since all natural and human-caused model fits exhibit monotonic behavior over most of the range (i.e., from the 5th to the 95th percentiles) of slope (Figure 4a,b) and distance to roads (Figure 4e,f), the AUCs are quite similar between the GLMs and GAMs, never differing by more than 0.01. However there is sufficient nonlinearity in the responses between the 5th and 95th percentiles of maximum NDVI such that the GAMs outperform the GLMs, with differences in AUCs ≥ 0.03 (Figure 4c,d).
Table 1 shows the evolution of model construction, starting with the multiple logistic-GLMs, and then including the smoothed terms in the multiple logistic-GAMs, and finishing with the multiple logistic spatial-GAMs. All explanatory variables were statistically significant ( p 0.01 ) and increased the percent deviance explained by more than 0.1% when included in the models of human-caused ignitions. However, for the models of natural-caused ignitions, distance to roads was not significant in the multiple logistic-GLM and did not improve the percent deviance explained by more than 0.1% when included in the spatial-GAM.
After building the initial spatial-GAMs, the default basis dimensions were iteratively increased for all smoothed terms. Increasing the values of k for slope, maximum NDVI, and distance to roads had very little effect on the on the spatial-GAMs and did not increase the percent deviance explained by more than 0.1% at any given value of k selected for the smoothed geographic coordinates. Therefore, the basis dimensions for slope, maximum NDVI, and distance to roads were left unchanged from their default values of k = 10 . In contrast, increasing k for the smoothed geographic coordinates continually increased the percent deviance explained, albeit at a slower rate at higher values of k, indicating less improvement in model performance for incremental changes in k as the models became more complex. Ultimately, for the sake of model generality and computational speed, the basis dimension for the smoothed geographic coordinates was increased until the rate of change in percent deviance explained dropped below 0.1%, at which point k = 42 and k = 50 for the natural and human-caused ignition models, respectively.
Evaluating all models using the held-back testing datasets revealed increases in model efficacy with increasing model complexity (Figure 5 and Figure 6). The multiple logistic-GLMs exhibited the most uniform frequency distributions (and thus the greatest overlap in predicted probabilities) for locations with and without wildfire ignitions. Including the smoothed terms in the multiple logistic-GAMs shifted the frequency distributions towards higher probabilities for the presence locations and towards lower probabilities for the absence locations, indicating better differentiation between locations with and without wildfire ignitions. Ultimately, the spatial-GAMs that included geographic coordinates as smoothed terms (and with optimized k values) yielded the greatest separation between the frequency distributions, with AUC = 0.84 and AUC = 0.89 for the natural and human-caused ignition models, respectively. For the spatial-GAMs, the optimum probability threshold for distinguishing locations with and without natural wildfire ignitions was found at p r ( y n ) = 0.53, with 78% of presence grid cells (i.e., TPR = 0.78) and 24% of absence grid cells (i.e., FPR = 0.24) in the testing datasets lying above this threshold. Similarly, the optimum probability threshold for distinguishing locations with and without human-caused ignitions was found at p r ( y h ) = 0.55, with 81% of presence grid cells (i.e., TPR = 0.81) and 18% of absence grid cells (i.e., FPR = 0.18) in the testing datasets lying above this threshold.
Partial effects plots for the spatial-GAMs show how the most complex models respond to each smoothed explanatory variable with all other covariates held constant (Figure 7 and Figure 8). For both natural and human-caused ignitions, the response of the spatial-GAMs to slope, maximum NDVI, and distance to roads was similar to that of the simple logistic-GAMs. The probability of a new naturally ignited wildfire increased up to a slope of ∼30 and then decreased thereafter with greater uncertainty in the prediction due to fewer training data. Natural and human-caused wildfires were least likely to start at locations having the lowest and highest maximum NDVI values, and were most likely to start at locations having intermediate NDVI values between 0.55 and 0.65. Notably, distance to roads played a prominent role in the spatial-GAM of human-caused ignitions, where half of the 30 m grid cells containing the origin of a human-caused wildfire were located within 2.5 km of a road. Including the geographic coordinates of the origin in the multiple logistic-GAMs essentially imparted spatial smoothers on the models. Whereas locations in the Northern Rocky Mountains shared the highest probability of new natural and human-caused wildfires, differences in the pattern of ignition likelihoods between causes were more evident in the eastern half of Montana.
Applying the spatial-GAMs to the stacked rasters of explanatory variables produced 30 m static probability maps indicating the likelihood of natural and human-caused ignitions (Figure 9). The static probability maps were broadly influenced by the spatial smoothers (Figure 7c and Figure 8d) and were locally modulated by slope, maximum NDVI, and distance to roads. This is most evident in the spatial-GAM of natural ignitions, where probabilities are lower in the valley bottoms and drainages with shallower slopes, and most evident in the spatial-GAM of human-caused ignitions, where probabilities are highest along the road network. Given the sparsity of ignition locations compared to the overall size of MT, both spatial-GAMs reassuringly predicted large areas with low probabilities of a wildfire ignition. Half of all land area is predicted to have less than a 23% and 15% chance of a natural or human-caused ignition, respectively. Conversely, locations with the highest predicted probabilities are rare. By area, less than 1% and 2% of the state had >90% chance of a natural or human-caused ignition, respectively. When using the optimum thresholds of p r ( y n ) = 0.53 and p r ( y h ) = 0.55 to classify the probability maps into binary masks of presence and absence, only 23% and 17% of the area across MT can be considered to have conditions most suitable for natural and human-caused ignitions, respectively.
Intersecting 26 years of all ignition locations with the static maps shown in Figure 9a,c revealed that new wildfires were more often reported inside areas with higher modeled probabilities. Similar to the testing datasets, 78% and 83% of all natural and human-caused wildfires were ignited at locations with modeled probabilities above the optimum thresholds of p r ( y n ) = 0.53 and p r ( y h ) = 0.55, respectively. Given the smaller areas with higher modeled probabilities (Figure 9b,d), mean annual ignition densities for natural and human-caused wildfires were 10× and 21× greater in locations with modeled probabilities above the optimum thresholds compared to below. Although the occurrence of more ignitions inside smaller areas resulted in greater mean annual ignition densities within areas of higher predicted probabilities (Figure 10a,b), there was nevertheless variability in A I D around the mean. Since the area covered by each probability interval in the static maps is constant, the variability in A I D is due to fluctuations in the annual number of statewide ignitions (Figure 10c,d).

4. Discussion

In this study we evaluated the efficacy of logistic-GLMs and logistic-GAMs to predict the spatial distribution of natural and human-caused wildfire ignitions across Montana, USA. Our final models adequately described the areas where each ignition source was likely to occur without being overly complex or dependent on dozens of data sources. Compared to other studies, the list of potential explanatory variables investigated here was not exhaustive. Future work is planned to expand this framework to cover the continental US at 30 m spatial resolution, and including additional explanatory variables may make predictions computationally intractable. Likewise, the basis dimensions, k, for the spatial smoothers were kept at a minimum to minimize computation time. In this regard, the spatial smoothers were used to capture the general patterns of ignition likelihood whilst slope, maximum NDVI, and distance to roads were used to refine and focus in on the localized patterns of ignition likelihood. Nevertheless, with only three variables and information about the historical, spatial distribution of wildfires, we were able to model the static ignition probabilities accurately. Given the simplicity of the final models and the few variables used, achieving AUC values of 0.84 and 0.89 for natural and human-caused ignitions is quite remarkable and it suggests that temporally-invariant components of the fire environment that are proxies for fuel availability, terrain, and accessibility are important factors to include in any fire danger rating system.
Generalized additive models have several useful characteristics for this type of application. First, they do not assume linear relationships. Recent work exploring relationships between biomass and fire activity has shown that there is an optimum biomass where fires occur [27]. In areas with low biomass, fire occurrence can be fuel-limited (e.g., [50,51]). Contrarily, areas with high biomass may indicate closed canopies where understory fuels, generally composed of lighter grass and shrub loads [30], remain sheltered from the sunlight and wind and are wet and unable to burn. We noted the same relationship where the optimum, nonlinear, influence of maximum NDVI on wildfire ignitions occurred at intermediate values, supporting the intermediate productivity hypothesis of fire occurrence (e.g., [52]). Nevertheless, in contrast to previous analyses performed at coarse scales (1–100 km) (e.g., [27,29]), we documented this relationship at finer (30 m) spatial resolutions, which can better serve operational fire management planning. Moreover, we noted a strong, nonlinear increase in the probability of a human-caused ignition with proximity to the transportation network, especially within 5 km from a primary or secondary road (Figure 8c), which is consistent with other findings [22,39,53,54]. Ultimately, having a robust modeling framework that does not assume linear relationships between the response and explanatory variables allows us to develop better wildfire ignition predictive models. Further, the ability to include spatial terms and spatial interactions means that these methods can better characterize regional differences in fire activity that is not captured by the biophysical variables alone. This combination of characteristics was a key factor in the choice to use spatial logistic GAMs for this work.
Our work demonstrates that we can effectively model the spatial variations of wildfire ignition across a large area. However, we know that temporally-variant fire environment drivers are also critical components of assessing dynamic fire hazards (Figure 1). While these static models adequately predict the mean annual ignition densities, interannual variability in both natural and human-caused wildfire ignitions (Figure 10) and these variations are likely a direct result of interannual weather variations that either favor or impede new ignitions. Future work should focus on coupling our static wildfire probability maps with dynamic ignition triggers to effectively model the seasonal variations in both natural and human-caused wildfire probabilities across large areas (e.g., [55,56]). Previous work has shown that outputs of the US National Fire Danger Rating System, such as the Energy Release Component and Burning Index, are closely correlated with the timing of new ignitions as reported in the FPA FOD across the Continental United States [57]. Fortunately, preliminary work on utilizing GAMs to predict both the spatial location and timing of new ignitions throughout the fire season shows promise and will be the subject of future work.
Decoupling the static and dynamic controls of wildfire ignition has some distinct computational benefits. The static variables used in these predictive models can be mapped ahead of time, thus reducing computation time if predictive models are used to forecast new ignition potential. If suitable dynamic ignition probability models can be developed for natural and human-caused ignitions, then a final predictive map over some time interval could be computed as follows:
p r ( N e w W i l d f i r e I g n i t i o n ) = p r ( S t a t i c ) p r ( D y n a m i c )
The final choice to use a generalized additive model mainly involved the evaluation of the final predictive model and its efficacy to predict the spatial locations of wildfires across Montana. However, there are many choices for machine learning models, and several methods have been effectively applied to produce models of wildfire likelihood across large areas. For example, Parisien et al. [58] used the MaxEnt method across the Western US, while others have used random forest [22] and support vector machine (SVM) methods [59] and artificial neural networks [60] for regional studies. Ultimately, more work is needed to evaluate the strengths and weaknesses of various machine learning approaches but AUC values of about 0.75 across various methods suggest that the choice of the exact method may not be as important as selecting good spatial covariates for model training and prediction.
While this simple, static wildfire ignition likelihood model can serve as the foundation of a more robust wildfire ignition potential forecast system, it can also be used independently. For example, wildfire management programs can leverage these probability maps to focus their prevention activities in areas with the highest ignition likelihood. Areas with the highest ignition probability (>0.8) in our final prediction maps were restricted to just 7.3% of the statewide land area for natural ignitions and 3.6% of the area for human-caused ignitions, and this could help focus pre-fire prevention and during-season resource pre-positioning efforts to focus on the places where new fires are mostly likely to start.

5. Conclusions

This study investigated several logistic regression models of increasing complexity for predicting the probability of natural and human-caused ignitions across the state of Montana, USA, by relating 26 years of wildfire locations with biophysical and human infrastructure drivers that can be mapped at high spatial resolution. In general, natural ignitions were more locally influenced by slope while human-caused ignitions were more locally influenced by distance to roads. This study provides the framework whereby the ignition probabilities produced herein can be efficiently modeled with high spatial resolution data to provide 30 m resolution maps. These static wildfire ignition probability maps can build the foundation of a robust fire potential prediction system. These models can easily be updated as new data become available. Overall, this predictive framework facilitates more concrete fire management decisions, including the allocation of firefighting resources and the location of future fuels reduction treatments, to reduce risk of new wildfire ignitions.

Author Contributions

Conceptualization, W.M.J., A.J.-R., P.H.F., D.J.V.-N., N.A.M.-V., C.I.B.-H., and M.R.; methodology, P.H.F., W.M.J., and A.J.-R.; software, A.J.-R., and P.H.F.; validation, A.J.-R., W.M.J., and P.H.F.; formal analysis, A.J.-R., and P.H.F.; investigation, A.J.-R., and W.M.J.; resources, W.M.J., and P.H.F.; data curation, W.M.J., and P.H.F.; writing—original draft preparation, A.J.-R., P.H.F., W.M.J., and M.R.; writing—review and editing, W.M.J., P.H.F., D.J.V.-N., and M.R.; visualization, P.H.F., and A.J.-R.; supervision, W.M.J.; project administration, W.M.J.; funding acquisition, W.M.J., and M.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research funded in part by Project FIREPATHS (PID2020-116556RA-I00), Spanish Ministry of Science and Innovation.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Not applicable.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. Krawchuk, M.A.; Moritz, M.A.; Parisien, M.A.; Van Dorn, J.; Hayhoe, K. Global pyrogeography: The current and future distribution of wildfire. PLoS ONE 2009, 4, e5102. [Google Scholar] [CrossRef]
  2. Krawchuk, M.A.; Moritz, M.A. Constraints on global fire activity vary across a resource gradient. Ecology 2011, 92, 121–132. [Google Scholar] [CrossRef] [PubMed]
  3. Archibald, S.; Lehmann, C.E.R.; Gómez-dans, J.L.; Bradstock, R.A. Defining pyromes and global syndromes of fire regimes. Proc. Natl. Acad. Sci. USA 2013, 110, 6445–6447. [Google Scholar] [CrossRef]
  4. Page, S.E.; Siegert, F.; Rieley, J.O.; Boehm, H.D.V.; Jaya, A.; Limin, S. The amount of carbon released from peat and forest fires in Indonesia during 1997. Nature 2002, 420, 61–65. [Google Scholar] [CrossRef]
  5. Bladon, K.D.; Emelko, M.B.; Silins, U.; Stone, M. Wildfire and the Future of Water Supply. Environ. Sci. Technol. 2014, 48, 8936–8943. [Google Scholar] [CrossRef] [PubMed]
  6. Smith, H.G.; Sheridan, G.J.; Lane, P.N.; Nyman, P.; Haydon, S. Wildfire effects on water quality in forest catchments: A review with implications for water supply. J. Hydrol. 2011, 396, 170–192. [Google Scholar] [CrossRef]
  7. McLauchlan, K.K.; Higuera, P.E.; Miesel, J.; Rogers, B.M.; Schweitzer, J.; Shuman, J.K.; Tepley, A.J.; Varner, J.M.; Veblen, T.T.; Adalsteinsson, S.A.; et al. Fire as a fundamental ecological process: Research advances and frontiers. J. Ecol. 2020, 108, 2047–2069. [Google Scholar] [CrossRef]
  8. Kramer, H.A.; Mockrin, M.H.; Alexandre, P.M.; Radeloff, V.C. High wildfire damage in interface communities in California. Int. J. Wildland Fire 2019, 28, 641–650. [Google Scholar] [CrossRef]
  9. McLennan, J.; Ryan, B.; Bearman, C.; Toh, K. Should we leave now? Behavioral factors in evacuation under wildfire threat. Fire Technol. 2019, 55, 487–516. [Google Scholar] [CrossRef]
  10. Riley, K.L.; Thompson, M.P.; Scott, J.H.; Gilbertson-Day, J.W. A model-based framework to evaluate alternative wildfire suppression strategies. Resources 2018, 7, 4. [Google Scholar] [CrossRef]
  11. Tedim, F.; Leone, V.; Coughlan, M.; Bouillon, C.; Xanthopoulos, G.; Royé, D.; Correia, F.J.; Ferreira, C. Extreme wildfire events: The definition. In Extreme Wildfire Events and Disasters; Elsevier: Amsterdam, The Netherlands, 2020; Chapter 1; pp. 3–29. [Google Scholar] [CrossRef]
  12. Belval, E.J.; Stonesifer, C.S.; Calkin, D.E. Fire suppression resource scarcity: Current metrics and future performance indicators. Forests 2020, 11, 217. [Google Scholar] [CrossRef]
  13. Syphard, A.D.; Keeley, J.E. Location, timing and extent of wildfire vary by cause of ignition. Int. J. Wildland Fire 2015, 24, 37–47. [Google Scholar] [CrossRef]
  14. Balch, J.K.; Bradley, B.A.; Abatzoglou, J.T.; Chelsea Nagy, R.; Fusco, E.J.; Mahood, A.L. Human-started wildfires expand the fire niche across the United States. Proc. Natl. Acad. Sci. USA 2017, 114, 2946–2951. [Google Scholar] [CrossRef] [PubMed]
  15. Mitchener, L.J.; Parker, A.J. Climate, lightning, and wildfire in the national forests of the southeastern United States: 1989–1998. Phys. Geogr. 2005, 26, 147–162. [Google Scholar] [CrossRef]
  16. Schultz, C.J.; Nauslar, N.J.; Wachter, J.B.; Hain, C.R.; Bell, J.R. Spatial, temporal and electrical characteristics of lightning in reported lightning-initiated wildfire events. Fire 2019, 2, 18. [Google Scholar] [CrossRef] [PubMed]
  17. Prestemon, J.; Hawbaker, T.; Bowden, M.; Carpenter, J.; Brooks, M.; Abt, K.; Sutphen, R.; Scranton, S. Wildfire Ignitions: A Review of the Science and Recommendations for Empirical Modeling; USDA-Forest Service, Southern Research Station: Asheville, NC, USA, 2013; p. 20.
  18. Faivre, N.; Jin, Y.; Goulden, M.L.; Randerson, J.T. Controls on the spatial pattern of wildfire ignitions in Southern California. Int. J. Wildland Fire 2014, 23, 799–811. [Google Scholar] [CrossRef]
  19. Benefield, A.; Chen, J. Examining the influence of outdoor recreation on anthropogenic wildfire regime of the southern Rocky Mountains. Nat. Hazards 2022, 111, 523–545. [Google Scholar] [CrossRef]
  20. Keeley, J.E.; Syphard, A.D. Historical patterns of wildfire ignition sources in California ecosystems. Int. J. Wildland Fire 2018, 27, 781–799. [Google Scholar] [CrossRef]
  21. Ganteaume, A.; Camia, A.; Jappiot, M.; San-Miguel-Ayanz, J.; Long-Fournel, M.; Lampin, C. A review of the main driving factors of forest fire ignition over Europe. Environ. Manag. 2013, 51, 651–662. [Google Scholar] [CrossRef]
  22. Barreto, J.S.; Armenteras, D. Open Data and Machine Learning to Model the Occurrence of Fire in the Ecoregion of “Llanos Colombo—Venezolanos”. Remote Sens. 2020, 12, 3921. [Google Scholar] [CrossRef]
  23. Viegas, D.X. Slope and wind effects on fire propagation. Int. J. Wildland Fire 2004, 13, 143–156. [Google Scholar] [CrossRef]
  24. Syphard, A.D.; Radeloff, V.C.; Keeley, J.E.; Hawbaker, T.J.; Clayton, M.K.; Stewart, S.I.; Hammer, R.B. Human influence on California fire regimes. Ecol. Appl. 2007, 17, 1388–1402. [Google Scholar] [CrossRef]
  25. Narayanaraj, G.; Wimberly, M.C. Influences of forest roads on the spatial patterns of human-and lightning-caused wildfire ignitions. Appl. Geogr. 2012, 32, 878–888. [Google Scholar] [CrossRef]
  26. Thompson, M.P.; Gannon, B.M.; Caggiano, M.D. Forest roads and operational wildfire response planning. Forests 2021, 12, 110. [Google Scholar] [CrossRef]
  27. Briones-Herrera, C.I.; Vega-Nieva, D.J.; Monjarás-Vega, N.A.; Flores-Medina, F.; Lopez-Serrano, P.M.; Corral-Rivas, J.J.; Carrillo-Parra, A.; Pulgarin-Gámiz, M.Á.; Alvarado-Celestino, E.; González-Cabán, A.; et al. Modeling and mapping forest fire occurrence from aboveground carbon density in Mexico. Forests 2019, 10, 402. [Google Scholar] [CrossRef]
  28. Costafreda-Aumedes, S.; Comas, C.; Vega-Garcia, C. Human-caused fire occurrence modelling in perspective: A review. Int. J. Wildland Fire 2017, 26, 983–998. [Google Scholar] [CrossRef]
  29. Archibald, S.; Roy, D.P.; van Wilgen, B.W.; Scholes, R.J. What limits fire? An examination of drivers of burnt area in Southern Africa. Glob. Chang. Biol. 2009, 15, 613–630. [Google Scholar] [CrossRef]
  30. Kahiu, M.N.; Hanan, N. Fire in sub-Saharan Africa: The fuel, cure and connectivity hypothesis. Glob. Ecol. Biogeogr. 2018, 27, 946–957. [Google Scholar] [CrossRef]
  31. Silverman, N.L.; Maneta, M.P.; Chen, S.H.; Harper, J.T. Dynamically downscaled winter precipitation over complex terrain of the Central Rockies of Western Montana, USA. Water Resour. Res. 2013, 49, 458–470. [Google Scholar] [CrossRef]
  32. Heim, R. Widespread general rains in Montana during the summer. Natl. Weather Dig. 1986, 11, 21–27. [Google Scholar]
  33. Crawford, C.J.; Griffin, D.; Kipfmueller, K.F. Capturing season-specific precipitation signals in the northern Rocky Mountains, USA, using earlywood and latewood tree rings. J. Geophys. Res. Biogeosci. 2015, 120, 428–440. [Google Scholar] [CrossRef]
  34. Kalashnikov, D.A.; Loikith, P.C.; Catalano, A.J. A 30-Yr Climatology of Meteorological Conditions Associated with Lightning Days in the Interior Western United States. J. Clim. 2020, 33, 3771–3785. [Google Scholar] [CrossRef]
  35. Rollins, M.G. LANDFIRE: A nationally consistent vegetation, wildland fire, and fuel assessment. Int. J. Wildland Fire 2009, 18, 235–249. [Google Scholar] [CrossRef]
  36. Short, K.C. A spatial database of wildfires in the United States, 1992–2011. Earth Syst. Sci. Data 2014, 6, 1–27. [Google Scholar] [CrossRef]
  37. Gorelick, N.; Hancher, M.; Dixon, M.; Ilyushchenko, S.; Thau, D.; Moore, R. Google Earth Engine: Planetary-scale geospatial analysis for everyone. Remote Sens. Environ. 2017, 202, 18–27. [Google Scholar] [CrossRef]
  38. Xu, C.; Li, Y.; Hu, J.; Yang, X.; Sheng, S.; Liu, M. Evaluating the difference between the normalized difference vegetation index and net primary productivity as the indicators of vegetation vigor assessment at landscape scale. Environ. Monit. Assess. 2012, 184, 1275–1286. [Google Scholar] [CrossRef] [PubMed]
  39. Monjarás-Vega, N.A.; Briones-Herrera, C.I.; Vega-Nieva, D.J.; Calleros-Flores, E.; Corral-Rivas, J.J.; López-Serrano, P.M.; Pompa-García, M.; Rodríguez-Trejo, D.A.; Carrillo-Parra, A.; González-Cabán, A.; et al. Predicting forest fire kernel density at multiple scales with geographically weighted regression in Mexico. Sci. Total Environ. 2020, 718, 137313. [Google Scholar] [CrossRef] [PubMed]
  40. US Census Bureau. TIGER/Line Shapefiles. 2015. Available online: https://www.census.gov/geographies/mapping-files/time-series/geo/tiger-line-file.html (accessed on 2 June 2022).
  41. Syphard, A.D.; Radeloff, V.C.; Keuler, N.S.; Taylor, R.S.; Hawbaker, T.J.; Stewart, S.I.; Clayton, M.K. Predicting spatial patterns of fire on a southern California landscape. Int. J. Wildland Fire 2008, 17, 602–613. [Google Scholar] [CrossRef]
  42. Bar Massada, A.; Syphard, A.D.; Stewart, S.I.; Radeloff, V.C. Wildfire ignition-distribution modelling: A comparative study in the Huron–Manistee National Forest, Michigan, USA. Int. J. Wildland Fire 2012, 22, 174–183. [Google Scholar] [CrossRef]
  43. Vilar, L.; Woolford, D.G.; Martell, D.L.; Martín, M.P. A model for predicting human-caused wildfire occurrence in the region of Madrid, Spain. Int. J. Wildland Fire 2010, 19, 325–337. [Google Scholar] [CrossRef]
  44. R Core Team. R: A Language and Environment for Statistical Computing; R Foundation for Statistical Computing: Vienna, Austria, 2021. [Google Scholar]
  45. Wood, S. Generalized Additive Models: An Introduction with R, 2nd ed.; Chapman and Hall/CRC: New York, NY, USA, 2017. [Google Scholar]
  46. Wood, S.N.; Goude, Y.; Shaw, S. Generalized additive models for large data sets. J. R. Stat. Society. Ser. C Appl. Stat. 2015, 64, 139–155. [Google Scholar] [CrossRef]
  47. Wood, S.N. Fast stable restricted maximum likelihood and marginal likelihood estimation of semiparametric generalized linear models. J. R. Stat. Soc. Ser. B Stat. Methodol. 2011, 73, 3–36. [Google Scholar] [CrossRef]
  48. Hogland, J.; Anderson, N.; Affleck, D.L.; St Peter, J. Using Forest Inventory Data with Landsat 8 Imagery to Map Longleaf Pine Forest Characteristics in Georgia, USA. Remote Sens. 2019, 11, 1803. [Google Scholar] [CrossRef]
  49. Bradley, A.P. The Use of the Area Under the ROC Curve in the Evaluation of Machine Learning Algorithms. Pattern Recogn. 1997, 30, 1145–1159. [Google Scholar] [CrossRef]
  50. Bradstock, R.A. A biogeographic model of fire regimes in Australia: Current and future implications. Glob. Ecol. Biogeogr. 2010, 19, 145–158. [Google Scholar] [CrossRef]
  51. Holz, A.; Kitzberger, T.; Paritsis, J.; Veblen, T.T. Ecological and climatic controls of modern wildfire activity patterns across southwestern South America. Ecosphere 2012, 3, 1–25. [Google Scholar] [CrossRef]
  52. Pausas, J.G.; Ribeiro, E. The global fire–productivity relationship. Glob. Ecol. Biogeogr. 2013, 22, 728–736. [Google Scholar] [CrossRef]
  53. De Vasconcelos, M.P.; Silva, S.; Tome, M.; Alvim, M.; Pereira, J.C. Spatial prediction of fire ignition probabilities: Comparing logistic regression and neural networks. Photogramm. Eng. Remote Sens. 2001, 67, 73–81. [Google Scholar]
  54. Grala, K.; Grala, R.K.; Hussain, A.; Cooke, W.H.; Varner, J.M. Impact of human factors on wildfire occurrence in Mississippi, United States. For. Policy Econ. 2017, 81, 38–47. [Google Scholar] [CrossRef]
  55. Juan, P.; Mateu, J.; Saez, M. Pinpointing spatio-temporal interactions in wildfire patterns. Stoch. Environ. Res. Risk Assess. 2012, 26, 1131–1150. [Google Scholar] [CrossRef]
  56. Martín, Y.; Zúñiga-Antón, M.; Rodrigues Mimbrero, M. Modelling temporal variation of fire-occurrence towards the dynamic prediction of human wildfire ignition danger in northeast Spain. Geomat. Nat. Hazards Risk 2019, 10, 385–411. [Google Scholar] [CrossRef]
  57. Jolly, W.M.; Freeborn, P.H.; Page, W.G.; Butler, B.W. Severe Fire Danger Index: A Forecastable Metric to Inform Firefighter and Community Wildfire Risk Management. Fire 2019, 2, 47. [Google Scholar] [CrossRef]
  58. Parisien, M.A.; Snetsinger, S.; Greenberg, J.A.; Nelson, C.R.; Schoennagel, T.; Dobrowski, S.Z.; Moritz, M.A. Spatial variability in wildfire probability across the western United States. Int. J. Wildland Fire 2012, 21, 313–327. [Google Scholar] [CrossRef]
  59. Jaafari, A.; Pourghasemi, H.R. 28 - Factors Influencing Regional-Scale Wildfire Probability in Iran: An Application of Random Forest and Support Vector Machine. In Spatial Modeling in GIS and R for Earth and Environmental Sciences; Pourghasemi, H.R., Gokceoglu, C., Eds.; Elsevier: Amsterdam, The Netherlands, 2019; pp. 607–619. [Google Scholar] [CrossRef]
  60. Elia, M.; D’Este, M.; Ascoli, D.; Giannico, V.; Spano, G.; Ganga, A.; Colangelo, G.; Lafortezza, R.; Sanesi, G. Estimating the probability of wildfire occurrence in Mediterranean landscapes using Artificial Neural Networks. Environ. Impact Assess. Rev. 2020, 85, 106474. [Google Scholar] [CrossRef]
Figure 1. Conceptual scheme of the dual component of wildfire hazard or danger: the static factors and the dynamic factors.
Figure 1. Conceptual scheme of the dual component of wildfire hazard or danger: the static factors and the dynamic factors.
Forests 13 01200 g001
Figure 2. Maps of Montana, USA, shown with primary and secondary roads. (a) LANDFIRE Existing Vegetation Type (EVT) (b) and (c) natural and human-caused ignition locations recorded in the Fire Program Analysis Fire-Occurrence Database (FPA FOD) from 1992–2017. (d) Slope from LANDFIRE. (e) Maximum Normalized Difference Vegetation Index (NDVI) from 1992–2011. (f) Distance to roads.
Figure 2. Maps of Montana, USA, shown with primary and secondary roads. (a) LANDFIRE Existing Vegetation Type (EVT) (b) and (c) natural and human-caused ignition locations recorded in the Fire Program Analysis Fire-Occurrence Database (FPA FOD) from 1992–2017. (d) Slope from LANDFIRE. (e) Maximum Normalized Difference Vegetation Index (NDVI) from 1992–2011. (f) Distance to roads.
Forests 13 01200 g002
Figure 3. General workflow for model building and evaluation including inputs, sampling of wildfire presence and absence locations, extraction of explanatory variables, creation of an attributed point wildfire database, model fitting of new wildfire occurrences, and the final spatial predictions of wildfire ignition probabilities.
Figure 3. General workflow for model building and evaluation including inputs, sampling of wildfire presence and absence locations, extraction of explanatory variables, creation of an attributed point wildfire database, model fitting of new wildfire occurrences, and the final spatial predictions of wildfire ignition probabilities.
Forests 13 01200 g003
Figure 4. Univariate simple logistic general linear models (GLMs) and simple logistic general additive models (GAMs) to predict the probability of natural ignitions, p r ( y n ) (left column), and human-caused ignitions, p r ( y h ) (right column). Models were built using a single explanatory variable each, shown for slope (a,b), maximum NDVI (c,d) and distance to roads (e,f). The dotted vertical lines indicate the 5th and 95th percentiles of the explanatory variables and therefore capture 90% of the training data. The area under the receiver operating characteristic curve (AUC) is based on the testing data.
Figure 4. Univariate simple logistic general linear models (GLMs) and simple logistic general additive models (GAMs) to predict the probability of natural ignitions, p r ( y n ) (left column), and human-caused ignitions, p r ( y h ) (right column). Models were built using a single explanatory variable each, shown for slope (a,b), maximum NDVI (c,d) and distance to roads (e,f). The dotted vertical lines indicate the 5th and 95th percentiles of the explanatory variables and therefore capture 90% of the training data. The area under the receiver operating characteristic curve (AUC) is based on the testing data.
Forests 13 01200 g004
Figure 5. Evaluation of the multiple logistic general linear model (GLM), general additive model (GAM), and spatial-GAM used to model natural wildfire ignition probability. Model definitions are shown in Table 1. Each model was applied to the held-back testing datasets and the frequency distributions shown in (ac) indicate the count of presence (red) and absence (black) grid cells according to their predicted probability. The receiver operating characteristic (ROC) curves and area under the ROC curves (AUC) shown in (df) were also generated from the testing datasets. The optimum probability thresholds (i.e., closest to TPR = 1.0 and FPR = 0.0) are shown as dotted vertical lines in (ac) and solid circles in (df).
Figure 5. Evaluation of the multiple logistic general linear model (GLM), general additive model (GAM), and spatial-GAM used to model natural wildfire ignition probability. Model definitions are shown in Table 1. Each model was applied to the held-back testing datasets and the frequency distributions shown in (ac) indicate the count of presence (red) and absence (black) grid cells according to their predicted probability. The receiver operating characteristic (ROC) curves and area under the ROC curves (AUC) shown in (df) were also generated from the testing datasets. The optimum probability thresholds (i.e., closest to TPR = 1.0 and FPR = 0.0) are shown as dotted vertical lines in (ac) and solid circles in (df).
Forests 13 01200 g005
Figure 6. Evaluation of the multiple logistic general linear model (GLM), general additive model (GAM), and spatial-GAM used to model human-caused wildfire ignition probability. Model definitions are shown in Table 1. Each model was applied to the held-back testing datasets and the frequency distributions shown in (ac) indicate the count of presence (red) and absence (black) grid cells according to their predicted probability. The receiver operating characteristic (ROC) curves and area under the ROC curves (AUC) shown in (df) were also generated from the testing datasets. The optimum probability thresholds (i.e., closest to TPR = 1.0 and FPR = 0.0) are shown as dotted vertical lines in (ac) and solid circles in (df).
Figure 6. Evaluation of the multiple logistic general linear model (GLM), general additive model (GAM), and spatial-GAM used to model human-caused wildfire ignition probability. Model definitions are shown in Table 1. Each model was applied to the held-back testing datasets and the frequency distributions shown in (ac) indicate the count of presence (red) and absence (black) grid cells according to their predicted probability. The receiver operating characteristic (ROC) curves and area under the ROC curves (AUC) shown in (df) were also generated from the testing datasets. The optimum probability thresholds (i.e., closest to TPR = 1.0 and FPR = 0.0) are shown as dotted vertical lines in (ac) and solid circles in (df).
Forests 13 01200 g006
Figure 7. Partial effects plots for slope (a), maximum NDVI (b) and longitude and latitude (c) included in the spatial-GAM built to predict the probability of a naturally ignited wildfire, p r ( y n ) .
Figure 7. Partial effects plots for slope (a), maximum NDVI (b) and longitude and latitude (c) included in the spatial-GAM built to predict the probability of a naturally ignited wildfire, p r ( y n ) .
Forests 13 01200 g007
Figure 8. Partial effects plots for slope (a), maximum NDVI (b), distance to roads (c) and longitude and latitude (d) included in the spatial-GAM built to predict the probability of a human-caused wildfire ignition, p r ( y h ) .
Figure 8. Partial effects plots for slope (a), maximum NDVI (b), distance to roads (c) and longitude and latitude (d) included in the spatial-GAM built to predict the probability of a human-caused wildfire ignition, p r ( y h ) .
Forests 13 01200 g008
Figure 9. Maps of (a) the probability of a naturally ignited wildfire, p r ( y n ) , and (c) the probability of a human-caused wildfire, p r ( y h ) , in Montana obtained from the spatial-GAMs. Distributions of the proportion of land area as functions of p r ( y n ) and p r ( y h ) are shown in (b) and (d), respectively.
Figure 9. Maps of (a) the probability of a naturally ignited wildfire, p r ( y n ) , and (c) the probability of a human-caused wildfire, p r ( y h ) , in Montana obtained from the spatial-GAMs. Distributions of the proportion of land area as functions of p r ( y n ) and p r ( y h ) are shown in (b) and (d), respectively.
Forests 13 01200 g009
Figure 10. Shown in (a,b) are the distributions of annual ignition densities ( A I D , # yr 1 km 1 ) for natural (subscript n) and human-caused (subscript h) wildfires calculated within each probability interval (subscript i). Shown in (c,d) are the frequency distributions of the yearly (subscript y) count of natural and human-caused ignitions along with their mean values.
Figure 10. Shown in (a,b) are the distributions of annual ignition densities ( A I D , # yr 1 km 1 ) for natural (subscript n) and human-caused (subscript h) wildfires calculated within each probability interval (subscript i). Shown in (c,d) are the frequency distributions of the yearly (subscript y) count of natural and human-caused ignitions along with their mean values.
Forests 13 01200 g010
Table 1. Summary of the multiple logistic general linear model (GLM), the general additive model (GAM), and spatial-GAM. Formulas are presented for modeling the probability of a natural ignition, p r ( y n ) , and human-caused ignition, p r ( y h ) , where α is the intercept, v 1 = slope, v 2 = maximum NDVI, v 3 = distance to roads, x = longitude, and y = latitude. The s ( ) functions indicate the smoothed terms in the GAMs. The area under the receiver operating characteristic (ROC) curve (AUC) is based on the testing data, and the ROCs themselves are shown in Figure 5 and Figure 6, respectively.
Table 1. Summary of the multiple logistic general linear model (GLM), the general additive model (GAM), and spatial-GAM. Formulas are presented for modeling the probability of a natural ignition, p r ( y n ) , and human-caused ignition, p r ( y h ) , where α is the intercept, v 1 = slope, v 2 = maximum NDVI, v 3 = distance to roads, x = longitude, and y = latitude. The s ( ) functions indicate the smoothed terms in the GAMs. The area under the receiver operating characteristic (ROC) curve (AUC) is based on the testing data, and the ROCs themselves are shown in Figure 5 and Figure 6, respectively.
VersionCauseFormulaAUC
GLMNatural
Human
p r ( y n ) = α + v 1 + v 2
p r ( y h ) = α + v 1 + v 2 + v 3
0.74
0.73
GAMNatural
Human
p r ( y n ) = α + s ( v 1 ) + s ( v 2 ) + s ( v 3 )
p r ( y h ) = α + s ( v 1 ) + s ( v 2 ) + s ( v 3 )
0.77
0.77
Spatial GAMNatural
Human
p r ( y n ) = α + s ( v 1 ) + s ( v 2 ) + s ( x , y )
p r ( y h ) = α + s ( v 1 ) + s ( v 2 ) + s ( v 3 ) + s ( x , y )
0.84
0.89
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Jiménez-Ruano, A.; Jolly, W.M.; Freeborn, P.H.; Vega-Nieva, D.J.; Monjarás-Vega, N.A.; Briones-Herrera, C.I.; Rodrigues, M. Spatial Predictions of Human and Natural-Caused Wildfire Likelihood across Montana (USA). Forests 2022, 13, 1200. https://doi.org/10.3390/f13081200

AMA Style

Jiménez-Ruano A, Jolly WM, Freeborn PH, Vega-Nieva DJ, Monjarás-Vega NA, Briones-Herrera CI, Rodrigues M. Spatial Predictions of Human and Natural-Caused Wildfire Likelihood across Montana (USA). Forests. 2022; 13(8):1200. https://doi.org/10.3390/f13081200

Chicago/Turabian Style

Jiménez-Ruano, Adrián, William M. Jolly, Patrick H. Freeborn, Daniel José Vega-Nieva, Norma Angélica Monjarás-Vega, Carlos Iván Briones-Herrera, and Marcos Rodrigues. 2022. "Spatial Predictions of Human and Natural-Caused Wildfire Likelihood across Montana (USA)" Forests 13, no. 8: 1200. https://doi.org/10.3390/f13081200

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop