Next Article in Journal
Assessment of Burned Forest Area Severity and Postfire Regrowth in Chapada Diamantina National Park (Bahia, Brazil) Using dNBR and RdNBR Spectral Indices
Next Article in Special Issue
Mineralogical and Chemical Investigations of the Amguid Crater (Algeria): Is there Evidence on an Impact Origin?
Previous Article in Journal
Variation of Ground Temperature along the Stratum Depth in Ice-rich Tundra of Hinggan Mountains Region, NE China
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

A Machine Learning-Based Approach for Wildfire Susceptibility Mapping. The Case Study of the Liguria Region in Italy

1
Institute of Earth Surface Dynamics, Faculty of Geosciences and Environment, University of Lausanne, CH-1015 Lausanne, Switzerland
2
CIMA Research Foundation, 17100 Savona, Italy
*
Author to whom correspondence should be addressed.
Geosciences 2020, 10(3), 105; https://doi.org/10.3390/geosciences10030105
Submission received: 30 January 2020 / Revised: 14 March 2020 / Accepted: 16 March 2020 / Published: 18 March 2020
(This article belongs to the Special Issue 2020: A 10 Years Journey-Advances in Geosciences)

Abstract

:
Wildfire susceptibility maps display the spatial probability of an area to burn in the future, based solely on the intrinsic local proprieties of a site. Current studies in this field often rely on statistical models, often improved by expert knowledge for data retrieving and processing. In the last few years, machine learning algorithms have proven to be successful in this domain, thanks to their capability of learning from data through the modeling of hidden relationships. In the present study, authors introduce an approach based on random forests, allowing elaborating a wildfire susceptibility map for the Liguria region in Italy. This region is highly affected by wildfires due to the dense and heterogeneous vegetation, with more than 70% of its surface covered by forests, and due to the favorable climatic conditions. Susceptibility was assessed by considering the dataset of the mapped fire perimeters, spanning a 21-year period (1997–2017) and different geo-environmental predisposing factors (i.e., land cover, vegetation type, road network, altitude, and derivatives). One main objective was to compare different models in order to evaluate the effect of: (i) including or excluding the neighboring vegetation type as additional predisposing factors and (ii) using an increasing number of folds in the spatial-cross validation procedure. Susceptibility maps for the two fire seasons were finally elaborated and validated. Results highlighted the capacity of the proposed approach to identify areas that could be affected by wildfires in the near future, as well as its goodness in assessing the efficiency of fire-fighting activities.

1. Introduction

Wildfires represent a hazardous and harmful phenomenon to people and the environment, especially in populated areas. In these areas, the primary cause of ignition is related to human activities. In Europe, for example, 95% of wildfire events are estimated to be induced by humans [1].
Among the weather-induced emergencies, wildfires constitute one of the more complex scenarios. Specifically, in the Mediterranean basin, wildfires are responsible for substantial damaging effects [2]. According to the Advance EFFIS Report on Forest Fires 2017 [3], Italy, Greece, Portugal, and Spain have been affected by a total of 43,733 fires, over a total an area of 89,4244 ha. Despite the financial support of the European Commission and the efforts by local wildfire administrations to prevent and fight wildfires, in the European Union (EU), in 2017, wildfires burnt over 1.2 million ha of natural lands, causing the death of 127 people among fire fighters and civilians. Over 25% of the total burnt area was in the different Natura 2000 sites, undermining the efforts in preserving natural habitats and key biodiversity for future generations. The losses caused by these fires amounted to 10 billion euros, as estimated by the European Forest Fire Information System [3].
Mapping current and past wildfire events by identifying their location, burned area, starting date, and duration, provides a precious tool for addressing prevention-planning programs aiming to reduce human and material losses. These inventories are an important source of information for the elaboration of hazard, risk, and susceptibility maps. Namely, hazard maps represent the zonation of the spatio-temporal probability of events, while the expected damages or losses are assessed and represented on risk maps. Susceptibility maps do not account for the temporal dimension and indicate zones with a potential to experience a particular hazard in the future, based solely on the intrinsic local properties of a site, expressed in terms of relative spatial likelihood. This definition relies on the basic assumption that future events are expected to occur under similar geo-environmental conditions to those observed in past events. Although these concepts are well-consolidated in the research area related with the risk assessment, especially for landslides [4,5,6,7], there is a need for elaborating susceptibility and risk maps for other natural hazards and to develop new quantitative and robust methods supporting their production.
In the field of wildfire risk assessment, fire risk has been defined as a quantitative or qualitative indicator of the likelihood that an area would burn in a certain period of time [8,9,10,11]. In this context, modeling fire risks represents a modern tool to support forest protection plans and address fuel management strategies in order to reduce fire consequences [12,13,14]. More generally, risk and susceptibility analyses are of great importance for land use planning, civil protection, and risk reduction programs. A number of techniques were recently developed to monitor and map the spatial distribution of a burned area and to predict the area at risk of wildfires. These often involve the implementation of physically-based models integrated into a Geographic Information System (GIS), relying on expert knowledge or including statistical analyses and modeling to assess the importance of the predisposing factors [15,16,17,18,19,20,21,22,23,24]. Lately, the comparison of deterministic physically-/statistically-based and stochastic approaches highlights the benefit of using data driven methods [25,26,27,28,29,30,31,32,33] that are able to extract knowledge directly from data. Machine learning has proven to be a mature and reliable tool in quantitative wildfire-related studies, as for the estimation of standing crop and fuel moisture content, often outperforming standard statistical approaches [34,35,36]. In comparison with deterministic methods, which, given a set of initial conditions (i.e., predisposing factors), always give the same results, stochastic models assume that results obtained through the combination of independent predisposing factors can be slightly different, as a consequence of the randomness of the process.
Although the terms “risk” and “susceptibility” are often used as synonymous, hereinafter we refer to “susceptibility mapping”, meaning that the probability for an area to burn is assessed with no consideration of the magnitude of the single wildfire event or of its temporal dimension. Indeed, only the spatial probability for an area to burn in the future, assessed by defining a rank from low to high, is evaluated. This quantitative evaluation is performed considering two aspects: where wildfires occurred in the past, in terms of burned area, and which are the geo-environmental and anthropogenic predisposing factors that favor their spread. In this regard, it is worth noting that meteorological factors, such as wind speed and wind direction, temperature, humidity, and rainfall, are considered as triggering and not predisposing factors. The trigger is a local condition that causes a risk to occur if the area is susceptible to that risk, while the susceptibility is assessed based on predisposing factors, stable over time. The proximity to road and pathway networks and to urban and recreation areas are most frequently mentioned as predisposing human factors for wildfire [37,38,39,40,41,42,43,44]. As regards geo-environmental variables, those related to vegetation type and topography are the most significant drivers, especially in Mediterranean-type regions [45,46,47,48].
Italy is particularly affected by wildfires because of the high topographic and vegetation heterogeneity of the territory, as well as the favorable climate conditions that characterize the entire Mediterranean basin. Wildfires are more frequent and larger in the summer (May–October) than in the winter (November–April) season in almost all Mediterranean countries, since it is hotter and drier. The Liguria region in Italy represents an exception, as it is highly affected by wildfires during the entire year and it sees a number of events and burned areas that can be more important in winter than in summer. Nevertheless, the spatial distribution of burned areas differs in the two seasons, probably due to the vegetation phenology at different altitudes in terms of plant senescence. Therefore, it is important to separately assess wildfire susceptibility in Liguria for the summer and winter season.
In this study, we adopted a stochastic approach based on a machine learning (ML) algorithm to elaborate wildfire susceptibility mapping for Liguria. The ML includes a class of algorithms for the analysis, modelling, and visualization of environmental data and performs particularly well to model environmental hazards, which naturally have a complex and non-linear behavior [49,50]. At present, only a few studies exist in literature on the use of this kind of approach for wildfire susceptibility and risk mapping. Moreover, a rigorous procedure to select the predisposing factors and to validate the results needs to be the established. To fill this gap, we developed different models and we evaluated their prediction performance by considering the effect of: (i) including or excluding the neighboring vegetation type as a predisposing factor and (ii) using an increasing number of folds in the spatial cross-validation procedure.

2. Study Area

Liguria has a total area of 5400 km2, found between the Cote d’Azur (France) and Tuscany (Italy) on the northwest coast of the Tyrrhenian Sea. This Mediterranean region is characterized by a complex topography (with a slope that is higher than 40% over 50% of the total area) and dense and heterogeneous vegetation (with more than 70% of its surface covered by forests, see Figure 1).
During the investigated period, 1997–2017, an average of 365 wildfires burnt an area of 55 km2 per year and were a recurrent phenomenon both in the summer and winter. The winter fire regime is mainly due to frequent extremely dry winds from the north in conditions of curing, for most of the herbaceous species, resulting in a number of wildfires and burned areas that can be higher than in summer (Figure 2).
In 1950, the forest cover was limited to 30% of the regional surface, as most of the areas were subject to grazing activities. After World War II, rural communities were engaged in many reforestation programs, using different Mediterranean pine species. Pines and shrub species that are highly flammable spread quickly as agricultural and grazing activities were abandoned. Furthermore, the rural exodus and urbanization that followed the war greatly extended the Wildland–Urban Interface.

3. Materials and Methods

3.1. The Dataset: Burned Area and Predisposing Factors

The dependent variable of our model, allowing for the assessment of the susceptibility of the region to wildfires, was the burned area available as mapped burned areas and spanning a 21-year period (1997–2017). This dataset was acquired and compiled in shapefile format by the regional forestry service, on the base of GPS-surveying and subsequent digitalization over the cadastral map (scale 1:10,000). It is worth observing that fire perimeters were also affected by the capacity of intervention, which has not been considered in the analysis because of a lack of homogenization of the data.
The seasonality of the fire regime was also considered, partitioning the dataset of burned areas in two macro seasons: winter (November–April) and summer (May–October) (Figure 3)
The following independent variables provided detailed knowledge of topography and land cover, allowing for the understanding of the main features involved in wildfire occurrences and their behavior: DEM (digital elevation model) and derivatives (slope, northness, and eastness), distance to anthropogenic features (urban area, road, pathways, crops), protected areas, vegetation type, and neighboring vegetation.
In more detail, northerness and easterness, corresponding respectively to the cosine and to the minus sign of the aspect angle, were considered instead of the pure aspect angle (i.e., the terrain orientation) to avoid the use of a circular variable. The distance values were evaluated by computing, for each pixel, the Euclidean distance to the closest considered element. Protected areas were introduced as a binary variable, computing for each pixel, if it falls inside or outside of these delimited areas. Information on the type of vegetation came from the land use map (scale 1:10,000); furthermore, the class “forest” was reclassified by splitting the corresponding areal units into the sub-classes identified on the regional map of the forest types (scale 1:25,000). The final map included more than one hundred classes, which, for the purposes of the present study, were aggregated into 37 classes based on the flammability of each vegetation type. The non-vegetated areas, which came from the abovementioned land use map (including urban area, industrial and commercial units, road and rail network, port and airport area, bare rocks, and water bodies), were unified into a unique class under the label “non-flammable area”. All these digital layers (Table 1) were provided by the authority of the Liguria region and are available on the official geo-portal (https://geoportal.regione.liguria.it/). The ensemble of the spatial layers, in raster format, were resampled and coregistered (i.e., spatially aligned) to match the same reference image with a spatial resolution of 100 m.
In addition, a set of new variables, taking into account the vegetation cover in the proximity of each pixel, was estimated as the percentage of each classes (including the 37 vegetation types and the non-flammable area) within a 300 by 300 m neighborhood distance. This resulted in 38 additional variables identifying the neighboring vegetation (Table 1). To prove if the neighboring vegetation type enables a rise in the predictive performance of the model, and consequently if this factor needs to be included in this kind of study, two models were tested: the first did not consider the neighboring vegetation (hereafter defined “standard model”) and the second did account for this factor (hereafter defined “neighboring vegetation model”).

3.2. Modeling Procedure: Machine Learning Approach

ML is based on algorithms capable of learning from and making predictions on data, through the modeling of the hidden relationships between a set of input and output variables, representing the predisposing factors (independent variables) and the occurrences of the phenomenon (dependent variable).
A well-established procedure to validate a model is to split the dataset into training, validation, and testing. The “training dataset” is needed to calibrate the parameters of the model, which will be used to get predictions on new data, that is the “validation dataset”. Thus, the ultimate purpose of the validation dataset was to provide an unbiased evaluation of the model’s fitness. Indeed, a good model is the one which gives accurate predictions on new data and avoids overfitting and underfitting.
Lastly, to provide an unbiased evaluation of the final model and to assess its performance (i.e., generalization), results were predicted over unused observations, defined as the “testing dataset”. Model generalization refers to the ability of a model to perform good predictions on new/previously unseen data, drawn from the same distribution as the data used to create the model. This concept is closely related with overfitting: this happens when a model performs well on the training data but is unable to make good predictions on new data. Thus, a model that perfectly fits the training data normally displays a large generalization error.
A binary classification problem, such as the prediction of burned and unburned areas in the present case study, can be solved by counting how many times each observation is classified as “one” (burning) or “zero” (non-burning) and normalizing the result over the total number of predictions. This provided “probabilistic outputs”, the values of which ranged between 0 and 1, which can be used to elaborate susceptibility maps. In these, the value of each pixel represents the probabilistic predicted value of burning in the future. The proposed approach involves the generation of pseudo-absences to ascertain an unburned area. Indeed, to assure a good generalization of the model, avoiding the overestimation of the low classes (i.e., with a probabilistic value close to 0), pseudo-absences need to be generated in all cases where they are not explicitly expressed (e.g., wildfire location is known, normally as mapped burned areas, but the not-burned areas have to be defined). We solved this problem by randomly generating a number of absences equal to the number of presences (i.e., equal to the number of pixels burning).
Analyses were performed using random forest [51], an ensemble ML algorithm based on decision trees. The most important hyperparameters that need to be specified are the number of decision trees (ntree) and the number of variables randomly sampled as candidates at each split (mtry). As a general rule, ntree should be large enough to ensure that every observation (i.e., input rows) gets predicted at least a few times. The standard value for mtry in classification problems is equal to the square root of the number of predictors (i.e., the independent variables), but this value can be optimized. Operationally, the algorithm generated ntree-subsets of the training dataset by bootstrapping (i.e., random sampling with replacement), each subset counting about two-thirds of the observations. A decision tree was then generated for each subset, considering a reduced number of variables (mtry), randomly selected. At each node, the Gini impurity was computed and the variable that minimized this value was selected as the best one for the split. Specifically, Gini impurity represented the probability of classifying incorrectly an observation in the dataset if it were classified randomly, according to the class distribution in the dataset. This process was iterated up to the maximum level and it could stop when each node contained less than a fixed number of data points. For a classification problem, the prediction of new data was finally computed, taking the maximum voting, which can be converted into a probabilistic output. The hyperparameters of the model were optimized by evaluating the prediction-error on those observations that were not used in the training subsets (called “out-of-bag”). In this study, hyperparameters were set to 750 for ntree and to the rounded up square root of the number of predictor factors for mtry, both optimized by applying a trial and error process.

3.3. Model Validation

As mentioned above, a well-established procedure in ML to validate a model is to split the dataset into training, validation, and testing.
When dealing with a spatial environmental phenomenon, if the validation dataset is selected randomly, observations can be located close to the training dataset, leading to an over-estimation of the predictive performance of the model. This circumstance is known as “spatial autocorrelation”, meaning that observations close to each other hold similar characteristics. To overcome this issue, training and validation data have to be selected far enough apart in the geographic space. To this end, in the present study, we adopted the spatial k-fold cross validation using an increasing number of folds. Methodologically, the k-fold cross validation consisted of splitting the dataset into k folds, holding out a fold at a time, training the model on the remaining k-1 folds and finally validating the model using the hold out fold. The process was repeated k-times and the evaluation scores resulting from each folding were finally averaged. In the same way, the final prediction value was computed as the arithmetical mean among all the predictions estimated from each folding. We compared three models, considering 1-, 5-, and 9-folds. The space was organized into spatial blocks of 15 × 15 km, resulting in 50 blocks covering the entire study area. In the case of 1-fold, the validation dataset was defined by randomly selecting 10 blocks only once, which corresponded to the random selection of the 20% block of observations. In the case of 5- and 9-folds (Figure 4), each single fold included, respectively, 10 blocks (corresponding to the 20% of observations) and 5 or 6 blocks (corresponding to about 10% of the observations).
The testing dataset was defined by splitting the original dataset by years: Mapped burned areas observed in the period 1997–2011 were used in the training procedure, while the last 6 years (2012–2017), corresponding to about 25% of the entire dataset, were held out for testing the predictive performance of the model. This was assessed by computing the percentage of an area with a probabilistic predicted value of burning within a certain range, falling inside the burned areas in the testing dataset (Testing BA, Table 2). Finally, the model that predicted better was the one that resulted in a higher percentage for high range values and a lower percentage for low range values. The probabilistic predicted values were ranked into five classes, defined by the following percentile ranking ranges: 25th, 50th, 75th, 90th, and 95th. It is worth pointing out that the 25th percentile represented 25% of the area with the lowest probabilistic predicted values of burning, while the 95th percentile represented the 5% of the area with the highest probabilistic values, and so on for the intermediate ranges. The rank-limits corresponded to different values of the probabilistic output (Prob_value, Table 2) resulting from each model, allowing a flexible interpretation of the results. In addition, keeping in mind the goal of delivering unbiased results by adopting different fitness estimators, the root mean square error (RMSE) was also computed (Table 3). The RMSE is based on the difference between the predicted outputs and the observed true values, where observations can only assume the value “one” (if the area burned) or “zero” (if the area did not burn), while the predicted output is a probabilistic value expressed as a floating number between zero and one.

4. Results

The main results of the present study are the following: (i) the models comparison, including the “standard” vs. the “neighboring vegetation” model and the use of 1- vs. 5- vs. 9-folds for the spatial cross validation; (ii) the probabilistic predicted values as the main output of random forest, allowing to elaborate susceptibility maps of wildfires for the winter and summer seasons.

4.1. Models Comparison

The values indicated in Table 2 allow comparing and evaluating the predictive performance of different models: the “standard” vs. the “neighboring vegetation” model and the use of 1- vs. 5- vs. 9-folds spatial cross validation for the winter (values above) and summer seasons (values below). In addition to the percentile ranking ranges (i.e., “Classes”), the corresponding probabilistic predicted value (i.e., Prob_value) is specified. The field “Total area” indicates the surface predicted by the model to fall within each range, normalized over the entire area. The field “Predicted BA” represents the fraction of the “Total area” falling inside the burned area detected in the testing dataset (i.e., 2012–2017 mapped burned areas). For example, looking at one-fold standard model, the first class (25th percentile) indicates that only 5.07% of the 25% of the area with the lowest probabilistic predicted values of burning (corresponding to <0.13), fell inside the burned area in the testing dataset. On the opposite extreme, the 95th percentile indicates that about 47% of the 5% of the area with the highest probabilistic predicted values (corresponding to >0.91), fell inside the burned area in the testing dataset. Thus, a way to compare the different models is to evaluate which model predicts the largest area with the highest probability of burning falling inside the burned area in the testing dataset (hereafter defined for brevity as “predicted burned area”), and vice-versa for the lower classes. To facilitate this comparison, the last 75th percentile (i.e., 25% of the area with the highest probabilistic predicted value of burning) was considered and discussed in the following. Results show that the neighboring vegetation model performed better than the standard model in both seasons. Indeed, the predicted burned area was higher for both 1- and 5-folds cross-validation when the model included the information concerning the type of vegetation and non-flammable area in the neighboring of each pixel. The increment in this case is of about three percentage points or more. Instead, when comparing 5- with 5-folds cross-validation, the predicted burned area increased y only 0.02% in the winter season and 0.87% in the summer season. So, using 9- instead of 5-folds allows to slightly increase the predictive performance of the model, facing the fact that the training algorithm is much more computationally intensive. On the other hand, the predictive performance of using 5-folds cross-validation compared with the 1-fold increased by about four-percentage points in the winter season, when considering both the standard and the neighboring vegetation model, and was 2.24% and only 0.07% for the equivalent models in the summer season. Despite this last low value, which will be discussed later, overall, using 5-folds cross-validation results in a better predictive performance of the model than selecting a random 20% of the dataset by blocks for validation (corresponding to 1-fold).
The root mean square errors computed for all the models (Table 3) confirm these results. As expected, all RMSEs were lower than 0.5, as, on average, each model performed good prediction of burned and unburned areas. The neighboring vegetation model performed better than the standard model in both seasons. Increasing the number of folds in the cross validation allowed to increase the model’s performance in all the cases, except in the case of the neighboring vegetation model in the summer season, which performed the same, regardless of the number of folds.
All the models performed better in winter than in summer. In the winter, the values of the testing burned area, considering the last 75th percentile, ranged from about 80% (1-fold “standard model”) to 87% (5- and 9-folds “neighboring vegetation model”). In the summer, these values were significantly lower, ranging from about 70% (1-fold “standard model”) to 75% (5- and 9-folds “neighboring vegetation model”). These performances were confirmed by the RMSE, showing lower values in summer than in winter for the same model.
The neighboring vegetation model using 5-folds cross-validation performed better. This last model was finally evaluated by computing the fraction of predicted burning area above the 75th percentile falling inside the testing burned areas for each single year and by season. Values are expressed both as percentage and number of pixels (Table 4). This allowed for the investigation of the influence of the testing dataset on the predictive performances of the model. In the winter season, the model performed well every year, with values ranging from 83.4% in 2013 to 91.7% in 2016. In the summer season, performances were still quite good in the first five testing periods (years 2012–2016), while, in the last year (2017), the assessed value dropped to 45.7%. At the same time, we noticed that the size of the burned area, which highly differed in each testing period, did not compromise the predictive performance of the model. More specifically, this result seems to imply that something during the 2017 wildfire summer season in Liguria did not work as predicted by the model.

4.2. Susceptibility Mapping

Random forest gives as an output a probabilistic value, expressing the probability for each pixel to burn in the future under the assumption of a set of predisposing variables. These values were used to elaborate wildfire susceptibility maps in the winter and summer seasons (Figure 5). To this end, we retained the results that were obtained by the model that performed better, which was the neighboring vegetation model using 5-folds cross-validation. The two seasons displayed different behavior in relation to the predicted susceptibility to wildfires.
Higher classes (above the 90th percentile) characterized the coastal area in summer and the more elevated inland areas in winter. This could be due to the state of the vegetation: In winter, vegetation is more stressed and senescent at a higher altitude due to the lower temperature in altitude. On the contrary, in summer, vegetation is dryer and more burnable at lower elevations because of the high temperature and the dry weather. Finally, even though the two models implemented separately for the winter and the summer seasons used the same independent variables as the input, random forest succeeded in discriminating among the two patterns in the distribution of the wildfire-susceptible areas, thanks to the training procedure.

5. Discussion

In a very recent review paper, Jain et al. [52] found a total of 298 publications, from 1996 to 2019, relevant to the topic of machine learning applications in wildfire science and management, with a very strong increase in the last five years of investigation. Among them, a considerable number of references (71) used various ML algorithms to map wildfire susceptibility. Nevertheless, only a few papers considered time series longer than a few years, allowing to assess the predictive performance of the model in the near future [26,31,53] or separately in summer and winter seasons [54].
In the present paper, we introduced an innovative ML approach, based on random forest, which allowed the elaboration of the wildfire susceptibility map for the Liguria region in Italy. Susceptibility was assessed by evaluating the probability for an area to burn in the future, taking into account the spatial extension of past-burned areas and the geo-environmental factors that favor their occurrence (i.e., DEM and its derivatives, distance to an anthropogenic feature, protected area, vegetation type). An alternative model, including the neighboring vegetation type at each location, was developed and compared with the standard model. For validation purposes, we adopted the spatial k-fold cross-validation (with k = 1, 5, and 9) and then we evaluated the predictive performances of the different models over an independent testing dataset. Finally, we compared the standard vs. the neighboring vegetation model and the use of 1- vs. 5- vs. 9-folds spatial cross-validation, both for the winter and summer seasons. The results show that: (1) the neighboring vegetation model performed better than the standard model in both seasons; (2) 5 represents the optimal number of folds; (3) all the models performed better in the winter season than in the summer season.
The predictive performance of the models implemented in the present study was globally good, as proved by their high capacity of discriminating most of the burned area within the 75th percentile for most of the testing periods (2012 to 2017) and in both the wildfire seasons. This attests the good generalization capabilities of our models. The neighboring vegetation model using 5-folds cross-validation gave the best results in terms of predictive performance and was retained to elaborate the winter and the summer wildfire susceptibility maps. The performance of this model in the summer 2017 testing period manifested the only notable exception to the overall good prediction capability and it will be the object of discussion in the following. Instead of considering this fact a downside of the proposed modeling framework, it shifted the attention towards a specific situation: Fire management in summer 2017. As a matter of fact, it is worth considering that the Italian Forestry Corp, in charge of fire management since 1984, was dismissed at the end of 2016 (DLGS 19/08/2016 n. 177). To counter this, the Liguria region named Italian Fire Fighters (CNVVF) in charge of forest fire management. Before 2017, the role of CNVVF was limited to the management of fire in the Wildland–Urban Interface and was mainly restricted to the safeguard of civilians (protection of houses and infrastructures). In January 2017, a week of severe fire danger caused the burning of about 3000 ha, most of them characterized by high or extreme fire risk. However, the summer season of 2017 was atypical, not only in terms of fire management procedures, but also with respect to the meteorological conditions. This season was in fact characterized by a long drought, but with a remarkable average relative humidity higher than 67%. Only a couple of days were characterized by relative humidity lower than 40%. These meteorological conditions favored the mop up phase, which caused each single fire to be reignited several times, extending the fire propagation for many days.
In the period ranging from 1st May to 31st October 2017, 205 wildfires burned a total area of 927 ha. Only six events burned an area greater than 50 ha each, resulting in 554 ha (corresponding to 60% of the total burned area). These six events were highlighted for a better understanding of the implication that the new management can have on the predictive performance of the model. As is evident in Figure 6, most of the burned area within these six largest wildfires was predicted by the model as belonging to a middle-to-low level of susceptibility. This result reflects that random forest could not take into account the management issues of 2017: The underestimation of the mop up phase. However, it successfully predicted the areas to be burned in the future under unchanged geo-environmental and management conditions, as proven by the good performances of the model in the previous testing periods.

6. Conclusions

The proposed approach proved to be globally effective to deal with large, high dimensional spatial data, which reflects the great flexibility of machine learning in general. In particular, one advantage of random forest is its capability of handling directly categorical variables, such as land use classes or vegetation types.
The importance of considering the vegetation cover in the proximity of each pixel, specifically the type of vegetation, as an additional variable is proved by the results of the present research, and we recommend including this factor in similar studies. Moreover, we recommend the use of k-fold cross-validation to improve the predictive performance of machine learning-based models seeking to assess the susceptibility of hazardous events in general and wildfires in particular.
Ultimately, the results of the present study highlight the importance of taking into account possible changes of surrounding conditions, which can affect the predictive performance of the model in space and in time. Indeed, despite the rapid capacity of intervention of the Italian Fire Fighters, the fire brigade did not consider the mop up phase, which, in summer 2017, caused each single fire to be reignited several times. These considerations put in evidence the capacity of the proposed approach to identify the efficiency of fire-fighting activities, specifically if fire extinction procedures are handled with different modalities compared with the procedures used previously and implicitly valued in the model. Before 2017, the different tactics of fire management could have resulted in a lower extension of the burned areas, as identified by the models and results from the susceptibility map.

Author Contributions

Data curation, G.B. and S.D.E.; Formal analysis, M.T. and M.D.; Funding acquisition, M.T. and P.F.; Investigation, M.T. and M.D.; Methodology, M.T.; Project administration, P.F.; Software, M.D.; Supervision, P.F. and M.T.; Validation, M.T. and M.D.; Visualization, G.B.; Writing—original draft, M.T.; Writing—review & editing, M.T., A.T. and P.F. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded Swiss National Science Foundation (FNS): IZSEZ0_186483.

Acknowledgments

The authors acknowledge the Italian Civil Protection Department - Presidency of the Council of Ministers, who founded this research through the convention with the CIMA Research Foundation, for the development of knowledge, methodologies, technologies, and training, useful for the implementation of national wildfire systems of monitoring, prevention, and surveillance. A visiting period of Mirko D’Andrea, from CIMA, at the Institute of Earth Surface Dynamics (UNIL), has been possible thanks to a scientific exchange grant funded by the Swiss National Science Foundation (IZSEZ0_186483). The authors are grateful to Regione Liguria and the Liguria’s regional direction of the Italian National Fire Corps for data providing and for the stimulating discussion about the summer fire season 2017. Thanks to the anonymous reviewers for helping to improve the manuscript.

Conflicts of Interest

The authors declare no conflict of interest.

References

  1. San-Miguel-Ayanz, J.; Schulte, E.; Schmuck, G.; Camia, A.; Strobl, P.; Liberta, G.; Giovando, C.; Boca, R.; Sedano, F.; Kempeneers, P.; et al. Comprehensive monitoring of wildfires in Europe: The European Forest Fire Information System (EFFIS). In Approaches to Managing Disaster—Assessing Hazards, Emergencies and Disaster Impacts; Tiefenbacher, J., Ed.; InTech: London, UK, 2012; ISBN 978-953-51-0294-6. [Google Scholar]
  2. Amraoui, M.; Pereira, M.G.; DaCamara, C.C.; Calado, T.J. Atmospheric conditions associated with extreme fire activity in the Western Mediterranean region. Sci. Total Environ. 2015, 524–525, 32–39. [Google Scholar] [CrossRef]
  3. San-Miguel-Ayanz, J.; Durrant, T.; Boca, R.; Libertà, G.; Branco, A.; de Rigo, D.; Ferrari, D.; Maianti, P.; Vivancos, T.A.; Schulte, E.; et al. Forest Fires in Europe, Middle East and North Africa 2017. Available online: https://www.driver-project.eu/forest-fires-in-europe-middle-east-and-north-africa-2017-the-new-jrc-report/ (accessed on 17 March 2020).
  4. Mihalić Arbanas, S. Landslide hazard, risk assessment and prediction: Landslide inventories and susceptibility, hazard mapping methods, damage potential—Part 2. In Advancing Culture of Living with Landslides; Mikos, M., Tiwari, B., Yin, Y., Sassa, K., Eds.; Springer International Publishing: Cham, Switzerland, 2017; pp. 695–698. ISBN 978-3-319-53497-8. [Google Scholar]
  5. Hervás, J.; Bobrowsky, P. Mapping: Inventories, susceptibility, hazard and risk. In Landslides—Disaster Risk Reduction; Sassa, K., Canuti, P., Eds.; Springer Berlin Heidelberg: Berlin/Heidelberg, Germany, 2009; pp. 321–349. ISBN 978-3-540-69966-8. [Google Scholar]
  6. Van Den Eeckhaut, M.; Hervás, J. State of the art of national landslide databases in Europe and their potential for assessing landslide susceptibility, hazard and risk. Geomorphology 2012, 139–140, 545–558. [Google Scholar] [CrossRef]
  7. Fell, R.; Corominas, J.; Bonnard, C.; Cascini, L.; Leroi, E.; Savage, W.Z. Guidelines for landslide susceptibility, hazard and risk zoning for land-use planning. Eng. Geol. 2008, 102, 99–111. [Google Scholar] [CrossRef] [Green Version]
  8. Finney, M.A. The challenge of quantitative risk analysis for wildland fire. For. Ecol. Manag. 2005, 211, 97–108. [Google Scholar] [CrossRef]
  9. Hardy, C.C. Wildland fire hazard and risk: Problems, definitions, and context. For. Ecol. Manag. 2005, 211, 73–82. [Google Scholar] [CrossRef]
  10. Watts, J.M.; Hall, J.R. Introduction to fire risk analysis. In SFPE Handbook of Fire Protection Engineering; Hurley, M.J., Gottuk, D., Hall, J.R., Harada, K., Kuligowski, E., Puchovsky, M., Torero, J., Watts, J.M., Wieczorek, C., Eds.; Springer: New York, NY, USA, 2016; pp. 2817–2826. ISBN 978-1-4939-2564-3. [Google Scholar]
  11. Meacham, B.J.; Charters, D.; Johnson, P.; Salisbury, M. Building fire risk analysis. In SFPE Handbook of Fire Protection Engineering; Hurley, M.J., Gottuk, D., Hall, J.R., Harada, K., Kuligowski, E., Puchovsky, M., Torero, J., Watts, J.M., Wieczorek, C., Eds.; Springer: New York, NY, USA, 2016; pp. 2941–2991. ISBN 978-1-4939-2564-3. [Google Scholar]
  12. Brillinger, D.R.; Preisler, H.K.; John, W. Benoit risk assessment: A forest fire example. Lect. Notes-Monogr. Ser. 2003, 40, 177–196. [Google Scholar]
  13. Catry, F.X.; Rego, F.C.; Bação, F.L.; Moreira, F. Modeling and mapping wildfire ignition risk in Portugal. Int. J. Wildland Fire 2010, 18, 921–931. [Google Scholar] [CrossRef] [Green Version]
  14. Ager, A.A.; Finney, M.A.; Kerns, B.K.; Maffei, H. Modeling wildfire risk to northern spotted owl (Strix occidentalis caurina) habitat in Central Oregon, USA. For. Ecol. Manag. 2007, 246, 45–56. [Google Scholar] [CrossRef]
  15. Eugenio, F.C.; dos Santos, A.R.; Fiedler, N.C.; Ribeiro, G.A.; da Silva, A.G.; dos Santos, Á.B.; Paneto, G.G.; Schettino, V.R. Applying GIS to develop a model for forest fire risk: A case study in Espírito Santo, Brazil. J. Environ. Manag. 2016, 173, 65–71. [Google Scholar] [CrossRef] [PubMed]
  16. Teodoro, A.C.; Duarte, L. Forest fire risk maps: A GIS open source application—A case study in Norwest of Portugal. Int. J. Geogr. Inf. Sci. 2013, 27, 699–720. [Google Scholar] [CrossRef]
  17. Gai, C.; Weng, W.; Yuan, H. GIS-based forest fire risk assessment and mapping. In Proceedings of the 2011 Fourth International Joint Conference on Computational Sciences and Optimization, Kunming and Lijiang City, China, 15–19 April 2011; pp. 1240–1244. [Google Scholar]
  18. Pourghasemi, H.R. GIS-based forest fire susceptibility mapping in Iran: A comparison between evidential belief function and binary logistic regression models. Scand. J. For. Res. 2016, 31, 80–98. [Google Scholar] [CrossRef]
  19. Mohammadi, F.; Bavaghar, M.P.; Shabanian, N. Forest fire risk zone modeling using logistic regression and GIS: An Iranian case study. Small-Scale For. 2014, 13, 117–125. [Google Scholar] [CrossRef]
  20. Vadrevu, K.P.; Eaturu, A.; Badarinath, K.V.S. Fire risk evaluation using multicriteria analysis—A case study. Environ. Monit. Assess. 2010, 166, 223–239. [Google Scholar] [CrossRef]
  21. Kant Sharma, L.; Kanga, S.; Singh Nathawat, M.; Sinha, S.; Chandra Pandey, P. Fuzzy AHP for forest fire risk modeling. Disaster Prev. Manag. Int. J. 2012, 21, 160–171. [Google Scholar] [CrossRef]
  22. Carmel, Y.; Paz, S.; Jahashan, F.; Shoshany, M. Assessing fire risk using Monte Carlo simulations of fire spread. For. Ecol. Manag. 2009, 257, 370–377. [Google Scholar] [CrossRef]
  23. Chuvieco, E.; Aguado, I.; Yebra, M.; Nieto, H.; Salas, J.; Martín, M.P.; Vilar, L.; Martínez, J.; Martín, S.; Ibarra, P.; et al. Development of a framework for fire risk assessment using remote sensing and geographic information system technologies. Ecol. Model. 2010, 221, 46–58. [Google Scholar] [CrossRef]
  24. Chuvieco, E.; Salas, J. Mapping the spatial distribution of forest fire danger using GIS. Int. J. Geogr. Inf. Syst. 1996, 10, 333–345. [Google Scholar] [CrossRef]
  25. Pourtaghi, Z.S.; Pourghasemi, H.R.; Aretano, R.; Semeraro, T. Investigation of general indicators influencing on forest fire and its susceptibility modeling using different data mining techniques. Ecol. Indic. 2016, 64, 72–84. [Google Scholar] [CrossRef]
  26. Leuenberger, M.; Parente, J.; Tonini, M.; Pereira, M.G.; Kanevski, M. Wildfire susceptibility mapping: Deterministic vs. stochastic approaches. Environ. Model. Softw. 2018, 101, 194–203. [Google Scholar] [CrossRef]
  27. Arpaci, A.; Malowerschnig, B.; Sass, O.; Vacik, H. Using multi variate data mining techniques for estimating fire susceptibility of Tyrolean forests. Appl. Geogr. 2014, 53, 258–270. [Google Scholar] [CrossRef]
  28. Oliveira, S.; Oehler, F.; San-Miguel-Ayanz, J.; Camia, A.; Pereira, J.M.C. Modeling spatial patterns of fire occurrence in Mediterranean Europe using Multiple Regression and Random Forest. For. Ecol. Manag. 2012, 275, 117–129. [Google Scholar] [CrossRef]
  29. Ngoc Thach, N.; Bao-Toan Ngo, D.; Xuan-Canh, P.; Hong-Thi, N.; Hang Thi, B.; Nhat-Duc, H.; Dieu, T.B. Spatial pattern assessment of tropical forest fire danger at Thuan Chau area (Vietnam) using GIS-based advanced machine learning algorithms: A comparative study. Ecol. Inform. 2018, 46, 74–85. [Google Scholar] [CrossRef]
  30. Satir, O.; Berberoglu, S.; Donmez, C. Mapping regional forest fire probability using artificial neural network model in a Mediterranean forest ecosystem. Geomat. Nat. Hazards Risk 2016, 7, 1645–1658. [Google Scholar] [CrossRef] [Green Version]
  31. Tehrany, M.S.; Jones, S.; Shabani, F.; Martínez-Álvarez, F.; Tien Bui, D. A novel ensemble modeling approach for the spatial prediction of tropical forest fire susceptibility using LogitBoost machine learning classifier and multi-source geospatial data. Theor. Appl. Climatol. 2019, 137, 637–653. [Google Scholar] [CrossRef]
  32. Rodrigues, M.; de la Riva, J. An insight into machine-learning algorithms to model human-caused wildfire occurrence. Environ. Model. Softw. 2014, 57, 192–201. [Google Scholar] [CrossRef]
  33. Ghorbanzadeh, O.; Blaschke, T.; Gholamnia, K.; Aryal, J. Forest fire susceptibility and risk mapping using social/infrastructural vulnerability and environmental variables. Fire 2019, 2, 50. [Google Scholar] [CrossRef] [Green Version]
  34. Sharma, S.; Ochsner, T.E.; Twidwell, D.; Carlson, J.D.; Krueger, E.S.; Engle, D.M.; Fuhlendorf, S.D. Nondestructive estimation of standing crop and fuel moisture content in Tallgrass Prairie. Rangel. Ecol. Manag. 2018, 71, 356–362. [Google Scholar] [CrossRef]
  35. Sayad, Y.O.; Mousannif, H.; Al Moatassime, H. Predictive modeling of wildfires: A new dataset and machine learning approach. Fire Saf. J. 2019, 104, 130–146. [Google Scholar] [CrossRef]
  36. Féret, J.-B.; le Maire, G.; Jay, S.; Berveiller, D.; Bendoula, R.; Hmimina, G.; Cheraiet, A.; Oliveira, J.C.; Ponzoni, F.J.; Solanki, T.; et al. Estimating leaf mass per area and equivalent water thickness based on leaf optical properties: Potential and limitations of physical modeling and machine learning. Remote Sens. Environ. 2019, 231, 110959. [Google Scholar] [CrossRef]
  37. Ganteaume, A.; Camia, A.; Jappiot, M.; San-Miguel-Ayanz, J.; Long-Fournel, M.; Lampin, C. A review of the main driving factors of forest fire ignition over Europe. Environ. Manag. 2013, 51, 651–662. [Google Scholar] [CrossRef] [Green Version]
  38. Tonini, M.; Parente, J.; Pereira, M.G. Global assessment of rural–urban interface in Portugal related to land cover changes. Nat. Hazards Earth Syst. Sci. 2018, 18, 1647–1664. [Google Scholar] [CrossRef] [Green Version]
  39. Conedera, M.; Tonini, M.; Oleggini, L.; Vega Orozco, C.; Leuenberger, M.; Pezzatti, G.B. Geospatial approach for defining the wildland-urban interface in the alpine environment. Comput. Environ. Urban Syst. 2015, 52, 10–20. [Google Scholar] [CrossRef]
  40. Zumbrunnen, T.; Menéndez, P.; Bugmann, H.; Conedera, M.; Gimmi, U.; Bürgi, M. Human impacts on fire occurrence: A case study of hundred years of forest fires in a dry alpine valley in Switzerland. Reg. Environ. Chang. 2012, 12, 935–949. [Google Scholar] [CrossRef]
  41. Badia, A.; Serra, P.; Modugno, S. Identifying dynamics of fire ignition probabilities in two representative Mediterranean wildland-urban interface areas. Appl. Geogr. 2011, 31, 930–940. [Google Scholar] [CrossRef]
  42. Romero-Calcerrada, R.; Barrio-Parra, F.; Millington, J.D.A.; Novillo, C.J. Spatial modelling of socioeconomic data to understand patterns of human-caused wildfire ignition risk in the SW of Madrid (central Spain). Ecol. Model. 2010, 221, 34–45. [Google Scholar] [CrossRef]
  43. Castillo Soto, M.E. The identification and assessment of areas at risk of forest fire using fuzzy methodology. Appl. Geogr. 2012, 35, 199–207. [Google Scholar] [CrossRef]
  44. Vilar, L.; Woolford, D.G.; Martell, D.L.; Martín, M.P. A model for predicting human-caused wildfire occurrence in the region of Madrid, Spain. Int. J. Wildland Fire 2010, 19, 325–337. [Google Scholar] [CrossRef]
  45. Mermoz, M.; Kitzberger, T.; Veblen, T.T. Landscape influences on occurrence and spread of wildfires in Patagonian forests and shrublands. Ecology 2005, 86, 2705–2715. [Google Scholar] [CrossRef] [Green Version]
  46. Carmo, M.; Moreira, F.; Casimiro, P.; Vaz, P. Land use and topography influences on wildfire occurrence in northern Portugal. Landsc. Urban Plan. 2011, 100, 169–176. [Google Scholar] [CrossRef] [Green Version]
  47. Harris, L.; Taylor, A.H. Previous burns and topography limit and reinforce fire severity in a large wildfire. Ecosphere 2017, 8, e02019. [Google Scholar] [CrossRef]
  48. Moreira, F.; Vaz, P.; Catry, F.; Silva, J.S. Regional variations in wildfire susceptibility of land-cover types in Portugal: Implications for landscape management to minimize fire hazard. Int. J. Wildland Fire 2009, 18, 563–574. [Google Scholar] [CrossRef]
  49. Kanevski, M.; Pozdnoukhov, A.; Timonin, V. Machine Learning for Spatial Environmental Data: Theory, Applications and Software; Collection: environmental engineering; Epfel: Lausanne, Switzerland, 2009; ISBN 978-0-8493-8237-6. [Google Scholar]
  50. Lee, S.; Jung, H.-S. Machine Learning Techniques Applied to Geoscience Information System and Remote Sensing; MDPI: Basel, Switzerland, 2019; ISBN 978-3-03921-216-3. [Google Scholar]
  51. Breiman, L. Random forest. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
  52. Jain, P.; Coogan, S.C.P.; Subramanian, S.G.; Crowley, M.; Taylor, S.; Flannigan, M.D. A review of machine learning applications in wildfire science and management. arXiv 2020, arXiv:2003.00646. [Google Scholar]
  53. Martín, Y.; Zúñiga-Antón, M.; Rodrigues Mimbrero, M. Modelling temporal variation of fire-occurrence towards the dynamic prediction of human wildfire ignition danger in northeast Spain. Geomat. Nat. Hazards Risk 2019, 10, 385–411. [Google Scholar] [CrossRef]
  54. Vacchiano, G.; Foderi, C.; Berretti, R.; Marchi, E.; Motta, R. Modeling anthropogenic and natural fire ignitions in an inner-alpine valley. Nat. Hazards Earth Syst. Sci. 2018, 18, 935–948. [Google Scholar] [CrossRef] [Green Version]
Figure 1. Study areas with the location map. Land use results from the aggregation of the original classes represented on the regional map of forest types, provided by the Liguria region.
Figure 1. Study areas with the location map. Land use results from the aggregation of the original classes represented on the regional map of forest types, provided by the Liguria region.
Geosciences 10 00105 g001
Figure 2. Yearly frequency of wildfires (on the top) and burned area (on the bottom) in the Liguria region (Italy) in the last two decades (1997–2017) during the summer and the winter season.
Figure 2. Yearly frequency of wildfires (on the top) and burned area (on the bottom) in the Liguria region (Italy) in the last two decades (1997–2017) during the summer and the winter season.
Geosciences 10 00105 g002
Figure 3. Spatial distribution of burned areas during the winter (on the top) and summer season (on the bottom) in the Liguria region (Italy) during the period 1997–2017.
Figure 3. Spatial distribution of burned areas during the winter (on the top) and summer season (on the bottom) in the Liguria region (Italy) during the period 1997–2017.
Geosciences 10 00105 g003
Figure 4. Spatial arrangement of the blocks for the 5- and 9-folds adopted in the work (each color corresponds to a single fold).
Figure 4. Spatial arrangement of the blocks for the 5- and 9-folds adopted in the work (each color corresponds to a single fold).
Geosciences 10 00105 g004
Figure 5. Wildfire susceptibility map in the winter (on the top) and summer seasons (on the bottom) for the Liguria region (Italy).
Figure 5. Wildfire susceptibility map in the winter (on the top) and summer seasons (on the bottom) for the Liguria region (Italy).
Geosciences 10 00105 g005
Figure 6. The six largest wildfire occurrences of summer 2017, along with the fire susceptibility map.
Figure 6. The six largest wildfire occurrences of summer 2017, along with the fire susceptibility map.
Geosciences 10 00105 g006
Table 1. Independent variables and their characteristics. (*) The neighboring vegetation was only included in the second model for comparison reasons.
Table 1. Independent variables and their characteristics. (*) The neighboring vegetation was only included in the second model for comparison reasons.
Independent VariablesAcquisition ScaleVariable TypeRange# of Variables
DEM 1:5000Numerical (meters)0–21321
Slope -Numerical (degree)0–601
Northness and Eastness-Numerical [−1–+1]2
Distance to anthropogenic features 1:10,000Numerical (meters)0–90004
Protected area1:25,000Binary0 or 11
Vegetation type1:25,000Categorical37 classes1
Non-flammable area1:25,000Categorical1 class1
Neighboring vegetation *-Numerical (percentage)[0,100]38
Table 2. Validation of the models (see Section 3.2 for more details).
Table 2. Validation of the models (see Section 3.2 for more details).
Winter Season1-Fold Cross Validation5-Fold Cross Validation9-Fold Cross Validation
Standard ModelNeighboring VegetationStandard ModelNeighboring VegetationNeighboring Vegetation
ClassesTotal Area (%)Testing BA (%)Prob_ValueTesting BA (%)Prob_ValueTesting BA (%)Prob_ValueTesting BA (%)Prob_ValueTesting BA (%)Prob_Value
25%255.070.133.850.094.420.113.50.073.360.07
50%254.950.253.40.223.440.233.270.183.170.17
75%2510.520.488.80.478.90.436.220.396.440.39
90%1517.640.7814.980.7415.770.713.050.6811.910.69
95%514.380.9115.260.8715.670.8517.70.8316.430.85
100%547.26152.86151.78156.26158.691
>75% 79.28 83.1 83.22 87.01 87.03
Summer Season1-Fold Cross Validation5-Fold Cross Validation9-Fold Cross Validation
Standard ModelNeighboring VegetationStandard ModelNeighboring VegetationNeighboring Vegetation
ClassesTotal Area (%)Testing BA (%)Prob_ValueTesting BA (%)Prob_ValueTesting BA (%)Prob_ValueTesting BA (%)Prob_ValueTesting BA (%)Prob_Value
25%254.710.081.040.044.040.060.80.040.80.04
50%257.520.234.640.179.390.195.080.145.540.14
75%2517.940.5118.270.4115.160.4419.770.3518.440.35
90%1524.450.7826.190.723.310.6921.510.6522.110.66
95%514.060.9114.60.8714.60.8315.060.8314.730.85
100%530.66133.43133.5137.71138.311
>75% 69.17 74.22 71.41 74.28 75.15
Table 3. Root mean square error based on the difference between the predicted and observed value.
Table 3. Root mean square error based on the difference between the predicted and observed value.
Winter Season1-Fold5-Folds9-Folds
Standard model0.4070.380-
Neighboring vegetation0.3770.3540.351
Summer season1-fold5-folds9-folds
Standard model0.4370.428-
Neighboring vegetation0.4110.4110.411
Table 4. Model validation evaluated by computing the percentage of the predicted burning area above the 75th percentile falling inside the testing burned areas for each single year (BA >75%). The number of pixels above and below this value for each season are also shown.
Table 4. Model validation evaluated by computing the percentage of the predicted burning area above the 75th percentile falling inside the testing burned areas for each single year (BA >75%). The number of pixels above and below this value for each season are also shown.
YearWinter SeasonSummer SeasonTot_Year
BA > 75 %BA > 75%Tot_WinterBA > 75 %BA > 75%Tot_Summer
(%) (# pixels) (# pixels)(%) (# pixels) (# pixels) (# pixels)
201284.1844100377.83374331436
201383.412114586.9140161306
201486.011713692.8103111247
201591.246551084.95356301140
201691.722024092.793610101250
201786.43144364045.74499834623
Tot 5674 33289002

Share and Cite

MDPI and ACS Style

Tonini, M.; D’Andrea, M.; Biondi, G.; Degli Esposti, S.; Trucchia, A.; Fiorucci, P. A Machine Learning-Based Approach for Wildfire Susceptibility Mapping. The Case Study of the Liguria Region in Italy. Geosciences 2020, 10, 105. https://doi.org/10.3390/geosciences10030105

AMA Style

Tonini M, D’Andrea M, Biondi G, Degli Esposti S, Trucchia A, Fiorucci P. A Machine Learning-Based Approach for Wildfire Susceptibility Mapping. The Case Study of the Liguria Region in Italy. Geosciences. 2020; 10(3):105. https://doi.org/10.3390/geosciences10030105

Chicago/Turabian Style

Tonini, Marj, Mirko D’Andrea, Guido Biondi, Silvia Degli Esposti, Andrea Trucchia, and Paolo Fiorucci. 2020. "A Machine Learning-Based Approach for Wildfire Susceptibility Mapping. The Case Study of the Liguria Region in Italy" Geosciences 10, no. 3: 105. https://doi.org/10.3390/geosciences10030105

APA Style

Tonini, M., D’Andrea, M., Biondi, G., Degli Esposti, S., Trucchia, A., & Fiorucci, P. (2020). A Machine Learning-Based Approach for Wildfire Susceptibility Mapping. The Case Study of the Liguria Region in Italy. Geosciences, 10(3), 105. https://doi.org/10.3390/geosciences10030105

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop