Classifying Large Wildﬁres in the United States by Land Cover

: Fire is an ecological process that also has socio-economic e ﬀ ects. To learn more about ﬁre occurrence, I examined relationships between land classes and about 12,000 spatially delineated large wildﬁres (deﬁned here as uncontrolled ﬁres ≥ 200 ha, although deﬁnitions vary) during 1999 to 2017 in the conterminous United States. Using random forests, extreme gradient boosting, and c5.0 classiﬁers, I modeled all ﬁres, ﬁrst years (1999 to 2002), last years (2014 to 2017), the eastern, central, and western United States and seven ecoregions. The three classiﬁers performed well (true positive rates 0.82 to 0.94) at modeling all ﬁres and ﬁres by year, region, and ecoregion. The random forests classiﬁer did not predict to other time intervals or regions as well as other classiﬁers and models were not constant in time and space. For example, the eastern region overpredicted ﬁres in the western region and models for the western region underpredicted ﬁres in the eastern region. Overall, greater abundance of herbaceous grasslands, or herbaceous wetlands in the eastern region, and evergreen forest and low abundance of crops and pasture characterized most large ﬁres, even with regional di ﬀ erences. The 14 states in the northeastern United States with no or few large ﬁres contained limited herbaceous area and abundant crops or developed lands. Herbaceous vegetation was the most important variable for ﬁre occurrences in the western region. Lack of crops was most important for ﬁres in the central region and a lack of pasture, crops, and developed open space was most important for ﬁres in the eastern region. A combination of wildlands vegetation was most inﬂuential for most ecoregions, although herbaceous vegetation alone and lack of pasture, crops, and developed open space also were inﬂuential. Despite departure from historical ﬁre regimes, these models demonstrated that herbaceous vegetation remains necessary for ﬁres and that evergreen forests in particular are ﬁre-prone, while reduction of vegetation surrounding housing developments will help provide a bu ﬀ er to reduce large ﬁres.


Introduction
Wildfire, or generally defined as an uncontrolled or unplanned fire that typically occurs in wildlands (e.g., https://www.nwcg.gov/glossary/a-z),regimes have altered over time.Most reports suggest that number, area, and severity of fires are increasing in regions throughout the world [1].Nonetheless, data may not be available yet to support these trends in some regions and agricultural expansion and intensification may be reducing overall fire activity globally [1,2].fires appear to have increased in number, area, severity, and season in some parts of the United States during recent decades [3][4][5].For example, during 1984-2010, fires ≥405 ha (i.e., 1000 acres) covered 24 million ha, or 10.6% of the total burnable area, of the western United States, 3 million ha of which were high severity, or 12.3% of the area burned and 1.3% of the total burnable area, although fires, like other natural disturbances, are patchy [3,6] Fire rotations therefore were about 250 years for all fires and 2100 years for high severity fires [3].Nonetheless, current fire number and area remain a fraction of historical fires before fire suppression, which generally became institutionalized during the early 1900s in the United States.Many historical ecosystems had fire regimes that burned every 2 to 25 years [7,8].In historical ecosystems, abundance of herbaceous vegetation, with either no or limited tree density, meant that fire severity was low, in the form of surface fires with short flame lengths rather than high severity fires in connected tree crowns [8,9].
Land cover, vegetation type and structure, vegetation amount and continuity, fuel properties of vegetation, expansion of native and non-native species that have different fuel properties, and fuel breaks are some factors that influence fires.Recent tree and shrub increase in both eastern and western forests and in central grasslands of the United States conceptually indicate that fires may be less frequent but may be more severe when they do occur [8,10].Many land uses, such as urbanization, grazing, and agriculture, and some land covers, such as deserts and water, may stop fire spread because they do not contain enough continuous vegetation to carry fires.Furthermore, non-native species have altered fire regime characteristics of spread, ignitability, severity, frequency, and likelihood of fire through providing different abundance and arrangement of fuels for different time intervals [11,12].Cheatgrass (Bromus tectorum) in shrublands of the western United States and eastern redcedar (Juniperus virginiana) in central grasslands have increased fine or coarse fuel loads, respectively [9,13].Risk of fires can be reduced by lessening stand density and fuel loads through preventative thinning or prescribed burning, letting fires burn in wildlands, reducing home ignition susceptibility, and treatment of non-native species [14].
Other factors influence fire at a variety of scales, including topography, temperature, precipitation, wind, humidity, seasonal and annual timing of wet periods that increase biomass and dry periods that cure vegetation to ignitable fuels, climate change, annual number of lightning strikes, and number of ignitions.For example, human-caused ignitions have expanded the fire season, to triple the length of lightning-caused fire season, and now account for 84% of all fires and 44% of total area burned [4].Wildland fires that are not quickly suppressed often involve extremely high winds, prolonged dry conditions, and accumulated fuels [15].Fire suppression has become a world-wide response to fire, but fires will occur when weather is severe and deferring fire likely will increase fire severity [1,14].
Models of burn probabilities generated through simulations are valuable tools that account for multiple factors; however, greater understanding of how land cover interacts with fire can be extracted from empirical databases of remotely sensed fires in combination with remotely sensed land cover.Recently, Donovan et al. [9] determined that fires in grasslands of the central United States were more likely to occur when woody vegetation was ≥20% of the landscape, with fires unlikely to occur in crops, pasture, and developed areas.Fire datasets continue to accumulate fire occurrences [16] and a revised version of the National Land Cover Database (NLCD) was published during 2020 to update all previously released versions of land cover, which is not directly comparable to previous products [17].Additionally, statistical modeling with classifiers has advanced rapidly during the past five to ten years to model big datasets that have complex spatiotemporal relationships [18], including refinements of classifiers, such as extreme gradient boosting, which reduces overfitting of models by other boosting classifiers, and has rarely been applied to ecological datasets [19].
Here, I classified the relationship between about 12,000 large wildfires (≥200 ha that are not known to be prescribed burns, although definitions of both wildfires and large wildfires may vary; hereafter 'fire' or 'fires') and newly revised land cover layers to determine land classes for where fires did and did not occur and importance of land class variables for fires.I analyzed different spatiotemporal scales, including the conterminous United States and by eastern, central, and western regions and seven ecoregions for all years between 1999 and 2017 and subdivided into early, middle, and late time intervals.I also assessed performance of the random forests, extreme gradient boosting, and c5.0 classifiers and whether models were consistent in time and space, by hindcasting and forecasting models and translocating models across regions and adjacent ecoregions.Generally, I expected fires to be most frequent in wildlands, particularly herbaceous vegetation, and less abundant in different land uses, such as crops.More formally, I expected to generate useful models that varied spatiotemporally and a valuable evaluation of the classifiers to answer questions about what are the relationships between fire and land classes, how do these relationships vary over space and time, and how do the classifiers differ in modeling the relationships, particularly when predicting across time and space.Excepting Donovan et al.'s [9] study examining land cover and fires in the central United States, to my knowledge, comparable research is not available.
Remote Sens. 2020, 12, x FOR PEER REVIEW 3 of 17 what are the relationships between fire and land classes, how do these relationships vary over space and time, and how do the classifiers differ in modeling the relationships, particularly when predicting across time and space.Excepting Donovan et al.'s [9] study examining land cover and fires in the central United States, to my knowledge, comparable research is not available.

Materials and Methods
First, I divided the conterminous United States into eastern, central, and western regions for modeling by regions (Figure 1).The regions generally followed ecological divisions with some refinements, for example, moving the forested Cross Timbers area from the central region, which is primarily grasslands, to eastern forests [20].Seven ecological divisions or "ecoregions" had >800 fires and I modeled these ecoregions and excluded ones with <300 fires.I removed 14 states, primarily in the eastern region where <15 fires were recorded during 1999 to 2017 in the fire databases ((Connecticut (0 fires), Illinois (0 fires), New Hampshire (0 fires), Rhode Island (0 fires), Vermont (0 fires), Delaware (1 fire), Iowa (1 fire), Massachusetts (1 fire), Indiana (2 fires), Maine (2 fires), Ohio (2 fires), Wisconsin (6 fires), New York (7 fires), and Pennsylvania (14 fires)).I used census blocks from the Silvis WUI layer [22] that did not intersect with fires from all years of the MTBS and GeoMAC databases or fire point locations from the Fire Program Analysis fire  I used census blocks from the Silvis WUI layer [22] that did not intersect with fires from all years of the MTBS and GeoMAC databases or fire point locations from the Fire Program Analysis fire occurrence database (about 2 million geo-referenced wildfire records during 1992-2015; [23]) for pseudoabsences, or spatial units without known fires [24].I randomly selected 20,000 of the census blocks without known fire with housing densities <400/km 2 , area ≥200 ha, and excluding states listed above with few large fires to focus the modeling efforts.To spatially match the regional ratios, the randomly selected pseudoabsences contained 10,500 blocks from the western region, 6000 from the eastern region, and 4000 from the central region.I then populated each of these blocks with land class information for 2001,2004,2006,2008,2011,2013, and 2016 (see below; Table 1), which increased the sample size, and then I randomly selected based on matching yearly proportions.
The National Land Cover Database recently was revised to update all previously released versions of landcover, which is not directly comparable to previous products [17].Updated years were 2001,2004,2006,2008,2011,2013, and 2016 and for impervious surface, 2001, 2006, 2011, and 2016 and can be visualized with an interactive viewer (https://www.mrlc.gov/viewer/).Model variables were percent impervious surface and 20 land cover variables including land class combinations of wildlands (forests, herbaceous, wetlands, and shrubs), and herbaceous mixtures (Table 1).I calculated mean values of percent area of variables from the NLCD for samples with and without recorded fires, and also background rates by region, ecoregion (excluding states with few fires), the 14 states with few fires, and additionally nine of the fourteen states in the eastern ecoregion.
Supervised classification uses known classes (e.g., fire occurrence) and explanatory variables to develop a relationship to assign unknown observations into classes.For this modeling, I applied c5.0, random forests, and extreme gradient boosting classifiers in the caret package [25,26], which trained the model with 10-fold cross-validation and then predicted for the testing sets.Validation occurs on separate testing data (25% for this modeling), with withheld known classes, to determine how well the classifier assigned classes using explanatory variables.The true positive rate is a measure of the number of predicted observations that were accurately assigned to a class, while the true negative rate is a measure of the number of predicted observations where the response variable of interest did not occur.To some extent, absences are never known and pseudoabsences are proxies for true absences [24].For this modeling, prevalence or the number of present samples to pseudoabsent samples was equal (e.g., 11,918 known fires and 11,918 census blocks where fire was not recorded).Prevalence, or ratio of present observations to pseudoabsent observations, may not improve the modeling of known instances when more known instances are provided [27].However, prevalence will affect prediction probabilities, and thus interact with thresholds for prediction of occurrence [27].
I modeled all fires (1999 to 2017) for the conterminous United States, the beginning of 2000 (1999 to 2002), the last years (2014 to 2017), and bracketing 2010 (2009 to 2011).I modeled the eastern, central, and western United States and seven ecoregions.I then examined variable importance.I also assessed how well the models translated in space and time by predicting the years 2014 to 2017 with the 1999 to 2002 model and predicting the years 1999 to 2002 with the 2014 to 2017 model.I additionally used each regional model to make predictions for the other two models and each ecoregional model to predict the most similar adjacent ecoregion.

Results
According to differences in mean NLCD values for samples with and without fire presence, fires did not occur in crops, which had values greater by about a factor of 25 for where fires were not known to occur compared to where fires did occur (Table 1).Percent area for the land classes of developed open spaces (lawns, parks, and golf courses), developed low intensity (single-family housing units), impervious surface, and pasture also were greater where fire did not occur.This was true for firebreaks (open water, rock, and soil), aside from the eastern region and the eastern and southeastern ecoregions.Mixed forest had greater values where fire did not occur, except for some ecoregions.
Herbaceous vegetation had greater percent area where fire did occur for all fires, by year, region, and ecoregions.Evergreen forests had greater percent area where fire did occur for all fires, by year, region, and ecoregions, with the exception of the prairie and Mediterranean ecoregions.Wetlands, shrub, and deciduous forest varied regionally, and thus, also by ecoregions.Wetlands and shrub had greater values for where fire did occur in the eastern region.Shrubs had greater percent area where fire did not occur in the subtropical steppe and desert ecoregions particularly.Deciduous forest had greater area where fire did occur in the eastern and central regions but not in all ecoregions.
The 14 U.S. states with no or few fire occurrences, which were excluded from modeling, contained a great percentage of crop, primarily in states on the eastern edge of the central region (Table 2).The 14 U.S. states also had limited area of herbaceous wetlands and herbaceous vegetation; although, not much less area than the eastern ecoregion.Equally, percent area of developed low intensity land was a similar but greater than the eastern region.Otherwise, most values were in between the minimum and maximum values of ecoregion land classes.A subset of nine northeastern states with few fires, where crop area was lower and in the eastern ecoregion, showed reductions in crop area but also even more limited area of herbaceous wetlands and herbaceous vegetation and greater percent area of developed lands.
The three classifiers performed well (true positive rates 0.85 to 0.92) at modeling all fires and fires by year and region (Table 3).The 2014 to 2017 models predicted 1999 to 2002 fires well (true positive rates 0.81 to 0.84), but 1999 to 2002 models were less successful (true positive rates 0.71 to 0.75) at forecasting 2014 to 2017 fires (Table 4).Regionally, models for the central region generally translocated well with both the eastern and western regional models (true positive rate mean =0.77), but models for the eastern region overpredicted fires in the western region and likewise, models for the western region underpredicted fires in the eastern region.
The three classifiers also performed well (true positive rates 0.82 to 0.94) at modeling fires by ecoregion (Table 5).The eastern and prairie ecoregions had greatest true positive rates, while the desert ecoregion, which encompasses both shrublands and forests, had the least true positive rates.Ecoregion models generally translocated well to adjacent ecoregions, although with some under-and overpredicting (the eastern and southeastern ecoregional pairs and temperate steppe to prairie).
A combination of forests, herbaceous vegetation, and wetlands land classes was the most influential variables for all fires and most years (Table 6; supplementary figures have partial dependence plots).Herbaceous vegetation was the most important variable for 2014-2017 fires and in the western region.Crops were most important in the central region and a combination of pasture, crops, and developed, open space were important in the eastern region; these variables had much greater values where areas where fire was not recorded (Table 1).
By ecoregion, the three different wildlands combinations, herbaceous vegetation, and the herbaceous land use combination of pasture, crops, and developed open area were most important, which produced overlap in ecoregions, even across regions such as the southeastern and temperate steppe ecoregions (Table 7).Except for herbaceous vegetation, single land classes overall were not influential.For example, even though deciduous forest had greater area where fire did occur in the eastern ecoregion, the classifiers relied on wildlands variables that included deciduous forest.

Key Findings
Greater abundance of herbaceous vegetation, forests, and wetlands and low abundance of crops, pasture, and developed open space (lawns, parks, and golf courses) generally explained large fire occurrence (wildfires ≥200 ha), with pronounced regional differences and less distinctive ecoregional differences.More specifically, herbaceous vegetation and herbaceous wetlands and evergreen forest classes were the primary wildland classes that carried fire.These results may help explain the lack of few large fires in 14 states, which contained limited herbaceous vegetation, either as grasslands or wetlands, and an abundance of crops or developed areas, although related factors such as fragmentation from small land parcels likely are influential in preventing fires.Herbaceous vegetation was important for fires in all regions, but in the eastern region, herbaceous vegetation was present in the form of emergent herbaceous wetlands instead of upland grasslands (Table 1).Although wetlands may be a firebreak when water is present, the herbaceous component of emergent herbaceous wetlands in the eastern region appears to be fire-prone during dry events.Wetlands were less likely to have fires in the central and western regions.Forests overall were more likely to have fires in all regions due to evergreen forests and not mixed or deciduous forests, which were less likely to have fires where these forest types were more influential variables; that is, in the eastern region and southeastern and desert ecoregions.Evergreen forests with low tree densities and an herbaceous layer, which typically result in low severity fires, occur as widespread ponderosa pine (Pinus ponderosa) in the western region and now relatively rare longleaf and shortleaf pine (Pinus palustris and echinata) forests and commercial pine plantations in the eastern region [8].More densely spaced evergreen species such as spruces, firs, and eastern redcedar, which also have low branches that provide ladder fuels to carry fire vertically to crowns, typically generate higher severity crown fires [9].
Crops and pasture were the primary land classes where fire did not occur, with developed open space as a less abundant class that also appeared to prevent fire spread.These land classes can be difficult to separate from herbaceous vegetation using remote sensing methods, but differentiation was very clear in where fires did and did not occur (Table 1).Although these land classes can have fires, fires often are prescribed and small to remove vegetation residue, with limited fuels available to spread fire or escalate fire severity [2,28,29].Crops, pasture, and developed open space are areas of reduced vegetation, in the gradient between wildlands and developed urban areas with little vegetation.Crops and pasture are harvested or grazed before plants dry and lose enough moisture to ignite easily.Lawns, parks, and golf courses similarly are kept green through watering and low in height through mowing, which limits fuels, and are intersected with impervious surfaces that fragment fuel continuity and prevent fire spread.
Land classes such as crops, pasture, and developed open space with negative influences on fire were as important for where fire did not occur as herbaceous vegetation, forests, and wetlands were greater in abundance where fire did occur, particularly in the eastern and central regions and the southeastern and temperate steppe ecoregions of these regions.In the eastern region, the most influential model variable was a combination of crops, pasture, and developed open space.These land classes covered 32% of the region, 6% of the area where fires occurred, and 38% of the area from samples where fires did not occur.Herbaceous wetlands were among the most important model variables.Herbaceous wetlands were 2% of the region and area from samples where fires did not occur, and 15% of the area where fires occurred.Likewise, in the central region, crops were the most important model variable.Crops were 33% of the region, 2% of the area where fires occurred, and 50% of the area from samples where fires did not occur.Herbaceous vegetation was among the most important model variables, albeit still minor.Herbaceous vegetation was 32% of the area, 72% of the area where fires occurred, and 23% of the area from samples where fires did not occur.
Furthermore, in the central region, forests or evergreen forests, which notably would include eastern redcedar, were not among most influential model variables, unlike grassland herbaceous vegetation that also was more disproportionately represented where fires occurred (2.2 times greater) compared to background land class rates than other wildlands vegetation types (e.g., evergreen forests were greater by a factor of 1.5).By ecoregion, the prairie in particular showed little association of evergreen forests with fire.However, the temperate steppe did contain greater percent area of evergreen forests (5.5%)where fire occurred than background rates (1.5%), with percent area of forests with fires about 2.8-fold greater than background rates.Conversely, herbaceous vegetation was 77% of area where fire occurred compared to 48% of the region, which is a grassland with a great herbaceous vegetation background rate.Nonetheless, the overall wildlands (herbaceous and woody vegetation) and land use (crop and pasture) combinations most influenced fire.These results contrasted with Donovan et al. [9], who found that woody vegetation burned disproportionately more than other land-use types in the Great Plains.However, Donovan et al. [9], expected to find that large fires occurred in grasslands, which are more fire-prone than woodlands.
The shrub land class was minimally influential on fire, but perhaps that will change due to recent fires in the shrub chaparral of California.Fire-dependent shrub ecosystems also occur in shrubs of the Lake Wales Ridge in Florida and pine barrens of the mid-Atlantic eastern US, and albeit small remnants, these types of ecosystems are examples to explain greater abundance of shrubs where fires occurred in the eastern region.Additionally, increased fire activity occurs because of invasion by primarily non-native cheatgrass in shrublands of the Great Basin in the desert ecoregion and a complex of non-native grass invasions in shrublands of the subtropical ecoregion; non-native grasses provide horizontal fuel continuity in fuel-limited shrub systems [13].However, fire then occurs in predominantly herbaceous vegetation rather than shrubs.
Fire relationships were not consistent in space, although the central region shared overlap with both the eastern and western regions.Models for the eastern region overpredicted fires in the western region and likewise, models for the western region underpredicted fires in the eastern region.That is, vegetation that would produce a fire in the eastern region was not sufficient for fire in the western region and vegetation that would not produce a fire in the western region was sufficient for fire in the eastern region.These differences in part may be due to more land classes associated with fires in the eastern region, including wetlands and deciduous forest, than in the western region.These results also seem to reflect that the western region typically has a drier climate, which can result in fuel limitations in vegetation but also greater probability of vegetation being cured and able to ignite for a longer time interval and extreme weather conditions suitable for fire [3,30].Nonetheless, despite greater current rates of fires in the west, historically, the longleaf pine "grasslands" of the Coastal Plain in the southeastern U.S. had fire regimes of 2 to 5 years, making it one of the most frequently burned regions in the past, despite a humid, wet climate [8,9].To a lesser magnitude, predictions among nearby ecoregions had the same problem, in that eastern ecoregional models overpredicted fires for the southeastern ecoregion and the southeastern models underpredicted fire for the eastern ecoregion and the temperate steppe also underpredicted fire for the prairie region.
Fire relationships were not consistent in time.The 1999 to 2002 models were less successful at forecasting 2014 to 2017 fires than 2014 to 2017 models predicted 1999 to 2002 fires.Number of fires greatly increased in the central region from 310 fires during 1999 to 2002 to 1569 fires during 2014 to 2017, while number of fires in the other regions stayed relatively stable.Compared to 1999 to 2002 models, importance of herbaceous and crop land covers increased for the 2014 to 2017 models, which still remained successful at hindcasting 1999 to 2002 fires.
The three classifiers performed near equally well at predicting based on models of the same extent or time interval, or similar ecoregions, despite a variety of potential errors arising from for example, the fire perimeters, land class misclassification, and areas that could have burned yet did not.However, in general, land classes were well-differentiated between where fire occurred and samples of where fire was not known to occur (Table 1).Predictions across time or space in different regions showed that random forests did not perform as well as the other two classifiers.Mean value of true positive rate for extreme gradient boosting was 0.790 compared 0.761 for random forests when predicting across time intervals.Mean value of true positive rate for extreme gradient boosting was 0.708 compared 0.655 for random forests when predicting across regions, albeit if the random forests prediction for the central region based on the eastern region is excluded, the mean value increases to 0.691.These two examples demonstrate that extreme gradient boosting remained more generalized with less overfitting to specific datasets than random forests, although the sample size is too small to state that extreme gradient boosting overall is preferable to random forests when models are in development or need flexibility.Moreover, random forests performed equally well in predicting fires in adjacent ecoregions, which had fewer differences than across regions.
Analysis with multiple classifiers allows identification of better performing classifiers and selection of desirable features.Random forests is a well-established classifier with reliably accurate performance, whereas C5.0 decision trees generate explicit rulesets, which can be help explain models when there are few variables, but also produce many similar and competing models.Fernandez-Delgado et al. [18] comprehensively assessed 179 classifiers using 121 data sets and the best classifier was random forests, with C5.0 decision trees among the best classifiers.Extreme gradient boosting is newer classifier, intended to reduce overfitting that other boosting classifiers are prone to producing [19].
Further research may explore some unexpected results from these relationships.For example, future studies can examine the interaction of fire with mixed and deciduous forests, which overall were not associated with fires.These forest types were less abundant where fires occurred when these forests were among the most influential model variables; for example, mixed forests in the eastern region and deciduous forests in the southeastern and desert ecoregions.It may be that current species in mixed and deciduous forests are not fire-tolerant, and thus suppress rather than support fires, reducing fire frequency but increasing fire risk when fire does occur [31,32].Furthermore, presence of water in mixed and deciduous forests will act as a firebreak.Deciduous species are rare in the western region and often occur along riparian networks, and likewise, deciduous and mixed forests in the southeastern U.S. often form along wetlands.Similarly, the importance of herbaceous wetlands, as a substitute for herbaceous vegetation, for fires in the eastern region also may be assessed to explore the interaction between the fire-spreading herbaceous component and the fire-breaking wetlands component.
Although this study provides summaries that clearly show fires are more prevalent in some land classes than others, varying be region, along with accurate predictions of models at least within modeling extents, limitations include errors in data and also that land cover is just one of many factors that influence fire.fires vary in severity and include unburned areas within perimeters, fires may not be detected, land classes may be misidentified, and most areas that are flammable did not burn during the time interval of this study [5].fires also are influenced by time since last fire, vegetation properties in terms of acting as fuels, weather and climate, number of ignitions, and topography.Additionally, spatial arrangement of land classes also is an important component to whether fire will be able to spread across flammable wildlands vegetation or be limited by interspersed firebreaks that break fuel continuity.Furthermore, fire protection services in conjunction with wildlands fragmentation and lack of extreme weather may suppress fires while they are small.For example, some of the 14 states with few large fires, particularly New York, indeed have many ignitions, resulting in numerous fires <200 ha [23,33].Land management through prescribed burns also will reduce fuels and reduce the risk of fire.

Implications
Fire is a fundamental ecological process that maintains ecosystems and associated species by filtering species that can survive fire or reproduce in response to fire.Fire favors herbaceous vegetation, which rapidly captures aboveground growing space, over woody vegetation, and thereby supports a diverse and unique assemblage of species, including insects that provide many critical ecosystem services [34][35][36][37].Hominins have applied fire as a labor-saving land management tool for hundreds of thousands of years [38].However, as human densities have increased, fire has become a risk to human structures and lives.Fire had been suppressed for about a century in the United States, resulting in great departure from historical fire regimes.Nonetheless, these models demonstrated that fires still follow historical ecology that herbaceous vegetation and pine forests are fire-prone, with traits that promote fire [8].
Although fire suppression appears to be a logical response to fire, fire ecology produces a paradox that fires become more catastrophic because of, not despite, fire suppression [14,39,40].Fuels accumulate, increasing in quantity and continuity, when fuels do not burn due to suppression of unintended fires, exclusion of prescribed fires for land management, along with passive suppression from land classes that act as fire breaks (described in this study).Changing characteristics of fires often culminates in disastrous fires that cannot be suppressed [14].Fire suppression produces an escalating cycle of need for fire suppression and increased fire suppression costs [14].This aspect of fire ecology has been understood for over a century.For example, prescribed burns were established practice during the late 1800s in California forests to prevent high severity crown fires [41].Show and Kotok (1924, [41]) wrote, The idea that fires could be excluded entirely from millions of acres was generally regarded as preposterous and the most gloomy pictures were drawn of any such attempt.It was claimed that the uncontrollable crown fire was to be expected as the inevitable consequence of allowing ground cover and litter to accumulate.Thus, in the early years of protection of the national forests, the forests were still open as a result of the repeated fires of the past . . . .As fire protection became an accomplished fact and the young growth began to fill up the open forest, the amount of inflammable material in the forests increased greatly.
Compounding altered fire regimes, housing developments and other infrastructure have expanded into wildlands area, increasing the area of wildland-urban interface (WUI) and intermix and concurrently, exposure of assets to fire throughout the world and resulting in structural losses to fire events [14].Fire severity in wildlands may be relatively independent of fire in the home ignition zone; that is, homeowner management of the area immediately surrounding the determines home losses from wildland fire [14].Thus, in addition to untreated fuel accumulation in wildlands and increased exposure and ignition rates due to development in wildlands, many residents in wildlands underinvest in mitigation actions, relying on neighboring public or private land owners for fuel treatment, government emergency services to suppress fire, and insurance or federal disaster assistance to cover any property losses [42,43].Housing materials, such as wood shakes, can be very flammable and burning homes generate hot embers, which increase the spread of a fire.Lofted embers may cause more residential losses than direct flames or radiant heat [14].
Strategies for risk mitigation to coexist with fire include identification of communities at risk, disclosure of fire risk, reduction of vegetation (i.e., fuel for fire), outreach to homeowners to reduce both structure flammability and vegetation surrounding structures, and planning and zoning to reduce sprawl into wildlands [33].For an example of outreach and risk disclosure, counties can provide advisory letters informing citizens about risks and homeowner responsibilities, similar to the following, "The physical characteristics of your property can be positive and negative.Trees are a wonderful environmental amenity, but can also involve your home in a forest fire.Building at the top of a forested draw should be considered as dangerous as building in a flash flood area.'Defensible perimeters' are very helpful in protecting buildings from forest fire and inversely can protect the forest from igniting if your house catches on fire."(country commissioners in Stevens County, MT, USA, as cited by [44]).Reduction of intermixing between wildlands and housing, by concentration of housing at greater densities, will reduce fire exposure.These models show that creative planning and zoning may allow placement of firebreak buffer zones of crops, grazed pastures, and open spaces that are managed by mowing and watering between wildlands and developed land classes with housing.

Conclusions
Information can be extracted from statistical relationship among remotely sensed imagery using many new classifier methods.While extreme gradient boosting overall did not perform better than the more established methods of random forests and c5.0 in predicting to withheld samples from models, extreme gradient boosting had improved predictions across time and space compared to random forests and produced different models that may be preferable in some cases.Based on these results, with agreement among the classifiers, the primary wildlands classes that carried fire were herbaceous grasslands and herbaceous wetlands and evergreen forest, while crops and pasture were the primary land classes that acted as firebreaks.Research needs include greater examination of the interaction between fires and mixed and deciduous forests.Despite departure from historical fire regimes, these models confirmed that herbaceous vegetation remains necessary for fires and that evergreen forests in particular are fire-prone.Reduction of vegetation surrounding housing developments, perhaps through low intensity grazing, will help provide a buffer to reduce large fires.

Supplementary Materials:
The following are available online at http://www.mdpi.com/2072-4292/12/18/2966/s1, Figure S1: Partial dependence plots for c5.0, random forests, and extreme gradient boosting models of all fires in the conterminous United States, Figure S2: Partial dependence plots for c5.0, random forests, and extreme gradient boosting models of the eastern United States, Figure S3: Partial dependence plots for c5.0, random forests, and extreme gradient boosting models of the central United States, and Figure S4: Partial dependence plots for c5.0, random forests, and extreme gradient boosting models of the western United States.

Figure 1 .
Figure 1.Three regions and seven ecoregions of the conterminous United States with all fires during 1999 to 2017.Below, fires during 1999 to 2002 (A) and fires during 2014 to 2017 (B).

Fire
occurrences during 1999 to 2017 were spatially delineated fires of ≥200 ha.From the Monitoring Trends in Burn Severity database (MTBS; https://www.mtbs.gov/;[21]), I excluded prescribed burns, and I added fires of 200 to 400 ha from the Geospatial Multi-Agency Coordination database (GeoMAC; https://www.geomac.gov/GeoMACTransition.shtml;https://datanifc.opendata.arcgis.com/search?tags=HistoricWildfires_OpenData;1999 not available) that did not overlap with the MTBS fires; thereby, removing potential duplicates.A total of 11,918 fires occurred: 5895 in the western region, 2767 in the eastern region, and 3256 in the central region.Sample size by years, to match with the National Land Cover Database years, was 2301 fires during 1999-2002 for land cover during 2001, 794 fires during 2003-2004 for land cover during 2004, 1638 fires during 2005-2006 for land cover during 2006, 1235 fires during 2007-2008 for land cover during 2008, 1631 fires during 2009-2011 for land cover during 2011, 1068 fires during 2012-2013 for land cover during 2013, and 3251 fires during 2014-2017 for land cover during 2016.

Figure 1 .
Figure 1.Three regions and seven ecoregions of the conterminous United States with all fires during 1999 to 2017.Below, fires during 1999 to 2002 (A) and fires during 2014 to 2017 (B).

Fire
occurrences during 1999 to 2017 were spatially delineated fires of ≥200 ha.From the Monitoring Trends in Burn Severity database (MTBS; https://www.mtbs.gov/;[21]), I excluded prescribed burns, and I added fires of 200 to 400 ha from the Geospatial Multi-Agency Coordination database (GeoMAC; https://www.geomac.gov/GeoMACTransition.shtml;https://data-nifc.opendata.arcgis.com/search?tags=HistoricWildfires_OpenData;1999 not available) that did not overlap with the MTBS fires; thereby, removing potential duplicates.A total of 11,918 fires occurred: 5895 in the western region, 2767 in the eastern region, and 3256 in the central region.Sample size by years, to match with the National Land Cover Database years, was 2301 fires during 1999-2002 for land cover during 2001, 794 fires during 2003-2004 for land cover during 2004, 1638 fires during 2005-2006 for land cover during 2006, 1235 fires during 2007-2008 for land cover during 2008, 1631 fires during 2009-2011 for land cover during 2011, 1068 fires during 2012-2013 for land cover during 2013, and 3251 fires during 2014-2017 for land cover during 2016.

Table 1 .
Variables and values (% area) from the National Land Cover Database for recorded fires and samples with no known fires in the conterminous United States by years, region, and ecoregion (years 1999-2017 for each region and ecoregion).

Table 2 .
Variables and values (% area) from the 2008 National Land Cover Database for 14 states with few fires, northeastern states with few fires in the eastern ecoregion, and ecoregions (years 1999-2017; ecoregions exclude 14 states with few fires).

Table 3 .
True positive rate and true negative rate of predictions for three different classifiers (C5.0; rf = random forests, xgbTree = extreme gradient boosting with trees) for test set (25% of observations) based on modeled training data of all fire occurrences during 1999 to 2017, 1999 to 2002, 2009 to 2011, 2014 to 2017, and three regions.

Table 4 .
True positive rate and true negative rate of predictions for three different classifiers (C5.0; rf = random forests, xgbTree = extreme gradient boosting with trees) of modeled training data for different years and regions.

Table 5 .
True positive rate and true negative rate of predictions for three different classifiers (C5.0; rf = random forests, xgbTree = extreme gradient boosting with trees) for test set (25% of observations) and for adjacent ecoregions based on modeled training data of all fire occurrences during 1999-2017 for seven ecoregions.