Up-Scaling Fuel Hazard Metrics Derived from Terrestrial Laser Scanning Using a Machine Learning Model

: The characterisation of fuel distribution across heterogeneous landscapes is important for wildﬁre mitigation, validating fuel models, and evaluating fuel treatment outcomes. However, efﬁcient fuel mapping at a landscape scale is challenging. Fuel hazard metrics were obtained using Terrestrial Laser Scanning (TLS) and the current operational approach (visual fuel assessment) for seven sites across south-eastern Australia. These point-based metrics were then up-scaled to a continuous fuel map, an area relevant to ﬁre management using random forest modelling, with predictor variables derived from Airborne Laser Scanning (ALS), Sentinel 2A images


Introduction
An increase in the frequency and extent of wildfires across the world is of growing concern [1-3] and has been attributed to climate change in combination with other factors [4]. Recent large and severe wildfires, for example, the 2018 California wildfires [5], the 2019 Amazon forest fires [6,7], and the 2020 Australia wildfires [8], have become more destructive. As fires become more frequent and severe, it is increasingly important to be able to accurately assess the fuel hazard risk associated with wildfires.
Recent advancements in fire behaviour modelling have occurred in both empirical and computational fluid dynamics (CFDs) models, providing fire researchers and scientists with estimates of the rate of spread and resultant fire interactions in heterogeneous environments [9][10][11][12]. These models are used to characterise fire behaviour under specific fuel, weather, and atmospheric conditions and predict fire spread during a fire event [13]. Fuel is the only element of a landscape that can be modified to influence the behaviour of future fires [14]. Land managers, therefore, undertake substantial effort to treat fuel with the aim of reducing risks [15,16]. To guide these efforts, fire behaviour models are used in conjunction with fuel maps to determine the expected change in wildfire behaviour from implementing a given fuel treatment. However, for the impact of these efforts to be understood, accurate and high resolution maps of the current fuel structure and hazard at no study has attempted to estimate fuel metrics derived from 3D TLS datasets using the machine learning technique. In order to fill this gap, we aim to develop a machine-learningbased workflow to map the spatial pattern of fuel hazard metrics derived from TLS point clouds and visual assessments in south-eastern Australia. The predictor variables for both models were derived from satellite multi-spectral imagery, ALS, climate, and soil datasets. The cover and height of near-surface, elevated, and canopy layers were chosen as a candidate metrics owing to their importance across many applications, including wildfire behaviour models [9,10,13]. The specific objectives of the study were as follows: • To estimate cover and height of near-surface, elevated, and canopy fuel layers derived from TLS point clouds; • To evaluate the performance of a random forest model trained with TLS observation and compared to random forest models trained with visual assessment, where appropriate; • To map fuel metrics derived from TLS point clouds across an area relevant to operational fire management decisions and compared to a map of fuel metrics obtained from visual assessments, where appropriate.

Study Area
In this study, seven sites (within burn units provided by the local land management agency, where a burn unit is a discrete unit of land that will be treated by prescribed fire) consisting of dry, lowland, and wet Eucalypt forests distributed across Victoria, Australia were included (Figure 1). Sites were selected to be accessible, contain a range of fuel hazard and vegetation conditions, and coincide with the recent (within the last 10 years) capture of ALS data (Table 1).

Fuel Data
The fuel data used in this study were collected at each plot using two approaches-one using TLS and another using visual assessments following OFHAG, which is the current operational approach used in Victoria, Australia. These in situ approaches both aimed to characterise fuel within 10 m radius plots. The number of plots at each site is provided in Table 1, with plot locations selected such that plots were situated at least 100 m and no more than 500 m away from the road edge within each site.

Terrestrial Laser Scanning
A Faro Focus M70 laser scanner was used to collect scans across all seven sites. Multiple plots were captured (4)(5)(6)(7)(8)(9)(10)(11)(12)(13)(14) with plots distributed based on existing land management approaches. The number of plots collected for each site is mentioned in Table 1. At each plot, five scans were collected in a cross-pattern with a central scan and four scans 10 m from that scan at 90 deg intervals. All together, 340 TLS scans were collected across seven sites. To characterise fuel in a particular plot, the average of the fuel metric values for each layer collected by the five scans was taken. This reduced the number of sample size to 68 for all seven sites. Each scan was captured at a scan resolution of 1/2 and scan quality of 2×. This resulted in a 3.1 mm point spacing at a distance of 10 m. Scans were taken at a height between 1.6 and 1.8 m from the ground surface to maximise near-field (<4 m from the scanner) observation of fuel features.

Visual Assessments
At each plot, a visual assessment following the OFHAG method outlined by Hines et al. [23] was undertaken. Assessments were carried out by personnel from the Department of Environment Land Water and Planning (DELWP) and RMIT University. The number of years of experience for assessors in assessing fuel hazard visually varied from less than one year to 10 years. This process requires an assessment of three different vegetation strata (surface, near-surface, and elevated layers), and an assessment of the bark hazard. Bark hazard is associated with the loose bark on tree trunks and branches, and can create extreme fuel hazards. When assessing each strata, characteristics including the proportion of dead material (near-surface and elevated), the depth (surface), height (where height differentiates fuel layers for separation from a visual standpoint), cover, and arrangement of fuel are visually compared in the field against a set of reference images and descriptions. Estimates of surface and bark fuel attributes made by this method were not considered further in this study, as currently no reliable method exists to observe the characteristics of these layers from TLS data.

Method
This study follows the workflow shown in Figure 2. The processing of the TLS data described below produces descriptions of fuel height and cover for the near-surface and elevated layers. In comparison, OFHAG does not explicitly measure near-surface and elevated height [23]. As such, the random forest-based workflow is used to produce landscape-level maps of near-surface and elevated layers cover using both TLS and OFHAG plot observations for training and validation, as well as height using only the TLS observations. Furthermore, the height and cover of the canopy layer is also predicted at landscape level using TLS observations for use in the properties prediction of lower vegetation strata. This section describes the steps taken to pre-process the TLS data and then up-scale response variables for both sets of plot-level data.

TLS Point Cloud to Fuel Metrics
The workflow outlined in [37,41] was adapted to derive the cover and height of fuel strata from each TLS scan. The main steps of this workflow, including any variations to the parameter used from that of Hillman et al. [37], Wallace et al. [41], are described in this section.

Voxelisation
Each individual scan was first clipped to only include points within a 4 m horizontal radius from the scanner's origin to avoid overlap between scans. Then, to provide a discrete representation of the environment, point clouds were transformed into a 0.02 m resolution voxel space, where any voxel (or 3D pixel) containing 1 or more points was considered filled. A voxel model for these purposes is a three-dimensional grid domain with the size of the grid cells determining the resolution of the 3D grid [55].

Noise Detection and Removal
Noise was detected within this voxel space using a kNN outlier detection approach [56]. This approach relies on two parameters, the number of nearest neighbours (nn) and a stan-dard deviation threshold (ST) where noise points are found by determining the nearest neighbour distance between each point and other points in the point cloud. Points are then considered noise if their nearest neighbour distance is greater than the mean distance plus the standard deviation multiplied by a threshold (ST). Due to the properties of the scanner/environment interaction, it was found that the nn and ST performed better if the (3D) distance to the scanner was considered [41]. The parameters determined in Wallace et al. [41] within a similar environment and using the same scanner were used for this study. These parameters are as follows: for points <2 m (nn = 18, ST = 1.8), 2-4 m (nn = 15, ST = 2.6), 4-6 m (nn = 12, ST = 2.6), >6 m (nn = 9, ST = 2.6).

Normalisation
Points originating from the ground were detected using the Cloth Simulation Filter (CSF) [57]. In this case, the parameters for the CSF algorithm were set as follows-a cloth resolution of 0.18, a rigidness of 3, a class threshold of 0.03, and a time step of 0.4. These parameters were determined based on the comparison of CSF outcomes on single scans in comparison to manually edited ground points set from a dense multi-scan data collection (reported in [41]). The DTM was constructed by triangular irregular network (TIN) interpolation between the identified ground points. The TIN was then used to normalise the points within each individual scan.

Fuel Strata Classification
Filled voxels were attributed to fuel layers using the layer pouring method outlined in [37]. In brief, the method utilises the vertical layers of the voxel space to determine clusters of filled space. These clusters are determined in the top-most layer first. Where any cluster intersects with a cluster from the layer directly above it, the two clusters are considered to be from the same vegetation element. This then creates a connection of elements through to the ground layer. Once a connection is determined, a cluster is attributed to a fuel strata layer based on the height of the upper-most voxel within the strata. This allows for objects to be allocated to a fuel strata. Although there are no fixed height thresholds defined in the literature for describing fuel strata, we applied an approach consistent with fuel studies in eucalyptus forests. This divides fuel into three classes: the near-surface (<0.6 m), elevated (0.6 m to 3 m), and canopy (>3 m) fuel layers [23,37,41,58]. This process was applied on all point clouds to separate those points, which describe the stem and crown elements from the elevated and near-surface and surface elements of the fuel strata.

Fuel Metric Derivation
The cover and height of the near-surface and elevated fuel layers are extracted from the TLS data and used in the landscape level prediction. Canopy height and cover were also determined from the TLS data and predicted at a landscape level.
Mean height was calculated as the mean height of the surface of each fuel layer. The top surface was determined as the highest point within each voxel column. Mean cover was calculated as the number of voxel columns containing at least one voxel from each layer divided by the number of voxel columns containing any filled voxels. In the case of the near-surface layer, the number of columns containing a voxel from this strata was divided by the number of columns containing any voxels from the ground, near-surface, and elevated layers. Once fuel properties were determined for each individual scan, the mean value of the five scans was used to characterise each plot.

Random Forest Model Configuration
In order to predict fuel properties across the landscape, random forest regression as implemented in Scikit-learn [59] was used. Random forest was chosen as it has been shown to provide high accuracy and allows for our results to be compared against similar outputs from within the literature [15,27].
For optimal performance of the random forest, two hyper-parameters were tunedmtry (the number of variables used in each replicate run) and ntree (the total number of independent trees). Predictions were made for the training dataset with ntree values from 200 to 1000. We found that the ntree parameter did not differ significantly with increasing ntree values > 500, the default in many random forest regression modelling studies. Thus, 500 independent trees were used. We considered five possible values for mtry: default (square root of the number of input features), 3, 4, 6, and 9. The default value yielded the lowest out-of-bag error and was used for subsequent modelling.
Separate random forest models were produced for each fuel structural variable derived from the TLS point clouds and the visual assessments. As the understorey vegetation conditions are likely dependent on the canopy properties, a sequential approach was used to predict fuel properties using the TLS data [60]. In this approach, the canopy properties (height and cover) were predicted with the output used as parameters in the prediction of near-surface and elevated fuel properties. This approach was not followed using the OFHAG assessments as no canopy observations were recorded.
The four key drivers of vegetation distribution: topography [61], climate, soil [62], and vegetation indices from Sentinel-2 [63] (detailed below and described in Appendix A) were used as predictor variables. These predictor variables were chosen based upon a review of the literature and were considered to have an explanatory relationship with vegetation structure and composition in the study sites [15,27,61,[63][64][65][66]. The final model was produced with a 20 m resolution to be commensurate with the plot area (10 m radius) and the resolution of landscape predictor variables.

Topographic Variables from ALS
ALS point clouds were used to produce a digital terrain model (DTM) at a spatial resolution of 20 m using pre-classified ground points and blast2dem tool within lastools [61]. These point clouds were obtained from the Victorian State Government (DELWP). At each site, data were collected using a range of ALS sensors, with capture properties also differing including the target pulse densities ( Table 2). QGIS software was then used to derive eleven topographic attributes (Table A1) from the DTM at a spatial resolution of 20 m.  [27], three soil variables (bulk density, clay content, pH CaCl 2 ) were used as predictor variables. These were obtained from CSIRO Data Access Portal at a resolution of 3 × 3 arcseconds ≈90 × 90 m, (www. clw.csiro.au/aclep/soilandlandscapegrid/index.html (accessed on 16 March 2022)) [67]. The raw soil data were provided for three depths (0-5, 5-15, 15-30 cm). However, only the top-most layer for each soil variable was included due to high correlations among the depths [62]. Soil variables were then resampled at a spatial resolution of 20 m using cubic convolution approximation resampling approach.

Climate Variables from Climate Data
Climate data were downloaded from Worldclim at a resolution of 1 km 2 (http:// worldclim.org/version2) (accessed on 16 March 2022) [68]. These were summarised into annual mean temperature, maximum temperature of warmest month, and precipitation of warmest quarter following McColl-Gausden et al. [27] and resampled using a cubic convolution approximation resampling approach to a spatial resolution of 20 m.

Vegetative Indices from Sentinel MSI-2
Sentinel 2A multi-spectral imagery captured within 60 days of the plot data acquisition was downloaded from Google Earth Engine. Cloud-free composites of Level-2A imagery from acquisitions between November 2021 and January 2022 were created. From this imagery, seven individual bands (B1-B7) were obtained. Following [63,65], we then calculated the chlorophyll red-edge index, shortwave infrared to near infrared ratio, tasseled cap greenness, and tasseled cap wetness specific to the Sentinel-2 data [63] at a spatial resolution of 20 m.

Model Evaluation
For each fuel property, values were extracted from the raster dataset at the location of each plot using the nearest neighbour approach. In each case, these sample points were divided into a training dataset (70%) and a validation dataset (30%). This was repeated 100 times to remove any bias from the selection of these groups [69][70][71]. For final performance, average results were presented.
The resultant models were used to predict each fuel metric derived from TLS data at a landscape level. As cover estimates for visual assessments were available only for nearsurface and elevated layers, the following comparisons between visual and TLS estimates were only based on these layers.
The random forest regression models for each fuel metrics were parameterised using the out-of-bag (OOB) score (R 2 ). OOB accuracy assessments provided by random forest are claimed to be reliable [72]. Random forest based on the out-of-bag concept does not need a separate testing set to evaluate the model [73], while the success of the model against the held-out validation data was assessed via root mean square error (RMSE) and Pearson correlation coefficient (r) [74,75]. The mean value and standard deviation were recorded for each set of training and validation data. For final performance statistics, average results were presented.

Plot-Level Fuel Estimates
Elevated fuel cover estimates from TLS and the visual assessment approaches showed low correlation (r = 0.14) ( Figure 3B). The cover estimates observed using TLS point cloud ranged from 24.2 to 93.7% (mean= 69.6, median = 73.8). However, visually assessed cover estimates ranged from 0 to 90% (mean = 32.4, median = 30). For the near-surface layer, a negative correlation (r = −0.71) was observed between two estimates ( Figure 3A). In most cases, the TLS estimates of cover (mean = 26%) were lower than the visual estimates (mean = 59%) ( Figure 3A). The cover estimates observed using TLS point cloud ranged from 2 to 65%, whereas visually assessed cover estimates ranged from 10 to 90%.

Random Forest Model Predictions for Near-Surface and Elevated Fuel Cover
The Random forest model trained and validated with the TLS estimates was found to have higher predictive power than the model trained with visually assessed estimates ( Figure 4). The OOB score was found to be positive for both layers when predicted using TLS estimates and negative (R 2 = −0.1) for estimates predicted using visual assessments ( Table 3). As such, the predictions of cover for both layers from the random forest models trained with TLS estimates were found to have higher correlation as well as lower RMSE in comparison to models based on visual estimates (Table 3).  The models for each variable predicted a restricted range of cover estimates in comparison to the field observations ( Figure 4). In case of the TLS model, cover predictions ranged from 14.24% to 42.58% for near-surface ( Figure 4A) and 52.01% to 81.84% for elevated layers ( Figure 4C). However, field observation of cover using TLS ranged from 1.98% to 64.8% for near-surface and 24.21% to 93.74% for elevated layers.
A similarly restricted range of cover was predicted from the model trained with visual assessments in comparison to the in-field visually assessed observations. Cover predictions from the model trained with visual assessments ranged from 44.93% to 68.61% in comparison to in-field visual assessment (10% to 90%) ( Figure 4B) for near-surface layers and 19.88% to 47.13% in comparison to the in-field visual assessment (0% to 90%) for elevated layers ( Figure 4D).

Fuel Metrics Predicted Using TLS Measurements Only
Canopy cover predicted by the random forest models trained with TLS observations achieved low OOB (R 2 = 0.10), an RMSE of 12%, and a correlation of 0.38 (Table 3). Canopy height predictions on the other hand had higher predictive power (R 2 = 0.25) and exhibited a moderate correlation (r = 0.6) with the validation data; however, low predictions in taller forests resulted in a relatively high RMSE (4.41 m) ( Figure 5C). Both elevated and near-surface height models provided low-to-moderate predictive power. The mean predictions of elevated height from the random forest models trained with TLS observations were found to have low correlation (r = 0.12) with elevated height observed using TLS ( Figure 5B). The mean predictions of near-surface height from the ran-dom forest models trained with TLS observations was found to have moderate correlation (r = 0.47) with near-surface height observed using TLS ( Figure 5A).

Near-Surface and Elevated Cover
The TLS and visual assessment produced different magnitudes of cover within the near-surface and elevated layers, consistent with the differences seen in the comparison of infield data. Where the near-surface cover was higher and elevated cover was lower within the model predictions based on visual assessments, the TLS-based model predicted lower near-surface and higher elevated fuel cover ( Figure 6). For example, at Clonbinane the mean near-surface cover predicted by the random forest model trained with TLS estimates was 34% in comparison to 55% from the random forest model trained with visual assessments (Figure 6). Despite being of differing magnitudes, similar landscape patterns of near-surface and elevated cover were predicted using the two training data inputs, as shown in Figures 7A,B and 8A,B.  The standard deviation between the 100 predictions of cover for both the elevated and near-surface layer was lower across all seven sites when the TLS data were used to train the model ( Figures 7C,D and 8C,D) in comparison with when visual assessments were used to train the model. For example, the mean standard deviation of cover for the near-surface layer ranged from 2% (at Tostaree) to 3% (at Three Bridges) for TLS in comparison to 4% (at Tostaree) to 8% (at Three Bridges) for visual assessments. Patterns of standard deviation were similar between the predictions of the training data with higher variability in cover estimates occurring along the top of the slopes and the bottom of valleys. This is evident in Figures 7 and 8 for Clonbinane and at all other sites (Appendix B). In both cases, the sites with a smaller number of plots in the area that was up-scaled produced higher variance in model estimates, suggesting that the conditions were not fully captured by the model. For example, where only four plots were used to up-scale an area of 482.7 ha (Three Bridges), a near-surface cover variance of 3.23% was produced, and where 14 plots were used to up-scale an area of 341.3 ha (Tostaree), a near-surface cover variance of 2% was produced ( Figure 6).

Fuel Height
Landscape-level predictions of fuel height were also generated based on the TLS data ( Figure 9 and figures in Appendix B). Similar features across the landscapes were observed in the height maps for both elevated and near-surface layer ( Figure 9C,E). Patterns of standard deviation were also observed to be similar for both elevated and near-surface fuel height. This is evident in (Figure 9D,F) and at all other sites (figures related to height of fuel layers in Appendix B). The mean variation was observed to be low across the up-scaled area for both the near-surface and elevated layer for all seven sites ( Figure 10).

Discussion
The current visual approach to assess fuel hazard in southeast Australian forests provides a rapid means of assessing fuel and an indication of wildfire intensity. However, this approach has been shown to be subjective and does not provide a quantified estimate of fuel properties [30,31]. Nevertheless, using these data Jenkins et al. [15], McColl-Gausden et al. [27] demonstrated that environmental variables provide a driver of fuel hazard conditions across Victoria, Australia. In this study, the visually assessed fuel parameter estimates were captured followed Hines et al. [23] by the same set of observers over a short period of time. Despite this approach, removing some of the subjectivity associated with fuel hazard assessments, the random forest regression based on similar predictor variables to [15,27] failed to provide a prediction significantly improved over what could be achieved with random data. In comparison, moderate model performance was found when using TLS-derived properties for all metrics except for elevated fuel height. As such, in this study we demonstrated that the fuel properties derived from TLS data provide an improved source of plot-level data over visually assessed metrics for up-scaling fuel strata height and cover across the Victorian forests.
Previous attempts to describe landscape fuel conditions such as [15,27] based on plotlevel visual assessments have often aimed to classify the landscape into fuel hazard classes (for instance, low, moderate, high, or extreme fuel hazard). By predicting classes, there is an inability to identify how certain classes were achieved. The workflow presented in this study provides an avenue for land managers to spatially interpolate metrics describing fuel properties by utilising observations from TLS. By predicting fuel properties instead of hazard, land managers are able to make informed decisions on how these properties may vary across the landscape without any loss of detail. To use these data to inform a fuel hazard rating, land managers may combine these products with live and dead fuel moisture readings as well observations of surface fuel hazard and bark hazard. Further research should be conducted to link non-morphological fuel characteristics with the structural metrics able to be derived from TLS to form a complete picture of the factors influencing fuel hazard.
The method used to randomly sample plot locations within the burn units in this study is currently implemented by Forest Fire Management Victoria staff and was developed to balance the capture of useful information and efficiency of data capture [76]. As such, plots were situated at least 100 m and no more than 500 m away from the road edge (to reduce edge effect whilst balancing human safety factors) with the number of plots scaled to the size of the burn unit. The determination of site placement did not consider changes in the landscape factors that are used to assist in the scaling-up of TLS data points. This likely caused the random forest predictions to fall within a narrower band than the overall fuel conditions at each location. Therefore, it is suggested that the method of sampling is refined, whereby further variables are added into the determination of the plot locations that are pertinent to the landscape measures able to be extracted from climate, soil, and satellitederived datasets (e.g., the structurally guided sampling method [77]). By collecting data at sampling locations that are guided by environmental differences across the landscape (such as those highlighted in McColl-Gausden et al. [27]), it is likely that the location of plots spanning a greater range of fuel conditions will be obtained and in turn model accuracy estimates will likely improve.
In tandem with an improved sampling method, it is suggested that further variables to assist in the prediction of structural variables across the landscape are also considered. As highlighted in [15,26,27], the inclusion of time since fire and fire severity information from previous prescribed burning or wildfire is likely to assist in the prediction of structural metrics across an area. In this study, the plots captured did not show variability in time since fire, and hence this attribute was not used in the models. Satellite-based products have been shown to provide valuable information describing both the effect and recovery of Australian forests from disturbances [78][79][80][81]. The integration of such information to a modelling framework utilising plot-based TLS fuel properties will likely create greater understanding of fuel properties across a landscape as well as help in describing the dynamics of those properties.
Whilst the approach to process TLS point clouds shown here provides a more precise estimate of fuel properties than visual assessments, the produced results still require for decisions to be made for a number of metrics and thresholds. Using height thresholds to determine the attribution of vegetation elements to strata, for example, moves away from the physical description of fuel provided in Gould et al. [82], where fuel orientation (near-surface fuel is orientated horizontally, while elevated fuel is vertically orientated) should provide guidance as to the layer definition. By using defined height thresholds as in this research, negative correlation was seen between visual assessment and TLS methods. For example, where a dominant fuel element in a plot could be considered by an observer to be near-surface, due to its low height and orientation, the defined height threshold could result in this element being placed in the elevated layer during TLS processing. The consistent mapping of TLS-derived points (or point clusters) to physically relevant fuel strata remains a research challenge. Nevertheless, recent gains in areas such as in 3D machine learning point to potential improvements in the future processing of similar data [83]. However, further work is required to translate this into useful descriptions of fuel hazard.
It is vital that land and fire management staff further validate the products that are achieved using the method presented in this manuscript or similar approaches such as the one presented by Jenkins et al. [15], McColl-Gausden et al. [27]. Unlike the sole use of Airborne Laser Scanning data where land and fire managers can be completely remote from the production of fuel hazard metrics, sampling techniques such as terrestrial or mobile laser (MLS) scanning require on-ground sampling. Of these two technologies, it is believed that TLS is a preferred operational technique as it still provides a robust and repeatable measurement technique and requires the operator to remain in place for 3-5 min whilst each scan is undertaken [41,84]. This time provides the opportunity for the assessor to consider the plot area and capture ancillary information, such as photography of the site, and bark and surface fuel hazard assessments which are not currently able to be extracted from 3D point clouds. It is noted in a broader concept that landscape-specific sampling should be undertaken whereby if the landscape does not feature a dense forested canopy, ALS data may be sufficient to capture the fuel information required for a particular area [85,86].

Conclusions
This study presents a machine-learning-based method for estimating fuel metrics derived from TLS point clouds and visual assessment across an area relevant to operational fire management decisions. This method was applied across seven different sites distributed across Victoria, Australia. Plot-level data were collected by both TLS and visual assessments. The cover and height for three fuel layers such as near-surface, elevated, and canopy were derived from TLS point clouds. However, visual assessments were only available for cover of near-surface and elevated layers. Fuel metrics at landscape level were estimated by using a two-stage approach, firstly random forest models for each fuel metric were trained separately using TLS observations and visual assessments. Fuel metrics were then up-scaled to the large area by integrating topographic variables derived from ALS data, soil, climate variables, and vegetative indices derived from Sentinel MSI-2 imagery. The performance of a random forest model trained with TLS observation was compared with random forest models trained with visual assessment, where appropriate.
Our results show that the model trained and validated with TLS observations was found to have higher predictive power than the model trained with visual assessments for the cover of both fuel layers (near-surface and elevated). The results demonstrate the use of in-field TLS data coupled with machine learning approaches may provide a means to accurately generate fuel maps across the landscape and service operational decision-making as an alternative to visual-based assessments. The models utilising TLS data produced in this study could be utilised by fire managers to assess fuel treatment effectiveness and validate existing fuel maps. These results can be used to support a more robust determination of fuel metrics at the time of assessment and monitoring over time.
The under-sampling of ranges and conditions was still found to be an issue with the TLS dataset as higher variation was observed in the cover and height for each fuel layer for those sites where a smaller number of plots were used for up-scaling larger areas. It is recommended that the sampling strategy should ultimately be driven according to the environmental differences across the landscape so that the location of plots cover a greater range of fuel conditions which in turn can improve the accuracy of the model. Author

Contributions:
All authors contributed to the study conception and design. Material preparation, methodology, and analysis were performed by R.T., L.W., and S.H. Validation was performed by R.T. and L.W. Supervision for this study was performed by L.W., K.R., J.H., and S.J. Data for this study were collected by B.H., L.W., S.H., and R.T. The first draft of this manuscript was written by R.T. and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Conflicts of Interest:
The authors declare no conflict of interest.

Abbreviations
The following abbreviations are used in this manuscript: Appendix A Table A1. Topographic attributes derived from DTM using ALS data at a spatial resolution of 20 m for each site.

Topographic Variables Description
Slope Calculated by fitting a plane to the eight neighbouring cells [87].

Aspect
The orientation of the cell relative to the north [87].

Catchment area
The upstream area of each cell [88].
Profile curvature The rate of change of slope in a down-slope direction: a proxy for acceleration and deceleration of water over the terrain [89].

Plan curvature
The curvature of a contour at the central pixel. It can be used as a proxy for convergence and divergence of water [89].
Potential solar radiation ratio The ratio of the potential solar radiation on a sloping surface to that on a horizontal surface [90].

Topographic Position Index
Classifying terrain such that the altitude of each data point is evaluated against its neighbourhood to verify whether any particular data point forms part of a positive (e.g., crest) or negative (e.g., trough) feature of the surrounding terrain [91].

Terrain Ruggedness Index
The sum change in elevation between a grid cell and its eight neighbouring grid cells [92].
Stream Power Index A measure of the erosive power of flowing water [93].
Topographic Wetness Index A measure of soil moisture potential that combines contextual and site information and is used to identify potential locations of ephemeral gullies [94].

Convergence Index
The average bias of the slope directions of the adjacent cell from the direction of the central cell minus 90 degrees [88]. Table A2. Climate attributes obtained from Worldclim at a spatial resolution of 1 km 2 and re-sampled at a resolution of 20m for each site.

Climate Variables Description
Annual mean temperature (bio1) The annual mean temperature approximates the total energy inputs for an ecosystem. Calculated by taking the average over twelve months of average temperature for each month [95].
Max temperature of warmest month (bio5) Calculated by selecting the maximum temperature value across all months within a given year [95].

Precipitation of warmest quarter (bio18)
Calculated by first identifying the warmest quarter of the year and then summing up the precipitation values for that quarter [95]. Table A3. Soil variables obtained from CSIRO Data Access Portal at a resolution of 3 × 3 arcseconds and re-sampled at resolution of 20 m for each site.

Soil Variables Description
Soil bulk density (BDW) Bulk Density of the whole soil (including coarse fragments) in mass per unit volume by a method equivalent to the core method [67].
Soil clay content (CLY) <2 µm mass fraction of the <2 mm soil material determined using the pipette method [67].

Vegetative Indices Derived from Sentinel-2 Description
Normalized Difference Vegetation Index (NDVI) Describes the difference between visible and near-infrared reflectance of vegetation cover and can be used to estimate the density of green on an area of land [96].
Normalized Burn Ratio (NBR) Identify burned areas and provide a measure of burn severity.

Tasseled cap transformation
Technique generally used in land cover mapping or other classification projects [97]. It takes the linear combination of satellite imagery bands and a specialised coefficient matrix to create an n-band image with the first three bands containing the majority of the useful information. The first three bands created represents brightness, greenness, and wetness, which are used as predictors here [98].
Chlorophyll red-edge index Estimate the chlorophyll content of leaves, using the ratio of reflectivity in the near-infrared (NIR) and red-edge bands [98,99].
Shortwave infrared to near infrared ratio Provides an indication of leaf chlorophyll content [99].

Appendix B
Appendix B.1. Site: Lakes Entrance