3.1. Fire Spots in Brazilian Biomes and Municipalities
A fire spot, also known as a heat spot, is a point with a temperature above 47 °C that may indicate fires or burns. Moreover, flames can originate from one or more locations. A spot is detected by monitoring satellites at approximately 700 to 900 km altitude.
In this study, a hierarchical time series with three levels was considered. Level 0 represents Brazil’s total number of fire spots from January 2011 to December 2022. Level 1 consists of fire spots disaggregated by Brazilian biome: Amazon, Caatinga, Cerrado, Atlantic Forest, Pampa, and Pantanal. The lowest level, or level 2, represents biomes disaggregated by municipality in Brazil, totaling 5570 municipalities. Thus, a total of 5577 time series are included in the hierarchy.
Table 1 presents the descriptive measures of the number of fire spots in Brazil, its biomes, and municipalities. The data show high variability in monthly fire spots across all hierarchy levels, particularly in the municipalities. The Amazon and Cerrado biomes exhibit the highest numbers of fire spots, with significant variability between months, while the Pampa biome has the lowest number of fire spots and the least variability in the data. When grouped by their predominant biome, municipalities generally show a prevalence of months with few or no fire spots; 75% of the total observations from municipalities where the Caatinga, Atlantic Forest, and Pampa are the predominant biomes reported zero fire spots.
Figure 4 displays the time series of fire spots in Brazil and its biomes, representing the series at levels 0 and 1 of the hierarchy. In 2017, Brazil experienced the month with the highest number of fire spots between 2011 and 2022, with over 60 thousand fire spots. The Amazon had the highest number of fire spots among the six biomes, while the Pampa had the lowest. The sudden increase in the number of spots in the Pantanal in 2020 is noteworthy, a year in which the biome lost approximately 30% of its vegetation [
1]. The seasonal component in the series is visible in
Figure 4, being less explicit for the Pantanal due to the peak in spots in 2020 and the Pampa, which does not exhibit a well-defined pattern.
Due to the large number of municipalities (5570), it is unfeasible to visualize the series at the last level of the hierarchy. To better understand the distribution of fire spots and consider the discrepancy in the number of spots between municipalities, analyses were conducted regarding the number of spots per area (km
) from 2011 to 2022 in each municipality (
Figure 5,
Figure 6,
Figure 7 and
Figure 8). Most municipalities showed a low number of spots per area. There is no clear trend of increase or decrease in spots over the years (
Figure 5 and
Figure 6). In the Pantanal, the peak of fires in 2020 is again visible in this graph, and only two municipalities were responsible for most spots in the biome that year (
Figure 6).
Figure 6 shows that fire spots are not evenly distributed across the country; some municipalities are responsible for most spots in Brazil. From 2011 to 2022, the country’s southern region had few fire spots per km
, and the state of Santa Catarina, belonging to this region, had no spots in some years. Municipalities in the states of Maranhão and Tocantins had a high number of spots per km
throughout all years. These states have the Cerrado biome as their predominant biome.
The analysis of the distribution of fire spots per area in each municipality throughout the months of the year indicates the presence of seasonality in the data (
Figure 7 and
Figure 8). At the beginning of the year, the number of fire spots per area in municipalities is low or zero, and this number begins to increase in July (
Figure 7 and
Figure 8). The peak of fire spots occurs between September and October, depending on the municipality’s biome. Only in cities where the Pampa is the predominant biome is a pattern less clear to identify (
Figure 7). In September, the highest number of fire spots per km
occurs throughout the country, and by December, this number decreases considerably (
Figure 8). Fires are more frequent in the winter and spring months, from the end of June to the end of December, due to low rainfall levels [
30].
3.2. Hierarchical Time Series Forecasting
The routines for calculating fire spot forecasts for the twelve months of 2022 were implemented using R software, version 4.3.0. The results in this section pertain to reconciled forecasts using the square root transformation of fire spots plus 0.5 and the logarithmic transformation of fire spots plus 1. Both transformations used the ARIMA model for base forecasts. The ETS model was also used for base forecasts with the square root transformation of fire spots plus 1. Reconciled forecasts using ETS and the logarithmic transformation of fire spots encountered convergence errors, and the adjusted values and forecasts were unrealistic, so they were not presented.
Table 2 shows the root mean squared error (RMSE) for the level 0 series, related to the overall number of fire spots throughout Brazil, and the average RMSEs for the level 1 and 2 series. Values in bold indicate the lowest RMSEs at each hierarchical level. The results of the average accuracy for different reconciliation approaches using the square root plus 0.5 transformation in the data and the ARIMA model for base forecasts are presented in the first part of the table. With this setup, at level 0, the OLS approach exhibited the lowest RMSE. At level 1, the WLS(s) approach yielded better results, followed by the MinT(s) approach. At level 2, the MinT(s) and WLS(s) outperformed the other approaches. When considering the ARIMA model for base forecasts and a logarithmic transformation of the number of fire spots plus 1, at level 0, the MinT(s) approach had the lowest RMSE. At levels 1 and 2, BU, MinT(s), and WLS(v) outperformed the others (
Table 2). The logarithmic transformation results outperformed those based on the square root transformation when the ARIMA model obtained the base forecasts, especially for BU, MinT(s), and WLS(v). When considering the square root transformation and the ETS model to obtain the base forecasts, BU showed better results at level 0, and MinT(s) at the other two levels. The average RMSE values were generally higher than those obtained using the ARIMA model to obtain the base forecasts (
Table 2). Reconciled forecasts using ETS and the logarithmic transformation encountered convergence errors, and the adjusted values and forecasts were unrealistic, so they were not presented. When considering the Prophet to obtain the base forecasts, the overall best performance was obtained by BU, followed by MinT(s) and WLS(v) for both transformations (
Table 2).
Table 3 is similar to
Table 2, but uses MASE as the accuracy measure. The ARIMA model with logarithmic transformation presented the lowest average MASE values, particularly with the MinT(s) approach for level 0 and WLS(v) for levels 1 and 2. Comparing the average MASEs by hierarchical level, level 0, representing Brazil, showed the best results in terms of accuracy, followed by level 1. The MASE values for municipalities were, on average, greater than 1 for all models and approaches.
Table 4 presents the results for the RMSSE, which are similar to those of RMSE and MASE (
Table 2 and
Table 3) in terms of the best model and approach. Unlike for MASE (
Table 3), at level 2, the average RMSSE was less than one for some approaches. The base forecasts without considering any hierarchical structure, especially at levels 0 and 1, resulted in worse outcomes when compared to some reconciliation approaches (
Table 2,
Table 3 and
Table 4).
In addition to calculating the RMSE, MASE, and RMSSE, it is important to know which approaches produce better forecasts than the baseline. If they are not better, hierarchical time series forecasting loses relevance.
Table A1,
Table A2 and
Table A3 of
Appendix A show the AveRelRMSE, AveRelMASE, and AveRelRMSSE, respectively, for each level of the hierarchy considering the ARIMA, ETS, and Prophet base forecast models and the two transformations. A value of the AvgRelRMSE, AveRelMASE, and AveRelRMSSE equal to one indicates that the reconciled forecast is equal to the base forecast at that level. An example is the bottom-up approach, which uses base forecasts from municipalities, level 2, and sums them to obtain forecasts for the above levels. A value of AveRelRMSE, AveRelMASE, and AveRelRMSSE lower than one indicates that the reconciled forecasts showed improvements compared to the base forecasts, with bold values being the lowest in
Table A1,
Table A2 and
Table A3.
Considering the ARIMA model and the square root transformation of fire spots plus 0.5, at level 0, only the OLS and MO approaches had a value of less than one, showing an improvement of 11.1% and 5.0%, respectively, in RMSE compared to the base forecasts. At level 1, WLS(s) was the one that showed a higher improvement, with a value of 5.8%. At level 2, TDFP showed an improvement of 5.3%. Considering the ARIMA model and the logarithmic transformation of fire spots plus 1, at level 0, only the forecasts from the BU and WLS(v) approaches did not show improvements in the RMSE compared to the base forecasts. At this level, MinT(s) obtained the best result, with an improvement of 22.2% in the RMSE value. At level 1, out of the nine approaches, four of them showed improvements in the RMSE up to 20.0%. In the last level, no method showed improvement compared to the base forecasts (
Table A1). When considering the Prophet model for the base forecasts, the overall best reconciliation method was the BU, but without an RMSE improvement in level two. Overall, the MinT(s) and TDFP approaches outperformed the remaining when considering the logarithm transformation and the ARIMA model to obtain the base forecasts.
The AvgRelRMSE of the approaches using ETS for base forecasts and the square root transformation of fire spots plus 0.5 is also presented in
Table A1. At level 0, excluding the top-down approaches that use base forecasts at this level, all approaches showed improvements in RMSE. BU showed an improvement of 76.3% at this level. At level 1, only WLS(s) and TDPA showed improvements in RMSE, and six approaches showed improvements in RMSE at level 2, with TDPA having the highest improvement. It is also noted that the MO and TDPA approaches showed improvements at all levels.
When considering the Prophet model for the base forecasts, the most effective reconciliation method overall was the BU, except for level 0 in the case of the square root transformation, where the TDPA showed superior performance.
Despite the percentage improvements in RMSE, it is important to consider that the reconciled forecasts using each base model and transformation have different magnitudes. For example, the ETS model with the square root transformation had the overall highest RMSE values (
Table 2) compared to the prophet with the logarithm transformation. Thus, reconciled forecasts using the ETS model show significant improvements over base forecasts, but they are not better than those using the Prophet model.
Figure 9 and
Figure 10 show the boxplots for the RMSE for the nine forecasting reconciliation approaches and the three models and two transformations under consideration for level one (biomes) and level two (municipalities), respectively. For level one (
Figure 9), the WLS(s) seems to be the better-performing reconciliation approach, and the ETS model with the square root transformation is the best to obtain the base forecasts. For level two (
Figure 10), when not considering the “outliers”, a good overall performance was obtained for the TDFP reconciliation approach, and the ETS with the square root transformation seems to be the overall winner in performance.
Figure A1 and
Figure A2 show the boxplots for the MASE for the nine forecasting reconciliation approaches and the three models and two transformations under consideration for level one (biomes) and level two (municipalities), respectively.
Figure A3 and
Figure A4 show the boxplots for the RMSSE for the nine forecasting reconciliation approaches and the three models and two transformations under consideration for level one (biomes) and level two (municipalities), respectively. The overall conclusions from
Table A1,
Table A2 and
Table A3 are in the same line as those for the RMSE (
Figure 9 and
Figure 10).