Hierarchical Time Series Forecasting of Fire Spots in Brazil: A Comprehensive Approach

Ana Caroline Pinheiro; Paulo Canas Rodrigues

doi:10.3390/stats7030039

and

¹

Department of Statistics, Federal University of Bahia, Salvador 40170-115, BA, Brazil

²

Statistical Learning Laboratory (SaLLy), Federal University of Bahia, Salvador 40170-115, BA, Brazil

^*

Author to whom correspondence should be addressed.

Stats2024, 7(3), 647-670;https://doi.org/10.3390/stats7030039

This article belongs to the Special Issue Modern Time Series Analysis II

Version Notes

Order Reprints

Review Reports

Abstract

This study compares reconciliation techniques and base forecast methods to forecast a hierarchical time series of the number of fire spots in Brazil between 2011 and 2022. A three-level hierarchical time series was considered, comprising fire spots in Brazil, disaggregated by biome, and further disaggregated by the municipality. The autoregressive integrated moving average (ARIMA), the exponential smoothing (ETS), and the Prophet models were tested for baseline forecasts, and nine reconciliation approaches, including top-down, bottom-up, middle-out, and optimal combination methods, were considered to ensure coherence in the forecasts. Due to the need for transformation to ensure positive forecasts, two data transformations were considered: the logarithm of the number of fire spots plus one and the square root of the number of fire spots plus 0.5. To assess forecast accuracy, the data were split into training data for estimating model parameters and test data for evaluating forecast accuracy. The results show that the ARIMA model with the logarithmic transformation provides overall better forecast accuracy. The BU, MinT(s), and WLS(v) yielded the best results among the reconciliation techniques.

Keywords:

hierarchical time series forecasting; forecast reconciliation; Brazilian wildfires

1. Introduction

Forest fires are a recurring problem in Brazil, with 222,798 fire spots recorded nationwide in 2020 [1]. These wildfires significantly impact the ecosystem, infrastructure, and population. The Brazilian biomes host a rich biodiversity that can be partially lost due to these fires. Amazon forest fires substantially affect air quality and public health [2]. The quality and quantity of water can also be affected if the fires reach the vegetation around watersheds. Fires can impact crops and, in the long term, compromise food production if soil fertility is affected by the flames [1]. Producing statistics regarding fire spots is crucial for implementing more efficient control measures. Forecasting forest fires is pivotal in designing effective policies for preventing and controlling these phenomena [3].

The number of fire spots can be expressed as a time series, whether at the municipal, biome, state, or national level. Several studies have analyzed and forecast time series related to the number of fire spots in Brazil. For example, ref. [4] used multiple linear regression and non-seasonal ARIMA techniques associated with meteorological variables to model and forecast forest fires in the Pantanal of Mato Grosso do Sul in Corumbá. The logarithm of the number of fire spots was introduced as the dependent variable, and maximum temperature, relative humidity, and solar radiation were introduced as independent variables. The results showed that the ARIMA model was more suitable for modeling and predicting forest fires [4]. Another study, conducted in the same location, used artificial neural networks (ANNs) to forecast the number of fire spots and showed that it was possible to predict the number of fire spots with a predictive power of 84.8% using a set of meteorological variables as predictors, and 99.4% using only the time series of the number of fire spots as a predictor [5]. However, existing studies on fire spot forecasting have been conducted for specific biomes and/or regions, with no study considering making forecasts for all municipalities and biomes in Brazil.

Moreover, the number of fire spots per municipality, biome, and Brazilian territory can be organized hierarchically to form a hierarchical time series (HTS), which refers to a time series that can be aggregated at different levels based on products, geography, or other characteristics [6,7,8]. An example of a hierarchical time series is the sales of a particular item in a country, where total sales can be disaggregated by state, and each state’s sales can further be disaggregated by municipality. Forecasting hierarchical time series is typically done for all levels, and they should be “coherent”, meaning that aggregate forecasts should be equal to the sum of the corresponding disaggregated forecasts [9]. For instance, the sales forecast for a product in municipalities should result in state-level sales, and the sum of state-level forecasts should equal the total sales forecast for the country.

The first methods for forecasting reconciliation in hierarchical time series (HTS) were the top-down, the bottom-up, and the middle-out. These involve generating forecasts for one aggregation level and then using them to create coherent forecasts for the other levels. The bottom-up approach requires forecasts for all series at the lowest level, which are then aggregated to obtain higher-level forecasts. The top-down method requires a forecast only for the total series, which will be disaggregated to obtain forecasts for the other series at lower levels. The middle-out approach combines the bottom-up and top-down approaches to make forecasts. There is no consensus in the literature on which method is more efficient, as it may depend on the number of series in the hierarchy and the data quality [10].

More recently, methods have emerged that use all aggregation levels to generate coherent forecasts. These methods combine forecasts for all levels to provide the best aggregate forecasts [6] and are known as optimal combination or optimal reconciliation [10,11]. This approach requires estimating the variance and covariance matrix of forecast errors, leading to five variations of this method [9].

To generate coherent forecasts, individual forecasts of the series in the hierarchy are needed without considering any constraints; these are called incoherent forecasts or base forecasts. This paper considers three widely used methods for generating the base forecasts. Exponential smoothing, known as ETS [12,13,14], forecasts the time series as a weighted average of past observations, with weights exponentially decreasing as observations move away from the end of the series. The autoregressive integrated moving average (ARIMA) model generates forecasts based on the autocorrelation of the data. The Prophet, created by Facebook, generates forecasts considering a trend component, seasonality, a component capturing holiday effects, and an error term [15].

Therefore, this study aims to forecast the number of fire spots for each Brazilian municipality, biome, and whole territory by considering different reconciliation techniques and base forecast methods in a three-level hierarchical time series structure. The number of fire spots between January 2011 and December 2022 was obtained from the Brazilian National Institute for Space Research (INPE) and organized by [16]. The autoregressive integrated moving average (ARIMA), the exponential smoothing (ETS), and the Prophet models were tested for baseline forecasts, and nine reconciliation approaches, including top-down, bottom-up, middle-out, and optimal combination methods, were considered to ensure coherence in the forecasts.

The rest of this paper is organized as follows. Section 2 describes the data, the methodology behind hierarchical time series and reconciliation techniques, the models used to obtain the base forecasts, and the accuracy measures used to compare the competing models. Section 3 presents a descriptive and exploratory analysis of the data, followed by the results of the accuracy measures for each model. Finally, Section 4 presents the concluding remarks of the study.

2. Materials and Methods

2.1. The Data

The data used in this study were provided by the QUEIMADAS program of the Brazilian National Institute for Space Research (INPE). The dataset contains the number of fire spots per month in each municipality in Brazil from January 2011 to December 2022 and was organized and made available by [16]. The Brazilian territory is divided into six biomes: (i) Amazônia, encompassing approximately 60% of the world’s largest rainforest, rich in mineral reserves, and providing 20% of the world’s water supply; (ii) Caatinga, featuring a semi-arid climate with remarkable biological diversity and unique species; (iii) Cerrado, acknowledged as the world’s richest savanna in terms of biodiversity, which remained largely unchanged until the 1950s when the federal capital was moved to Brasília; (iv) Mata Atlântica, situated along the Brazilian coast and considered the most endangered biome in the country, with only 27% of the original forest cover still preserved; (v) Pampas, characterized by a rainy climate without a dry period and negative temperatures during winter; and (vi) Pantanal, recognized as the planet’s most extensive continuous floodplain.

The Brazilian municipalities composed of more than one biome were associated with the predominant biome based on the results of a Brazilian Agricultural Research Corporation (Embrapa) project [17]. Figure 1 shows the locations of each of the six biomes in the Brazilian territory.

Figure 1. Geographic localization of each of the six Brazilian biomes.

2.2. Hierarchical Time Series

A hierarchical time series is a time series that can be disaggregated into various attributes and different levels. An example of a hierarchical time series is the sales of a specific item in a country, where total sales can be disaggregated by state, and municipalities can further disaggregate each state’s sales. Figure 2 illustrates a hierarchical time series with three levels: Total, which is disaggregated into two time series, A and B, and these are further disaggregated into two (AA and AB) and three (BA, BB, and BC) time series, respectively.

Figure 2. A three-level hierarchical tree diagram.

In a hierarchical time series, obtaining observations at time t is possible by summing the observations from the level below. Considering the series in Figure 2, we have the following equations,

y_{t} = y_{A, t} + y_{B, t},

(1)

y_{A, t} = y_{A A, t} + y_{A B, t} and y_{B, t} = y_{B A, t} + y_{B B, t} + y_{B C, t},

(2)

Substituting (2) into (1) we get

y_{t} = y_{A A, t} + y_{A B, t} + y_{B A, t} + y_{B B, t} + y_{B C, t},

where

y_{t}

is the total of the time series in the time point t,

t = 1, 2, \dots T

.

The equations above can be written in matrix form as follows:

[\begin{matrix} y_{t} \\ y_{A, t} \\ y_{B, t} \\ y_{A A, t} \\ y_{A B, t} \\ y_{B A, t} \\ y_{B B, t} \\ y_{B C, t} \end{matrix}] = [\begin{matrix} 1 & 1 & 1 & 1 & 1 \\ 1 & 1 & 0 & 0 & 0 \\ 0 & 0 & 1 & 1 & 1 \\ 1 & 0 & 0 & 0 & 0 \\ 0 & 1 & 0 & 0 & 0 \\ 0 & 0 & 1 & 0 & 0 \\ 0 & 0 & 0 & 1 & 0 \\ 0 & 0 & 0 & 0 & 1 \end{matrix}] [\begin{matrix} y_{A A, t} \\ y_{A B, t} \\ y_{B A, t} \\ y_{B B, t} \\ y_{B C, t} \end{matrix}],

that simplifies as:

y_{t} = {Sb}_{t},

where

y

is an n-dimensional vector of all observations at time t,

S

is the

n \times m

sum matrix, and

b_{t}

is an m-dimensional vector of all observations at the lower level of the series at time t.

2.3. Forecasting Approaches for Hierarchical Time Series

It is possible to represent all forecasting methods using a single matrix notation:

{\tilde{y}}_{h} = SG {\hat{y}}_{h},

(3)

where

{\tilde{y}}_{h}

represents the coherent forecasts,

S

is the sum matrix,

{\hat{y}}_{h}

is the base forecasts for all time series in the hierarchy, and

G

is the matrix mapping the base forecasts, varying in each approach. The following sections briefly describe the different approaches to obtaining coherent forecasts for a hierarchical time series that will be considered in this paper.

2.3.1. Bottom-Up (BU) Approach

The bottom-up (BU) approach involves making forecasts for each time series at the lowest level and then aggregating/summing them to obtain forecasts for the higher levels. The advantage of this method lies in utilizing all available information because the forecasts are made at the lowest level. However, data at the lowest level can be quite noisy, leading to inaccurate forecasts. We can represent this approach using the general form of Equation (4), as follows:

G = [0_{m \times (n - m)} | I_{m}],

(4)

where

0_{m \times (n - m)}

is a null matrix and

I_{m}

is the identity matrix. The matrix

G

, mapping the base forecasts, is an identity matrix, as the individual forecasts are directly used as coherent forecasts for the higher levels. The bottom-up approach is useful when the lower levels of the hierarchy significantly contribute to the higher levels or when the data quality at the lower levels is more reliable.

2.3.2. Top-Down (TD) Approach

The top-down (TD) method, or top-down forecasting, involves forecasting the total series and then disaggregating it to the lower levels. For this purpose, proportions

p_{1}, p_{2}, \dots, p_{m}

are used to determine how these forecasts will be distributed to obtain the forecast for the lower series. This approach has the advantage of simplicity, as it uses only one series to generate forecasts, which also results in significant information loss [10]. In this approach, the matrix

G

in Equation (3) takes the following form:

G = [p | 0_{m \times (n - 1)}],

where

p = [p_{1}, p_{2}, \dots, p_{m}]

is a set of proportions. There are different approaches to obtaining these proportions; ref. [18] propose two: average historical proportions and proportions of the historical averages. The top-down (TD) approach in forecasting hierarchical time series is useful when there is insufficient or unreliable data at the lower levels of the hierarchy, making it challenging to model individual series accurately or when straightforward and computationally efficient forecasting methods are preferred.

The average historical proportions are computed as follows.

p_{j} = \frac{1}{T} \sum_{t = 1}^{T} \frac{y_{j, t}}{y_{t}} j = 1, \dots, m,

where

p_{j}

represents the historical average proportions of the lower series

y_{j, t}

in relation to

y_{t}

. In the results of this paper, this approach is denoted as TDAP.

The proportions of the historical averages are obtained as follows.

p_{j} = \sum_{t = 1}^{T} \frac{y_{j, t}}{T} / \sum_{t = 1}^{T} \frac{y_{t}}{T} j = 1, \dots, m,

where

p_{j}

represents the historical average value of the lower-level series

y_{j, t}

in relation to the mean of

y_{t}

. In the results of this paper, this approach is denoted as TDPA.

The methods described by Gross and Sohl take into account historical proportions but do not consider how these proportions may change over time, which can impact the accuracy of forecasts for lower levels. To address this, ref. [19] proposed the “Forecast Proportions”, where forecasts for the lower levels are used to calculate the proportions to be applied. The proportions are calculated as follows:

p_{j} = \prod_{𝓁 = 0}^{K - 1} \frac{{\hat{y}}_{j, h}^{(𝓁)}}{{\hat{S}}_{j, h}^{(𝓁 + 1)}}

where

\hat{y} {j, h}^{(𝓁)}

is the h-step ahead forecast of the series corresponding to the node that is ℓ levels above j, and

\hat{S} {j, h}^{(𝓁 + 1)}

is the sum of the base forecast h steps ahead below the node that is ℓ levels above j, which is directly connected to that node. This approach was denoted as TDFP.

2.3.3. Middle-Out (MO) Approach

In the middle-out approach, or intermediate approach, an intermediate level is chosen, and forecasts are generated for all series at that level. Using these forecasts, the bottom-up approach is applied to the levels above the intermediate level, and the top-down approach is applied to the levels below the intermediate level. This strategy aims to strike a balance between the detailed insights provided by the bottom-up method and the high-level overview offered by the top-down method. The choice of the intermediate level is crucial and may vary depending on the specific characteristics of the time series data and the hierarchy structure.

2.3.4. Minimum Trace (MinT) Optimal Reconciliation Approach

The minimum trace (MinT) optimal reconciliation approach optimally combines base forecasts to produce coherent forecasts. Unlike other approaches, it utilizes all information in the hierarchy to generate unbiased forecasts at all levels of the hierarchy [19]. It involves finding the matrix

G

that minimizes the variances of errors in coherent forecasts

{\tilde{y}}_{h}

, as demonstrated by [9]. The matrix of variance and covariance of errors in coherent forecasts h steps ahead is given by:

V_{h} = V a r [y_{T + h} - {\tilde{y}}_{h}] = {SGW}_{h} G^{'} S^{'},

where

W h = V a r [y T + h - {\hat{y}}_{h}]

is the variance–covariance matrix of base forecast errors. Minimizing the variances of errors in coherent forecasts is equivalent to minimizing the trace of the matrix

V_{h}

, hence the name minimum trace. The matrix

G

that minimizes the trace of the matrix

V_{h}

is

G = {(S^{'} W_{h}^{- 1} S)}^{- 1} S^{'} W_{h}^{- 1},

provided that

SGS = S

is satisfied. Therefore,

{\tilde{y}}_{h} = S {(S^{'} W_{h}^{- 1} S)}^{- 1} S^{'} W_{h}^{- 1} {\hat{y}}_{h} .

To obtain

{\tilde{y}}_{h}

, it is necessary to estimate the matrix

W_{h}

. Several approximations have been proposed, giving rise to various approaches. The five main ones are listed below:

1.: $W_{h} = k_{h} I$ for $k_{h} > 0$ , transforms MinT into an ordinary least squares (OLS) estimator and assumes that the matrix $G$ is independent of the data, facilitating calculations [6]. However, it assumes independence, leading to information loss.
2.: $W_{h} = k_{h} diag ({\hat{W}}_{1})$ for all h, where $k_{h} > 0$ and

${\hat{W}}_{1} = \frac{1}{T} \sum_{t = 1}^{T} e_{t} e_{t}^{'},$

where $e_{t}$ is an n-dimensional vector of residuals from the model generating base forecasts. This approach scales base forecasts using the variance of residuals. In this case, $W_{h}$ can be described as the weighted least squares (WLS) estimator [20]. In the results, this approach is denoted as WLS(v).
3.: $W_{h} = k_{h} Λ$ for all h, where $k_{h} > 0$ , $Λ = diag (S 1)$ , and $1$ is a unit vector of dimension m. This implies that lower-level base forecast errors have variance $k_{h}$ and are uncorrelated between nodes. This estimator depends only on the aggregation structure and not on the data, hence termed structural scaling [21]. It is denoted by WLS(s) in the results.
4.: $W_{h} = k_{h} {\hat{W}}_{1}$ for all h, where $k_{h} > 0$ . In this approach, error covariance matrices are considered proportional to each other, and the covariance matrix of a one-step-ahead $W_{1}$ is estimated using sample covariance. Despite being simple to obtain, it may not be a good estimate when $m > t$ [9]. This approach is denoted by MinT(Cov).
5.: $W_{h} = k_{h} {\hat{W}}_{1, D}^{*}$ for all h, where $k_{h} > 0$ , ${\hat{W}}_{1, D}^{*} = λ_{D} {\hat{W}}_{1, D} + (1 - λ_{D}) {\hat{W}}_{1}$ , with ${\hat{W}}_{1, D}$ being a diagonal matrix composed of the diagonal of ${\hat{W}}_{1}$ , and $λ_{D}$ is the shrinkage intensity parameter. This estimator aims to reduce the sample covariance to a diagonal matrix. $λ_{D}$ is estimated through sample correlation [22], given by:

${\hat{λ}}_{D} = \frac{\sum_{i \neq j} \hat{V a r ({\hat{r}}_{i j})}}{\sum_{i \neq j} {\hat{r}}_{i j}^{2}},$

where ${\hat{r}}_{i j}$ is the $i j$ -th element of ${\hat{R}}_{1}$ , the sample correlation matrix one step ahead. This approach is denoted by MinT(s).

The minimum trace (MinT) optimal reconciliation approach aims to minimize the discrepancy between aggregated and coherent forecasts at different hierarchical levels. It achieves this by determining adjustment proportions that minimize the trace of the residual covariance matrix. This approach seeks to optimize the consistency of forecasts throughout the hierarchy, enhancing their accuracy and alignment. In practice, MinT is used as a reconciliation technique to adjust forecasts obtained from different methods, ensuring coherence and adherence to the hierarchical structure. This technique aims to improve the quality of forecasts, especially when distinct methods are applied at different levels of the time series hierarchy. The MinT approach uses all the available information in the hierarchy, but its disadvantage lies in the difficulty of estimating the covariance matrix

W_{h}

, as it must be positive definite to ensure that its inverse exists, which is not always the case.

2.4. Univariate Time Series Forecasting

Individual or base forecasts of the series are required to generate coherent forecasts. This study will consider three models: autoregressive integrated moving average (ARIMA), the ETS class of exponential smoothing models, and Prophet. A brief explanation of each model follows.

The ARIMA model is a combination of three components: autoregressive (AR), moving average (MA), and integrated (I). The autoregressive component involves regressing the variable of interest using its past values. The moving average component for a stationary time series describes the variable of interest as a linear combination of its past forecast errors. The integrated part refers to the difference between consecutive observations of a variable, aiming to make a time series stationary [23]. The seasonal ARIMA model can be written as follows:

ϕ_{p} (B) Φ_{P} (B^{s}) \nabla^{d} \nabla_{s}^{D} z_{t} = θ_{q} (B) Θ_{Q} (B^{s}) a_{t}

(5)

and is denoted by

A R I M A (p, d, q) {(P, D, Q)}_{s}

, where

B is the backward shift operator, defined by $B z_{t} = z_{t - 1}$ , e.g., $B^{s} z_{t} = z_{t - s}$ ;
$ϕ_{p} (B) = 1 - ϕ_{1} B - ϕ_{2} B^{2} - \dots - ϕ_{p} B^{p}$ is the autoregressive operator of order p;
$θ_{q} (B) = 1 - θ_{1} B - θ_{2} B^{2} - \dots - θ_{q} B^{q}$ is the mean average operator of order q;
$Φ_{P} (B^{s}) = 1 - Φ_{1} B^{s} - Φ_{2} B^{2 s} - \dots - Φ_{P} B^{P s}$ is the seasonal autoregressive operator of order P;
$Θ_{Q} (B^{s}) = 1 - Θ_{1} B^{s} - Θ_{2} B^{2 s} - \dots - Θ_{Q} B^{Q s}$ the seasonal mean average operator of order Q;
$\nabla^{d} = {(1 - B)}^{d}$ is the referencing operator, with d the number of differences to make the time series stationary;
$\nabla_{s}^{D} = {(1 - B^{s})}^{D}$ is the seasonal differences operator;
$a_{t}$ is the white noise.

The selection of the ARIMA model can be done automatically in the R software, version 4.4.0, using the ARIMA function from the Fable package, which uses a variation of the Hyndman-Khandakar algorithm [24]. For model selection, the unit root test, minimization of the corrected Akaike information criterion (AICc), bias-corrected version of the AIC for a small sample [25], and maximum likelihood estimation (MLE) are used.

The ARIMA model is a combination of three components: autoregressive (AR), moving average (MA), and integrated (I). The autoregressive component involves regressing the variable of interest using its past values. The moving average component for a stationary time series describes the variable of interest as a linear combination of its past forecast errors. The integrated part refers to the difference between consecutive observations of a variable, aiming to make a time series stationary [23]. The ARIMA model can be written as follows:

y_{t}^{'} = c + ϕ_{1} y_{t - 1}^{'} + \dots + ϕ_{p} y_{t - p}^{'} + θ_{1} ε_{t - 1} + \dots + θ_{q} ε_{t - q} + ε_{t},

(6)

where

y_{t}^{'}

is the differenced time series. This model is called the ARIMA(p,d,q) model, where p is the order of the auto-regressive part, d is the degree of differencing, and q is the order of the moving average part. For data with seasonality, seasonal terms are included in Equation (6), and the model would be of the form

A R I M A (p, d, q) {(P, D, Q)}_{m}

, where m is the seasonal period. Uppercase letters represent the model’s seasonal part, and lowercase letters represent the non-seasonal part. The seasonal term is similar to the non-seasonal term but involves lags of the seasonal period. In the model, the seasonal terms are multiplied by the non-seasonal terms.

The selection of the ARIMA model can be done automatically in the R software using the ARIMA function from the Fable package, which uses a variation of the Hyndman-Khandakar algorithm [24]. For model selection, the unit root test, minimization of the Akaike information criterion (AIC), and maximum likelihood estimation (MLE) are used.

In exponential smoothing models, the time series forecasts are a weighted average of past observations, where the weights decay exponentially as the observations move away from the end of the series. The simplest exponential smoothing method is called simple exponential smoothing for data without trend and seasonality. In this method, the forecasts are a weighted average of the data, and the weights decrease exponentially as follows:

{\hat{y}}_{T + 1 | T} = α y_{T} + α (1 - α) y_{T - 1} + α {(1 - α)}^{2} y_{T - 2} + \dots,

where

0 \leq α \leq 1

is the smoothing parameter. An extension of simple exponential smoothing that allows forecasting data with a trend was created by [13], called the Holt linear model, which adds a trend parameter. Another version of this model is the damped trend, which also includes a parameter that dampens the trend to a flat line in the future [26]. A version of the linear Holt model for data with seasonality was proposed by [12,13], and it is known as the Holt–Winters model. This model has two variations: the additive and multiplicative, depending on the nature of the seasonal component.

These models were put into state space form by [27]. Each exponential smoothing model consists of an equation describing the observed data and state equations describing how the level (

𝓁_{t}

), trend (

b_{t}

), and seasonality (

s_{t}

) components change over time. Each method described above has two models, one with additive errors and the other with multiplicative errors. Thus, each state space model is denoted as ETS(

\cdot, \cdot, \cdot

) for (error, trend, seasonality). The selection of the ETS model is done automatically in the R software using the ETS function from the Fable package.

The Prophet model is a recent model created by Facebook [15]. The model is ideal for data with strong seasonality and large series. It is based on an additive model composed of four components, which are combined as follows:

y (t) = g (t) + s (t) + h (t) + ϵ_{t},

where

g (t)

is a trend function,

s (t)

describes various seasonal patterns,

h (t)

captures the effects of holidays, and

ϵ_{t}

is the error term. The model can be selected automatically in the R software using the prophet function from the fable.prophet package.

2.5. Accuracy Measures

To assess the accuracy of the forecasts, the data were divided into two sets: training data to estimate the model parameters and test data to evaluate its accuracy. The years 2011 to 2021 were used for training, and 2022 was used for testing. This way, 12 forecasts will be obtained for each series to test the model. Two metrics using forecasting errors were employed to measure accuracy. The first is the root mean squared error (RMSE). This scale-dependent measure cannot be used on series with different units. RMSE is defined as follows:

R M S E = \sqrt{\frac{\sum_{i = 1}^{h} {(y_{t} - {\hat{y}}_{t})}^{2}}{h}},

where h is the number of forecasts,

y_{t}

is the actual series values, and

{\hat{y}}_{t}

is the corresponding forecast. The mean absolute scaled error (MASE) and the root mean squared scaled error (RMSSE) were also used, which are useful for comparing series with different units, as they do not depend on the scale of the data [28]. The MASE and RMSSE are given by:

M A S E = \frac{1}{n} \sum_{t = 1}^{n} \frac{| y_{t} - \hat{y_{t}} |}{\frac{1}{n - m} \sum_{t = m + 1}^{n} | y_{t} - y_{t - m} |}

(7)

R M S S E = \frac{1}{n} \sum_{t = 1}^{n} \frac{{(y_{t} - \hat{y_{t}})}^{2}}{\frac{1}{n - m} \sum_{t = m + 1}^{n} {(y_{t} - y_{t - m})}^{2}}

(8)

where n is the length of the time series and m the seasonal period.

M A S E < 1

and

R M S S E < 1

indicate that the proposed method, on average, presents smaller errors than those of a single step of the seasonal naive method.

It is also important to analyze whether the forecasts using different reconciliation approaches yield better results than the base forecasts. The relative root mean squared error (AvgRelRMSE) recommended by [29] can be defined through the geometric mean of the ratios of the mean absolute errors. The AvgRelRMSE is calculated as follows:

A v g R e l R M S E = {(\prod_{t = 1}^{m} r_{i})}^{1 / m}; r_{i} = \frac{R M S E_{i}^{R e c}}{R M S E_{i}^{B a s e}},

where

R M S E_{i}^{B a s e}

is the RMSE of the base forecasts for series i, and

R M S E_{i}^{R e c}

is the RMSE obtained for the same series i after reconciliation, and m is the number of time series. One advantage of this metric is that the calculation of

(1 - AveRelRMSE) \times 100 %

represents the percentage improvement of each reconciliation approach compared to the base forecasts. In an analogous manner to the AveRelRMSE, the AveRelMASE and AveRelRMSSE can also be calculated.

2.6. Data Transformation

To ensure that the base forecasts are positive, data transformation is necessary. This work studied two transformations:

l o g (y + 1)

and a square root transformation applied to

y + 0.5

, where y represents the time series data. In this case, it is necessary to reverse the transformation to obtain the forecasts in the original scale.

In the database used in this study, 140 municipalities had no fire spots, 96 municipalities had only one fire spot, and 13 municipalities had the same distribution of fire spots as another municipality. This makes it impossible to calculate the inverse of the

W_{h}

matrix of variance and covariance of the base forecast errors for the MinT(s), MinT(c), and WLS(s) approaches, as seen in Section 2.3.4. To solve this problem, random Gaussian noise with zero mean and a standard deviation of 0.001 was added to these municipalities’ fire spot time series. After the inclusion of the noise, it was still not possible to calculate the forecasts using the MinT(c) approach.

2.7. Flowchart of Methodology

To better understand the methodology used in this study, a flowchart is presented in Figure 3.

Figure 3. Flowchart of the methodology used in the paper.

3. Empirical Study

3.1. Fire Spots in Brazilian Biomes and Municipalities

A fire spot, also known as a heat spot, is a point with a temperature above 47 °C that may indicate fires or burns. Moreover, flames can originate from one or more locations. A spot is detected by monitoring satellites at approximately 700 to 900 km altitude.

In this study, a hierarchical time series with three levels was considered. Level 0 represents Brazil’s total number of fire spots from January 2011 to December 2022. Level 1 consists of fire spots disaggregated by Brazilian biome: Amazon, Caatinga, Cerrado, Atlantic Forest, Pampa, and Pantanal. The lowest level, or level 2, represents biomes disaggregated by municipality in Brazil, totaling 5570 municipalities. Thus, a total of 5577 time series are included in the hierarchy.

Table 1 presents the descriptive measures of the number of fire spots in Brazil, its biomes, and municipalities. The data show high variability in monthly fire spots across all hierarchy levels, particularly in the municipalities. The Amazon and Cerrado biomes exhibit the highest numbers of fire spots, with significant variability between months, while the Pampa biome has the lowest number of fire spots and the least variability in the data. When grouped by their predominant biome, municipalities generally show a prevalence of months with few or no fire spots; 75% of the total observations from municipalities where the Caatinga, Atlantic Forest, and Pampa are the predominant biomes reported zero fire spots.

Table 1. Descriptive measures of the number of fire spots in Brazil, its biomes, and municipalities.

Figure 4 displays the time series of fire spots in Brazil and its biomes, representing the series at levels 0 and 1 of the hierarchy. In 2017, Brazil experienced the month with the highest number of fire spots between 2011 and 2022, with over 60 thousand fire spots. The Amazon had the highest number of fire spots among the six biomes, while the Pampa had the lowest. The sudden increase in the number of spots in the Pantanal in 2020 is noteworthy, a year in which the biome lost approximately 30% of its vegetation [1]. The seasonal component in the series is visible in Figure 4, being less explicit for the Pantanal due to the peak in spots in 2020 and the Pampa, which does not exhibit a well-defined pattern.

Figure 4. Time series of the monthly number of forest fire spots in Brazil and each of the six Brazilian biomes.

Due to the large number of municipalities (5570), it is unfeasible to visualize the series at the last level of the hierarchy. To better understand the distribution of fire spots and consider the discrepancy in the number of spots between municipalities, analyses were conducted regarding the number of spots per area (km

^{2}

) from 2011 to 2022 in each municipality (Figure 5, Figure 6, Figure 7 and Figure 8). Most municipalities showed a low number of spots per area. There is no clear trend of increase or decrease in spots over the years (Figure 5 and Figure 6). In the Pantanal, the peak of fires in 2020 is again visible in this graph, and only two municipalities were responsible for most spots in the biome that year (Figure 6). Figure 6 shows that fire spots are not evenly distributed across the country; some municipalities are responsible for most spots in Brazil. From 2011 to 2022, the country’s southern region had few fire spots per km

^{2}

, and the state of Santa Catarina, belonging to this region, had no spots in some years. Municipalities in the states of Maranhão and Tocantins had a high number of spots per km

^{2}

throughout all years. These states have the Cerrado biome as their predominant biome.

Figure 5. Boxplots for the annual number of fire spots in Brazilian municipalities divided by their area and grouped by the municipality’s predominant biome.

Figure 6. Maps of the total fire spots in Brazilian municipalities divided by their area from 2011 to 2022.

Figure 7. Boxplots for the monthly number of fire spots in Brazilian municipalities divided by their area and grouped by the municipality’s predominant biome, considering the years from 2011 to 2022.

Figure 8. Maps of the total forest fire spots in Brazilian municipalities divided by their area per month, considering the years from 2011 to 2022.

The analysis of the distribution of fire spots per area in each municipality throughout the months of the year indicates the presence of seasonality in the data (Figure 7 and Figure 8). At the beginning of the year, the number of fire spots per area in municipalities is low or zero, and this number begins to increase in July (Figure 7 and Figure 8). The peak of fire spots occurs between September and October, depending on the municipality’s biome. Only in cities where the Pampa is the predominant biome is a pattern less clear to identify (Figure 7). In September, the highest number of fire spots per km

^{2}

occurs throughout the country, and by December, this number decreases considerably (Figure 8). Fires are more frequent in the winter and spring months, from the end of June to the end of December, due to low rainfall levels [30].

3.2. Hierarchical Time Series Forecasting

The routines for calculating fire spot forecasts for the twelve months of 2022 were implemented using R software, version 4.3.0. The results in this section pertain to reconciled forecasts using the square root transformation of fire spots plus 0.5 and the logarithmic transformation of fire spots plus 1. Both transformations used the ARIMA model for base forecasts. The ETS model was also used for base forecasts with the square root transformation of fire spots plus 1. Reconciled forecasts using ETS and the logarithmic transformation of fire spots encountered convergence errors, and the adjusted values and forecasts were unrealistic, so they were not presented.

Table 2 shows the root mean squared error (RMSE) for the level 0 series, related to the overall number of fire spots throughout Brazil, and the average RMSEs for the level 1 and 2 series. Values in bold indicate the lowest RMSEs at each hierarchical level. The results of the average accuracy for different reconciliation approaches using the square root plus 0.5 transformation in the data and the ARIMA model for base forecasts are presented in the first part of the table. With this setup, at level 0, the OLS approach exhibited the lowest RMSE. At level 1, the WLS(s) approach yielded better results, followed by the MinT(s) approach. At level 2, the MinT(s) and WLS(s) outperformed the other approaches. When considering the ARIMA model for base forecasts and a logarithmic transformation of the number of fire spots plus 1, at level 0, the MinT(s) approach had the lowest RMSE. At levels 1 and 2, BU, MinT(s), and WLS(v) outperformed the others (Table 2). The logarithmic transformation results outperformed those based on the square root transformation when the ARIMA model obtained the base forecasts, especially for BU, MinT(s), and WLS(v). When considering the square root transformation and the ETS model to obtain the base forecasts, BU showed better results at level 0, and MinT(s) at the other two levels. The average RMSE values were generally higher than those obtained using the ARIMA model to obtain the base forecasts (Table 2). Reconciled forecasts using ETS and the logarithmic transformation encountered convergence errors, and the adjusted values and forecasts were unrealistic, so they were not presented. When considering the Prophet to obtain the base forecasts, the overall best performance was obtained by BU, followed by MinT(s) and WLS(v) for both transformations (Table 2).

Table 2. Average RMSE per hierarchical level using (i) the square root transformation and the ARIMA model for base forecasts; (ii) the logarithmic transformation and the ARIMA model for base forecasts; (iii) the square root transformation and the ETS model for base forecasts; (iv) the square root transformation and the Prophet model for base forecasts; and (v) the logarithmic transformation and the Prophet model for base forecasts.

Table 3 is similar to Table 2, but uses MASE as the accuracy measure. The ARIMA model with logarithmic transformation presented the lowest average MASE values, particularly with the MinT(s) approach for level 0 and WLS(v) for levels 1 and 2. Comparing the average MASEs by hierarchical level, level 0, representing Brazil, showed the best results in terms of accuracy, followed by level 1. The MASE values for municipalities were, on average, greater than 1 for all models and approaches.

Table 3. Average MASE per hierarchical level using (i) the square root transformation and the ARIMA model for base forecasts; (ii) the logarithmic transformation and the ARIMA model for base forecasts; (iii) the square root transformation and the ETS model for base forecasts; (iv) the square root transformation and the Prophet model for base forecasts; and (v) the logarithmic transformation and the Prophet model for base forecasts.

Table 4 presents the results for the RMSSE, which are similar to those of RMSE and MASE (Table 2 and Table 3) in terms of the best model and approach. Unlike for MASE (Table 3), at level 2, the average RMSSE was less than one for some approaches. The base forecasts without considering any hierarchical structure, especially at levels 0 and 1, resulted in worse outcomes when compared to some reconciliation approaches (Table 2, Table 3 and Table 4).

Table 4. Average RMSSE per hierarchical level using (i) the square root transformation and the ARIMA model for base forecasts; (ii) the logarithmic transformation and the ARIMA model for base forecasts; (iii) the square root transformation and the ETS model for base forecasts; (iv) the square root transformation and the Prophet model for base forecasts; and (v) the logarithmic transformation and the Prophet model for base forecasts.

In addition to calculating the RMSE, MASE, and RMSSE, it is important to know which approaches produce better forecasts than the baseline. If they are not better, hierarchical time series forecasting loses relevance. Table A1, Table A2 and Table A3 of Appendix A show the AveRelRMSE, AveRelMASE, and AveRelRMSSE, respectively, for each level of the hierarchy considering the ARIMA, ETS, and Prophet base forecast models and the two transformations. A value of the AvgRelRMSE, AveRelMASE, and AveRelRMSSE equal to one indicates that the reconciled forecast is equal to the base forecast at that level. An example is the bottom-up approach, which uses base forecasts from municipalities, level 2, and sums them to obtain forecasts for the above levels. A value of AveRelRMSE, AveRelMASE, and AveRelRMSSE lower than one indicates that the reconciled forecasts showed improvements compared to the base forecasts, with bold values being the lowest in Table A1, Table A2 and Table A3.

Here, we summarize the results for the AveRelRMSE (Table A1 of the Appendix A). The interpretation for the AveRelMASE (Table A2 of the Appendix A) and AveRelRMSSE (Table A3 of the Appendix A) are done similarly.

Considering the ARIMA model and the square root transformation of fire spots plus 0.5, at level 0, only the OLS and MO approaches had a value of less than one, showing an improvement of 11.1% and 5.0%, respectively, in RMSE compared to the base forecasts. At level 1, WLS(s) was the one that showed a higher improvement, with a value of 5.8%. At level 2, TDFP showed an improvement of 5.3%. Considering the ARIMA model and the logarithmic transformation of fire spots plus 1, at level 0, only the forecasts from the BU and WLS(v) approaches did not show improvements in the RMSE compared to the base forecasts. At this level, MinT(s) obtained the best result, with an improvement of 22.2% in the RMSE value. At level 1, out of the nine approaches, four of them showed improvements in the RMSE up to 20.0%. In the last level, no method showed improvement compared to the base forecasts (Table A1). When considering the Prophet model for the base forecasts, the overall best reconciliation method was the BU, but without an RMSE improvement in level two. Overall, the MinT(s) and TDFP approaches outperformed the remaining when considering the logarithm transformation and the ARIMA model to obtain the base forecasts.

The AvgRelRMSE of the approaches using ETS for base forecasts and the square root transformation of fire spots plus 0.5 is also presented in Table A1. At level 0, excluding the top-down approaches that use base forecasts at this level, all approaches showed improvements in RMSE. BU showed an improvement of 76.3% at this level. At level 1, only WLS(s) and TDPA showed improvements in RMSE, and six approaches showed improvements in RMSE at level 2, with TDPA having the highest improvement. It is also noted that the MO and TDPA approaches showed improvements at all levels.

When considering the Prophet model for the base forecasts, the most effective reconciliation method overall was the BU, except for level 0 in the case of the square root transformation, where the TDPA showed superior performance.

Despite the percentage improvements in RMSE, it is important to consider that the reconciled forecasts using each base model and transformation have different magnitudes. For example, the ETS model with the square root transformation had the overall highest RMSE values (Table 2) compared to the prophet with the logarithm transformation. Thus, reconciled forecasts using the ETS model show significant improvements over base forecasts, but they are not better than those using the Prophet model.

Figure 9 and Figure 10 show the boxplots for the RMSE for the nine forecasting reconciliation approaches and the three models and two transformations under consideration for level one (biomes) and level two (municipalities), respectively. For level one (Figure 9), the WLS(s) seems to be the better-performing reconciliation approach, and the ETS model with the square root transformation is the best to obtain the base forecasts. For level two (Figure 10), when not considering the “outliers”, a good overall performance was obtained for the TDFP reconciliation approach, and the ETS with the square root transformation seems to be the overall winner in performance.

Figure 9. Boxplots for the root mean squared error for 12 ahead forecast of the number of fire spots for all six Brazilian biomes considering the nine forecasting reconciliation approaches, using (i) the square root transformation and the ARIMA model for base forecasts; and (ii) the logarithmic transformation and the ARIMA model for base forecasts; (iii) the square root transformation and the ETS model for base forecasts; (iv) the square root transformation and the Prophet model for base forecasts; and (v) the logarithmic transformation and the Prophet model for base forecasts; red line: mean of the medians.

Figure 10. Boxplots for the logarithm of the root mean squared error plus one for 12 ahead forecast of the number of fire spots for all 5570 Brazilian municipalities considering the nine forecasting reconciliation approaches, using (i) the square root transformation and the ARIMA model for base forecasts; and (ii) the logarithmic transformation and the ARIMA model for base forecasts; (iii) the square root transformation and the ETS model for base forecasts; (iv) the square root transformation and the Prophet model for base forecasts; and (v) the logarithmic transformation and the Prophet model for base forecasts red line: mean of the medians.

Figure A1 and Figure A2 show the boxplots for the MASE for the nine forecasting reconciliation approaches and the three models and two transformations under consideration for level one (biomes) and level two (municipalities), respectively. Figure A3 and Figure A4 show the boxplots for the RMSSE for the nine forecasting reconciliation approaches and the three models and two transformations under consideration for level one (biomes) and level two (municipalities), respectively. The overall conclusions from Table A1, Table A2 and Table A3 are in the same line as those for the RMSE (Figure 9 and Figure 10).

4. Concluding Remarks

In this study, we comprehensively analyzed various reconciliation approaches to forecasting the number of fire spots, considering different base forecast models and data transformations. The results offer valuable insights into the effectiveness of these approaches at different hierarchical levels, ranging from biomes to municipalities.

Our findings indicate that the choice of the reconciliation method is crucial and depends on factors such as the base forecasting model and the transformation applied to the data. At level 0, the bottom-up (BU) approach demonstrated stronger performance, particularly when using the Prophet model and the logarithm transformation. However, alternative methods, such as WLS, BU, and MinT, exhibited competitive results at other levels.

Moreover, the influence of data transformations, including square root and logarithmic transformations, was evident in the performance of reconciliation approaches. Notably, when combined with the ARIMA or the Prophet model, the logarithmic transformation showcased superior results for certain approaches at various levels.

The AveRelRMSE, AveRelMASE, and AveRelRMSSE metrics provided a nuanced understanding of the improvements achieved by each reconciliation approach compared to baseline forecasts. This information is essential for practical decision-making, emphasizing the importance of not only minimizing forecast errors but also ensuring improvements over the baseline.

It is worth noting that the reconciliation methods exhibited diverse performance across different levels of the hierarchical structure, suggesting the need for adaptive strategies based on the specific forecasting context.

In summary, this study contributes valuable insights into applying forecasting reconciliation in fire spot forecasts. The results presented here guide practitioners and decision-makers in choosing reconciliation methods tailored to their specific forecasting scenarios. As the field continues to evolve, future research may explore additional factors influencing reconciliation effectiveness and expand the applicability of these findings to diverse domains.

Author Contributions

Conceptualization, P.C.R.; methodology, A.C.P. and P.C.R.; software, A.C.P.; validation, A.C.P. and P.C.R.; formal analysis, A.C.P. and P.C.R.; investigation, A.C.P. and P.C.R.; writing—original draft preparation, A.C.P. and P.C.R.; writing—review and editing, A.C.P. and P.C.R.; visualization, A.C.P.; supervision, P.C.R. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

The data are available in https://github.com/SaLLy-laboratory/HTS-Brazilian-firesopts (accessed on 26 June 2024).

Conflicts of Interest

The authors declare no conflicts of interest.

Appendix A

Table A1. AveRelRMSE per hierarchical level using (i) the square root transformation and the ARIMA model for base forecasts; (ii) the logarithmic transformation and the ARIMA model for base forecasts; (iii) the square root transformation and the ETS model for base forecasts; (iv) the square root transformation and the Prophet model for base forecasts; and (v) the logarithmic transformation and the Prophet model for base forecasts.

	BU	MinT(s)	MO	OLS	TDAP	TDFP	TDPA	WLS(s)	WLS(v)
ARIMA–Square root transformation
Level 0	1.830	1.300	0.950	0.890	1.000	1.000	1.000	1.016	1.749
Level 1	1.003	0.975	1.000	1.226	1.771	0.981	1.364	0.942	0.990
Level 2	1.000	1.001	0.975	1.913	1.586	0.947	1.208	1.642	0.999
ARIMA–Logarithm transformation
Level 0	1.167	0.778	0.997	0.989	1.000	1.000	1.000	1.002	1.173
Level 1	0.804	0.859	1.000	1.066	1.645	1.019	1.266	0.892	0.800
Level 2	1.000	1.071	1.066	2.264	1.825	1.055	1.390	1.805	1.001
ETS–Square root transformation
Level 0	0.237	0.301	0.881	0.981	1.000	1.000	1.000	0.666	0.272
Level 1	1.178	1.050	1.000	1.087	1.238	1.030	0.983	0.835	1.095
Level 2	1.000	0.994	0.914	1.676	0.952	0.876	0.757	1.711	0.999
Prophet–Square root transformation
Level 0	1.406	1.292	1.139	1.014	1.019	1.011	0.986	1.165	1.376
Level 1	0.925	0.940	1.000	1.127	1.542	1.024	1.193	0.991	0.928
Level 2	1.000	1.007	1.031	1.593	1.378	1.039	1.053	1.450	1.000
Prophet–Logarithm transformation
Level 0	0.576	0.867	0.983	0.958	1.005	1.002	0.987	0.804	0.596
Level 1	0.671	0.709	1.000	0.991	1.160	0.930	0.898	0.699	0.673
Level 2	1.000	1.035	1.132	1.857	1.476	1.101	1.126	1.397	1.001

Table A2. AveRelMASE per hierarchical level using (i) the square root transformation and the ARIMA model for base forecasts; (ii) the logarithmic transformation and the ARIMA model for base forecasts; (iii) the square root transformation and the ETS model for base forecasts; (iv) the square root transformation and the Prophet model for base forecasts; and (v) the logarithmic transformation and the Prophet model for base forecasts.

	BU	MinT(s)	MO	OLS	TDAP	TDFP	TDPA	WLS(s)	WLS(v)
ARIMA–Square root transformation
Level 0	1.858	1.305	1.078	0.922	1.000	1.000	1.000	1.013	1.769
Level 1	1.181	1.111	1.000	1.154	1.833	1.010	1.456	0.975	1.162
Level 2	1.000	0.965	0.886	1.746	1.351	0.842	1.036	1.497	0.999
ARIMA–Logarithm transformation
Level 0	1.181	0.832	0.936	0.968	1.000	1.000	1.000	0.956	1.177
Level 1	0.870	0.906	1.000	1.057	1.706	1.007	1.357	0.896	0.866
Level 2	1.000	1.041	1.003	2.040	1.616	0.987	1.239	1.653	1.001
ETS–Square root transformation
Level 0	0.271	0.304	0.855	0.977	1.000	1.000	1.000	0.658	0.294
Level 1	1.175	1.049	1.000	1.017	1.237	0.967	1.028	0.864	1.093
Level 2	1.000	0.992	0.891	1.768	0.961	0.844	0.757	1.798	0.999
Prophet–Square root transformation
Level 0	1.348	1.255	1.066	1.002	1.019	1.003	0.990	1.120	1.326
Level 1	0.968	0.972	1.000	1.108	1.513	1.011	1.200	0.994	0.970
Level 2	1.000	1.004	1.000	1.601	1.366	1.005	1.049	1.454	1.000
Prophet–Logarithm transformation
Level 0	0.646	1.043	1.056	0.971	0.998	0.988	0.982	0.865	0.698
Level 1	0.688	0.734	1.002	1.002	1.210	0.943	0.969	0.724	0.680
Level 2	1.000	1.048	1.116	1.842	1.480	1.084	1.136	1.415	1.002

Table A3. AveRelRMSSE per hierarchical level using (i) the square root transformation and the ARIMA model for base forecasts; (ii) the logarithmic transformation and the ARIMA model for base forecasts; (iii) the square root transformation and the ETS model for base forecasts; (iv) the square root transformation and the Prophet model for base forecasts; and (v) the logarithmic transformation and the Prophet model for base forecasts.

	BU	MinT(s)	MO	OLS	TDAP	TDFP	TDPA	WLS(s)	WLS(v)
ARIMA–Square root transformation
Level 0	1.830	1.300	0.950	0.890	1.000	1.000	1.000	1.016	1.749
Level 1	1.003	0.975	1.000	1.226	1.771	0.981	1.364	0.942	0.990
Level 2	1.000	1.001	0.975	1.913	1.586	0.947	1.208	1.642	0.999
ARIMA–Logarithm transformation
Level 0	1.167	0.778	0.997	0.989	1.000	1.000	1.000	1.002	1.173
Level 1	0.804	0.859	1.000	1.066	1.645	1.019	1.266	0.892	0.800
Level 2	1.000	1.071	1.066	2.264	1.825	1.055	1.390	1.805	1.001
ETS–Square root transformation
Level 0	0.237	0.301	0.881	0.981	1.000	1.000	1.000	0.666	0.272
Level 1	1.178	1.050	1.000	1.087	1.238	1.030	0.983	0.835	1.095
Level 2	1.000	0.994	0.914	1.676	0.952	0.876	0.757	1.711	0.999
Prophet–Square root transformation
Level 0	1.406	1.292	1.139	1.014	1.019	1.011	0.986	1.165	1.376
Level 1	0.925	0.940	1.000	1.127	1.542	1.024	1.193	0.991	0.928
Level 2	1.000	1.007	1.031	1.593	1.378	1.039	1.053	1.450	1.000
Prophet–Logarithm transformation
Level 0	0.576	0.867	0.983	0.958	1.005	1.002	0.987	0.804	0.596
Level 1	0.671	0.709	1.000	0.991	1.160	0.930	0.898	0.699	0.673
Level 2	1.000	1.035	1.132	1.857	1.476	1.101	1.126	1.397	1.001

Appendix B

Figure A1. Boxplots for the mean absolute scaled error for 12 ahead forecast of the number of fire spots for all six Brazilian biomes considering the nine forecasting reconciliation approaches, using (i) the square root transformation and the ARIMA model for base forecasts; and (ii) the logarithmic transformation and the ARIMA model for base forecasts; (iii) the square root transformation and the ETS model for base forecasts; (iv) the square root transformation and the Prophet model for base forecasts; and (v) the logarithmic transformation and the Prophet model for base forecasts; red line: mean of the medians.

Figure A2. Boxplots for the mean absolute scaled error for 12 ahead forecast of the number of fire spots for all 5570 Brazilian municipalities considering the nine forecasting reconciliation approaches, using (i) the square root transformation and the ARIMA model for base forecasts; and (ii) the logarithmic transformation and the ARIMA model for base forecasts; (iii) the square root transformation and the ETS model for base forecasts; (iv) the square root transformation and the Prophet model for base forecasts; and (v) the logarithmic transformation and the Prophet model for base forecasts; red line: mean of the medians.

Figure A3. Boxplots for the root mean squared scaled error for 12 ahead forecast of the number of fire spots for all six Brazilian biomes considering the nine forecasting reconciliation approaches, using (i) the square root transformation and the ARIMA model for base forecasts; and (ii) the logarithmic transformation and the ARIMA model for base forecasts; (iii) the square root transformation and the ETS model for base forecasts; (iv) the square root transformation and the Prophet model for base forecasts; and (v) the logarithmic transformation and the Prophet model for base forecasts; red line: mean of the medians.

Figure A4. Boxplots for the root mean squared scaled error for 12 ahead forecast of the number of fire spots for all 5570 Brazilian municipalities considering the nine forecasting reconciliation approaches, using (i) the square root transformation and the ARIMA model for base forecasts; and (ii) the logarithmic transformation and the ARIMA model for base forecasts; (iii) the square root transformation and the ETS model for base forecasts; (iv) the square root transformation and the Prophet model for base forecasts; and (v) the logarithmic transformation and the Prophet model for base forecasts; red line: mean of the medians.

References

Pivello, V.R.; Vieira, I.; Christianini, A.V.; Ribeiro, D.B.; da Silva Menezes, L.; Berlinck, C.N.; Melo, F.P.; Marengo, J.A.; Tornquist, C.G.; Tomas, W.M.; et al. Understanding Brazil’s catastrophic fires: Causes, consequences and policy needed to prevent future tragedies. Perspect. Ecol. Conserv. 2021, 19, 233–255. [Google Scholar] [CrossRef]
Butt, E.W.; Conibear, L.; Knote, C.; Spracklen, D.V. Large air quality and public health impacts due to Amazonian deforestation fires in 2019. GeoHealth 2021, 5, e2021GH000429. [Google Scholar] [CrossRef] [PubMed]
Morello, T.F.; Ramos, R.M.; Anderson, L.O.; Owen, N.; Rosan, T.M.; Steil, L. Predicting fires for policy making: Improving accuracy of fire brigade allocation in the Brazilian Amazon. Ecol. Econ. 2020, 169, 106501. [Google Scholar] [CrossRef]
Viganó, H.H.D.G.; Souza, C.C.D.; Reis Neto, J.F.; Cristaldo, M.F.; Jesus, L.D. Prediction and modeling of forest fires in the Pantanal. Rev. Bras. Meteorol. 2018, 33, 306–316. [Google Scholar] [CrossRef]
Gama Viganó, H.H.; de Souza, C.C.; Cristaldo, M.F.; de Jesus, L. Redes neurais artificiais na previsão de queimadas e incêndios no Pantanal. Rev. Bras. Geogr. Física 2017, 10, 1355–1367. [Google Scholar] [CrossRef][Green Version]
Hyndman, R.J.; Ahmed, R.A.; Athanasopoulos, G.; Shang, H.L. Optimal combination forecasts for hierarchical time series. Comput. Stat. Data Anal. 2011, 55, 2579–2589. [Google Scholar] [CrossRef]
Lila, M.F.; Meira, E.; Oliveira, F.L.C. Forecasting unemployment in Brazil: A robust reconciliation approach using hierarchical data. Socio-Econ. Plan. Sci. 2022, 82, 101298. [Google Scholar] [CrossRef]
Gaweł, B.; Paliński, A. Global and Local Approaches for Forecasting of Long-Term Natural Gas Consumption in Poland Based on Hierarchical Short Time Series. Energies 2024, 17, 347. [Google Scholar] [CrossRef]
Wickramasuriya, S.L.; Athanasopoulos, G.; Hyndman, R.J. Optimal forecast reconciliation for hierarchical and grouped time series through trace minimization. J. Am. Stat. Assoc. 2019, 114, 804–819. [Google Scholar] [CrossRef]
Athanasopoulos, G.; Hyndman, R.J.; Kourentzes, N.; Panagiotelis, A. Forecast reconciliation: A review. Int. J. Forecast. 2023, 40, 430–456. [Google Scholar] [CrossRef]
Hollyman, R.; Petropoulos, F.; Tipping, M.E. Understanding forecast reconciliation. Eur. J. Oper. Res. 2021, 294, 149–160. [Google Scholar] [CrossRef]
Winters, P.R. Forecasting sales by exponentially weighted moving averages. Manag. Sci. 1960, 6, 324–342. [Google Scholar] [CrossRef]
Holt, C. Forecasting Seasonals and Trends by Exponentially Weighted Averages (ONR Memorandum No. 52); Carnegie Institute of Technology: Pittsburgh, PA, USA, 1957; Volume 10. [Google Scholar]
Sulandari, W.; Suhartono; Subanar; Rodrigues, P. C. Exponential smoothing on modeling and forecasting multiple seasonal time series: An overview. Fluct. Noise Lett. 2021, 20, 2130003. [Google Scholar] [CrossRef]
Taylor, S.J.; Letham, B. Forecasting at scale. Am. Stat. 2018, 72, 37–45. [Google Scholar] [CrossRef]
Pimentel, J.; Bulhões, R.; Rodrigues, P.C. Spatio-temporal modeling of the Brazilian wildfires: The influence of human and meteorological variables. In Proceedings of the 64th ISI World Statistics Congress, Ottawa, ON, Canada, 16–20 July 2023. [Google Scholar]
da Silva, G.; Fasiaben, M.; Nogueira, S.; Grego, C.; Moraes, A.; Almeida, M.; de Oliveira, O.; Eusebio, G.; Lopes, W. Método Para Determinar o Bioma Predominante nos Municípios Brasileiros; Embrapa Agricultura Digital: Campinas, Brazil, 2022. [Google Scholar]
Gross, C.W.; Sohl, J.E. Disaggregation methods to expedite product line forecasting. J. Forecast. 1990, 9, 233–254. [Google Scholar] [CrossRef]
Athanasopoulos, G.; Ahmed, R.A.; Hyndman, R.J. Hierarchical forecasts for Australian domestic tourism. Int. J. Forecast. 2009, 25, 146–166. [Google Scholar] [CrossRef]
Hyndman, R.J.; Lee, A.J.; Wang, E. Fast computation of reconciled forecasts for hierarchical and grouped time series. Comput. Stat. Data Anal. 2016, 97, 16–32. [Google Scholar] [CrossRef]
Athanasopoulos, G.; Hyndman, R.J.; Kourentzes, N.; Petropoulos, F. Forecasting with temporal hierarchies. Eur. J. Oper. Res. 2017, 262, 60–74. [Google Scholar] [CrossRef]
Schäfer, J.; Strimmer, K. A shrinkage approach to large-scale covariance matrix estimation and implications for functional genomics. Stat. Appl. Genet. Mol. Biol. 2005, 4, 1–32. [Google Scholar] [CrossRef] [PubMed]
Box, G.E.; Jenkins, G.M.; Reinsel, G.C.; Ljung, G.M. Time Series Analysis: Forecasting and Control; John Wiley & Sons: Hoboken, NJ, USA, 2015. [Google Scholar]
Hyndman, R.J.; Khandakar, Y. Automatic time series forecasting: The forecast package for R. J. Stat. Softw. 2008, 27, 1–22. [Google Scholar] [CrossRef]
Sugiura, N. Further analysis of the data by Akaike’s information criterion and the finite corrections. Commun. Stat.-Theory Methods 1978, 7, 13–26. [Google Scholar] [CrossRef]
Gardner, E.S., Jr.; McKenzie, E. Forecasting trends in time series. Manag. Sci. 1985, 31, 1237–1246. [Google Scholar] [CrossRef]
Hyndman, R.J.; Koehler, A.B.; Snyder, R.D.; Grose, S. A state space framework for automatic forecasting using exponential smoothing methods. Int. J. Forecast. 2002, 18, 439–454. [Google Scholar] [CrossRef]
Hyndman, R.J.; Koehler, A.B. Another look at measures of forecast accuracy. Int. J. Forecast. 2006, 22, 679–688. [Google Scholar] [CrossRef]
Davydenko, A.; Fildes, R. Measuring forecasting accuracy: The case of judgmental adjustments to SKU-level demand forecasts. Int. J. Forecast. 2013, 29, 510–522. [Google Scholar] [CrossRef]
Pezzopane, J.E.M.; Neto, S.N.D.O.; Vilela, M.D.F. Risco de incêndios em função da característica do clima, relevo e cobertura do solo. Floresta Ambiente 2012, 8, 161–166. [Google Scholar]

Figure 1. Geographic localization of each of the six Brazilian biomes.

Figure 2. A three-level hierarchical tree diagram.

Figure 3. Flowchart of the methodology used in the paper.

Figure 4. Time series of the monthly number of forest fire spots in Brazil and each of the six Brazilian biomes.

Figure 5. Boxplots for the annual number of fire spots in Brazilian municipalities divided by their area and grouped by the municipality’s predominant biome.

Figure 6. Maps of the total fire spots in Brazilian municipalities divided by their area from 2011 to 2022.

Figure 7. Boxplots for the monthly number of fire spots in Brazilian municipalities divided by their area and grouped by the municipality’s predominant biome, considering the years from 2011 to 2022.

Figure 8. Maps of the total forest fire spots in Brazilian municipalities divided by their area per month, considering the years from 2011 to 2022.

Figure 9. Boxplots for the root mean squared error for 12 ahead forecast of the number of fire spots for all six Brazilian biomes considering the nine forecasting reconciliation approaches, using (i) the square root transformation and the ARIMA model for base forecasts; and (ii) the logarithmic transformation and the ARIMA model for base forecasts; (iii) the square root transformation and the ETS model for base forecasts; (iv) the square root transformation and the Prophet model for base forecasts; and (v) the logarithmic transformation and the Prophet model for base forecasts; red line: mean of the medians.

Figure 10. Boxplots for the logarithm of the root mean squared error plus one for 12 ahead forecast of the number of fire spots for all 5570 Brazilian municipalities considering the nine forecasting reconciliation approaches, using (i) the square root transformation and the ARIMA model for base forecasts; and (ii) the logarithmic transformation and the ARIMA model for base forecasts; (iii) the square root transformation and the ETS model for base forecasts; (iv) the square root transformation and the Prophet model for base forecasts; and (v) the logarithmic transformation and the Prophet model for base forecasts red line: mean of the medians.

Table 1. Descriptive measures of the number of fire spots in Brazil, its biomes, and municipalities.

	Min	1st Quartile	Median	Mean	3rd Quartile	Max	SD	CV
Brazil	1126	2702.50	7080.5	15,351.06	20,630.50	72,895	17,381.45	113.23
Biome
Amazônia	273	922.75	2517.0	7277.49	11,143.50	41,481	8962.50	123.15
Caatinga	25	123.75	306.5	1191.62	1711.75	7271	1579.92	132.59
Cerrado	233	748.50	2301.0	5030.30	7298.00	28,546	6099.56	121.26
Mata Atlântica	193	377.75	573.5	1190.56	1541.50	6438	1270.43	106.71
Pampa	12	36.00	58.0	88.20	91.75	485	86.64	98.23
Pantanal	4	60.50	159.0	572.89	559.50	8497	1113.68	194.40
Municipality
Amazônia	0	0	1	14.47	7	3577	70.02	483.98
Caatinga	0	0	0	1.09	0	1298	7.65	703.67
Cerrado	0	0	0	4.73	2	1540	21.17	447.36
Mata Atlântica	0	0	0	0.43	0	329	2.54	583.90
Pampa	0	0	0	0.55	0	43	1.92	347.88
Pantanal	0	1	6	63.65	31	2523	215.56	338.64

Table 2. Average RMSE per hierarchical level using (i) the square root transformation and the ARIMA model for base forecasts; (ii) the logarithmic transformation and the ARIMA model for base forecasts; (iii) the square root transformation and the ETS model for base forecasts; (iv) the square root transformation and the Prophet model for base forecasts; and (v) the logarithmic transformation and the Prophet model for base forecasts.

	Base	BU	MinT(s)	MO	OLS	TDAP	TDFP	TDPA	WLS(s)	WLS(v)
ARIMA–Square root transformation
Level 0	1945.01	3559.09	2527.78	1848.32	1731.79	1945.01	1945.01	1945.01	1976.47	3401.77
Level 1	1506.60	1541.12	1493.22	1506.60	1516.88	2238.07	1542.04	1934.35	1475.37	1519.82
Level 2	3.63	3.63	3.61	3.66	3.81	5.59	3.66	4.56	3.69	3.61
ARIMA–Logarithm transformation
Level 0	1681.92	1963.26	1307.86	1676.90	1662.92	1681.92	1681.92	1681.92	1684.68	1973.06
Level 1	1683.26	1329.78	1355.90	1683.26	1686.89	2235.39	1698.42	1927.02	1482.96	1325.33
Level 2	3.51	3.51	3.53	3.73	3.73	5.61	3.73	4.57	3.59	3.51
ETS–Square root transformation
Level 0	11,492.09	2727.34	3459.96	10,122.96	11,270.64	11,492.09	11,492.09	11,492.09	7652.48	3127.44
Level 1	2481.41	1929.94	1831.77	2481.41	2511.18	2630.49	2586.16	2479.50	2056.69	1860.50
Level 2	4.15	4.15	4.05	4.46	4.88	5.23	4.50	4.64	4.44	4.07
Prophet–Square root transformation
Level 0	3116.49	4381.06	4027.54	3549.05	3161.25	3175.72	3150.09	3074.12	3631.56	4287.71
Level 1	1816.60	1685.20	1693.75	1819.54	1853.53	2255.26	1821.14	1967.02	1768.38	1689.48
Level 2	3.69	3.69	3.70	3.80	3.84	5.54	3.80	4.55	3.76	3.70
Prophet–Logarithm transformation
Level 0	2590.24	1491.70	2245.59	2545.24	2481.83	2603.39	2596.49	2557.64	2083.34	1544.51
Level 1	2124.87	1376.00	1481.08	2133.44	2051.94	2258.14	2087.70	1961.76	1643.81	1412.05
Level 2	3.61	3.61	3.70	4.14	4.11	5.58	4.09	4.56	3.82	3.66

Table 3. Average MASE per hierarchical level using (i) the square root transformation and the ARIMA model for base forecasts; (ii) the logarithmic transformation and the ARIMA model for base forecasts; (iii) the square root transformation and the ETS model for base forecasts; (iv) the square root transformation and the Prophet model for base forecasts; and (v) the logarithmic transformation and the Prophet model for base forecasts.

	Base	BU	MinT(s)	MO	OLS	TDAP	TDFP	TDPA	WLS(s)	WLS(v)
ARIMA–Square root transformation
Level 0	0.287	0.533	0.375	0.310	0.265	0.287	0.287	0.287	0.291	0.508
Level 1	0.883	1.014	0.956	0.883	1.150	1.677	0.868	1.226	0.854	0.997
Level 2	1.141	1.141	1.110	1.052	15.889	1.617	1.017	1.159	10.613	1.140
ARIMA–Logarithm transformation
Level 0	0.245	0.289	0.204	0.229	0.237	0.245	0.245	0.245	0.234	0.289
Level 1	0.940	0.803	0.838	0.940	0.994	1.670	0.938	1.223	0.830	0.799
Level 2	1.043	1.043	1.057	1.053	11.846	1.614	1.042	1.157	7.250	1.043
ETS–Square root transformation
Level 0	1.395	0.377	0.425	1.192	1.363	1.395	1.395	1.395	0.917	0.410
Level 1	1.087	1.194	1.047	1.087	1.207	1.284	1.051	1.065	0.974	1.075
Level 2	1.220	1.220	1.212	1.117	12.149	1.235	1.075	0.979	14.916	1.219
Prophet–Square root transformation
Level 0	0.439	0.592	0.551	0.468	0.440	0.448	0.440	0.435	0.492	0.582
Level 1	1.064	1.010	1.020	1.065	1.208	1.660	1.073	1.212	1.055	1.016
Level 2	1.155	1.155	1.158	1.152	8.767	1.628	1.156	1.167	6.481	1.155
Prophet–Logarithm transformation
Level 0	0.338	0.218	0.353	0.357	0.328	0.337	0.334	0.332	0.292	0.236
Level 1	1.339	0.919	1.011	1.341	1.380	1.662	1.265	1.219	1.000	0.934
Level 2	1.070	1.070	1.104	1.168	14.069	1.624	1.141	1.163	6.783	1.071

Table 4. Average RMSSE per hierarchical level using (i) the square root transformation and the ARIMA model for base forecasts; (ii) the logarithmic transformation and the ARIMA model for base forecasts; (iii) the square root transformation and the ETS model for base forecasts; (iv) the square root transformation and the Prophet model for base forecasts; and (v) the logarithmic transformation and the Prophet model for base forecasts.

	Base	BU	MinT(s)	MO	OLS	TDAP	TDFP	TDPA	WLS(s)	WLS(v)
ARIMA–Square root transformation
Level 0	0.213	0.390	0.277	0.203	0.190	0.213	0.213	0.213	0.217	0.373
Level 1	0.800	0.807	0.781	0.800	1.178	1.549	0.774	1.054	0.751	0.794
Level 2	0.945	0.945	0.942	0.950	13.663	1.318	0.942	1.025	8.692	0.944
ARIMA–Logarithm transformation
Level 0	0.184	0.215	0.143	0.184	0.182	0.184	0.184	0.184	0.185	0.216
Level 1	0.892	0.710	0.739	0.892	0.941	1.561	0.897	1.061	0.774	0.701
Level 2	0.926	0.926	0.933	0.960	12.049	1.323	0.957	1.028	6.776	0.926
ETS–Square root transformation
Level 0	1.260	0.299	0.379	1.110	1.235	1.260	1.260	1.260	0.839	0.343
Level 1	1.009	1.044	0.926	1.009	1.194	1.184	1.002	0.954	0.860	0.951
Level 2	1.019	1.019	1.016	0.992	12.094	1.105	0.980	0.962	15.203	1.019
Prophet–Square root transformation
Level 0	0.342	0.480	0.441	0.389	0.347	0.348	0.345	0.337	0.398	0.470
Level 1	0.901	0.834	0.846	0.902	1.045	1.505	0.917	1.030	0.897	0.838
Level 2	0.957	0.957	0.959	0.970	8.697	1.299	0.973	1.017	6.115	0.957
Prophet–Logarithm transformation
Level 0	0.284	0.164	0.246	0.279	0.272	0.285	0.285	0.280	0.228	0.169
Level 1	1.212	0.818	0.887	1.213	1.236	1.539	1.147	1.052	0.891	0.837
Level 2	0.940	0.940	0.950	0.996	15.869	1.313	0.983	1.023	7.280	0.941

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Hierarchical Time Series Forecasting of Fire Spots in Brazil: A Comprehensive Approach

Abstract

1. Introduction

2. Materials and Methods

2.1. The Data

2.2. Hierarchical Time Series

2.3. Forecasting Approaches for Hierarchical Time Series

2.3.1. Bottom-Up (BU) Approach

2.3.2. Top-Down (TD) Approach

2.3.3. Middle-Out (MO) Approach

2.3.4. Minimum Trace (MinT) Optimal Reconciliation Approach

2.4. Univariate Time Series Forecasting

2.5. Accuracy Measures

2.6. Data Transformation

2.7. Flowchart of Methodology

3. Empirical Study

3.1. Fire Spots in Brazilian Biomes and Municipalities

3.2. Hierarchical Time Series Forecasting

4. Concluding Remarks

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Conflicts of Interest

Appendix A

Appendix B

References

Article Metrics

Citations

Article Access Statistics