Non-Negative Forecast Reconciliation: Optimal Methods and Operational Solutions

Girolimetto, Daniele

doi:10.3390/forecast7040064

Open AccessArticle

Non-Negative Forecast Reconciliation: Optimal Methods and Operational Solutions

by

Daniele Girolimetto

Department of Statistical Sciences, University of Padova, 35121 Padova, Italy

Forecasting 2025, 7(4), 64; https://doi.org/10.3390/forecast7040064

Submission received: 10 October 2025 / Revised: 22 October 2025 / Accepted: 24 October 2025 / Published: 26 October 2025

(This article belongs to the Special Issue Feature Papers of Forecasting 2025)

Download

Browse Figures

Versions Notes

Abstract

In many different applications such as retail, energy, and tourism, forecasts for a set of related time series must satisfy both linear and non-negativity constraints, as negative values are meaningless in practice. Standard regression-based reconciliation approaches achieve coherence with linear constraints, but may generate negative forecasts, reducing interpretability and usability. This paper develops and evaluates three alternative strategies for non-negative forecast reconciliation. First, reconciliation is formulated as a non-negative least squares problem and solved with the operator splitting quadratic program, allowing flexible inclusion of additional constraints. Second, we propose an iterative non-negative reconciliation with immutable forecasts, offering a practical optimization-based alternative. Third, we investigate a family of set-negative-to-zero heuristics that achieve efficiency and interpretability at minimal computational cost. Using the Australian Tourism Demand dataset, we compare these approaches in terms of forecast accuracy and computation time. The results show that non-negativity constraints consistently improve accuracy compared to base forecasts. Overall, set-negative-to-zero achieve near-optimal performance with negligible computation time, the block principal pivoting algorithm provides a good accuracy–efficiency compromise, and the operator splitting quadratic program offers flexibility for incorporating additional constraints in large-scale applications.

Keywords:

forecast reconciliation; non-negative forecasts; hierarchical time series; non-negative least squares; set-negative-to-zero

1. Introduction

Forecasting plays a central role in supporting decision-making across business, policy, healthcare and scientific domains [1]. In many applications, forecasts are required not only for individual time series but also for collections of related series that are connected through linear relationships. For example, forecasts of product demand must sum consistently across categories, forecasts of regional electricity consumption must aggregate to the national total, and forecasts of monthly tourism flows should align with quarterly or annual aggregates. In these contexts, base forecasts are typically produced, either independently for each series or through multivariate models, that focus only on improving forecast accuracy, without considering the underlying linear constraints among the series [2]. As a consequence, the base forecasts are generally incoherent, meaning that they do not satisfy the required linear constraints that link the different levels or components [3]. For this reason, a post-forecasting process—called forecast reconciliation [4]—aimed to adjust a set of incoherent “base” forecasts of a multivariate time series so that they satisfy the constraints, should be applied.

In hierarchical and grouped time series, two well-known and commonly used approaches [5] are bottom-up and top-down. In the bottom-up approach, forecasts are first generated for the most disaggregated series and then aggregated to produce predictions for higher-level series [6,7]. The top-down approach begins with forecasts at the fully aggregated level, which are subsequently allocated to the lower-level series using predetermined proportions [8,9,10]. It is worth noting that traditional reconciliation strategies fail to exploit all available information. Hyndman et al. [3] introduced an optimal reconciliation approach, using a regression model to optimally combine the base forecasts for all series so that the resulting forecasts are coherent.

Since its introduction, the linear regression reconciliation framework has been applied and developed in different contexts. Methodological contributions include the cross-sectional framework, where forecasts are linked through contemporaneous structures [11,12,13,14,15], the temporal framework, where the same variable is available for different frequencies [16,17], and the cross-temporal framework, which integrates both dimensions within a unified setting [18,19,20,21,22]. In addition, forecast reconciliation is applied in various disciplines: retail [23,24,25], energy [24,26,27,28,29,30,31], tourism [10,21,32,33] and economics [19,34], among others. Athanasopoulos et al. [4] provide a comprehensive review that synthesizes and organizes the expanding literature on forecast reconciliation.

Many forecasting applications involve variables that can only take non-negative values, such as sales revenues, product demand, or counts of individuals. In these contexts, it is essential that the reconciliation process preserves non-negativity, since negative forecasts lack practical meaning and may lead to wrong managerial decisions. To address this issue, Wickramasuriya et al. [13] reformulated reconciliation as a non-negative least squares (NNLS) problem [35], including non-negativity directly within the optimal reconciliation framework using the Block Principal Pivoting algorithm [36] and establishing formal properties, such as conditions for existence, uniqueness, and optimality of NNLS reconciliation. Although theoretically rigorous, this approach can be computationally demanding in large-scale applications [4,21].

Building on this theoretical foundation, we focus on a more applied perspective. Our objective is to examine and compare alternative procedures, both optimization-based and heuristic, for enforcing non-negativity in reconciled forecasts, with particular attention to computational feasibility and suitability for different frameworks (cross-sectional, temporal and cross-temporal). In addition, we focus on assessing the empirical performance of different approaches, highlighting the trade-offs between forecast accuracy, robustness, and computational efficiency.

Beyond optimization-based reconciliation, alternative heuristic procedures have also been proposed, designed to achieve non-negativity with reduced computational burden. Kourentzes and Athanasopoulos [37] proposed an iterative correction algorithm to eliminate negative reconciled forecasts in the context of intermittent demand series. Another heuristic, proposed by Di Fonzo and Girolimetto [20], is the set-negative-to-zero procedure, which replaces negative reconciled forecasts with zeros while preserving coherence through bottom-up. This strategy provides a useful baseline against which more elaborate procedures can be compared.

In this paper, we contribute to the development of non-negative forecast reconciliation methods by proposing and evaluating three distinct approaches. First, we formulate the reconciliation problem as a NNLS program, and solve it using the operator splitting quadratic program introduced by Stellato et al. [38]. This approach is flexible and allows the incorporation of additional types of constraints, such as immutable forecasts [39]. Second, we propose an iterative non-negative reconciliation procedure with immutable forecasts, offering an alternative optimization-based strategy inspired by the procedure of Kourentzes and Athanasopoulos [37]. Third, we investigate a family of heuristics, known as set-negative-to-zero, originally introduced by Di Fonzo and Girolimetto [20], which provide accurate reconciled forecasts at minimal computational cost. Finally, we evaluate these approaches in a large-scale empirical study using the Australian Tourism Demand dataset [12,13]. Our analysis highlights both forecasting accuracy and computational efficiency to achieve a balance between theoretical property and practical feasibility. In doing so, we provide methodological insights for researchers and practitioners facing the dual requirements of coherence and non-negativity in forecasting. All procedures examined in this study have been implemented in the R package FoReco [40].

The remainder of the paper is structured as follows. Section 2.1 reviews the general framework for forecast reconciliation. Section 3 introduces and develops the non-negative reconciliation procedures. Section 4 presents the empirical application and reports the results, focusing on accuracy and computational performance across methods. Finally, Section 5 provides the conclusion. The code and data for reproducing the results are available at https://github.com/danigiro/vn525nn, accessed on 23 October 2025.

2. Forecast Reconciliation

2.1. Zero-Constrained and Structural Representation

Let

y_{t} = {[y_{1, t}, \dots, y_{i, t}, \dots, y_{n, t}]}^{'}

be an n-variate linearly constrained time series observed at the most temporally disaggregated level, with a seasonality of period m (e.g.,

m = 12

for monthly data,

m = 4

for quarterly data). The cross-sectional constraints [12,19] can be expressed as

C_{c s} y_{t} = 0_{(n_{a} \times 1)}, t = 1, \dots, T

(1)

where

C_{c s}

is an

(n_{a} \times n)

zero-constraints cross-sectional matrix. Following Girolimetto and Di Fonzo [41],

y_{t}

can be organized into two sets of variables such that

y_{t} = {[u_{t}^{'} b_{t}^{'}]}^{'}

, where the

n_{a}

“constrained” series

u_{t}

are linked to the

n_{b}

“free” series

b_{t}

through the cross-sectional linear combination matrix

A_{c s}

via

u_{t} = A_{c s} b_{t}

. In general, Girolimetto and Di Fonzo [41] show that

C_{c s} = [I_{n_{a}} - A_{c s}]

. A different, equivalent way of expressing the constraints (1) is the structural representation

y_{t} = S_{c s} b_{t},

where the cross-sectional structural matrix

S_{c s} = [\begin{matrix} A_{c s} \\ I_{n_{b}} \end{matrix}],

characterizes the entire system by representing

y_{t}

as a linear combination of the free components

b_{t}

.

When we are dealing with a genuine hierachical/grouped time series [3], this two sets are naturally formed by the upper- and the bottom-level time series, respectively. For example, in a simple two-level hierarchy where

y_{T, t} = y_{X, t} + y_{Y, t}

, we have

A_{c s} = [\begin{matrix} 1 & 1 \end{matrix}]

,

S_{c s} = [\begin{matrix} 1 & 1 \\ 1 & 0 \\ 0 & 1 \end{matrix}]

, and

C_{c s} = [\begin{matrix} 1 & - 1 & - 1 \end{matrix}]

.

Looking to the temporal framework [16], let

K = {k_{p}, k_{p - 1}, \dots, k_{2}, k_{1}}

be the set of p factors of m, in descending order, with

k_{1} = 1

and

k_{p} = m

(e.g., for quarterly series,

m = 4

,

p = 3

, and

K = {4, 2, 1}

). Then,

x_{i, j}^{[k]} = \sum_{t = (j - 1) k + 1}^{j k} y_{i, t}

is the temporally aggregated value of the series

i = 1, . . ., n

for a factor k at time

j = 1, . . ., \frac{T}{k}

. For a fixed temporal aggregation order

k \in K

, we stack the observations into a column vector

x_{i, τ}^{[k]}

and the complete vector for all temporal aggregation orders for a single series i is

x_{i, τ} = {[\begin{matrix} x_{i, τ}^{[k_{p}]'} & x_{i, τ}^{[k_{p - 1}]'} & \dots & x_{i, τ}^{[1]'} \end{matrix}]}^{'}

. Given

A_{t e} = [\begin{matrix} 1_{k_{p}} \\ I_{\frac{m}{k_{p - 1}}} \otimes 1_{k_{p - 1}} \\ ⋮ \\ I_{\frac{m}{k_{2}}} \otimes 1_{k_{2}} \end{matrix}],

we can construct the temporal structural matrix

S_{t e} = [\begin{matrix} A_{t e} \\ I_{m} \end{matrix}]

, which relates

x_{i, τ}

to the most disaggregated series

x_{i, τ}^{[1]}

through

x_{i, τ} = S_{t e} x_{i, τ}^{[1]}

, and the zero-constraints temporal matrix is

C_{t e} = [I_{k^{*}} - A_{t e}]

, such that

C_{t e} x_{i, τ} = 0_{[k^{*} \times (m + k^{*})]}

, where

k^{*} = \sum_{k \in K ∖ {1}} \frac{m}{k}

is the number of upper time series in the temporal hierarchy [21].

To unify both frameworks, the series are stacked into an

[n \times (m + k^{*})]

matrix

X_{τ} = [\begin{matrix} x_{1, τ} \\ ⋮ \\ x_{n, τ} \end{matrix}]

for

τ = 1, \dots, N

where the rows represent cross-sectional, and columns temporal, dimensions. The cross-temporal zero-constrained representation for the complete set of observations

x_{τ} = vec (X_{τ}^{'})

is given by

C_{c t} x_{τ} = 0_{[(n_{a} m + n k^{*}) \times 1]}

, where

C_{c t} = [\begin{matrix} C^{*} \\ I_{n} \otimes C_{t e} \end{matrix}]

is the full rank zero-constraints cross-temporal matrix with

C^{*} = [\begin{matrix} 0_{(n_{a} m \times n k^{*})} & I_{m} \otimes C_{c s} \end{matrix}] P^{'}

[19],

P

is the commutation matrix [42] such that

P vec (X_{τ}) = vec (X_{τ}^{'})

, and the operator

vec (\cdot)

converts a matrix into a vector. Alternatively, the structural representation is

x_{τ} = S_{c t} b_{τ}^{[1]}

, where

S_{c t} = S_{c s} \otimes S_{t e}

is the cross-temporal summation matrix, and

b_{τ}^{[1]} = [\begin{matrix} x_{1, τ}^{[1]} \\ ⋮ \\ x_{n, τ}^{[1]} \end{matrix}]

contains the most disaggregated (

k_{1} = 1

) constrained time series.

For the most aggregated forecast horizon

h = 1, \dots, H

, let

{\hat{X}}_{h}

be the h-step-ahead base forecasts matrix where the rows represent cross-sectional, and columns temporal, dimensions. These base forecasts are generally incoherent, meaning

C_{c t} {\hat{x}}_{h} \neq 0

with

{\hat{x}}_{h} = vec ({\hat{X}}_{h}^{'})

. Forecast reconciliation adjusts these base forecasts to obtain reconciled forecasts

{\tilde{x}}_{h}

, which satisfy the linear constraints

C_{c t} {\tilde{x}}_{h} = 0

. To summarise,

X_{h}

represents the matrix of true coherent values,

{\hat{X}}_{h}

are the matrix of incoherent base forecasts, and

{\tilde{X}}_{h}

are the matrix of reconciled forecasts.

2.2. Regression-Based Reconciliation

In this section, we address regression-based reconciliation in a general setting, without reference to a specific cross-sectional, temporal, or cross-temporal framework. In addition, Table 1 summarizes the main symbols used throughout this section, along with their descriptions and the corresponding notation in the different forecasting frameworks: cross-sectional (cs), temporal (te), and cross-temporal (ct). The table provides a unified reference for the vectors of true values, base forecasts, reconciled forecasts, as well as the structural and constraint matrices. Dimensions and indexing conventions are also specified to clarify how the general notation maps to each specific framework.

Starting from a zero-constrained representation, the regression-based approach assumes that the base forecasts

\hat{y}

are related to the true (but unobserved) coherent forecasts y by a linear model [12,19,43]

\hat{x} = x + ε,

(2)

where

\hat{x}

is an

(n^{*} \times 1)

vector of base forecasts,

x

is an

(n^{*} \times 1)

vector of target (true) forecasts, and

ε

is an

(n^{*} \times 1)

vector of zero-mean errors with a known positive definite covariance matrix

Ω = E [ε ε^{'}]

. The target forecasts

x

must satisfy a system of linearly independent constraints:

C x = 0_{(n_{a}^{*} \times 1)} .

(3)

The reconciliation process involves finding reconciled forecasts

\tilde{x}

that are “as close as possible” to the base forecasts

\hat{x}

according to a pre-specified metric, while simultaneously satisfying the constraints. This is achieved by minimizing a linearly constrained generalized least squares (GLS) objective function:

\tilde{x} = \underset{x}{arg min} {(x - \hat{x})}^{'} Ω^{- 1} (x - \hat{x}) s . t . C x = 0_{(n_{a}^{*} \times 1)} .

(4)

The closed-form solution [43,44] for the reconciled forecasts

\tilde{x}

is given by

\tilde{x} = \hat{x} - Ω C^{'} {(C Ω C^{'})}^{- 1} C \hat{x} = M \hat{x},

(5)

where the reconciliation matrix

M = I_{n^{*}} - Ω C^{'} {(C Ω C^{'})}^{- 1} C

is a projection matrix. This formula essentially adjusts the base forecasts

\hat{x}

by a linear combination of their coherency errors,

C \hat{x}

.

Equivalently, using the structural representation of a linear constrained time series [3,10,41], the reconciled forecasts can be derived through the linear model

\hat{x} = S β + ε,

(6)

where

\hat{x}

is an

(n^{*} \times 1)

vector of base forecasts,

β

is an

(n_{b}^{*} \times 1)

vector of free components’ target forecasts, and

ε

is an

(n^{*} \times 1)

vector of zero-mean errors with a known positive definite covariance matrix

Ω = E [ε ε^{'}]

. Minimizing the GLS objective function

{(\hat{x} - S β)}^{'} Ω^{- 1} (\hat{x} - S β)

(7)

results in

\tilde{β} = {(S^{'} Ω^{- 1} S)}^{- 1} S^{'} Ω^{- 1} \hat{x}

, from which the whole reconciled vector can be computed as [3,19]

\tilde{x} = S \tilde{β} = S {(S^{'} Ω^{- 1} S)}^{- 1} S^{'} Ω^{- 1} \hat{x} = S \tilde{G} \hat{x},

(8)

where

\tilde{G} = {(S^{'} Ω^{- 1} S)}^{- 1} S^{'} Ω^{- 1}

and

M = S \tilde{G}

. If the base forecasts

\hat{x}

are unbiased, the reconciled forecasts

\tilde{x}

are also unbiased and achieve minimum variance, since the weight matrix

\tilde{G}

minimizes the trace of the reconciled forecasts’ covariance matrix (MinT, [12]):

\tilde{G} = \underset{G}{arg min} tr (S G Ω G^{'} S^{'}) s . t . G S = I_{n_{b}^{*}},

(9)

where

tr (\cdot)

denotes the trace of a square matrix.

In practice, the covariance matrix

Ω

is generally unknown and its accurate estimation is crucial for effective reconciliation. For the cross-sectional case, several approximations are commonly used. Simple choices include the identity matrix [3], assuming uncorrelated errors with equal variance, and the variance scaling [11], which accounts for heterogeneous error variances but still overlooks correlations. Moreover a shrinkage covariance [12] combining a structured target with the sample covariance can capture the full error dependence and obtain a robust, non-singular estimator. For the temporal case,

For the temporal case, modelling error dependencies across forecast horizons is still an important issue [21]. Simple approaches assume independence or constant variance over time [16], while others methods estimate autocovariances from forecast residuals and impose some struture for different forecast horizons [17,45].

For the cross-temporal case, both cross-sectional and temporal dependencies must be captured at once, which greatly increases dimensionality. As the number of series and forecast horizons grows, the resulting covariance matrix can become extremely large, making direct estimation from limited data unstable or even infeasible. To overcome this, reconciliation methods [19,21] typically use structured forms, shrinkage, or dimension-reduction techniques that exploit the underlying hierarchy and temporal dependence to obtain tractable and reliable covariance estimates.

2.3. Iterative Cross-Temporal Reconciliation

The iterative cross-temporal reconciliation proposed by Di Fonzo and Girolimetto [19] provides a heuristic alternative to the cross-temporal optimal combination method. This method produces reconciled forecasts by alternating reconciliation steps along one dimension (either cross-sectional or temporal) in a cyclic fashion until convergence. In details, the iteration

j \geq 1

can be described as follows:

Step 1: compute the temporally reconciled forecasts ( ${\tilde{X}}_{te}^{(j)}$ ) for each variable $i \in {1, \dots, n}$ of ${\tilde{X}}_{cs}^{(j - 1)}$ ;
Step 2: compute the time-by-time cross-sectional reconciled forecasts ( ${\tilde{X}}_{cs}^{(j)}$ ) for all the temporal aggregation levels of ${\tilde{X}}_{te}^{(j)}$ .

These two steps are performed iteratively until a convergence criterion is met. Typically, convergence is achieved when the remaining discrepancies (

D_{te} = C_{t e} {\tilde{X}}_{cs}^{(j)'}

) fall below a predefined positive tolerance value

δ

(e.g.,

δ = 10^{- 6}

). The matrix

{\tilde{X}}_{cs}^{(j)}

from the final iteration contains the cross-temporal reconciled forecasts

\tilde{X}

. At

j = 0

, the starting values are given by

{\tilde{X}}_{cs}^{(0)} = \hat{X}

(see Section 2.1).

Note that the temporal-then-cross-sectional sequence is presented here, but one may also begin with the cross-sectional step by inverting the order of Step 1 and Step 2. Note, however, that the final reconciled values may depend on the chosen order.

Other heuristic procedures have also been proposed, such as Kourentzes and Athanasopoulos [18] and Yagli et al. [46]. However, these are not discussed in detail here, as they fall outside the scope of this work. In particular, the KA heuristic relies on projection matrices, which are not available for most of the solutions presented in Section 3, while [46], whenever it achieves cross-temporally coherent results, can be considered as a special case of the iterative procedure described above [20].

3. Non-Negative Reconciliation

When applying a “free” reconciliation procedure (i.e., without imposing non-negativity), the resulting forecasts are not guaranteed to remain non-negative. Sometimes, this can be problematic in practice, since negative values are meaningless for variables such as demand, revenue, or counts, and can undermine both interpretability and credibility of the forecasts. A simple remedy is to replace negative reconciled values with zero [47,48]. However, this naive adjustment generally breaks the aggregation constraints, producing forecasts that are no longer coherent with the underlying hierarchical or temporal structure.

To overcome this issue, non-negativity constraints can be explicitly incorporated into the optimization formulations presented in Section 2.2. In particular, the projection (4) and the structural (7) problem can be re-formulated, respectively, as

{\tilde{x}}_{0} = \underset{x}{arg min} {(x - \hat{x})}^{'} Ω^{- 1} (x - \hat{x}) s . t . C x = 0_{(n_{a}^{*} \times 1)}, x \geq 0,

(10)

and

{\tilde{β}}_{0} = \underset{β}{arg min} {(\hat{x} - S β)}^{'} Ω^{- 1} (\hat{x} - S β) s . t . β \geq 0 .

(11)

Both formulations refer to non-negative least squares (NNLS) problems [35,49], firstly studied in the context of forecast reconciliation by Wickramasuriya et al. [13].

Several algorithms have been proposed for solving NNLS problems. Wickramasuriya et al. [13] show that the Block Principal Pivoting (

b p v

) algorithm [36] achieves strong performance in terms of both solution accuracy and computational efficiency. This solver is available in the FoReco package [40], which also provides an alternative implementation based on the Operator Splitting Quadratic Program (

o s q p

) [38,50], a flexible framework that can incorporate additional types of constraints, such as immutable forecasts [39].

In addition to optimization-based solvers, we consider three heuristic approaches that combine ease of use, interpretability, and accuracy:

The Negative Forecasts Correction Algorithm (nfca), an iterative correction procedure proposed by Kourentzes and Athanasopoulos [37].
The iterative Non-Negative reconciliation with Immutable Constraints (nnic), where non-negativity is progressively enforced by treating corrected values as immutable in successive reconciliations [39,40].
The bottom-up and top-down variants of the Set-Negative-To-Zero (sntz) procedure [20], which are very simple, competitive in accuracy, and computationally faster compared to the more intensive nonlinear optimization methods.

3.1. Block Principal Pivoting (bpv)

Since Lawson and Hanson [35], a wide range of approaches for addressing non-negative least squares problems have been proposed. The idea behind the Block Principal Pivoting [36] is to transform an inequality-constrained least squares problem into a sequence of equality-constrained sub-problems.

b p v

incorporates a subset selection of variables to exchange and a backup rule to guarantee convergence in a finite number of iterations. Further details may be found in Wickramasuriya et al. [13].

3.2. Operator Splitting Quadratic Program (osqp)

The Operator Splitting Quadratic Program [38,50] solver is based on the Alternating Direction Method of Multipliers (ADMM) [51,52] and provides high-accuracy solutions by solving a quasi-definite linear system with a largely constant coefficient matrix across iterations.

o s q p

is highly robust, requiring neither positive definiteness of the objective function nor linear independence of the constraints. Further details may be found in Stellato et al. [38].

3.3. Negative Forecasts Correction Algorithm (nfca)

An alternative approach for ensuring non-negative reconciled forecasts is the Negative Forecasts Correction Algorithm [37]. Unlike optimization-based algorithms, that impose non-negativity constraints directly within the reconciliation problem, this method applies an iterative correction procedure. The central idea of the algorithm is straightforward: rather than discarding coherence by simply truncating negatives to zero, it progressively eliminates them while redistributing the necessary adjustments throughout the hierarchy. Formally, let

{\tilde{x}}^{(0)}

denote the

f r e e

reconciled forecasts. At each iteration

r \geq 1

, a correction vector is constructed as follows. First, consider the

(n^{*} \times 1)

vector

δ^{(r)}

, with components

δ_{i}^{(r)} = \{\begin{matrix} - {\tilde{x}}_{i}^{(r - 1)} & if {\tilde{x}}_{i}^{(r - 1)} < 0 \\ 0 & otherwise \end{matrix}, i = 1, \dots, n^{*} .

(12)

This auxiliary vector is linearly transformed into a coherent correction vector using the reconciliation matrix

M

:

{\tilde{δ}}^{(r)} = M δ^{(r)}

, and the reconciled forecasts are updated as

{\tilde{x}}^{(r)} = {\tilde{x}}^{(r - 1)} + {\tilde{δ}}^{(r)}

. The procedure is repeated until all components of

{\tilde{x}}^{(r)}

are non-negative, or until changes fall below a predefined tolerance.

It is worth noting that the nfca algorithm is not an exact solution to the non-negative least squares problem. Rather, it should be considered as a heuristic: it enforces non-negativity by repeated coherent corrections rather than by solving a constrained optimization problem. As a result, while it is simple to code and often works well in small systems where negatives are rare, it may encounter difficulties when applied to larger or more complex hierarchies. In such settings, the number of required iterations may increase substantially, and convergence is not guaranteed. Furthermore, when many forecasts turn negative at once, numerical stability may be an issue.

3.4. Iterative Non-Negative Reconciliation with Immutable Constraints (NNIC)

Building on [53], and inspired by the nfca heuristic [37], the iterative Non-Negative reconciliation with Immutable Constraints provides an effective procedure to enforce non-negativity in reconciled forecasts. This heuristic can be viewed as an analogue of the Block Principal Pivoting algorithm for non-negative least squares. Rather than solving the constrained optimization problem in a single step,

n n i c

adopts an iterative pivoting strategy: whenever a forecast is found to be negative, it is fixed at zero and incorporated into the set of immutable constraints, while the remaining elements are left free to adjust.

Formally, assume that the

f r e e

reconciled forecasts’ vector,

\tilde{x}

, has

l^{(0)} \geq 1

negative entries. Denote

{\tilde{x}}^{(0)} = \tilde{x}

, and let

C^{(0)} = C

be the constraints matrix in expression (3). At each iteration

r \geq 1

, the constraint set is updated as

C^{(r)} = [\begin{matrix} C^{(r - 1)} \\ E^{(r)} \end{matrix}],

(13)

where

E^{(r)}

is an

[l^{(r - 1)} \times n^{*}]

matrix whose rows are zero everywhere except for a single unit entry in the columns corresponding to the negative elements of

{\tilde{x}}^{(r - 1)}

. The next iterate reconciled forecasts,

{\tilde{x}}^{(r)}

, is obtained by applying the projection approach reconciliation formula (5) (or the structural formulation (8) following the immutable reconciliation framework of [39]) with the updated constraints matrix

C^{(r)}

. The procedure terminates once all components are non-negative or a pre-specified iteration limit is reached.

Although effective in many applications, nnic has some limitations. Its iterative nature can become computationally demanding when a large number of negatives are present, and convergence is not theoretically guaranteed in all settings. Nonetheless, empirical evidence indicates that

n n i c

frequently coincides with the solutions obtained by exact NNLS solvers, while being considerably simpler to implement.

3.5. Set-Negative-to-Zero (sntz): Bottom-Up and Top-Down Variants

In this section, we consider a simple heuristic strategy without any computationally intensive numerical optimization. For simplicity, we focus on a cross-sectional hierarchy with upper- and bottom-level series, assuming that at least one bottom-level free-reconciled forecast is negative.

The original

s n t z

procedure (

s n t z_{b u}

), as described by Di Fonzo and Girolimetto [20], operates by first setting any negative bottom-level (corresponding, in temporal and cross-temporal frameworks, to the high-frequency and high-frequency bottom-level series, respectively) free-reconciled forecasts to zero, and then reconstructing the full set of reconciled forecasts using a bottom-up aggregation. Then, the non-negative bottom-level forecasts are left unchanged, and the upper-level forecasts are adjusted to restore coherence. Consequently, the reconciled top-level forecast may differ from its free-reconciled counterpart. This bottom-focused approach seems appropriate when the forecaster has greater confidence in the bottom-level base forecasts, and prefers minimal adjustments at this level, accepting potential changes at the upper levels.

However, in several practical settings (i.e., intermittent time series), bottom-level forecasts are more difficult to forecast, while upper-level aggregates may be more stable and accurate. In such cases, it may be more appropriate to preserve the upper-level forecasts. Motivated by this consideration, we propose a top-down variant, denoted

s n t z_{t d}

, to preserve the upper-level forecasts. In this approach, negative bottom-level forecasts are first replaced with zero, which typically creates a discrepancy between the adjusted bottom-level forecasts and the upper-level totals. So, this discrepancy is redistributed across the bottom-level series according to a set of distributional weights. Finally, the adjusted bottom-level forecasts are aggregated upward, ensuring that the hierarchy remains coherent and all series are non-negative, while the upper-level forecasts are left unchanged.

In summary, using

s n t z_{b u}

means that the forecaster is confident on the bottom-level free-reconciled forecasts, that are touched as little as possible, and is willing to accept (hopefully small) changes in the forecasts at the upper levels of the hierarchy. On the other hand,

s n t z_{t d}

should be preferred when the upper level (aggregated) series’ free-reconciled forecasts are deemed to be relatively more accurate than those (disaggregated) of the bottom level variables, and the practitioner would prefer to retain coherency and non-negativity without no further adjustment of the free-reconciled forecasts of the upper level variables in the hierarchy.

A numerical illustration

To illustrate the two variants, consider a simple hierarchy with one top-level variable a and three bottom-level components

b_{1}

,

b_{2}

, and

b_{3}

, such that

a = b_{1} + b_{2} + b_{3} .

Suppose the free-reconciled forecasts are

\tilde{a} = 40

,

{\tilde{b}}_{1} = 35

,

{\tilde{b}}_{2} = - 5

, and

{\tilde{b}}_{3} = 10

, which are coherent by construction. Under

s n t z_{b u}

, the negative bottom-level forecast is set to zero, i.e.,

{\tilde{b}}_{2, 0} = 0

. The adjusted top-level forecast is then computed as

{\tilde{a}}_{0} = {\tilde{b}}_{1} + {\tilde{b}}_{3} = 45

, that is

{\tilde{b}}_{1, 0} = {\tilde{b}}_{1} = 35

and

{\tilde{b}}_{3, 0} = {\tilde{b}}_{3} = 10

.

General formulation of ${sntz}_{td}$

Let

\tilde{a} > 0

denote the

f r e e

reconciled forecast of the top-level variable, and

{{\tilde{b}}_{i}}_{i = 1, \dots, n_{b}}

the corresponding reconciled forecasts of

n_{b}

bottom-level (in our example,

n_{b} = 3

) series, such that

\tilde{a} = \sum_{i = 1}^{n_{b}} {\tilde{b}}_{i}

. Suppose that at least one

{\tilde{b}}_{i}

is negative. Define the index sets

I^{+} = {i : {\tilde{b}}_{i} > 0}, I^{-} = {i : {\tilde{b}}_{i} \leq 0},

and compute the discrepancy

d = \tilde{a} - \sum_{i \in I^{+}} {\tilde{b}}_{i} = \sum_{i \in I^{-}} {\tilde{b}}_{i} .

As the set

I^{-}

is not empty, the discrepancy d is negative since

\tilde{a} = \sum_{i = 1}^{n_{b}} {\tilde{b}}_{i} = \underset{> 0}{\underset{⏟}{\sum_{i \in I^{+}} {\tilde{b}}_{i}}} + \underset{< 0}{\underset{⏟}{\sum_{i \in I^{-}} {\tilde{b}}_{i}}}

. Let

{w_{i}}_{i = 1}^{n_{b}}

be non-negative distribution coefficients satisfying

w_{i} = 0

for

i \in I^{-}

,

w_{i} > 0

for

i \in I^{+}

, and

\sum_{i \in I^{+}} w_{i} = 1

. We consider three alternative specifications of

w_{i}

,

i \in I^{+}

:

Proportional distribution, $s n t z_{t d p}$ : $w_{i} = \frac{{\tilde{b}}_{i}}{\sum_{j \in I^{+}} {\tilde{b}}_{j}}, i \in I^{+}$
Squared proportional distribution, $s n t z_{t d s p}$ : $w_{i} = \frac{{\tilde{b}}_{i}^{2}}{\sum_{j \in I^{+}} {\tilde{b}}_{j}^{2}}, i \in I^{+} .$
Variance-weighted distribution, $s n t z_{t d v w}$ : $w_{i} = \frac{{\hat{σ}}_{i}^{2}}{\sum_{j \in I^{+}} {\hat{σ}}_{j}^{2}}, i \in I^{+} .$

The choice of weights depends on the structure of the series and the desired trade-off between simplicity and robustness. When the series have comparable scale and uncertainty, the proportional variant (

s n t z_{t d p}

) provides a transparent and balanced allocation of the discrepancy. Under sparsity, when many small or zero components are present, or when it is desirable to protect small positive forecasts from being reduced below zero, the squared-proportional variant (

s n t z_{t d s p}

) is preferable, as it concentrates the adjustment on larger components. In the presence of heteroskedasticity or markedly unequal forecast uncertainty, the variance-weighted scheme (

s n t z_{t d v w}

) is recommended, since it allocates larger reductions to noisier series while preserving more reliable ones.

Finally, according to the top-down set-negative-to-zero heuristic, we derive the non-negative reconciled forecasts as

{\tilde{a}}_{0} = \tilde{a}, {\tilde{b}}_{i, 0} = \{\begin{matrix} 0 & if i \in I^{-} \\ {\tilde{b}}_{i} + w_{i} d & if i \in I^{+} \end{matrix}

and, more generally,

{\tilde{x}}_{0} = S {\tilde{b}}_{0}

with, in this case,

{\tilde{x}}_{0} = {[\begin{matrix} {\tilde{a}}_{0} & b_{0}^{'} \end{matrix}]}^{'}

and

{\tilde{b}}_{0} = {[\begin{matrix} {\tilde{b}}_{1, 0} & \dots & {\tilde{b}}_{n_{b}, 0} \end{matrix}]}^{'}

. By construction, the adjusted bottom-level forecasts are non-negative and coherent:

\sum_{i = 1}^{n_{b}} {\tilde{b}}_{i, 0} = \sum_{i \in I^{+}} {\tilde{b}}_{i, 0} + \sum_{i \in I^{-}} {\tilde{b}}_{i, 0} = \sum_{i \in I^{+}} {\tilde{b}}_{i} + \sum_{i \in I^{+}} w_{i} (\tilde{a} - \sum_{i \in I^{+}} {\tilde{b}}_{i}) = \tilde{a} = {\tilde{a}}_{0} .

where, by definition,

\sum_{i = 1}^{n_{b}} w_{i} = \sum_{i \in I^{+}} w_{i} = 1

.

The proposed procedure always decreases the value of the positive freely reconciled forecasts

{\tilde{b}}_{i}

,

i \in I^{+}

. For some i it may happen that

{\tilde{b}}_{i} < | w_{i} d |

, which would result in a negative reconciled forecast

{\tilde{b}}_{i, 0}

. This issue may be easily overcome by iterating the procedure, setting to zero the negative forecasts generated in the previous step, and distributing the new discrepancy as shown before. In practice, this simple iteration converges in finitely many steps and guarantees non-negativity for all components. A negative value can reappear only when

w_{i} > {\tilde{b}}_{i} / | d |

, a situation that was empirically rare in our experiments, no case required more than two passes (see Figure 1).

In conclusion, Table 2 illustrates the application of these variants to the toy example above, assuming

{\hat{σ}}_{1}^{2} = 64

and

{\hat{σ}}_{3}^{2} = 16

. The table reports the

f r e e

reconciled forecasts and the corresponding non-negative reconciled values under

s n t z_{b u}

,

s n t z_{t d p}

,

s n t z_{t d s p}

, and

s n t z_{t d v w}

, respectively.

4. Australian Tourism Demand Dataset

The empirical analysis builds on the Australian tourism demand dataset originally studied in Wickramasuriya et al. [12,13], extending it to the context of cross-temporal forecast reconciliation under non-negativity constraints [21]. The dataset consists of monthly measures of tourist flows, expressed in visitor nights (VN), collected through the Australian Government’s National Visitor Survey. The data cover 228 monthly observations, from January 1998 to December 2016, capturing both arrivals and the number of nights spent in tourist facilities.

The dataset has a cross-sectional grouped structure, which arises from combining a geographic hierarchy with a classification by purpose of travel. The geographic hierarchy disaggregates the country into seven states, further subdivided into 27 zones and 76 regions, with 111 geographic divisions in total. However, six zones consist of only a single region (South Coast NSW, ACT NSW, West Coast VIC, North WA, South WA, South TAS), which leads to 105 non-redundant nodes rather than the theoretical 111. In this sense, the geographic hierarchy can be considered “unbalanced” [33] (see Figure 2, Table A1 in Appendix B and [21]).

In addition, tourism demand is disaggregated by purpose of travel (PoT) into four categories: holiday (Hol), visiting friends and relatives (Vis), business (Bus), and other (Oth). This classification generates 24 additional nodes (six single-region zones, each crossed with four PoT categories) that are duplications and therefore excluded. Accounting for these adjustments, we have a total of 525 non-redundant nodes [21,33], rather than the theoretical 555 [12,13]. Within this structure, the most disaggregated level (the “bottom” series) comprises 304 variables. These combine to produce 221 additional aggregate series, yielding 525 distinct time series overall (see Table 3). This update refines the structure originally presented in Table 7 of Wickramasuriya et al. [12].

From a temporal perspective, the dataset consists of monthly observations (

m = 12

), which can be aggregated into 2-, 3-, 4-, 6-, and 12-monthly series, giving

K = 1, 2, 3, 4, 6, 12

.

An important feature of this dataset is that for a large number of time series at least one of the values observed is zero, as shown in Table 4. Approximately 46% (239 out of 525) of the monthly series contain at least one zero. This proportion decreases as the temporal aggregation order increases, but even among the annual series, 16 (around 3% of the total) contain at least one zero. At the regional level, 13 of the 76 monthly series also display at least one zero. In summary, Table 4 provides an overview of the sparsity pattern within the Australian Tourism Demand dataset, revealing the extent of zero-inflation across both temporal and hierarchical dimensions. Such information is crucial for interpreting the empirical results, as the presence of structural zeros introduces additional challenges for both model estimation and reconciliation. In particular, it highlights the practical relevance of enforcing non-negativity: when many series contain zeros or near-zero values, unconstrained reconciliation may easily generate negative forecasts, undermining interpretability and coherence. Consequently, this feature of the dataset further motivates the use of non-negative reconciliation procedures capable of handling sparse (or intermittent) time series effectively.

4.1. Forecasting Experiment

The forecasting experiment adopts a recursive evaluation design with an expanding training window. The initial training sample spans January 1998 to December 2008 (10 years, 120 months), and forecasts are generated for the subsequent year (2009). The training window is then progressively expanded by one month (e.g., January 1998–January 2009 to forecast February 2009–January 2010), continuing until the final training sample, January 1998–December 2015. This results in 85 distinct forecast origins and, correspondingly, 85 replications of the forecasting experiment.

Forecast horizons vary depending on the temporal aggregation level: up to six steps ahead for bimonthly series, four steps for quarterly, three for four-monthly, two for semiannual, and one for annual aggregates. Base forecasts are generated using ARIMA and ETS [54] models, estimated by minimizing the corrected Akaike Information Criterion (AICc). Both modeling approaches employ the default implementations of the R package forecast [55], following the procedures described in Hyndman and Khandakar [56]. In addition, models are fitted to log-transformed series, with forecasts back-transformed as in Wickramasuriya et al. [13].

Among the four sets of base forecasts (

a r i m a

,

a r i m a + l o g

,

e t s

,

e t s + l o g

), only

e t s + l o g

forecasts were carried forward to reconciliation. This choice is justified by two considerations: first,

e t s + l o g

forecasts are guaranteed to be nonnegative; and second, they consistently achieve the lowest average relative mean squared error (AvgRelMSE; see Section 4.2) across most cases (see Figure A1 in Appendix B). Base forecasts were subsequently reconciled using the R package FoReco [40] with six different reconciliation procedures:

$c s (s h r)$: Optimal cross-sectional approach with shrinkage covariance matrix [12];
$t e (s t r)$: Optimal temporal approach with diagonal covariance matrix based on the temporal structural matrix $S_{t e}$ [16];
$t e (w l s v)$: Optimal temporal approach with diagonal covariance matrix based on in-sample residuals [16];
$i t e (w l s v, s h r)$: Iterative approach with temporal diagonal covariance matrix [16] and cross-sectional shrinkage covariance matrix [12];
$c t (s t r)$: Optimal cross-temporal approach with diagonal covariance matrix based on the temporal structural matrix $S_{c t}$ [16];
$c t (w l s v)$: Optimal cross-temporal approach with diagonal covariance matrix based on in-sample residuals [19];
$c t (b d s h r)$: Optimal cross-temporal approach with block-diagonal covariance matrix based on in-sample residuals [19].

The first three procedures generate reconciled forecasts that are coherent along only one dimension (either cross-sectional or temporal).

4.2. Performance Measures for Multiple Comparisons

Forecast accuracy is evaluated using mean squared error (MSE) and its relative counterpart. Let

{\hat{e}}_{i, j, t}^{[k], h}

denote the forecast error for series

i = 1, \dots, n

, method

j = 0, \dots, J

, temporal aggregation level

k \in K

, forecast origin

t = 1, \dots, q

, and forecast horizon

h = 1, \dots, h_{k}

:

{\hat{e}}_{i, j, t}^{[k], h} = y_{i, t + h}^{[k]} - {\hat{y}}_{i, j, t}^{[k], h} .

The MSE is defined as the average of squared errors across forecast origins:

{MSE}_{i, j}^{[k], h} = \frac{1}{q} \sum_{t = 1}^{q} {({\hat{e}}_{i, j, t}^{[k], h})}^{2} .

Relative mean squared error (rMSE) compares each method j against a benchmark (method 0):

{rMSE}_{i, j}^{[k], h} = \frac{{MSE}_{i, j}^{[k], h}}{{MSE}_{i, 0}^{[k], h}} .

The average relative MSE (AvgRelMSE) is then defined as the geometric mean of rMSE values across series, temporal aggregation levels, and horizons:

{AvgRelMSE}_{j} = {(\prod_{i = 1}^{n} \prod_{k \in K} \prod_{h = 1}^{h_{k}} {rMSE}_{i, j}^{[k], h})}^{\frac{1}{n | K | m}} .

This index can be reported for specific groups of variables, across multiple forecast horizons, and at different levels of temporal aggregation. To assess whether performance differences are statistically significant, we employ the non-parametric Friedman test and the post hoc Multiple Comparison with the Best (MCB) Nemenyi procedure [18,57,58,59].

4.3. Non-Negative Reconciled Forecasts

Although the

e t s + l o g

forecasts are nonnegative by construction, reconciliation does not guarantee that this property is preserved. Indeed, negative reconciled forecasts occur under all reconciliation techniques considered (see Table 5). To address this, we evaluate seven alternative strategies for producing nonnegative reconciled forecasts (Section 3): block principal pivoting (

b p v

) [13], operator splitting quadratic programming (

o s q p

) [38], the negative forecasts correction algorithm (nfca) [60], iterative nonnegative reconciliation with immutable constraints (

n n i c

), and four variants of the set-negative-to-zero heuristic, original bottom-up (

s n t z_{b u}

) [20], top-down proportional (

s n t z_{t d p}

), top-down squared proportional (

s n t z_{t d s p}

), and top-down variance-weighted (

s n t z_{t d v w}

).

In Section 4, we evaluate the performance of alternative procedures for reconciling forecasts that may produce negative values, focusing on two key dimensions of assessment: forecasting accuracy and computational efficiency. By comparing both optimization-based and heuristic approaches, we want to show that computationally lighter procedures can achieve performance comparable to, or even superior to, that of more demanding optimal procedures. In doing so, we provide implementation insights on how non-negativity constraints can be applied in a large-scale setting without compromising either coherence or accuracy.

4.4. Results

Table 6 summarizes the Average Relative MSE (AvgRelMSE) across temporal aggregation levels, reconciliation approaches, and non-negative procedure. Overall, the results indicate improvements in forecast performance for all series and temporal aggregation levels when using non-negative reconciliation strategies, compared to both the

b a s e

forecasts (all the AvgRelMSE values

< 1

) and the

f r e e

reconciliations. The Multiple Comparison with the Best (MCB) tests (Figure A2 and Figure A3 in Appendix B) show that all non-negative approaches consistently outperform the base forecasts, while improvements over

f r e e

reconciliation are not always statistically significant.

When considering only the non-negative procedures, forecast accuracy does not provide a conclusive ranking. In this application,

o s q p

and

b p v

converge to the same results, as expected. It’s worth noting that the iterative non-negative reconciliation approach

n n i c

converges at the same outcomes, although this consistency may not be generalized (see Appendix A). Relative to the optimal results guaranteed by

o s q p

and

b p v

, heuristic approaches perform competitively and, in some cases, outperform the optimal methods. Notably, the

s n t z_{b u}

and

s n t z_{t d s p}

heuristics show particularly strong performance across different temporal levels.

Analyzing the results separately for high-frequency bottom-level series and the remaining time series provides additional insights (Figure A2 and Figure A3 in Appendix B). For high-frequency bottom-level series, imposing non-negativity constraints significantly improve over both

f r e e

and

b a s e

reconciliations, as indicated by MCB tests in Figure A2. However, for the temporal approaches (

t e (s t r)

and

t e (w l s v)

), the differences are not statistically significant, because the

f r e e

reconciliation produces few negative values (see Table 5). While improvements for

s n t z_{b u}

are expected, almost all the top-down

s n t z

heuristics also provide statistically significant improvements for bottom-level series. Considering all the remaining time series including both bottom-level series at temporal aggregation levels

k > 1

and upper-level series at any temporal aggregation level (Figure A2), the

f r e e

reconciliation often is competitive with several reconciliation approaches, such as

c t (b d s h r)

. However, heuristics like

s n t z_{b u}

, and

s n t z_{t d s p}

are consistently statistically superior or at least not worse than the

f r e e

reconciliation approach.

In addition to forecast accuracy, we evaluate the computational performance of the different non-negative procedures. Figure 3 presents boxplots of the reconciliation times across replications of the forecasting experiment, while a more detailed analysis, including comparisons between structural and projection formulations, is provided in Figure A4 of Appendix B. The boxplots of Figure 3 show that the numerical optimization procedures (nfca,

n n i c

,

b p v

, and

o s q p

) generally require considerably more computation time than set-negative-to-zero. In particular,

o s q p

is highly sensitive to user-defined convergence parameters, resulting in longer computation times for larger structure, such as cross-temporal cases (

c t (s t r)

and

c t (w l s v)

), while for smaller hierarchies (e.g., temporal

t e (s t r)

and

t e (w l s v)

) the time is similar to

s n t z

. Further information on the solver hyperparameter settings and computational environment used in the experiments is provided in Appendix C.

Among the optimization-based approaches,

b p v

offers a good balance between efficiency and accuracy, while

n n i c

performs well with diagonal covariance structures but loses efficiency when

f r e e

reconciliation times increase, as observed in

c t (b d s h r)

. A further consideration concerns nfca, which is the most computationally demanding procedure, typically requiring more resources than both optimization-based and heuristic alternatives. Achieving convergence was also more challenging: whereas

b p v

and

o s q p

converged with tolerances of

10^{- 6}

(and

n n i c

with

10^{- 5}

), for nfca we had to relax the tolerance to

10^{- 3}

. These factors substantially limit the practicality of this procedure in large-scale or time-sensitive settings. At the same time, all set-negative-to-zero heuristics showed computation times very close to

f r e e

, confirming them as the fastest non-negative options.

In summary, the results show that non-negative reconciliation strategies consistently improve forecast accuracy while preserving coherence. Among these, the set-negative-to-zero heuristics, particularly

s n t z_{b u}

and

s n t z_{t d s p}

, stand out for achieving near-optimal accuracy with computational costs comparable to the

f r e e

reconciliation. Figure 4, which plots median computation time against AvgRelMSE, confirms that these heuristics lie close to the efficiency frontier, whereas optimization-based methods deliver limited or no additional accuracy gains at substantially higher costs.Given that accuracy differences among methods are relatively small on average, computational efficiency becomes the decisive criterion for practical deployment. In this regard,

s n t z_{b u}

and

s n t z_{t d s p}

offer the best balance between accuracy and scalability, making them particularly well suited for large-scale, online, or real-time forecasting applications where faster reconciliation is essential.

The online appendix, provided as Supplementary Material, reports results using the mean absolute error as the accuracy metric, as well as the complete set of results for all alternative base forecasts (

a r i m a

,

e t s

, and

a r i m a + l o g

). These additional analyses are consistent with the findings reported for

e t s + l o g

.

5. Conclusions

In this paper, we examined non-negative reconciliation methods for multiple time series subject to linear aggregation constraints. Our results show that imposing non-negativity consistently improves the accuracy and interpretability of forecasts compared to both base forecasts and

f r e e

reconciliations, while preserving coherence across all levels of aggregation. These gains are particularly evident for disaggregated series, where negative forecasts are most likely to arise, but we also observed improvements at more aggregated levels, indicating that the benefits propagate throughout the system.

A key contribution of our study is the systematic comparison of optimization-based and heuristic reconciliation procedures. While algorithms such as block principal pivoting and operator splitting quadratic programming deliver optimal reconciled solutions, they require substantial computation time and can be sensitive to parameter settings. Moreover, the iterative method

n n i c

shows competitive results but with reduced efficiency in some settings, while the heuristic nfca is even more computationally demanding and particularly sensitive to convergence settings compared to the other procedures. Instead, we found that simple set-negative-to-zero heuristics, and especially the bottom-up and squared-proportional top-down variants, achieve near-optimal forecast at almost no additional computational cost. In several cases, these heuristics even outperformed the optimization-based methods, demonstrating their robustness, scalability, and suitability for large-scale forecasting problems.

In conclusion, we show that addressing non-negativity in reconciled forecasts is not only a methodological refinement, but a practical necessity for high-dimensional, real-time, and resource-constrained forecasting environments. Our results suggest that practitioners can confidently adopt heuristic reconciliation procedures to obtain accurate and coherent forecasts while avoiding the computational burden of more complex optimization methods. The empirical evidence presented in this study further indicates that the choice among non-negative reconciliation approaches should be guided by the intended application and computational constraints. The set-negative-to-zero (

s n t z

) heuristics deliver near-optimal accuracy at negligible computational cost, making them particularly suitable for large-scale, online, or real-time forecasting settings. The block principal pivoting (

b p v

) algorithm provides a balanced trade-off between accuracy and computational efficiency, representing a robust default among optimization-based methods. Finally, the operator splitting quadratic program (

o s q p

) offers the highest degree of flexibility, allowing for the inclusion of additional constraints, with acceptable tuning time but some sensitivity to tolerance settings in large cross–temporal hierarchies. Future research could extend this analysis by considering alternative loss functions or accuracy metrics that explicitly account for non-negativity constraints, thereby better reflecting the practical implications of sign violations in strictly non-negative forecasting applications.

Supplementary Materials

The following supporting information can be downloaded at: https://www.mdpi.com/article/10.3390/forecast7040064/s1, including extended tables and figures related to the Australian Tourism Demand case study for different base forecasts and loss functions.

Funding

This research received no external funding.

Data Availability Statement

Code and data for reproducing the results are available at https://github.com/danigiro/vn525nn, accessed on 23 October 2025.

Conflicts of Interest

The author declares no conflicts of interest.

Appendix A. NNIC vs. NNLS Algorithms

While NNLS algorithms (e.g., block principal pivoting) are active-set methods that may both add and release constraints until the optimal active set is identified, the proposed

n n i c

is a monotone fix-at-zero scheme: once a component is found negative, it is treated as immutable in subsequent reconciliations. Hence,

n n i c

coincides with NNLS whenever the NNLS-optimal active set can be obtained by a monotone expansion from the negatives of the

f r e e

reconciliation. Conversely, if optimality requires releasing some index,

n n i c

may converge to a different feasible non-negative solution.

Consider a cross-sectional system with two aggregates and three bottom series,

\begin{matrix} a_{1} & = b_{1} + b_{2}, \\ a_{2} & = b_{2} + b_{3} \end{matrix}

and let

y = {[\begin{matrix} a_{1} & a_{2} & b_{1} & b_{2} & b_{3} \end{matrix}]}^{'}

. The cross-sectional zero-constraints matrix and a diagonal weight matrix are, respectively,

C_{c s} = [\begin{matrix} 1 & 0 & - 1 & - 1 & 0 \\ 0 & 1 & 0 & - 1 & - 1 \end{matrix}], Ω = diag (1, 1, 0.5, 1, 0.5) .

with base forecasts

\hat{y} = {[\begin{matrix} - 1.5330 & 0.7408 & - 0.8774 & 1.5604 & - 0.1223 \end{matrix}]}^{'}

The reconciled vectors obtained with the unconstrained projection (free), the exact non-negative least squares (NNLS, solved via

o s q p

/

b p v

), and the proposed

n n i c

procedure are

\begin{matrix} {\tilde{y}}_{f r e e} & = {[- 0.6106, 0.6508, - 1.3386, 0.7280, - 0.0773]}^{'}, \\ {\tilde{y}}_{b p v / o s q p} & = {[0.2261, 0.3161, 0, 0.2261, 0.0901]}^{'}, \\ {\tilde{y}}_{n n i c} & = {[0.2561, 0.2561, 0, 0.2561, 0]}^{'} . \end{matrix}

All reconciled vectors satisfy the aggregation constraints. Both

{\tilde{y}}_{bpv / osqp}

and

{\tilde{y}}_{nnic}

are non-negative, but they differ because the immutable zero set imposed by

n n i c

does not coincide with the optimal active set of the NNLS solution. In this example, the NNLS optimum requires releasing one previously fixed component, which

n n i c

, by construction, cannot do.

Appendix B. Australian Tourism Demand: Tables and Figures

In this Appendix, we provide Supplementary Material to the empirical analysis presented in Section 4. It includes detailed descriptions of the dataset, additional tables and figures illustrating forecast performance. Specifically, we present:

geographic division of the Australian Tourism Demand dataset (Table A1);
average relative MSE (AvgRelMSE) for the automatic ARIMA and ETS base forecasts on both levels and log-transformed data;
additional results from the Multiple Comparison with the Best (MCB) tests, highlighting statistically significant differences among reconciliation strategies for both bottom-level (Figure A2) and upper-level (Figure A3) series;
Boxplots of computational times for different reconciliation methods across replications and comparisons between structural and projection approaches (Figure A4 and Figure A5).

Table A1. Geographical divisions of Australia in States, Zones e Regions. Zones formed by a single region have been highlighted in italics.

Serie	Name	Label	Serie	Name	Label
Total			Continue: Regions
1	Australia	Total	49	Gippsland	BCB
States			50	Phillip Island	BCC
2	New South Wales (NSW)	A	51	Central Murray	BDA
3	Victoria (VIC)	B	52	Goulburn	BDB
4	Queensland (QLD)	C	53	High Country	BDC
5	South Australia (SA)	D	54	Melbourne East	BDD
6	Western Australia (WA)	E	55	Upper Yarra	BDE
7	Tasmania (TAS)	F	56	MurrayEast	BDF
8	Northern Territory (NT)	G	57	Mallee	BEA
Zones			58	Wimmera	BEB
9	Metro NSW	AA	59	Western Grampians	BEC
10	Nth Coast NSW	AB	60	Bendigo Loddon	BED
	Sth Coast NSW	AC	61	Macedon	BEE
11	Sth NSW	AD	62	Spa Country	BEF
12	Nth NSW	AE	63	Ballarat	BEG
	ACT	AF	64	Central Highlands	BEG
13	Metro VIC	BA	65	Gold Coast	CAA
	West Coast VIC	BB	66	Brisbane	CAB
14	East Coast VIC	BC	67	Sunshine Coast	CAC
15	Nth East VIC	BD	68	Central Queensland	CBA
16	Nth West VIC	BE	69	Bundaberg	CBB
17	Metro QLD	CA	70	Fraser Coast	CBC
18	Central Coast QLD	CB	71	Mackay	CBD
19	Nth Coast QLD	CC	72	Whitsundays	CCA
20	Inland QLD	CD	73	Northern	CCB
21	Metro SA	DA	74	Tropical North Queensland	CCC
22	Sth Coast SA	DB	75	Darling Downs	CDA
23	Inland SA	DC	76	Outback	CDB
24	West Coast SA	DD	77	Adelaide	DAA
25	West Coast WA	EA	78	Barossa	DAB
	Nth WA	EB	79	Adelaide Hills	DAC
	Sth WA	EC	80	Limestone Coast	DBA
	Sth TAS	FA	81	Fleurieu Peninsula	DBB
26	Nth East TAS	FB	82	Kangaroo Island	DBC
27	Nth West TAS	FC	83	Murraylands	DCA
28	Nth Coast NT	GA	84	Riverland	DCB
29	Central NT	GB	85	Clare Valley	DCC
Regions			86	Flinders Range and Outback	DCD
30	Sydney	AAA	87	Eyre Peninsula	DDA
31	Central Coast	AAB	88	Yorke Peninsula	DDB
32	Hunter	ABA	89	Australia’s Coral Coast	EAA
33	North Coast NSW	ABB	90	Experience Perth	EAB
34	South Coast	ACA	91	Australia’s SouthWest	EAC
35	Snowy Mountains	ADA	92	Australia’s North West	EBA
36	Capital Country	ADB	93	Australia’s Golden Outback	ECA
37	The Murray	ADC	94	Hobart and the South	FAA
38	Riverina	ADD	95	East Coast	FBA
39	Central NSW	AEA	96	Launceston, Tamar and the North	FBB
40	New England North West	AEB	97	North West	FCA
41	Outback NSW	AEC	98	WildernessWest	FCB
42	Blue Mountains	AED	99	Darwin	GAA
43	Canberra	AFA	100	Kakadu Arnhem	GAB
44	Melbourne	BAA	101	Katherine Daly	GAC
45	Peninsula	BAB	102	Barkly	GBA
46	Geelong	BAC	103	Lasseter	GBB
47	Western	BBA	104	Alice Springs	GBC
48	Lakes	BCA	105	MacDonnell	GBD

Source: [12,21].

Figure A1. Average relative MSE (AvgRelMSE) across different temporal aggregation levels (monthly 1, two-monthly 2, quarterly 3, four-monthly 4, semi-annual 6, annual 12 and all) for the automatic ARIMA and ETS base forecasts on both levels and log-transformed data:

a r i m a

/

e t s

stand for Automatic ARIMA/ETS on the original data;

a r i m a + l o g

/

e t s + l o g

stand for Automatic ARIMA/ETS on the log-transformed data.

Figure A1. Average relative MSE (AvgRelMSE) across different temporal aggregation levels (monthly 1, two-monthly 2, quarterly 3, four-monthly 4, semi-annual 6, annual 12 and all) for the automatic ARIMA and ETS base forecasts on both levels and log-transformed data:

a r i m a

/

e t s

stand for Automatic ARIMA/ETS on the original data;

a r i m a + l o g

/

e t s + l o g

stand for Automatic ARIMA/ETS on the log-transformed data.

Figure A2. Results of the MCB Nemenyi test across the high frequency bottom time series evaluated over all forecast horizons (

h = 1, \dots, 12

) grouped by reconciliation approach (

c s (s h r)

,

t e (s t r)

,

t e (w l s v)

,

i t e (w l s v, s h r)

,

c t (s t r)

,

c t (w l s v)

,

c t (b d s h r)

). In each panel, the p-value of the Friedman test is reported in the lower-right corner, while the mean rank of each method is displayed to the right of its label. Statistically significant differences in forecasting performance are identified based on the overlap of confidence intervals (inditaced in blue): approaches whose intervals do not intersect (indicated by red circles; blue triangles otherwise) are considered significantly different. In particular, any approach whose interval does not overlap with that of the best-performing (green-highlighted) method is considered significantly worse.

Figure A2. Results of the MCB Nemenyi test across the high frequency bottom time series evaluated over all forecast horizons (

h = 1, \dots, 12

) grouped by reconciliation approach (

c s (s h r)

,

t e (s t r)

,

t e (w l s v)

,

i t e (w l s v, s h r)

,

c t (s t r)

,

c t (w l s v)

,

c t (b d s h r)

). In each panel, the p-value of the Friedman test is reported in the lower-right corner, while the mean rank of each method is displayed to the right of its label. Statistically significant differences in forecasting performance are identified based on the overlap of confidence intervals (inditaced in blue): approaches whose intervals do not intersect (indicated by red circles; blue triangles otherwise) are considered significantly different. In particular, any approach whose interval does not overlap with that of the best-performing (green-highlighted) method is considered significantly worse.

Figure A3. MCB Nemenyi test across the bottom-level series at temporal aggregation levels

k > 1

and upper-level series at any temporal aggregation level, evaluated over all forecast horizons (

h = 1, \dots, H_{k}

) grouped by reconciliation approach (

c s (s h r)

,

t e (s t r)

,

t e (w l s v)

,

i t e (w l s v, s h r)

,

c t (s t r)

,

c t (w l s v)

,

c t (b d s h r)

). In each panel, the p-value of the Friedman test is reported in the lower-right corner, while the mean rank of each method is displayed to the right of its label. Statistically significant differences in forecasting performance are identified based on the overlap of confidence intervals (inditaced in blue): approaches whose intervals do not intersect (indicated by red circles; blue triangles otherwise) are considered significantly different. In particular, any approach whose interval does not overlap with that of the best-performing (green-highlighted) method is considered significantly worse.

Figure A3. MCB Nemenyi test across the bottom-level series at temporal aggregation levels

k > 1

and upper-level series at any temporal aggregation level, evaluated over all forecast horizons (

h = 1, \dots, H_{k}

) grouped by reconciliation approach (

c s (s h r)

,

t e (s t r)

,

t e (w l s v)

,

i t e (w l s v, s h r)

,

c t (s t r)

,

c t (w l s v)

,

c t (b d s h r)

). In each panel, the p-value of the Friedman test is reported in the lower-right corner, while the mean rank of each method is displayed to the right of its label. Statistically significant differences in forecasting performance are identified based on the overlap of confidence intervals (inditaced in blue): approaches whose intervals do not intersect (indicated by red circles; blue triangles otherwise) are considered significantly different. In particular, any approach whose interval does not overlap with that of the best-performing (green-highlighted) method is considered significantly worse.

Figure A4. Boxplot of computational times, computed over the replications of the forecast experiments, for the different reconciliation approaches (

c s (s h r)

,

t e (s t r)

,

t e (w l s v)

,

i t e (w l s v, s h r)

,

c t (s t r)

,

c t (w l s v)

,

c t (b d s h r)

) and non-negativity procedures (

b p v

,

o s q p

, nfca,

n n i c

,

s n t z_{b u}

,

s n t z_{t d p}

,

s n t z_{t d s p}

,

s n t z_{t d v w}

);

f r e e

indicates the unrestricted reconciled forecasts, which may include negative values. Results are reported using both the projection (5) and the structural (8) equations.

Figure A4. Boxplot of computational times, computed over the replications of the forecast experiments, for the different reconciliation approaches (

c s (s h r)

,

t e (s t r)

,

t e (w l s v)

,

i t e (w l s v, s h r)

,

c t (s t r)

,

c t (w l s v)

,

c t (b d s h r)

) and non-negativity procedures (

b p v

,

o s q p

, nfca,

n n i c

,

s n t z_{b u}

,

s n t z_{t d p}

,

s n t z_{t d s p}

,

s n t z_{t d v w}

);

f r e e

indicates the unrestricted reconciled forecasts, which may include negative values. Results are reported using both the projection (5) and the structural (8) equations.

Figure A5. Computational scaling of reconciliation algorithms. Boxplots report the distribution of computation times (in seconds, log scale) for different hierarchical structures and reconciliation schemes. The horizontal axis represents the number of nodes in the hierarchy (temporal, cross-sectional, and cross-temporal systems with 28, 525, and

525 \times 28

series, respectively).

Figure A5. Computational scaling of reconciliation algorithms. Boxplots report the distribution of computation times (in seconds, log scale) for different hierarchical structures and reconciliation schemes. The horizontal axis represents the number of nodes in the hierarchy (temporal, cross-sectional, and cross-temporal systems with 28, 525, and

525 \times 28

series, respectively).

Appendix C. Solver Hyperparameters and Computational Environment

All non-negativity algorithms were executed using consistent numerical tolerances and iteration settings across reconciliation structures. For clarity, we summarize the configurations by reconciliation dimension and indicate the default settings provided by the FoReco package [40].

$o s q p$: max_iter = 10000, check_termination = 25, eps_abs = 1e–5, eps_rel = 0,
eps_dual_inf = 1e–7, polish = TRUE, polish_refine_iter = 500
(Cross-sectional reconciliation); max_iter = 10000, check_termination = 20,
eps_abs = 1e–5, eps_rel = 1e–6, eps_dual_inf = 1e–7, polish = TRUE,
polish_refine_iter = 100 (Temporal reconciliation); max_iter = 1000000,
check_termination = 20, eps_abs = 1e–6, eps_rel = 0, polish = TRUE,
polish_refine_iter = 500 (Cross-temporal reconciliation);
$n n i c$: tol = 1e–5, itmax = 100.
$n f c a$: tol = 1e–3, itmax = 100.
$b p v$: ptype = "fixed", par = 10, tol = gtol = sqrt(.Machine$double.eps),
itmax = 100.

All experiments were performed on a Windows 10 (build 19045) workstation equipped with an Intel^® Core™ i7–10700 CPU (8 cores, 16 threads, 2.90 GHz) and 64 GB RAM. Analyses were conducted in R 4.4.0 (2024–04–24, ucrt).

References

Petropoulos, F.; Apiletti, D.; Assimakopoulos, V.; Babai, M.; Barrow, D.; Bergmeir, C.; Bessa, R.; Boylan, J.; Browell, J.; Carnevale, C.; et al. Forecasting: Theory and practice. Int. J. Forecast. 2022, 38, 705–871. [Google Scholar] [CrossRef]
Kourentzes, N. Toward a one-number forecast: Cross-temporal hierarchies. Foresight Int. J. Appl. Forecast. 2022, 67, 32–40. [Google Scholar]
Hyndman, R.J.; Ahmed, R.A.; Athanasopoulos, G.; Shang, H.L. Optimal combination forecasts for hierarchical time series. Comput. Stat. Data Anal. 2011, 55, 2579–2589. [Google Scholar] [CrossRef]
Athanasopoulos, G.; Hyndman, R.J.; Kourentzes, N.; Panagiotelis, A. Forecast reconciliation: A review. Int. J. Forecast. 2024, 40, 430–456. [Google Scholar] [CrossRef]
Dangerfield, B.; Morris, J. Top-down or bottom-up: Aggregate versus disaggregate extrapolations. Int. J. Forecast. 1992, 6, 233–241. [Google Scholar] [CrossRef]
Orcutt, G.H.; Watts, H.W.; Edwards, J.B. Data aggregation and information loss. Am. Econ. Rev. 1968, 58, 773–787. Available online: https://www.jstor.org/stable/1815532 (accessed on 23 October 2025).
Dunn, D.M.; Williams, W.H.; Dechaine, T.L. Aggregate versus subaggregate models in local area forecasting. J. Am. Stat. Assoc. 1976, 71, 68–71. [Google Scholar] [CrossRef]
Gross, C.W.; Sohl, J.E. Disaggregation methods to expedite product line forecasting. J. Forecast. 1990, 9, 233–254. [Google Scholar] [CrossRef]
Fliedner, G. Hierarchical forecasting: Issues and use guidelines. Ind. Manag. Data Syst. 2001, 101, 5–12. [Google Scholar] [CrossRef]
Athanasopoulos, G.; Ahmed, R.A.; Hyndman, R.J. Hierarchical forecasts for Australian domestic tourism. Int. J. Forecast. 2009, 25, 146–166. [Google Scholar] [CrossRef]
Hyndman, R.J.; Lee, A.J.; Wang, E. Fast computation of reconciled forecasts for hierarchical and grouped time series. Comput. Stat. Data Anal. 2016, 97, 16–32. [Google Scholar] [CrossRef]
Wickramasuriya, S.L.; Athanasopoulos, G.; Hyndman, R.J. Optimal Forecast Reconciliation for Hierarchical and Grouped Time Series Through Trace Minimization. J. Am. Stat. Assoc. 2019, 114, 804–819. [Google Scholar] [CrossRef]
Wickramasuriya, S.L.; Turlach, B.A.; Hyndman, R.J. Optimal non-negative forecast reconciliation. Stat. Comput. 2020, 30, 1167–1182. [Google Scholar] [CrossRef]
Panagiotelis, A.; Gamakumara, P.; Athanasopoulos, G.; Hyndman, R.J. Probabilistic forecast reconciliation: Properties, evaluation and score optimisation. Eur. J. Oper. Res. 2023, 306, 693–706. [Google Scholar] [CrossRef]
Panagiotelis, A.; Athanasopoulos, G.; Gamakumara, P.; Hyndman, R.J. Forecast reconciliation: A geometric view with new insights on bias correction. Int. J. Forecast. 2021, 37, 343–359. [Google Scholar] [CrossRef]
Athanasopoulos, G.; Hyndman, R.J.; Kourentzes, N.; Petropoulos, F. Forecasting with temporal hierarchies. Eur. J. Oper. Res. 2017, 262, 60–74. [Google Scholar] [CrossRef]
Nystrup, P.; Lindström, E.; Pinson, P.; Madsen, H. Temporal hierarchies with autocorrelation for load forecasting. Eur. J. Oper. Res. 2020, 280, 876–888. [Google Scholar] [CrossRef]
Kourentzes, N.; Athanasopoulos, G. Cross-temporal coherent forecasts for Australian tourism. Ann. Tour. Res. 2019, 75, 393–409. [Google Scholar] [CrossRef]
Di Fonzo, T.; Girolimetto, D. Cross-temporal forecast reconciliation: Optimal combination method and heuristic alternatives. Int. J. Forecast. 2023, 39, 39–57. [Google Scholar] [CrossRef]
Di Fonzo, T.; Girolimetto, D. Spatio-temporal reconciliation of solar forecasts. Sol. Energy 2023, 251, 13–29. [Google Scholar] [CrossRef]
Girolimetto, D.; Athanasopoulos, G.; Di Fonzo, T.; Hyndman, R.J. Cross-temporal probabilistic forecast reconciliation: Methodological and practical issues. Int. J. Forecast. 2024, 40, 1134–1151. [Google Scholar] [CrossRef]
Rombouts, J.; Ternes, M.; Wilms, I. Cross-temporal forecast reconciliation at digital platforms with machine learning. Int. J. Forecast. 2025, 41, 321–344. [Google Scholar] [CrossRef]
Pennings, C.L.P.; van Dalen, J. Integrated hierarchical forecasting. Eur. J. Oper. Res. 2017, 263, 412–418. [Google Scholar] [CrossRef]
Karmy, J.P.; Maldonado, S. Hierarchical time series forecasting via support vector regression in the European travel retail industry. Expert Syst. Appl. 2019, 137, 59–73. [Google Scholar] [CrossRef]
Punia, S.; Singh, S.P.; Madaan, J.K. A cross-temporal hierarchical framework and deep learning for supply chain forecasting. Comput. Ind. Eng. 2020, 149, 106796. [Google Scholar] [CrossRef]
Yang, D.; Quan, H.; Disfani, V.R.; Liu, L. Reconciling solar forecasts: Geographical hierarchy. Sol. Energy 2017, 146, 276–286. [Google Scholar] [CrossRef]
Yang, D.; Quan, H.; Disfani, V.R.; Rodríguez-Gallegos, C.D. Reconciling solar forecasts: Temporal hierarchy. Sol. Energy 2017, 158, 332–346. [Google Scholar] [CrossRef]
Yang, D. A guideline to solar forecasting research practice: Reproducible, operational, probabilistic or physically-based, ensemble, and skill (ROPES). J. Renew. Sustain. Energy 2019, 11, 022701. [Google Scholar] [CrossRef]
Ben Taieb, S.; Taylor, J.W.; Hyndman, R.J. Hierarchical probabilistic forecasting of electricity demand with smart meter data. J. Am. Stat. Assoc. 2021, 116, 27–43. [Google Scholar] [CrossRef]
Hansen, M.E.; Peter, N.; Møller, J.K.; Henrik, M. Reconciliation of wind power forecasts in spatial hierarchies. Wind Energy 2023, 26, 615–632. [Google Scholar] [CrossRef]
Abolghasemi, M.; Girolimetto, D.; Di Fonzo, T. Improving cross-temporal forecasts reconciliation accuracy and utility in energy market. Appl. Energy 2025, 394, 126053. [Google Scholar] [CrossRef]
Hollyman, R.; Petropoulos, F.; Tipping, M.E. Understanding forecast reconciliation. Eur. J. Oper. Res. 2021, 294, 149–160. [Google Scholar] [CrossRef]
Di Fonzo, T.; Girolimetto, D. Forecast combination-based forecast reconciliation: Insights and extensions. Int. J. Forecast. 2024, 40, 490–514. [Google Scholar] [CrossRef]
Athanasopoulos, G.; Gamakumara, P.; Panagiotelis, A.; Hyndman, R.J.; Affan, M. Hierarchical Forecasting. In Macroeconomic Forecasting in the Era of Big Data; Fuleky, P., Ed.; Springer International Publishing: Cham, Switzerland, 2020; Volume 52, pp. 689–719. [Google Scholar]
Lawson, C.L.; Hanson, R.J. Solving Least Squares Problems; Prentice Hall: Hoboken, NJ, USA, 1974. [Google Scholar]
Júdice, J.J.; Pires, F.M. A Block Principal Pivoting Algorithm for Large-Scale Strictly Monotone Linear Complementarity Problems. Comput. Oper. Res. 1994, 21, 587–596. [Google Scholar] [CrossRef]
Kourentzes, N.; Athanasopoulos, G. Elucidate structure in intermittent demand series. Eur. J. Oper. Res. 2021, 288, 141–152. [Google Scholar] [CrossRef]
Stellato, B.; Banjac, G.; Goulart, P.; Bemporad, A.; Boyd, S. OSQP: An operator splitting solver for quadratic programs. Math. Program. Comput. 2020, 12, 637–672. [Google Scholar] [CrossRef]
Zhang, B.; Kang, Y.; Panagiotelis, A.; Li, F. Optimal reconciliation with immutable forecasts. Eur. J. Oper. Res. 2023, 308, 650–660. [Google Scholar] [CrossRef]
Girolimetto, D.; Di Fonzo, T. FoReco: Forecast Reconciliation, Version 1.1.0; CRAN: Vienna, Austria, 2025. [CrossRef]
Girolimetto, D.; Di Fonzo, T. Point and probabilistic forecast reconciliation for general linearly constrained multiple time series. Stat. Methods Appl. 2024, 33, 581–607. [Google Scholar] [CrossRef]
Magnus, J.R.; Neudecker, H. Matrix Differential Calculus with Applications in Statistics and Econometrics; Wiley: Hoboken, NJ, USA, 2019. [Google Scholar]
Stone, R.; Champernowne, D.G.; Meade, J.E. The precision of national income estimates. Rev. Econ. Stud. 1942, 9, 111–125. [Google Scholar] [CrossRef]
Byron, R.P. The Estimation of Large Social Account Matrices. J. R. Stat. Soc. Ser. A 1978, 141, 359–367, Erratum in J. R. Stat. Soc. Ser. A 1979, 142, 405. https://doi.org/10.2307/2982515. [Google Scholar] [CrossRef]
Nystrup, P.; Lindström, E.; Møller, J.K.; Madsen, H. Dimensionality reduction in forecasting with temporal hierarchies. Int. J. Forecast. 2021, 37, 1127–1146. [Google Scholar] [CrossRef]
Yagli, G.M.; Yang, D.; Srinivasan, D. Reconciling solar forecasts: Sequential reconciliation. Sol. Energy 2019, 179, 391–397. [Google Scholar] [CrossRef]
Karjalainen, E.J.; Karjalainen, U.P. Component reconstruction in the primary space of spectra and concentrations. Alternating regression and related direct methods. Anal. Chim. Acta 1991, 250, 169–179. [Google Scholar] [CrossRef]
Berry, M.W.; Browne, M.; Langville, A.N.; Pauca, V.P.; Plemmons, R.J. Algorithms and applications for approximate nonnegative matrix factorization. Comput. Stat. Data Anal. 2007, 52, 155–173. [Google Scholar] [CrossRef]
Chen, D.; Plemmons, R.J. Nonnegativity constraints in numerical analysis. In The Birth of Numerical Analysis; World Scientific: Singapore, 2009; pp. 109–139. [Google Scholar] [CrossRef]
Stellato, B.; Banjac, G.; Goulart, P.; Boyd, S.; Anderson, E. Package OSQP: Quadratic Programming Solver Using the ‘OSQP’ Library, Version 0.6.3.3; CRAN: Vienna, Austria, 2014. [CrossRef]
Boyd, S.; Parikh, N.; Chu, E.; Peleato, B.; Eckstein, J. Distributed optimization and statistical learning via the alternating direction method of multipliers. Found. Trends® Mach. Learn. 2011, 3, 1–122. [Google Scholar] [CrossRef]
Gabay, D.; Mercier, B. A dual algorithm for the solution of nonlinear variational problems via finite element approximation. Comput. Math. Appl. 1976, 2, 17–40. [Google Scholar] [CrossRef]
Van Benthem, M.H.; Keenan, M.R. Fast algorithm for the solution of large-scale non-negativity-constrained least squares problems. J. Chemom. 2004, 18, 441–450. [Google Scholar] [CrossRef]
Hyndman, R.J.; Koehler, A.B.; Ord, J.K.; Snyder, R.D. Forecasting with Exponential Smoothing. The State Space Approach; Cambridge University Press: Cambridge, UK, 2008. [Google Scholar]
Hyndman, R.; Athanasopoulos, G.; Bergmeir, C.; Caceres, G.; Chhay, L.; O’Hara-Wild, M.; Petropoulos, F.; Razbash, S.; Wang, E.; Yasmeen, F. Forecast: Forecasting Functions for Time Series and Linear Models, Version 8.24.0; CRAN: Vienna, Austria, 2025. [CrossRef]
Hyndman, R.J.; Khandakar, Y. Automatic time series forecasting: The forecast package for R. J. Stat. Softw. 2008, 26, 1–22. [Google Scholar] [CrossRef]
Koning, A.J.; Franses, P.H.; Hibon, M.; Stekler, H.O. The M3 competition: Statistical tests of the results. Int. J. Forecast. 2005, 21, 397–409. [Google Scholar] [CrossRef]
Makridakis, S.; Spiliotis, E.; Assimakopoulos, V. The M5 Accuracy competition: Results, findings and conclusions. Int. J. Forecast. 2022, 38, 1346–1364. [Google Scholar] [CrossRef]
Kourentzes, N.; Svetunkov, I.; Schaer, O. Tsutils: Time Series Exploration, Modelling and Forecasting, Version 0.9.4; CRAN: Vienna, Austria, 2023. [CrossRef]
Athanasopoulos, G.; Kourentzes, N. On the evaluation of hierarchical forecasts. Int. J. Forecast. 2022, 39, 1502–1511. [Google Scholar] [CrossRef]

Figure 1. Empirical convergence stability of the top-down sntz procedure. The horizontal axis reports the number of iterations, and the vertical axis shows the percentage of reconciled vectors that achieved convergence after the corresponding number of iterations. Most cases converged after the first iteration, a small fraction required a second pass, and no instance required more than two iterations.

Figure 2. Geographical divisions of Australia in States, Zones e Regions. Look at Table A1 for the meaning of labels.

Figure 3. Boxplot of computational times, computed over the replications of the forecast experiments, for the different reconciliation approaches (

c s (s h r)

,

t e (s t r)

,

t e (w l s v)

,

i t e (w l s v, s h r)

,

c t (s t r)

,

c t (w l s v)

,

c t (b d s h r)

) and non-negativity procedures (

b p v

,

o s q p

, nfca,

n n i c

,

s n t z_{b u}

,

s n t z_{t d p}

,

s n t z_{t d s p}

,

s n t z_{t d v w}

);

f r e e

indicates the unrestricted reconciled forecasts, which may include negative values.

Figure 3. Boxplot of computational times, computed over the replications of the forecast experiments, for the different reconciliation approaches (

c s (s h r)

,

t e (s t r)

,

t e (w l s v)

,

i t e (w l s v, s h r)

,

c t (s t r)

,

c t (w l s v)

,

c t (b d s h r)

) and non-negativity procedures (

b p v

,

o s q p

, nfca,

n n i c

,

s n t z_{b u}

,

s n t z_{t d p}

,

s n t z_{t d s p}

,

s n t z_{t d v w}

);

f r e e

indicates the unrestricted reconciled forecasts, which may include negative values.

Figure 4. Median computational time versus

A v g R e l M S E

for all the variables at any temporal aggregation order, for the different reconciliation approaches (

c s (s h r)

,

t e (s t r)

,

t e (w l s v)

,

i t e (w l s v, s h r)

,

c t (s t r)

,

c t (w l s v)

,

c t (b d s h r)

) and non-negativity procedures (

b p v

,

o s q p

, nfca,

n n i c

,

s n t z_{b u}

,

s n t z_{t d p}

,

s n t z_{t d s p}

,

s n t z_{t d v w}

).

Figure 4. Median computational time versus

A v g R e l M S E

for all the variables at any temporal aggregation order, for the different reconciliation approaches (

c s (s h r)

,

t e (s t r)

,

t e (w l s v)

,

i t e (w l s v, s h r)

,

c t (s t r)

,

c t (w l s v)

,

c t (b d s h r)

) and non-negativity procedures (

b p v

,

o s q p

, nfca,

n n i c

,

s n t z_{b u}

,

s n t z_{t d p}

,

s n t z_{t d s p}

,

s n t z_{t d v w}

).

Table 1. Summary of the main symbols used in Section 2.2, including their descriptions and equivalent notation in the cross-sectional (cs), temporal (te), and cross-temporal (ct) forecasting frameworks. Dimensions and indexing conventions are provided to clarify how the general notation maps to each specific framework. Note that starting from matrices

X

,

\hat{X}

and

\tilde{X}

described in Section 2.1, the cross-sectional framework corresponds to working on a single column j (

X_{{\cdot, j}}

,

{\hat{X}}_{{\cdot, j}}

,

{\tilde{X}}_{{\cdot, j}}

), the temporal framework on a single row i (

X_{{i, \cdot}}

,

{\hat{X}}_{{i, \cdot}}

,

{\tilde{X}}_{{i, \cdot}}

) and the cross-temporal framework on the vectorised form (

vec (X)

,

vec (\hat{X})

,

vec (\tilde{X})

). In addition, we omit the subscript h to simplify notation.

Table 1. Summary of the main symbols used in Section 2.2, including their descriptions and equivalent notation in the cross-sectional (cs), temporal (te), and cross-temporal (ct) forecasting frameworks. Dimensions and indexing conventions are provided to clarify how the general notation maps to each specific framework. Note that starting from matrices

X

,

\hat{X}

and

\tilde{X}

described in Section 2.1, the cross-sectional framework corresponds to working on a single column j (

X_{{\cdot, j}}

,

{\hat{X}}_{{\cdot, j}}

,

{\tilde{X}}_{{\cdot, j}}

), the temporal framework on a single row i (

X_{{i, \cdot}}

,

{\hat{X}}_{{i, \cdot}}

,

{\tilde{X}}_{{i, \cdot}}

) and the cross-temporal framework on the vectorised form (

vec (X)

,

vec (\hat{X})

,

vec (\tilde{X})

). In addition, we omit the subscript h to simplify notation.

		Framework
Symbol	Description	cs	te	ct
$x$	$(n^{*} \times 1)$ vector of true (unknown) coherent vector	$X_{{\cdot, j}}$	$X_{{i, \cdot}}$	$vec (X)$
$\hat{x}$	$(n^{*} \times 1)$ vector of incoherent base forecasts	${\hat{X}}_{{\cdot, j}}$	${\hat{X}}_{{i, \cdot}}$	$vec (\hat{X})$
$\tilde{x}$	$(n^{*} \times 1)$ vector of reconciled forecasts	${\tilde{X}}_{{\cdot, j}}$	${\tilde{X}}_{{i, \cdot}}$	$vec (\tilde{X})$
$C$	$(n_{a}^{} \times n^{})$ zero-constraints matrix such that $C x = 0$	$C_{c s}$	$C_{t e}$	$C_{c t}$
$S$	$(n^{} \times n_{b}^{})$ structural matrix	$S_{c s}$	$S_{t e}$	$S_{c t}$
$n^{*}$	Total number of elements in the vectors	n	$m + k^{*}$	$n (m + k^{*})$
$n_{a}^{}$ , $n_{b}^{}$	Numbers of row and colum in the zero-constraints matrix $C$ and the structural matrix $S$ , respectively	$n_{a}$ , $n_{b}$	$k^{*}$ , m	$n m + n_{b} k^{*}$ , $n_{b} m$
$β$	$(n_{b}^{*} \times 1)$ vector of target forecasts corrisponding to the free components (cross-sectional)/high-frequency univariate series (temporal)/high-frequency free series (cross-temporal)

Table 2. Reconciled forecasts for the toy example introduced in Section 2. The column

f r e e

reports the unrestricted reconciled forecasts, which may include negative values, while the columns

s n t z_{b u}

,

s n t z_{t d p}

,

s n t z_{t d s p}

, and

s n t z_{t d v w}

show the results obtained with alternative non-negative reconciliation schemes, which enforce coherence and eliminate negative components.

Table 2. Reconciled forecasts for the toy example introduced in Section 2. The column

f r e e

reports the unrestricted reconciled forecasts, which may include negative values, while the columns

s n t z_{b u}

,

s n t z_{t d p}

,

s n t z_{t d s p}

, and

s n t z_{t d v w}

show the results obtained with alternative non-negative reconciliation schemes, which enforce coherence and eliminate negative components.

	Type of Forecast
Variable	$free$	${sntz}_{bu}$	${sntz}_{tdp}$	${sntz}_{tdsp}$	${sntz}_{tdvw}$
${\tilde{a}}_{}$	40	45	40	40	40
${\tilde{b}}_{1}$	35	35	31.1	30.4	31
${\tilde{b}}_{2}$	−5	0	0	0	0
${\tilde{b}}_{3}$	10	10	8.9	9.6	9

Table 3. Grouped time series for Australian tourism flows.

	Number of Series
	Geographical Division (GD)	Purpose of Travel (PoT)	Total
Australia	1	4	5
States	7	28	35
Zones *	21	84	105
Regions	76	304	380
Total	105	420	525

* 6 Zones with only one Region are included in the Regions.

Table 4. Number (#) and percentage of time series with values equal to 0, divided by temporal aggregation order (k) and cross-sectional level (

L_{0}

Australia,

L_{1}

States,

L_{2}

Zones,

L_{3}

Regions,

L_{4}

PoT,

L_{5} = L_{1} \times L_{4}

,

L_{6} = L_{2} \times L_{4}

,

L_{7} = L_{3} \times L_{4}

).

Table 4. Number (#) and percentage of time series with values equal to 0, divided by temporal aggregation order (k) and cross-sectional level (

L_{0}

Australia,

L_{1}

States,

L_{2}

Zones,

L_{3}

Regions,

L_{4}

PoT,

L_{5} = L_{1} \times L_{4}

,

L_{6} = L_{2} \times L_{4}

,

L_{7} = L_{3} \times L_{4}

).

k	$L_{0}$	$L_{1}$	$L_{2}$	$L_{3}$	$L_{4}$	$L_{5}$	$L_{6}$	$L_{7}$	Tot
_{# Series}	₁	₇	₂₁	₇₆	₄	₂₈	₈₄	₃₀₄	₅₂₅
1				13		1	25	200	239
				_(17%)		_(4%)	_(30%)	_(66%)	_(46%)
2				1		1	13	131	146
				_(1%)		_(4%)	_(15%)	_(43%)	_(28%)
3							6	98	104
							_(7%)	_(32%)	_(20%)
4							3	76	79
							_(4%)	_(25%)	_(15%)
6								54	54
								_(18%)	_(10%)
12								16	16
								_(5%)	_(3%)

Table 5. Summary statistics of the negative reconciled forecasts using ETS base forecasts (with the log-transformation): replication’s number with at least one negative value (# rep); series’ number (# series) with at least one negative value in one replication (min and max); min and max of negative values in all the replications (values).

Label	# rep	# series		Values		# rep	# series		Values
		min	max	min	max		min	max	min	max
	Monthly forecasts ( $k = 1$ )					Two-monthly forecasts ( $k = 2$ )
$c s (s h r)$	85	2	12	−58.35	−0.00031	85	1	8	−83.01	−0.00064
$t e (s t r)$	41	1	4	−4.81	−0.01459	7	1	1	−1.27	−0.06806
$t e (w l s v)$	43	1	5	−5.29	−0.00514	5	1	1	−1.13	−0.18561
$i t e (w l s v, s h r)$	85	2	8	−18.89	−0.00125	73	1	4	−21.21	−0.00102
$c t (s t r)$	85	10	32	−45.33	−0.00004	85	1	17	−71.76	−0.00510
$c t (w l s v)$	85	1	6	−29.97	−0.00558	60	1	4	−30.81	−0.01798
$c t (b d s h r)$	85	2	13	−21.12	−0.00064	84	1	8	−20.15	−0.00558
	Quarterly forecasts ( $k = 3$ )					Four-monthly forecasts ( $k = 4$ )
$c s (s h r)$	79	1	6	−71.64	−0.00051	69	1	4	−53.33	−0.00108
$t e (s t r)$	–	–	–	–	–	–	–	–	–	1 –
$t e (w l s v)$	–	–	–	–	–	–	–	–	–	1 –
$i t e (w l s v, s h r)$	49	1	3	−20.60	−0.00064	42	1	3	−23.73	−0.00851
$c t (s t r)$	84	1	10	−84.84	−0.00765	79	1	8	−100.25	−0.00223
$c t (w l s v)$	31	1	2	−19.00	−0.01225	25	1	3	−21.76	−0.06677
$c t (b d s h r)$	82	1	6	−26.81	−0.00231	71	1	5	−30.29	−0.00017
	Semi-annual forecasts ( $k = 6$ )					Annual forecasts ( $k = 12$ )
$c s (s h r)$	42	1	2	−26.78	−0.00734	5	1	1	−23.60	−0.60212
$t e (s t r)$	–	–	–	–	–	–	–	–	–	–
$t e (w l s v)$	–	–	–	–	–	–	–	–	–	–
$i t e (w l s v, s h r)$	21	1	3	−23.07	−0.01458	5	1	1	−16.51	−1.89363
$c t (s t r)$	69	1	7	−130.78	−0.05021	43	1	2	−131.82	−0.19975
$c t (w l s v)$	11	1	2	−19.76	−0.39761	4	1	1	−12.83	−1.74552
$c t (b d s h r)$	56	1	3	−35.96	−0.01885	9	1	1	−33.97	−1.08399

Table 6. Average relative MSE (AvgRelMSE) across different temporal aggregation levels (monthly 1, two-monthly 2, quarterly 3, four-monthly 4, semi-annual 6, annual 12 and all), reconciliation approaches (

c s (s h r)

,

t e (s t r)

,

t e (w l s v)

,

i t e (w l s v, s h r)

,

c t (s t r)

,

c t (w l s v)

,

c t (b d s h r)

) and non-negative (

b p v

,

o s q p

, nfca,

n n i c

,

s n t z_{b u}

,

s n t z_{t d p}

,

s n t z_{t d s p}

,

s n t z_{t d v w}

);

f r e e

indicates the unrestricted reconciled forecasts, which may include negative values. Bold values indicate the best performing approach within each block (reconciliation approach) and, blue underlined and green italics values denote the overall best and the second-best performance, respectively.

Table 6. Average relative MSE (AvgRelMSE) across different temporal aggregation levels (monthly 1, two-monthly 2, quarterly 3, four-monthly 4, semi-annual 6, annual 12 and all), reconciliation approaches (

c s (s h r)

,

t e (s t r)

,

t e (w l s v)

,

i t e (w l s v, s h r)

,

c t (s t r)

,

c t (w l s v)

,

c t (b d s h r)

) and non-negative (

b p v

,

o s q p

, nfca,

n n i c

,

s n t z_{b u}

,

s n t z_{t d p}

,

s n t z_{t d s p}

,

s n t z_{t d v w}

);

f r e e

indicates the unrestricted reconciled forecasts, which may include negative values. Bold values indicate the best performing approach within each block (reconciliation approach) and, blue underlined and green italics values denote the overall best and the second-best performance, respectively.

	Temporal Aggregation Level
Label	1	2	3	4	6	12	all	1	2	3	4	6	12	all
	$c s (s h r)$							$t e (s t r)$
$f r e e$	0.979279	0.977149	0.968516	0.959679	0.950190	0.942428	0.971733	0.978197	0.981807	0.979275	0.970618	0.955839	0.948815	0.975631
$\begin{matrix} b p v \\ o s q p \\ n n i c \end{matrix}\}$	0.977990	0.976276	0.968187	0.959392	0.950029	0.942396	0.970907	0.978193	0.981805	0.979276	0.970617	0.955841	0.948814	0.975628
$n f c a$	0.977953	0.976012	0.968163	0.959401	0.950000	0.942398	0.970831	0.978193	0.981805	0.979276	0.970617	0.955841	0.948814	0.975628
$s n t z_{b u}$	0.978009	0.976293	0.968147	0.959379	0.949981	0.942395	0.970908	0.978194	0.981804	0.979276	0.970616	0.955843	0.948813	0.975629
$s n t z_{t d p}$	0.978054	0.976361	0.968155	0.959399	0.949989	0.942387	0.970946	0.978192	0.981803	0.979274	0.970615	0.955841	0.948815	0.975627
$s n t z_{t d s p}$	0.978035	0.976378	0.968159	0.959398	0.949988	0.942387	0.970942	0.978192	0.981803	0.979274	0.970615	0.955840	0.948815	0.975627
$s n t z_{t d v w}$	0.978067	0.976383	0.968155	0.959401	0.949988	0.942385	0.970956	0.978194	0.981804	0.979276	0.970617	0.955843	0.948815	0.975628
	$t e (w l s v)$							$i t e (w l s v, s h r)$
$f r e e$	0.977797	0.981215	0.978636	0.969898	0.955040	0.948397	0.975091	0.965780	0.965329	0.958496	0.946306	0.925542	0.906115	0.957432
$\begin{matrix} b p v \\ o s q p \\ n n i c \end{matrix}\}$	0.977791	0.981213	0.978637	0.969897	0.955041	0.948396	0.975088	0.965324	0.965132	0.958274	0.946106	0.925353	0.905947	0.957122
$n f c a$	0.977791	0.981213	0.978637	0.969897	0.955041	0.948396	0.975088	0.965304	0.965104	0.958249	0.946084	0.925330	0.905920	0.957099
$s n t z_{b u}$	0.977793	0.981213	0.978638	0.969896	0.955044	0.948396	0.975089	0.965339	0.965113	0.958252	0.946079	0.925340	0.905960	0.957118
$s n t z_{t d p}$	0.977791	0.981211	0.978636	0.969895	0.955041	0.948397	0.975087	0.965258	0.965062	0.958206	0.946044	0.925314	0.905983	0.957062
$s n t z_{t d s p}$	0.977789	0.981211	0.978635	0.969895	0.955040	0.948397	0.975087	0.965227	0.965033	0.958178	0.946018	0.925291	0.905981	0.957034
$s n t z_{t d v w}$	0.977793	0.981213	0.978638	0.969896	0.955044	0.948397	0.975089	0.965305	0.965105	0.958246	0.946075	0.925341	0.905980	0.957102
	$c t (s t r)$							$c t (w l s v)$
$f r e e$	0.984083	0.986868	0.981204	0.970437	0.949994	0.933418	0.978475	0.971016	0.972262	0.967195	0.956613	0.938178	0.922365	0.965031
$\begin{matrix} b p v \\ o s q p \\ n n i c \end{matrix}\}$	0.981578	0.984560	0.978940	0.968185	0.948245	0.931837	0.976164	0.970660	0.972115	0.967051	0.956487	0.938074	0.922239	0.964802
$n f c a$	0.981578	0.984570	0.978937	0.968178	0.948243	0.931810	0.976163	0.970659	0.972109	0.967043	0.956480	0.938067	0.922232	0.964797
$s n t z_{b u}$	0.981914	0.984635	0.978871	0.968004	0.947865	0.930293	0.976208	0.970723	0.972069	0.966998	0.956421	0.938029	0.922214	0.964800
$s n t z_{t d p}$	0.982056	0.984944	0.979322	0.968553	0.948586	0.931386	0.976551	0.970735	0.972092	0.967031	0.956461	0.938081	0.922292	0.964825
$s n t z_{t d s p}$	0.981960	0.984816	0.979192	0.968394	0.948371	0.931052	0.976419	0.970731	0.972086	0.967025	0.956454	0.938069	0.922274	0.964819
$s n t z_{t d v w}$	0.982074	0.984986	0.979396	0.968688	0.948834	0.931990	0.976634	0.970738	0.972093	0.967030	0.956458	0.938076	0.922280	0.964826
	$c t (b d s h r)$
$f r e e$	0.964932	0.964332	0.957661	0.945062	0.924328	0.906373	0.956526
$\begin{matrix} b p v \\ o s q p \\ n n i c \end{matrix}\}$	0.964320	0.963973	0.957260	0.944642	0.923871	0.905896	0.956035
$n f c a$	0.964284	0.963898	0.957185	0.944566	0.923809	0.905858	0.955979
$s n t z_{b u}$	0.964377	0.963854	0.957090	0.944448	0.923693	0.905891	0.955975
$s n t z_{t d p}$	0.964389	0.963877	0.957124	0.944490	0.923749	0.905977	0.956002
$s n t z_{t d s p}$	0.964387	0.963874	0.957123	0.944489	0.923742	0.905964	0.955999
$s n t z_{t d v w}$	0.964392	0.963880	0.957123	0.944486	0.923739	0.905952	0.956002

Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content.

© 2025 by the author. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Girolimetto, D. Non-Negative Forecast Reconciliation: Optimal Methods and Operational Solutions. Forecasting 2025, 7, 64. https://doi.org/10.3390/forecast7040064

AMA Style

Girolimetto D. Non-Negative Forecast Reconciliation: Optimal Methods and Operational Solutions. Forecasting. 2025; 7(4):64. https://doi.org/10.3390/forecast7040064

Chicago/Turabian Style

Girolimetto, Daniele. 2025. "Non-Negative Forecast Reconciliation: Optimal Methods and Operational Solutions" Forecasting 7, no. 4: 64. https://doi.org/10.3390/forecast7040064

APA Style

Girolimetto, D. (2025). Non-Negative Forecast Reconciliation: Optimal Methods and Operational Solutions. Forecasting, 7(4), 64. https://doi.org/10.3390/forecast7040064

Article Menu

Non-Negative Forecast Reconciliation: Optimal Methods and Operational Solutions

Abstract

1. Introduction

2. Forecast Reconciliation

2.1. Zero-Constrained and Structural Representation

2.2. Regression-Based Reconciliation

2.3. Iterative Cross-Temporal Reconciliation

3. Non-Negative Reconciliation

3.1. Block Principal Pivoting (bpv)

3.2. Operator Splitting Quadratic Program (osqp)

3.3. Negative Forecasts Correction Algorithm (nfca)

3.4. Iterative Non-Negative Reconciliation with Immutable Constraints (NNIC)

3.5. Set-Negative-to-Zero (sntz): Bottom-Up and Top-Down Variants

4. Australian Tourism Demand Dataset

4.1. Forecasting Experiment

4.2. Performance Measures for Multiple Comparisons

4.3. Non-Negative Reconciled Forecasts

4.4. Results

5. Conclusions

Supplementary Materials

Funding

Data Availability Statement

Conflicts of Interest

Appendix A. NNIC vs. NNLS Algorithms

Appendix B. Australian Tourism Demand: Tables and Figures

Appendix C. Solver Hyperparameters and Computational Environment

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI