Abstract
This paper explores the hypothesis that the returns of asset classes can be predicted using common, systematic risk factors represented by the level, slope, and curvature of the US interest rate term structure. These are extracted using the Nelson–Siegel model, which effectively captures the three dimensions of the yield curve. To forecast the factors, we applied autoregressive (AR) and vector autoregressive (VAR) models. Using their forecasts, we predict the returns of government and corporate bonds, equities, REITs, and commodity futures. Our predictions were compared against two benchmarks: the historical mean, and an AR(1) model based on past returns. We employed the Diebold–Mariano test and the Model Confidence Set procedure to assess the comparative forecast accuracy. We found that Nelson–Siegel factors had significant predictive power for one-month-ahead returns of bonds, equities, and REITs, but not for commodity futures. However, for 6-month and 12-month-ahead forecasts, neither the AR(1) nor VAR(1) models based on Nelson–Siegel factors outperformed the benchmarks. These results suggest that the Nelson–Siegel factors affect the aggregate stochastic discount factor for pricing all assets traded in the US economy.
Keywords:
Nelson–Siegel model; forecasting; asset return prediction; yield curve dynamics; systematic risk factors; Diebold–Mariano test; model confidence set JEL Classification:
G10; G12; C53; E43; E44
1. Introduction
Modeling and forecasting the term structure of interest rates is crucial for policymakers and market participants, as the yield curve reflects market expectations about future monetary policy, output growth, and inflation across a range of time horizons. Additionally, used as a risk-free benchmark, the AA(A)-rated government bond yield curve often represents the floor for borrowing costs in the economy. This paper aims to extend the literature by examining the informational content of the US yield curve beyond the US government bond market. Specifically, we test the out-of-sample explanatory power of the yield curve’s shape—captured through the classical Nelson and Siegel (1987) level, slope, and curvature factors—in forecasting the returns of a range of financial assets besides bonds that, and riskless and not, encompass the equity, real estate, and commodity domains. We accomplish this task by deploying the insights derived from both simple, pairwise Diebold–Mariano (one-sided) tests of equal predictive accuracy and from more systematic model confidence set tests.
There are two primary approaches to fitting the default risk-free yield curve: nonparametric spline-based methods and parametric models. Spline-based approaches excel at providing an accurate in-sample fit, due to their flexibility; however, parsimonious parametric models typically offer superior forecasting performance, achieving an effective balance between model simplicity and predictive accuracy (Nymand-Andersen, 2018). Given that the primary focus of this paper is to test the out-of-sample predictive power of the yield curve’s information content, rather than merely fitting the curve in-sample, we adopt a parsimonious parametric approach. Specifically, we employ the Nelson–Siegel (NS) model, which offers simplicity and interpretability by requiring the estimation of only three (or four, see below for a discussion) parameters to characterize the entire yield curve. According to the specification in Diebold and Li (2006), these parameters can be interpreted as capturing the level, slope, and curvature of the yield curve.
Using data on the US yield curve from January 1990 to December 2020, we assess the forecasting performance of several models during the pseudo out-of-sample period January 2011–December 2020. Our analysis focuses on AR(1) and VAR(1) models applied to forecast the NS model factors at a certain horizon h. Such predicted factors are then used to predict asset returns at the same horizon h. To forecast asset returns, we implement a two-step approach. The predictive accuracy of these models is evaluated against benchmarks, including a historical average and an AR(1) model applied to asset returns. Additionally, we investigate whether the VAR(1) model applied to the NS outputs, and which accounts for interactions among the NS factors, can provide enhanced forecasting performance.
Contrary to Diebold and Li (2006) and the literature on forecasting yields using NS factors that has followed (see, e.g., Christensen et al. (2011); Favero et al. (2012); Fernandes and Vieira (2019); Guidolin and Pedio (2019); Toczydlowska and Peters (2018); Xiang and Zhu (2013); Yu and Zivot (2011) and the references in Diebold and Rudebusch (2013); Duffee (2013)) on more recent data, we find that NS factors do not always consistently outperform a set of benchmarks in the prediction of interest rates, even though the (dynamic) NS approach remains a competitive one. We then investigate the ability of these factors to explain returns across various asset classes, such as equities, corporate bonds, commodities, and REITs. This exploration is motivated by the potential economic significance of the Nelson–Siegel factors, which reflect bond investors’ expectations regarding the economic outlook and may reflect information about macroeconomic conditions. Given their ability to capture such crucial economic insights, these factors may also serve as systematic drivers of the stochastic discount factor (henceforth, SDF) that prices all financial assets.1 Because of these properties, the NS factors may themselves represent predictors of the SDF, particularly in the context of time-varying risk premia influenced by macroeconomic shifts and investor behavior throughout business cycles (Drobetz et al., 2002). In other words, while a naive null hypothesis would imply that the dynamic factors extracted from the yields (i.e., ex ante returns over a holding period) in one market should not help forecast yields or ex post, realized returns in other asset markets, the integration of NS factors into asset pricing models may help reconcile observed return patterns with macroeconomic fundamentals, refining our understanding of the stochastic discount factor and creating cross-asset common predictability linkages that we explore and document in this paper.
Our key empirical result suggests that the Nelson–Siegel factors exhibit a predictive power extending beyond the government bond market, though their performance varies significantly across forecast horizons and asset classes. Notably, these factors outperformed standard benchmarks in forecasting equity and REIT returns, while their predictive ability for commodity futures was more limited. As expected, direct regression on the Nelson–Siegel factors proved effective for forecasting Treasury securities returns. However, the comparison between VAR(1) and AR(1) models for the Nelson–Siegel factors yielded mixed results, with their relative performance depending on the specific forecast horizon and asset class considered.
Although we are not aware of any similar paper in the literature, in some respects our empirical goal is related to Hillebrand et al. (2018), who interpreted the yield curve’s NS factors as special, economically motivated substitutes of standard principal components that efficiently combine the information contained in the raw data, the yield curve. However, the standard NS factors were not “supervised” for a specific forecast target, in that they were constructed using only the predictors and not using a particular forecast target. Therefore, they proposed computing NS factors not of the original data, but of the many alternative forecasts that a range of models may offer, with each of the many forecasts being computed using one predictor at a time. These combined forecasts are shown to outperform classical benchmarks in the out-of-sample (henceforth, OOS) forecasting of monthly US output growth and inflation, especially at longer forecast horizons.
2. Methodology
This section presents the methodological framework informing the analysis. Section 2.1 introduces the parsimonious parametric model used to fit the US yield curve. Section 2.2 describes the models employed to forecast the yield curve. In Section 2.3, the focus shifts to the models used to predict the financial returns of non-government bond assets. Lastly, Section 2.4 briefly overviews the two statistical tests applied to evaluate the differential forecasting accuracy in this paper.
2.1. Fitting the Nelson–Siegel Model to the US Yield Curve
Our analysis begins by fitting a standard NS model—a parsimonious parametric framework—to the U.S. yield curve. The goal is to extract the level, slope, and curvature factors that capture the key dynamics of the term structure. Using data spanning a January 1990–December 2020 sample, these factors were computed to test whether they contain useful information for forecasting the returns of other assets, hence establishing a potential connection to the SDF(s). The functional form of the NS model for the yield y at time t, with maturity m is
To mitigate multicollinearity and facilitate an intuitive interpretation of the factors, we adopt the following specification (Diebold & Li, 2006):
The two specifications are identical for , when and at a given time t. The three potentially time-varying coefficients , , and determine the behavior of the yield curve over the long, short, and medium terms, respectively. In the presence of time variation, if we interpret , , and as factors, it is easy to see that the loading on is constant and equal to one, not decaying to zero with increasing maturity, which makes the factor governing the curve level in the long term. The loading on equals one at zero maturity and decays exponentially as the maturity increases, making the factor associated with short-term dynamics. In contrast, the loading on is zero at a zero maturity, increases initially, and then decays to zero as the maturity grows, making the factor capturing the medium-term behavior of the yield curve.
Interestingly, the three factors , , and can also be interpreted as linear transformations of the classical, latent level, slope, and curvature features of the riskless bond yield curve. Specifically, if we define the level of the yield curve as , the slope as , and the curvature as , we observe a close correspondence with , , and and , as illustrated in Figure 1, Figure 2 and Figure 3.
Figure 1.
US yield curve level and Nelson–Siegel long-term factor. This figure illustrates the long-term level factor () derived from the Nelson–Siegel model. The data span the sample January 1990–December 2020.
Figure 2.
US yield curve slope and Nelson–Siegel short-term factor. This figure illustrates the short-term slope factor () derived from the Nelson–Siegel model. The data span the sample January 1990–December 2020.
Figure 3.
US yield curve curvature and Nelson–Siegel medium-term factor. This figure illustrates the medium-term curvature factor () from the Nelson–Siegel model. The data span the sample January 1990–December 2020.
The estimated correlations between the level, slope, and curvature extracted from the observed yield curves and the fitted factors are nearly perfect, with values of 0.973, 0.995, and 0.998 for , , and , respectively. The sample time-series cross-factor correlations are estimated as , , and for , , and , respectively.
The coefficient governs the speed of the exponential decay of the loadings of the factors. While this may be estimated along with the factors , , and at any time, we fix it at , as suggested in Diebold and Li (2006). This value maximizes the loading on the medium-term, or curvature, factor at the 30-month maturity.2
At each point in time t, using monthly data, we recursively estimate the NS model in (2) and extract the factors best fitting the cross-section of U.S. Treasury constant maturity rates. These rates, which are available for maturities of 3 months; 6 months; and 1-, 2-, 3-, 5-, 7-, and 10 years, provide a reliable representation of the overall yield curve. By employing a recursive estimation procedure, we thus obtain a estimated matrix of factors.
2.2. Yield Curve Forecasting
The out-of-sample forecast period used in this paper spans January 2011–December 2020. An expanding window approach is employed, whereby the model estimation at each time t benefits from an additional data point relative to the previous estimation. The forecasting performance of the models is assessed at horizons of 1-, 6-, and 12-month steps. This is done with reference to maturities of 3 months, 1 year, 5 years, and 10 years. Forecasts for the 6 and 12 steps ahead are computed by iterating on the dynamic conditional mean models for the NS factors specified below. Specifically, for each period from t to , the model uses the predicted values as inputs for computing the time t + h forecast, rather than the actual observed values.
The factor prediction models are listed below.
2.2.1. AR(1) Model
We start with an AR(1) process applied to each of the time series of the three NS factors , , and recursively estimated previously for a given maturity and horizon:
where the model parameters are predicted from
and , are estimated by OLS using data up to time t on the factors obtained in the first step.3
Equation (3) is derived directly from Equation (2), maintaining the refinement proposed by Diebold and Li (2006) to improve estimation stability and interpretability. The key distinction lies in the factor representation, whereby the medium-term component is defined with an alternative decay structure that ensures a better empirical fit without affecting the fundamental properties of the model. The factor , often referred to as the “curvature factor”, exhibits an initial increase before decaying to zero as the maturity grows. While its decay property differs from a strict second-derivative curvature measure, it retains its role in capturing medium-term fluctuations in the yield curve.
2.2.2. Joint VAR(1) Model
We consider a VAR(1) process applied to each of the three NS model factors , , and as estimated previously:
where and are estimated by OLS using data up to time t on the vector of factors estimated in the earlier stage.
2.2.3. Random Walk Model for Yields
We also use as a benchmark a simple random walk model for yields:
2.2.4. AR(1) Model for Yields
As a parametric generalization of (6), we study an AR(1) process for yields:
where and are estimated by OLS using data up to time t.
2.2.5. Slope Regression Model
Following, for instance, Steeley (2014) and Chen and Niu (2014), we estimate a slope regression model specified as follows:
where the forecast is generated using a regression of yield changes over the prediction horizon h on a constant and the difference between the yield at maturity m and the 3-month yield. This approach captures the slope of the yield curve across different maturities. However, it is important to note that this model does not allow the prediction of the 3-month yield.
2.2.6. VAR(1) Model for Yields
We extend the AR(1) model by incorporating a VAR(1) process applied to all available yield maturities:
where the estimation of the matrices of coefficients is straightforward but horizon h-specific.4
2.2.7. VAR(1) Model for Yield Changes
Because the unit root nature of US riskless yields has been debated in the traditional finance literature (see, e.g., Pesando (1979)), we also analyze a VAR(1) model for yield changes:
which removes by differentiation any stochastic trends but does not test or exploit the existence of cointegration among interest rates.5
2.2.8. AR(1) Model for Principal Components
In addition, following Diebold and Li (2006), we decompose the time t sample covariance matrix C of the constant maturity rates into using principal component analysis. As is well known, the eigenvalues and eigenvectors of C are represented by the diagonal matrix W and columns of Q, respectively. We consider the three largest eigenvalues and their corresponding eigenvectors . Each of the eigenvectors contains a number of elements equal to the number of the available maturities, indexed by m. We then employ univariate AR(1) models to separately forecast each of the principal components h steps ahead:
where the principal components are defined as
and and are recursively estimated by OLS. The predicted yields h-steps ahead are then obtained from
where is the element of the eigenvector corresponding to maturity m.
2.3. Returns Forecasting
In this section, we introduce simple methods to evaluate the predictive power of the NS factors , , and for returns across various asset classes. In contrast to the riskless bond yield forecasting framework outlined in Section 2.2, this analysis requires us to use a distinct set of models. For corporate and Treasury bonds, equity, REIT, and commodity returns, we evaluate the forecasting performance of the three predictive models. First, we forecast returns using direct, h-step-ahead regression on the NS model factors estimated earlier:6
In this context, the coefficients , , , and are estimated through recursive OLS. The forecasts for the factors , , and , which are used as inputs for return predictions, are generated using either an AR(1) or VAR(1) process, as outlined in Section 2.2. The regression model in Equation (14) represents our workhorse model and provides direct information on whether, how, and for which assets/portfolios the NS factors may be structural features of the SDF(s), to the point of allowing us to obtain accurate predictions of the returns of other assets.
As additional benchmarks that fail to depend on the NS fixed income factors, we use a simple AR(1) model applied directly to returns:
where the coefficients are estimated according to the usual logic. A third and final benchmark is obtained by imposing the restriction for all portfolios and times, which features a classical simple historical average (HA) model, as featured in recent work on the predictability of asset returns (see, e.g., Rapach et al. (2010), Guidolin et al. (2013)), and the review in Rapach and Zhou (2013)).7
2.4. Measuring and Testing (Equal) Predictive Accuracy
To evaluate the predictive accuracy of alternative models, we apply two statistical tests: Diebold–Mariano (henceforth, DM) tests of pairwise equal predictive accuracy (Diebold & Mariano, 2002) and the Model Confidence Set (henceforth, MCS) procedure to perform tests and a multiple-model assessment (Hansen et al., 2011). Detailed formulas of these tests/algorithms are reported in Appendix A for completeness.
The DM test requires the computation of forecast errors for each model and an assumption on the functional form of the loss measuring the loss of utility to investors derived from the forecast errors. For yields with maturity m or asset returns indexed by i, the forecast error at time t and horizon h is defined as the difference between the forecast and the observed value. The loss functions are then assumed to be of the squared type. Although rather different loss functions might be assumed, the classical squared loss makes sense within the empirical finance literature because of the well-known connections between squared loss and measures of risk-adjusted performance, such as the Sharpe ratio (see Campbell and Thompson (2008)). At this point, the loss differentials between competing models considered in pairs are tested for statistical significance, see Appendix A for details.
The MCS procedure enables the comparison of all competing models within a multivariate framework. Starting with an initial set of models M, the procedure identifies a subset that, at a predetermined confidence level, contains the best forecasting models, in the sense that these are not dominated by any other model in the set outside . Dominance is conceptually identified with rejection of the null of equal predictive accuracy (EPA) in favor of a model, in a pairwise comparison. Depending on the relative performance of the models, may consist of a single model outperforming all others or include a rich set of models, in principle even all of M, when no dominance relationships can be established. Depending on the relative performance of the different models, the procedure sequentially eliminates models from the superior set until the EPA hypothesis is no longer rejected. Appendix A provides details.
2.5. Combination Forecasts With and Without NS Factors
To further evaluate the predictive power of the NS factors, we conduct a robustness analysis by implementing combination forecasts based on individual predictive regression models. Following Rapach et al. (2010), we construct forecasts using a broad set of macro-financial predictors originally introduced by Welch and Goyal (2008). The predictive regression model follows a standard formulation. Our forecasts are generated using an expanding window estimation approach with a six-month horizon forecast.
The predictive variables included in our analysis consist of valuation ratios, market risk indicators, interest rate measures, and credit risk variables. Specifically, we use the following GW predictors:8
- Dividend–Price Ratio (log), D/P: The logarithmic difference between total dividends paid on the S&P 500 index and the index’s price level, where dividends are aggregated over a rolling one-year period.
- Dividend Yield (log), D/Y: The log of dividends minus the log of lagged S&P 500 prices.
- Earnings–Price Ratio (log), E/P: The difference between the logarithm of S&P 500 earnings (measured over a trailing 12-month period) and the log of stock prices.
- Dividend–Earnings Ratio (log), D/E: Logarithmic gap between dividend payments and earnings on the S&P 500 index.
- Stock Variance, SVAR: The cumulative sum of squared daily returns on the S&P 500 index over a given month.
- Book-to-Market Ratio, B/M: The ratio between book value and market capitalization of the Dow Jones Industrial Average firms.
- Net Equity Issuance, NTIS: The ratio of 12-month cumulative net stock issuance by NYSE-listed firms to their total year-end market value.
- Treasury Bill Rate, TBL: Yield on three-month U.S. Treasury bills traded in the secondary market.
- Long-Term Yield, LTY: Yield on long-term government bonds.
- Long-Term Return, LTR: Realized return from investing in long-term government bonds.
- Term Spread, TMS: The difference between long-term government bond yields and short-term Treasury bill rates.
- Default Yield Spread, DFY: The yield spread between BAA-rated and AAA-rated corporate bonds.
- Default Return Spread, DFR: The difference in returns between long-term corporate and long-term government bonds.
- Inflation, INFL: Computed using the Consumer Price Index (CPI) for urban consumers. Following Welch and Goyal (2008), inflation is lagged by one month to account for release timing and used as in the regressions.
To assess the impact of the NS factors, we compare two models: one that includes the NS factors in the combination forecasts and one that excludes them. This approach allows us to isolate the contribution of the NS factors to the predictive accuracy and determine whether they improve the forecasting performance relative to a benchmark set of macro-financial predictors. The out-of-sample period for the analysis spans from January 2011 to December 2020, ensuring consistency with the evaluation of Diebold–Mariano test statistics and Model Confidence Set results.
3. Data Description
For all the assets and portfolios listed below, our sample spans the period January 1990–December 2020, comprising daily US yield curve rates sourced from the US Treasury Department; monthly returns on Treasury bills, notes, and bonds across various maturities; US equity returns spanning 113 distinct portfolios; sector-specific returns for REITs; corporate bond returns from investment-grade issuers in the US; and realized, chained return series for a diverse range of commodity futures.
3.1. US Treasury Yields
We used end-of-month data on the yield curve published by the US Treasury, commonly referred to as Constant Maturity Treasury (CMT) rates. These rates were derived through the interpolation of closing market bids from actively traded on-the-run US Treasury securities in the over-the-counter market. The interpolation employs a cubic spline model with fixed maturities at 1, 2, 3, and 6 months, as well as 1, 2, 3, 5, 7, 10, 20, and 30 years.9 The sample period January 1990–December 2020 was used to construct the monthly yield curve. The analysis was limited to maturities for which the full sample was available, specifically the 3-month, 6-month, and 1-, 2-, 3-, 5-, 7-, and 10-year maturities.10 The maturities used to estimate the NS factors covered a broader range than those utilized for yield curve forecasting. Forecasts were generated for the 3-month, 1-year, 5-year, and 10-year maturities, while the NS factors were estimated using yields from eight distinct maturities.
3.2. US Treasury Returns
We computed monthly returns based on the constant maturity rates published by the US Treasury discussed in the previous section. These returns were derived from holding 3-month, 6-month, and 1-, 2-, 3-, 5-, 7-, and 10-year T-bills and T-notes. The calculations were performed under two different sets of assumptions, depending on the maturity of the securities analyzed.
For coupon Treasury notes with maturities of one year or longer, we assumed the coupon bond was purchased at par at the end of the previous month, with a flat yield curve corresponding to the specified maturity. Monthly returns were then calculated based on changes in the bond’s valuation. Coupons are paid semiannually, on the last day of the month, beginning six months after the purchase date. The total return from holding the security for one month, comprising both the coupon income return and capital appreciation, was calculated as
where is the price of the bond with maturity m (expressed in years of fractions of years) and yield at time t, computed as
where indicates the fraction of time remaining until the next coupon payment in terms of coupon frequency.
For maturities shorter than a year, the approximate total return consists only of capital appreciation. In this case, the T-bill does not pay coupons, and at the beginning of each month, a T-bill is purchased at the prior month’s end price. Monthly, realized returns reflect the change in pricing of the bill under the assumption that the yield curve is flat for the maturity considered, which leads to a simple discounting formula for the residual life of the bill.
3.3. US Equity Returns
We studied the predictability of returns on 113 different US-based equity portfolios. These portfolios, made available through Kenneth French’s website,11 included the market portfolio, 17 industry portfolios, and various other portfolios obtained by sorting the CRSP stock universe (which currently includes all NYSE, NASDAQ, and AMEX-listed shares of stock) by a range of measurable stock characteristics: market capitalization (size) and book-to-market (value); bivariate combinations of size with other characteristics, including investments, operating profitability, cashflows, the dividend yield, the earnings-price ratio, and price momentum.
3.4. US REIT Returns
We also analyzed indices for closed-end fund-style returns derived from investments in real estate equity and debt instruments. REITs are companies that own, operate, or finance income-generating real estate across various sectors and regions, allowing investors to gain indirect exposure to the real estate market. These data were sourced from NAREIT (National Association of Real Estate Investment Trusts) and consist of the FTSE NAREIT U.S. Real Estate Index Series. Our analysis focused on monthly total returns, including those for all REITs combined (equity and mortgage), all equity REITs, all mortgage REITs, and six sector-specific REIT indices (office, industrial, retail, lodging and resorts, and self-storage). The total return series represent the sum of income and capital appreciation returns, calculated before taxes and commissions.
3.5. US Corporate Bond Returns
We assessed the predictability of four ICE Bank of America (BofA) US corporate bond series. The series correspond to portfolios of securities carrying investment-grade ratings of AAA, AA, A, and BBB as of the end of each month in the sample.12 The data, sourced from the St. Louis FED, track total returns for US dollar-denominated investment-grade corporate bonds publicly issued in the US domestic market.
3.6. Commodity Returns
Lastly, we examined the predictability of commodity returns across 17 series, encompassing precious metals, industrial metals, agricultural products, and livestock. The commodities were gold, silver, platinum, copper, Brent crude, gasoil, light crude oil, natural gas, cotton, cocoa, coffee, corn, lumber, soybean oil, soybeans, wheat, and live cattle. The monthly data, all sourced from Refinitiv, represent continuous futures contracts, with monthly returns calculated based on changes in the series levels.
3.7. Summary Statistics
The six panels of Table A1 (collected in Appendix B, to save space, labeled A through F) provide information on the US yield curve rates and the five sets of return series outlined in Section 3.1, Section 3.2, Section 3.3, Section 3.4, Section 3.5 and Section 3.6.
Most return series exhibited left skewness, with the exception of returns on commodity futures and on the US Treasury securities. This negative skewness can be attributed to the various market crashes that occurred in our sample. These sample skewness values emphasized the occurrence of extreme negative price events in our sample and need to be accounted for in what follows.
Table 1 presents summary statistics for the first-stage, estimated NS factors. The 1-, 12-, and 30-month sample autocorrelations ( for k = 1, 12, 30) reported in the table indicate that the level factor, , is the most persistent, as it seems to befit the level of a stationary time series. Its sample mean indeed agrees with a positive annualized risk-free rate of approximately 4.9 percent. In addition, unsurprisingly, the slope factor was negative on average (−2.21 percent), which captured a term structure that tends to be upward sloping in our sample. Finally, the curvature factor tended to be negative on average, which points to a concave yield curve shape. According to the augmented Dickey–Fuller test, neither the slope nor the level factors exhibited unit roots at any standard confidence level on average. However, the curvature factor, , was found to possibly display a unit root at both the 5% and 10% confidence levels.13
Table 1.
Summary statistics for the fitted Nelson–Siegel factors.
All three factors only showed significant partial autocorrelations at the first lag, justifying their modeling with an AR(1) and VAR(1) processes as described earlier, in Section 2.2.
4. Empirical Results
4.1. Yield Curve Forecasting
The NS factors were forecast using either the AR(1) or the VAR(1) model, with the latter model able to account for cross-factor serial correlations. These factor forecasts were then applied to compute yield forecasts for the 3-month, 1-year, 5-year, and 10-year maturities, as implied by Equation (2). While eight maturities were employed to estimate the NS factors, only the four listed maturities were predicted. Table 2 presents the results of the Diebold–Mariano test applied to forecast the bond yields.
Table 2.
Diebold–Mariano tests concerning alternative forecasts of US treasury rates.
We compared the predictive accuracy of the AR(1) for the NS factors against seven benchmark models, including the VAR(1) for NS factors; the random walk for the NS factors; predictive slope regressions, AR(1), and VAR(1) for yields; AR(1) for yield changes; and a direct regression on the first three AR(1) principal components. The comparison spanned three forecast horizons (one-, six-, and twelve-step-ahead) and four yield maturities. The results from the DM one-sided test, implemented at a 5% size, show that the null hypothesis of equal predictive accuracy was only rejected for a limited set of combinations, even though the negative statistics always indicated that using AR(1) for the factors predicted more accurately.
In particular, we found that using the AR(1) model for the NS factors outperformed the direct regression on principal components across most maturities and horizons, except for 3-month yields at the six- and twelve-step horizons. It also performed better than the VAR(1) for NS factors for the one-step-ahead forecasts of 3-month yields. In the case of the VAR(1) for yields, the AR(1) for NS factors performed better at a six-month horizon in the case of the 3-month, 5-year, and 10-year yields, and at a twelve-step horizon for 3-month and 10-year yields. The DM test statistics were positive and large when comparing the AR(1) for NS factors to benchmarks like the AR(1) and VAR(1) for yields, slope regression, and random walk, for one- and six-month horizons. However, despite the poor overall performance of the AR(1) model compared to the benchmarks, the AR(1) turned out not to imply a significant difference in predictive power.14
While the DM tests failed to establish the clear outperformance of the AR(1) for NS factors relative to the benchmarks, an interesting pattern emerges in Table 2: the relative forecasting performance improved as the horizon lengthens. This improvement was more evident for shorter maturities (3-month and 1-year yields) than for longer ones. Overall, the AR(1) for NS factors performed relatively well for long horizons and short maturities, as previously noted by Diebold and Li (2006). However, in contrast to their findings, our results do not indicate significant outperformance against the random walk model at conventional confidence levels. This aligns with Duffee (2013), who also found that the AR(1) process for NS factors failed to outperform the random walk in terms of RMSE, using a more recent sample, as we did in our work.
The MCS procedure yielded results similar to those of the Diebold–Mariano test. Table 3 shows that the outcomes were consistent across both the TR and TMax test statistics (see Appendix A for the meaning of these statistics). The finding that the AR(1) applied to the NS factors performed better for short-term maturities and longer forecasting horizons is confirmed by these tests. The AR and VAR models were instead excluded from the superior set (M*) when forecasting one-step-ahead yields, except for the 10-year yield. However, the models for NS factors were included in the superior set for the 6- and 12-step-ahead forecast horizons, ranking among the top three models for various maturities.
Table 3.
Model confidence set result for US Treasury rates.
The superior set increased in size as the forecast horizon lengthened, indicating a convergence in forecasting accuracy among the models for longer-term forecasts. In particular, the AR(1) model for yields appeared to be one of the best models across all maturities and forecasting horizons. Despite being the most accurate model for one-step-ahead forecasts, the VAR(1) for yields was excluded from the superior set for six-step and 12-step-ahead forecasts.
Additionally, although the TMax and TR test statistics yielded similar results, they differed in the number of models excluded from the superior set. For the six-month-ahead forecasts, the TMax statistic identified a narrower superior set, excluding more models than the TR statistic. This difference arose from the distinct comparison methods employed by the two test statistics: TMax evaluated each model’s predictive accuracy against the average of all other models, while TR compared models pairwise. Consequently, we expected TMax to be more selective in excluding models.
All in all, the evidence presented in Table 2 and Table 3 is consistent with the idea that autoregressive modeling of the NS factors is as accurate as a number of benchmarks. However, at least for horizons of 6 and 12 months, the equal predictive accuracy tests indicate that such models failed to be uniformly dominated by any of the benchmarks, at least with reference to the prediction of the very data from which the NS factors were extracted. This justifies our continued interest in their forecasting performance for asset returns in the reminder of this paper.
Finally, as a robustness check, we compared the combination forecasts following Rapach et al. (2010) for the factors of Welch and Goyal (2008) both with and without the NS factor. Table A4 in Appendix D presents this comparison. This additional analysis allowed us to evaluate the general usefulness of NS predictors in excess asset return forecasting, assessing whether their inclusion enhanced the predictive power of combination models beyond traditional macro-financial factors. The gains were particularly noticeable for short-term maturities, with an increase from 3.24% to 4.77% for 3-month yields and from 3.32% to 4.88% for 1-year yields. This suggests that the NS factor captured valuable information for forecasting short-term interest rates. While the improvement was smaller for longer maturities, such as 5-year and 10-year yields, the inclusion of the NS factor still enhanced the predictive accuracy, albeit to a lesser extent. This pattern aligns with the idea that the NS factor is particularly effective in explaining short-term yield variations, while providing moderate benefits for longer-term forecasts.
4.2. Forecasting Treasury Returns
Table A2 (in Appendix C) shows the results of the Diebold–Mariano test for return series; specifically, Panel A presents the results of the pairwise DM test for the government bond return series.15 The comparison of the predictive performance of the NS factors—themselves predicted from either an AR(1) or a VAR(1) model—with other specifications was performed across three forecasting horizons (one-, six-, and 12-step-ahead) and eight return series from monthly holdings of 3-month, 6-month, and 1-, 2-, 3-, 5-, 7-, and 10-year US Treasuries. The results indicate that we can reject the null hypothesis of equal predictive accuracy for only a limited set of combinations. Specifically, the AR(1) model for NS factors outperformed the AR(1) model for past returns for two maturities (3 and 6 months) and two forecasting horizons (six- and twelve-step-ahead). Additionally, it outperformed the HA model for these two maturities across all three forecasting horizons. However, the AR(1) for NS factors never outperformed the VAR(1) applied to predict NS factors, which turned out to be better suited for forecasting these returns. The VAR(1) performed better for shorter maturities and forecasting horizons, but its performance decreased for longer maturities and forecast horizons. Similarly to bond yields, the forecasting accuracy of the AR(1) NS improved as the forecasting horizon lengthened, particularly when compared to the VAR(1) NS, when their ranking flipped around and once more favored the simpler AR(1) model. However, Table A2 reports a large majority of negative DM test statistics, illustrating that the (predictions of) the NS factors provided serious competition to the AR models and the very random walk for prices, when it comes to forecasting bond returns.
In Table 4, the MCS results, obtained applying the 90% confidence testing level, further confirm the superior performance of the VAR(1) model applied to predict the NS factors compared to the other models. The VAR(1) NS model was always included in the superior set M* across all maturities and forecast horizons, consistently ranking the highest among the four models. The AR(1) NS also performed well but was often excluded from M* due to the VAR(1) model’s superior performance. Nonetheless, the results varied depending on the specific MCS test statistic used, whether TMax or TR. Under TMax, the AR(1) for the NS factors was excluded from the superior set only five times out of 24 tests (across three forecast horizons and eight return series). When we used the TR statistic, the AR(1) was excluded more frequently, six times for one-step-ahead predictions, five times for six-step-ahead, and four times for the 12-month-ahead forecasts. Finally, regarding the comparison of forecast combining models applied to Treasury returns, we observed that the inclusion of the NS factor significantly enhanced the predictive accuracy compared to using only the GW predictors. Specifically, the out-of-sample increased substantially for shorter maturities, such as the 3 M and 6 M Treasury securities, showing a marked improvement that suggests the additional informational value of the NS factor in short-term bond return forecasting. For longer maturities, the impact of the NS factor was more moderate but remained positive for several horizons, indicating its potential to enhance the performance of combined forecasting models.
Table 4.
Model confidence set results for return series.
More importantly for our purposes, both the AR(1) for past returns and the HA benchmarks consistently underperformed compared to the models using NS factor forecasts. These benchmarks were often excluded from the superior set and ranked low when included, especially in comparison to the VAR(1) for NS factors. Therefore, maybe unsurprisingly, predictions of NS factors extracted from Treasury yield data ranked rather highly within a rather classical set of forecast models for Treasury returns, especially for very-short- and very-long-term maturities and for six- and 12-month horizons.
4.3. Forecasting Equity Portfolio Returns
Following the same design used for Treasury returns, the DM test results for equity returns are shown in Panel B of Table A2, in Appendix C. The comparison was made across the three forecast horizons pursued in this paper and with reference to 113 portfolio return series, grouped into eight sub-categories: Fama and French factors portfolios; industry portfolios; various other portfolios sorted on the basis of stock and firm characteristics, as listed in Table A1, Panel C). In the prediction of stock returns one month ahead, the AR(1) model for the NS factors often outperformed the AR(1) applied to past returns and the HA model. Out of 113 equity return series, we rejected the null hypothesis of equal predictive accuracy between the AR(1) NS and the AR(1) for equity returns 52 times, and 49 times vs. the HA model. Such statistics indicate a superior predictive accuracy that largely exceeded the 10 percent we would expect from pure chance. However, this superior performance failed to emerge in the case of six- and 12-step-ahead forecasts, where the AR(1) NS model often underperformed compared to the benchmarks. The VAR(1) NS generally performed better than the AR(1) NS, forecasting more accurately than the latter more often than not. The difference in predictive accuracy between the two models was consistent across all three forecast horizons.
The results of the MCS approach provide a more comprehensive view of a model’s relative performance. A pattern involving both NS factor-based models emerged, consistent with what was reported with reference to the DM test. According to both the TR and TMax statistics, the AR(1) and VAR(1) NS models were the most accurate for one-step-ahead forecasting, rarely being excluded from the superior set. For the one-step-ahead forecast, the AR(1) was included in the superior set with 92% and 81% probabilities for TMax and TR, respectively. In fact, the VAR(1) NS was included with slightly higher probabilities: 93% and 82% for TMax and TR. As seen for US Treasury returns, the TR statistic was more selective, excluding more models from the superior set, and so these findings were to be expected. Moreover, AR(1) and VAR(1) NS turned out to be the best performing models for 53% and 38% of the target series, respectively, being placed as the best or the second best for 83% and 88% of the target series, which is rather an impressive performance. On the contrary, the AR(1) and HA models for returns performed rather poorly and were excluded from the superior set for more than 50% of the return series. Therefore, the short-term predictive power of NS factors for a large majority of the equity portfolio returns appeared to be remarkable and, possibly surprisingly, turned out to exceed what was found in the case of Treasuries.
Finally, the inclusion of NS factors did not significantly enhance the predictive power for equity returns, as it did for yields and government returns. The only notable exception was for Fabricated Products returns, where the inclusion of NS factors led to a visible improvement in predictive performance.
4.4. Forecasting REIT Returns
In contrast, for six- and 12-month-ahead forecasts, both the AR(1) applied directly to returns and the HA outperformed the AR(1) and VAR(1) models based on NS factor predictions. Specifically, the AR(1) for stock returns was included in the superior set with probabilities of 99% and 98% for six-step-ahead forecasts (using TMax and TR statistics, respectively), and 98% in the case of the 12-step-ahead horizon case. The even simpler HA model was included with slightly lower probabilities than the AR(1) but still tended to outperform the NS-based models. Overall, the HA model demonstrated superior accuracy to the AR(1), while the VAR(1) outperformed the AR(1) NS, regardless of the forecast horizon.
The results of DM tests applied to REIT returns are presented in Panel C of Table A3, as usual, across three forecasting horizons and nine return series. Similarly to the findings for equity returns, the AR(1) based on the NS factors outperformed, at the usual one-sided 5% size, the AR(1) estimated on returns and the historical mean models for one-step-ahead forecasts. The AR(1) NS, however, failed to outperform the VAR(1) NS across all REIT return series and forecasting horizons. In the case of the one-step-ahead forecast, the AR(1) NS outperformed the standard AR(1) six times and the HA model three times, out of nine series analyzed. When we compare the performance of the two NS factor-based models, AR(1) outperformed in the case of the month horizon, while VAR(1) came out on top for the 12-step-ahead horizon; we found mixed results for the six-month horizon. As in the case of equity returns, the predictive accuracy of the AR(1) NS deteriorated relative to the benchmarks as the forecast horizon lengthened. Contrary to the equity returns case, where the performance shifted in favor of the AR(1) model on returns and HA for the 12-step-ahead forecasts, the AR(1) NS consistently outperformed the two benchmarks for most REIT indices across all horizons, including the six-step and 12-step horizons.
The findings from the DM tests were reinforced by the results from the MCS procedure, which are presented in Table 4. The AR(1) and VAR(1) driven by NS predictions were rarely excluded from the superior set across all three horizons, irrespective of the test statistic used. Only for one REIT return series and for the one-step-ahead horizon was VAR(1) NS excluded from the superior set, based on the TMax statistic. The classical AR(1) and the HA models were instead frequently excluded from the superior set for the one-step-ahead horizon. Specifically, they were excluded six and seven times, respectively, out of the nine return series considered, regardless of the MCS test statistic used. However, both models were always included in the superior set for 6- and 12-month predictions. Even when they were included in the superior set, these models consistently ranked below the AR(1) and VAR(1) NS across the majority of the nine return series considered. Specifically, for the 6-step and 12-step-ahead horizons, the standard AR(1) never ranked first. The HA model ranked first three times for the 6-step-ahead horizon and once for the 12-step-ahead horizon, according to both MCS test statistics.
The REIT returns were predictable not only at the one-step-ahead horizon but also at the 6-step and 12-month horizons. Both the DM test and the MCS procedure showed a convergence in forecasting performance between the two NS factor-based models and the two benchmarks for the 6- and 12-step-ahead prediction horizons. The forecasts of the dynamics of the yield curve proved to contain more precise information on future REIT returns than for equity returns, indicating a surprisingly high integration between the REIT and U.S. government bond markets. This aligns with the fact that interest rate changes tend to have longer-lasting effects on real estate investments.
4.5. Forecasting Corporate Bond Returns
We analyzed the predictability of four corporate bond return series corresponding to the highest investment grade ratings. In our sample, based on the DM test statistic implemented at a 5% test size level, we failed to reject the null hypothesis of equal predictive accuracy across the three benchmarks assumed against the AR(1) NS factors model, for all forecast horizons (Panel D of Table A2). As observed for both equity and REIT returns (but not for Treasury returns), the forecasting accuracy of the AR(1) NS model decreased as the horizon lengthened, compared to the HA and the AR(1) model applied to corporate bond returns. In contrast, the AR(1) NS model performed better than the VAR(1) NS when forecasting corporate bond returns, although this turned out not to be the case for equity and REIT returns. Overall, while the AR(1) NS tended to lag behind the VAR(1), it outperformed the other two benchmark models.
The results from the MCS procedure, shown in Table 4, Panels A through F, align with those reported for REITs. In most experiments, both the AR(1) and VAR(1) NS models remained inside the superior set across all horizons, for the specific portfolio return series, and regardless of the test statistic used. The only exception concerned the AA-rated corporate bond series, for which the AR(1) NS model happened to be excluded from the superior set in the case of the one-month-ahead predictions, according to the TR statistic. On the contrary, the classical AR(1) and the HA model were typically excluded from the superior set for the one-step-ahead forecasts, but were included for the six- and twelve-month-ahead forecasts. Both models consistently ranked lower than AR(1) and VAR(1) NS, never ranking first across any of the horizons. For the six-step-ahead forecast, the HA model ranked second once and fourth three times, while the standard AR(1) for returns placed third three times and fourth once. For the 12-step-ahead forecasts, the HA model ranked third consistently, with the AR(1) for returns always ranking fourth.
Both the DM test and the MCS procedure confirmed that NS factors can predict U.S. corporate bond returns, though the predictive strength was weaker when compared to REIT returns and less pronounced than for equity returns at the one-step-ahead forecast horizon. Across the four bond return series, we could not reject the null hypothesis of equal predictive accuracy using the DM test. However, for both the 6- and 12-step-ahead predictions, the AR(1) and VAR(1) NS models outperformed the HA and the classical AR(1) and ranked higher according to the MCS procedure. The MCS results were consistent across the different test statistics, with rankings and superior set compositions remaining unchanged under both TMax and TR tests.
4.6. Forecasting Commodity Returns
With reference to the 17 return series for commodity futures, the test statistics presented in Table A2, Panel E, for various comparisons across commodities and three horizons, show that the NS factors performed poorly in predicting commodity futures returns. Similarly to the findings for corporate bond, the AR(1) NS failed to outperform the benchmarks, i.e., the AR(1) for commodity returns and the HA model. On the contrary, AR(1) NS performed similarly to the VAR(1) NS model. As in the case of other asset classes, the forecasting accuracy of the NS-driven models declined as the forecast horizon lengthened, in comparison to the benchmarks. Additionally, no clear trend was observed in the predictive performance across the four commodity subgroups (metals, energy, agricultural, and livestock).
Although this may appear unsurprising in light of the DM tests, according to the MCS procedure, the results of which are presented in Table 4, Panels A through F, forecasting commodity futures returns using NS factors led to partial success at best. The AR(1) and VAR(1) NS models were frequently excluded from the superior set M*; this occurred for approximately 50% of forecasting experiments, for both one- and six-month-ahead forecasts. However, the performance of the two benchmarks was comparable to the HA model, which was indeed often excluded from the superior set as well. The classical AR(1) for returns had the highest predictive accuracy among all models considered, ranking first 14 times for the one-step-ahead horizon and 11 times for the six-step-ahead horizon, out of the 17 series considered. As expected, in the case of the 12-step-ahead horizon, the MCS test excluded the AR(1) and VAR(1) NS from the superior set more frequently than for the shorter prediction horizons. At 12 months, both models were only included in M* once, according to the TMax statistic, and twice according to the TR statistic. When compared to the benchmarks, AR(1) NS outperformed HA for the one- and six-step-ahead forecasts but underperformed for the 12-step-ahead horizon. Therefore, and contrary expectations based on the influence of interest rates on commodity futures prices (Frankel, 2008), both th DM and MCS tests failed to produce significant results in forecasting commodity returns across all horizons.16
5. Discussion and Conclusions
The main goal of this paper was to assess the hypothesis that the three NS factors (level, slope, and curvature) derived from the US yield curve can be used to forecast returns across a variety of financial assets. To test this over-arching hypothesis, we analyzed the predictability of monthly return series covering a range of asset classes, i.e., equity portfolios, REITs, corporate bonds, commodity futures, and US Treasury notes. Given a baseline null hypothesis that, being extracted from yield curve data, the NS factors should contain no (or not much) information for forecasting returns on asset classes that remain essentially different from default risk-free bonds, our results suggest that the three Nelson–Siegel factors do carry some unsuspected predictive power well beyond the government bond market. Although the performance of AR(1) and VAR(1) models incorporating predictions of the NS factors relative to common benchmarks was not always overwhelming and it varied across forecast horizons and asset classes, the results were generally statistically significant. Specifically, we found that, in the case of one-step-ahead forecasts, the AR(1) and VAR(1) NS models frequently outperformed the benchmarks, which included a classical AR(1) model estimated on past returns and the historical mean model (derived from the random walk hypothesis for log-asset prices). However, the predictive accuracy of the AR(1) and VAR(1) NS models tended to decline as the forecast horizon lengthened, as the predictive performance at the 6- and 12-month-ahead horizons turned out to be inferior vs. the short-term horizons. These results indicate that models based on predictions of NS factors may have been more appropriate for short-term forecasting, while the benchmarks became increasingly competitive when applied within medium- to long-term applications.
As one would expect, in our recursive, pseudo OOS experiments, the strongest predictive accuracy was obtained with reference to the prediction of US Treasury returns, when both the AR(1) and VAR(1) NS models performed well across most forecast horizons. In light of the baseline null hypothesis, the results also revealed notable pockets of non-negligible predictive power in the case of corporate bonds, REITs (which are generally linked to the dynamics of interest rates through the mortgage market), and even equities. However, the informational content of the NS factors was weaker in the case of commodity futures returns, where they failed to outperform even simple benchmarks.17
In conclusion, our results underscore the importance of considering the historical dynamics of the yield curve’s level, slope, and curvature when forecasting financial returns, and opens up the possibility that the NS factors expressed by US interest rate data may appear in a structural way in the SDF pricing and hence display predictive power for the returns on all assets. Indeed, the NS factors may be conjectured to describe the yield curve’s dynamics and, in turn, capture shifts in macroeconomic conditions that affect investment opportunities and hence, risk premia. These factors, if sufficiently systematic and pervasive, could be drivers of the SDF, which governs asset pricing through the fundamental condition
where is the SDF and is the return on any traded asset. If the NS factors serve as state variables, because they may affect consumption growth, investor sentiment, or financial frictions, they may be embedded into the SDF representation as
Setting in Equation (18) implies that represents the short-term yield, linking the expectation of directly to the short-term rate. Since this rate is largely determined by the sum of the level and slope factors, it follows that the stochastic discount factor is primarily influenced by and , rather than the curvature factor . This observation suggests that the stronger predictive power observed for financial asset returns likely stems from the combined effect of level and slope factors. Empirically, our results support this interpretation, as models incorporating all three NS factors outperformed those based on individual factors. This indicates that return predictability is not solely driven by curvature effects but rather emerges from broader interactions within the term structure.
This formulation implies that any movement in the yield curve factors propagates into expected returns across asset classes. As argued since Diebold and Li (2006), the level factor () captures the general risk-free rate and long-term discount rates, the slope factor () reflects changes in short-term interest rate expectations and monetary policy, and the curvature factor () accounts for medium-term risk adjustments and liquidity considerations. Consequently, fluctuations in these factors alter discount rates and expected excess returns, influencing not only bonds but also equities, real estate, currencies, and, in principle, alternative assets.
Our findings establish a potential link between NS factors and traditional asset pricing models, such as the Fama–French factor model and the consumption-based asset pricing model (C-CAPM). While the Fama–French framework attributes equity return variations to systematic, aggregate market risk exposure, and the C-CAPM links asset prices to consumption risk, our results suggest that NS factors may similarly encapsulate macroeconomic conditions shaping expected returns across different asset classes. This implies that the NS factors could either complement existing pricing factors and serve as alternative proxies for systematic risk or simply provide instruments that can predict business cycles and hence aggregate asset market valuations. For example, the role of the level factor () in capturing variations in long-term discount rates aligns with the role played by the market risk premium in the Fama–French model. The slope factor () is a well-known predictor of business cycle fluctuations, and therefore may help explain the cross-sectional variation in stock returns, particularly in relation to small-cap (SMB) and value (HML) stocks. Meanwhile, the curvature factor () may be linked to liquidity conditions and short-term risk dynamics, concepts widely explored in liquidity-based asset pricing models (see, e.g., Holmström and Tirole (2001)). From a C-CAPM perspective, the NS factors could act as proxies for time-varying discount rates embedded in the SDF. If yield curve capture conveys shifts in consumption growth expectations and intertemporal risk aversion, then variations in (), (), and () should help predict excess returns across asset classes, which is consistent with the bulk of our empirical findings.
This predictive ability of NS factors aligns with theoretical constraints on return forecastability. For instance, Potì (2018) established a tight bound on the predictability of portfolio returns using features of the SDF, and showed that out-of-sample predictability is constrained by the correlation between the SDF and the returns of any portfolio. In this framework, the upper bound on return predictability is given by
where denotes the correlation between the SDF and the returns on some portfolio , while represents the unconditional variance in the SDF. If the NS factors significantly contribute to the predicted variation in , then their ability to forecast asset returns is dictated by their empirical correlation with the pricing kernel.
Our empirical evidence shows that NS factors often exhibit high predictive power for short-term returns, particularly in fixed-income and equity markets, indicating that they effectively capture fluctuations in risk premia and discount rates. The observed return predictability, if within the derived theoretical bounds, would support the idea that NS factors serve as valid proxies for economic conditions embedded in the SDF. By influencing discount rates and expected excess returns across multiple markets, the NS factors may serve as state variables that should be explicitly incorporated into asset pricing models.
Author Contributions
M.G.: conceptualization, formal analysis, writing—review and editing, supervision, funding acquisition, investigation; S.I.: software, writing—original draft preparation, formal analysis, investigation. All authors have read and agreed to the published version of the manuscript.
Funding
This study was funded by the European Union—NextGenerationEU, in the framework of the GRINS—Growing Resilient, Inclusive and Sustainable project (GRINS PE000018—CUP B43C22000760006). The views and opinions expressed are solely those of the authors and do not necessarily reflect those of the European Union, nor can the European Union be held responsible for them.
Data Availability Statement
The raw data supporting the conclusions of this article will be made available by the authors on request.
Conflicts of Interest
The authors declare no conflicts of interest.
Appendix A. Diebold–Mariano and Model Confidence Set Tests
For yields with maturity m, the forecast error for model i at time t and forecast horizon h is defined as
For returns of asset j, the forecast error for model i is given by
Using the squared forecast error as the loss function, the loss variables for yields and returns are respectively defined as
The loss differential between competing models i and k is defined as
The null hypothesis of equal predictive accuracy between models i and k is tested as follows
against the alternative:
Under the assumption that the loss differential series is covariance stationary and short in memory, the DM test statistic follows a standard normal distribution:
where is the average loss differential. The spectral density is estimated as follows:
For ,
For ,
For ,
where is the k-th autocorrelation of the loss differential series . The null hypothesis is tested for the models defined in Section 2.2 at a 5% significance level using a one-tailed test. A DM test statistic below rejects the null hypothesis.
As regards the MSC procedure, starting from an initial set of models M, the procedure identifies a subset that, with a predetermined confidence level, includes the best forecasting model. Let represent the loss differential between two models i and j, as defined above. Additionally, define the average loss differential between model i and all other models as
The null hypothesis of equal predictive accuracy (EPA) can be tested in two ways. First, we can test that all pairwise loss differentials are zero across all models:
against the alternative hypothesis that there is at least one non-zero pairwise loss differential:
Alternatively, we can test that the average loss differentials are zero for all models:
against the alternative hypothesis that at least one average loss differential is non-zero:
The following statistics are used to test these hypotheses:
where and , while and are bootstrapped estimates of the variances and , respectively.
The two test statistics for these hypotheses are
These test statistics are non-normally distributed, and their asymptotic distributions are estimated using the bootstrap. The procedure was carried out sequentially, eliminating models one at a time from the superior set until the EPA hypothesis was not rejected.
Models were eliminated according to the following rules:
The number of bootstrapped samples was set to 1000, the quadratic loss function was used, and the confidence level was . The MCS procedure was applied for models forecasting yields (models I to VIII in Section 2.2), for each forecasting horizon (1-step-ahead, 6-step-ahead, and 12-step-ahead), and for each yield analyzed (3 months, 1 year, 5 years, and 10 years). The procedure was also applied for models forecasting returns (models I to IV in Section 2.3), for each of the three forecasting horizons (1-step-ahead, 6-step-ahead, and 12-step-ahead), and for each portfolio analyzed across various asset classes (equity, US Treasury securities, corporate bonds, REITs, and commodity futures contracts).
Appendix B. Summary Statistics
Table A1.
Summary statistics for target return series.
Table A1.
Summary statistics for target return series.
| Panel A: Treasury Rates | Observations | Mean | Median | Minimum | Maximum | St. Dev | Skewness |
| 3 Months | 372 | 2.692 | 2.225 | 0.000 | 8.070 | 2.286 | 0.341 |
| 6 Months | 372 | 2.818 | 2.360 | 0.030 | 8.440 | 2.320 | 0.324 |
| 1 Year | 372 | 2.943 | 2.475 | 0.090 | 8.580 | 2.326 | 0.303 |
| 2 Years | 372 | 3.230 | 2.865 | 0.110 | 8.960 | 2.354 | 0.289 |
| 3 Years | 372 | 3.441 | 3.130 | 0.110 | 9.050 | 2.312 | 0.276 |
| 5 Years | 372 | 3.831 | 3.710 | 0.210 | 9.040 | 2.203 | 0.248 |
| 7 Years | 372 | 4.135 | 4.015 | 0.390 | 9.060 | 2.115 | 0.243 |
| 10 Years | 372 | 4.370 | 4.335 | 0.550 | 9.040 | 2.012 | 0.238 |
| Panel B: Treasury Returns | Observations | Mean | Median | Minimum | Maximum | St. Dev | Skewness |
| 3 Months | 371 | 0.226 | 0.193 | −0.017 | 0.698 | 0.195 | 0.377 |
| 6 Months | 371 | 0.242 | 0.197 | −0.060 | 0.876 | 0.217 | 0.556 |
| 1 Year | 371 | 0.445 | 0.205 | −5.024 | 12.214 | 1.985 | 1.326 |
| 2 Years | 371 | 0.483 | 0.336 | −6.957 | 9.110 | 2.327 | 0.355 |
| 3 Years | 371 | 0.505 | 0.398 | −8.362 | 8.550 | 2.480 | 0.065 |
| 5 Years | 371 | 0.534 | 0.485 | −8.637 | 9.317 | 2.542 | −0.025 |
| 7 Years | 371 | 0.551 | 0.480 | −8.611 | 9.892 | 2.481 | −0.064 |
| 10 Years | 371 | 0.562 | 0.413 | −8.346 | 11.054 | 2.380 | 0.085 |
| Panel C | Observations | Mean | Median | Minimum | Maximum | St. Dev | Skewness |
| Fama and French Factor Portfolios | |||||||
| SMB | 370 | 0.092 | 0.115 | −16.720 | 21.130 | 3.143 | 0.633 |
| HML | 370 | 0.191 | −0.050 | −9.050 | 14.110 | 3.074 | 0.761 |
| Mkt | 370 | 0.891 | 1.385 | −17.150 | 13.650 | 4.352 | −0.641 |
| Industry Portfolios | |||||||
| Food | 371 | 0.913 | 1.020 | −14.560 | 15.140 | 3.962 | −0.286 |
| Mines | 371 | 0.846 | 0.740 | −32.740 | 22.450 | 7.919 | −0.284 |
| Oil | 371 | 0.701 | 0.670 | −34.810 | 32.820 | 6.280 | 0.003 |
| Textiles | 371 | 1.040 | 1.280 | −22.140 | 23.270 | 6.150 | −0.216 |
| Consumer Durables | 371 | 0.808 | 1.070 | −25.770 | 29.240 | 5.777 | −0.217 |
| Chemicals | 371 | 0.897 | 1.240 | −21.930 | 22.300 | 5.886 | −0.193 |
| Consumer | 371 | 1.001 | 1.380 | −10.130 | 15.630 | 4.156 | −0.179 |
| Construction | 371 | 1.137 | 1.510 | −20.200 | 17.920 | 5.871 | −0.248 |
| Steel | 371 | 0.669 | 0.710 | −32.910 | 30.670 | 8.333 | −0.230 |
| Fabricated Products | 371 | 1.033 | 1.360 | −23.030 | 18.820 | 5.473 | −0.496 |
| Machinery | 371 | 1.178 | 1.670 | −28.320 | 19.320 | 6.938 | −0.514 |
| Cars | 371 | 1.097 | 0.980 | −28.430 | 38.710 | 7.194 | 0.536 |
| Transportation | 371 | 1.023 | 1.510 | −22.680 | 19.700 | 5.196 | −0.621 |
| Utilities | 371 | 0.799 | 1.230 | −13.020 | 11.720 | 3.965 | −0.645 |
| Retail Stores | 371 | 1.080 | 1.010 | −14.590 | 18.240 | 4.776 | −0.070 |
| Financials | 371 | 0.973 | 1.590 | −22.100 | 17.100 | 5.647 | −0.623 |
| Other | 371 | 0.918 | 1.600 | −17.790 | 15.310 | 4.941 | −0.489 |
| Size and Value | |||||||
| Ptf 1 | 370 | 0.500 | 0.937 | −27.845 | 38.321 | 7.982 | 0.209 |
| Ptf 2 | 370 | 1.080 | 1.454 | −21.521 | 42.421 | 7.013 | 0.442 |
| Ptf 3 | 370 | 0.982 | 1.370 | −21.929 | 21.057 | 5.740 | −0.344 |
| Ptf 4 | 370 | 1.159 | 1.470 | −26.786 | 26.369 | 5.615 | −0.344 |
| Ptf 5 | 370 | 1.166 | 1.330 | −27.820 | 16.381 | 5.801 | −0.721 |
| Ptf 6 | 370 | 0.875 | 1.527 | −25.775 | 28.115 | 7.024 | −0.308 |
| Ptf 7 | 370 | 1.045 | 1.581 | −23.116 | 18.785 | 5.775 | −0.518 |
| Ptf 8 | 370 | 1.042 | 1.529 | −21.257 | 16.062 | 5.191 | −0.661 |
| Ptf 9 | 370 | 0.974 | 1.788 | −23.772 | 16.193 | 5.295 | −0.822 |
| Ptf 10 | 370 | 1.025 | 1.771 | −32.144 | 26.019 | 6.333 | −0.776 |
| Ptf 11 | 370 | 0.929 | 1.721 | −23.097 | 23.130 | 6.485 | −0.436 |
| Ptf 12 | 370 | 1.091 | 1.454 | −21.509 | 18.969 | 5.335 | −0.418 |
| Ptf 13 | 370 | 0.978 | 1.465 | −17.847 | 16.757 | 4.879 | −0.555 |
| Ptf 14 | 370 | 1.077 | 1.522 | −26.854 | 16.936 | 5.168 | −0.752 |
| Ptf 15 | 370 | 1.055 | 1.382 | −31.175 | 19.058 | 5.885 | −0.815 |
| Ptf 16 | 370 | 1.116 | 1.380 | −20.840 | 26.044 | 5.816 | −0.175 |
| Ptf 17 | 370 | 1.054 | 1.364 | −20.165 | 15.899 | 4.873 | −0.713 |
| Ptf 18 | 370 | 0.931 | 1.462 | −25.327 | 15.515 | 5.010 | −0.867 |
| Ptf 19 | 370 | 1.002 | 1.604 | −31.427 | 16.233 | 5.025 | −1.128 |
| Ptf 20 | 370 | 0.899 | 1.832 | −32.802 | 18.336 | 5.830 | −0.961 |
| Ptf 21 | 370 | 1.013 | 1.078 | −14.776 | 15.356 | 4.437 | −0.298 |
| Ptf 22 | 370 | 0.911 | 1.313 | −17.049 | 13.691 | 4.168 | −0.540 |
| Ptf 23 | 370 | 0.931 | 1.469 | −17.308 | 12.570 | 4.246 | −0.623 |
| Ptf 24 | 370 | 0.605 | 1.151 | −27.164 | 15.878 | 5.025 | −1.209 |
| Ptf 25 | 370 | 0.846 | 1.330 | −28.401 | 21.494 | 6.218 | −0.572 |
| Operating Profits and Investments | |||||||
| Ptf 26 | 371 | 0.929 | 1.305 | −25.258 | 27.048 | 6.967 | −0.085 |
| Ptf 27 | 371 | 0.629 | 1.009 | −25.572 | 15.998 | 5.420 | −0.599 |
| Ptf 28 | 371 | 0.970 | 1.266 | −31.152 | 18.713 | 5.712 | −0.764 |
| Ptf 29 | 371 | 0.924 | 1.884 | −26.366 | 18.620 | 6.106 | −0.711 |
| Operating Profits and Investments | |||||||
| Ptf 30 | 371 | 0.485 | 1.201 | −27.414 | 18.763 | 6.832 | −0.598 |
| Ptf 31 | 371 | 0.945 | 1.098 | −19.474 | 21.352 | 5.180 | −0.238 |
| Ptf 32 | 371 | 1.010 | 1.486 | −20.912 | 17.886 | 4.983 | −0.383 |
| Ptf 33 | 371 | 0.824 | 1.244 | −24.368 | 18.783 | 5.243 | −0.920 |
| Ptf 34 | 371 | 1.004 | 1.255 | −20.014 | 23.141 | 5.258 | −0.163 |
| Ptf 35 | 371 | 0.822 | 1.189 | −20.749 | 27.256 | 5.612 | −0.180 |
| Ptf 36 | 371 | 0.932 | 1.393 | −23.239 | 15.767 | 5.105 | −0.572 |
| Ptf 37 | 371 | 1.032 | 1.269 | −22.021 | 16.520 | 4.746 | −0.522 |
| Ptf 38 | 371 | 0.990 | 1.433 | −20.939 | 13.858 | 4.505 | −0.781 |
| Ptf 39 | 371 | 0.975 | 1.296 | −21.704 | 16.741 | 4.745 | −0.643 |
| Ptf 40 | 371 | 0.863 | 1.700 | −24.746 | 16.446 | 5.825 | −0.714 |
| Ptf 41 | 371 | 1.241 | 1.599 | −21.235 | 15.446 | 4.563 | −0.449 |
| Ptf 42 | 371 | 1.034 | 1.227 | −13.247 | 15.350 | 3.826 | −0.171 |
| Ptf 43 | 371 | 0.947 | 1.201 | −14.485 | 12.952 | 4.242 | −0.413 |
| Ptf 44 | 371 | 1.077 | 1.569 | −17.855 | 18.125 | 4.613 | −0.447 |
| Ptf 45 | 371 | 1.103 | 1.342 | −23.784 | 18.896 | 5.565 | −0.412 |
| Ptf 46 | 371 | 1.184 | 1.534 | −17.225 | 15.081 | 4.588 | −0.258 |
| Ptf 47 | 371 | 0.973 | 1.292 | −19.068 | 17.008 | 4.364 | −0.356 |
| Ptf 48 | 371 | 0.914 | 1.142 | −17.249 | 17.431 | 4.253 | −0.354 |
| Ptf 49 | 371 | 1.103 | 1.233 | −16.588 | 12.851 | 4.537 | −0.352 |
| Size and Cash flow | |||||||
| Ptf 50 | 371 | 1.170 | 1.349 | −20.308 | 20.017 | 5.884 | −0.332 |
| Ptf 51 | 371 | 0.992 | 1.690 | −21.650 | 19.520 | 5.876 | −0.375 |
| Ptf 52 | 371 | 1.125 | 1.720 | −22.920 | 16.670 | 4.961 | −0.709 |
| Ptf 53 | 371 | 1.122 | 1.690 | −31.130 | 21.000 | 5.663 | −0.816 |
| Ptf 54 | 371 | 1.024 | 1.390 | −15.670 | 14.360 | 4.522 | −0.355 |
| Size and Dividend yield | |||||||
| Ptf 55 | 371 | 0.898 | 1.350 | −15.380 | 12.570 | 4.139 | −0.573 |
| Ptf 56 | 371 | 0.935 | 1.350 | −23.610 | 15.190 | 4.594 | −0.816 |
| Ptf 57 | 371 | 1.103 | 1.800 | −20.890 | 15.930 | 5.180 | −0.646 |
| Ptf 58 | 371 | 1.010 | 1.560 | −20.440 | 16.210 | 4.821 | −0.633 |
| Ptf 59 | 371 | 0.992 | 1.320 | −23.460 | 21.840 | 4.644 | −0.608 |
| Size and Price to Earnings ratio | |||||||
| Ptf 60 | 371 | 0.894 | 1.210 | −20.310 | 16.100 | 4.861 | −0.492 |
| Ptf 61 | 371 | 0.968 | 1.230 | −15.550 | 12.900 | 4.063 | −0.555 |
| Ptf 62 | 371 | 0.868 | 1.030 | −16.750 | 14.960 | 4.072 | −0.661 |
| Ptf 63 | 371 | 1.000 | 1.700 | −22.510 | 18.860 | 5.928 | −0.415 |
| Ptf 64 | 371 | 1.097 | 1.660 | −22.890 | 17.030 | 4.873 | −0.679 |
| Ptf 65 | 371 | 1.138 | 1.640 | −30.890 | 20.050 | 5.536 | −0.844 |
| Ptf 66 | 371 | 0.955 | 1.230 | −15.940 | 13.800 | 4.451 | −0.375 |
| Ptf 67 | 371 | 0.930 | 1.250 | −15.470 | 13.100 | 4.104 | −0.511 |
| Ptf 68 | 371 | 1.046 | 1.580 | −23.590 | 15.950 | 4.788 | −0.829 |
| Ptf 69 | 371 | 0.573 | 0.610 | −29.210 | 47.520 | 8.846 | 0.858 |
| Ptf 70 | 371 | 0.897 | 1.280 | −23.550 | 27.550 | 5.758 | −0.308 |
| Ptf 71 | 371 | 1.137 | 1.670 | −23.430 | 22.480 | 5.190 | −0.538 |
| Ptf 72 | 371 | 1.333 | 1.690 | −24.340 | 20.730 | 5.157 | −0.548 |
| Ptf 73 | 371 | 1.606 | 2.190 | −22.830 | 31.850 | 6.619 | −0.132 |
| Ptf 74 | 371 | 0.737 | 0.760 | −27.730 | 55.850 | 8.728 | 0.607 |
| Ptf 75 | 371 | 1.037 | 1.480 | −25.210 | 33.740 | 6.095 | −0.156 |
| Ptf 76 | 371 | 1.122 | 1.620 | −22.280 | 18.670 | 5.156 | −0.607 |
| Ptf 77 | 371 | 1.218 | 1.630 | −23.070 | 17.670 | 5.183 | −0.565 |
| Ptf 78 | 371 | 1.404 | 1.780 | −25.600 | 30.160 | 6.799 | −0.132 |
| Ptf 79 | 371 | 0.790 | 0.890 | −27.530 | 44.280 | 8.438 | 0.417 |
| Ptf 80 | 371 | 0.970 | 1.440 | −25.020 | 27.060 | 5.820 | −0.350 |
| Ptf 81 | 371 | 1.047 | 1.350 | −22.200 | 23.520 | 5.012 | −0.482 |
| Ptf 82 | 371 | 1.035 | 1.490 | −22.110 | 16.620 | 4.808 | −0.637 |
| Size and Momentum | |||||||
| Ptf 83 | 371 | 1.297 | 1.640 | −21.460 | 26.390 | 6.215 | −0.267 |
| Ptf 84 | 371 | 0.649 | 0.620 | −38.490 | 46.420 | 8.554 | 0.133 |
| Ptf 85 | 371 | 1.016 | 1.420 | −23.550 | 28.880 | 5.738 | −0.161 |
| Ptf 86 | 371 | 1.092 | 1.610 | −22.240 | 17.480 | 4.802 | −0.580 |
| Ptf 87 | 371 | 1.075 | 1.410 | −19.390 | 12.980 | 4.378 | −0.719 |
| Ptf 88 | 371 | 1.245 | 1.430 | −22.170 | 25.080 | 5.631 | −0.338 |
| Ptf 89 | 371 | 0.600 | 0.620 | −31.450 | 30.950 | 7.720 | 0.163 |
| Ptf 90 | 371 | 0.871 | 0.930 | −20.640 | 24.370 | 5.144 | −0.003 |
| Ptf 91 | 371 | 0.899 | 1.130 | −16.890 | 17.540 | 4.211 | −0.329 |
| Ptf 92 | 371 | 0.964 | 1.210 | −14.190 | 15.800 | 4.019 | −0.263 |
| Ptf 93 | 371 | 1.111 | 1.630 | −19.610 | 18.520 | 4.920 | −0.381 |
| Panel D | Observations | Mean | Median | Minimum | Maximum | St. Dev | Skewness |
| Diversified | 321 | 0.699 | 1.262 | −31.960 | 39.687 | 6.181 | −0.575 |
| Industrial | 321 | 1.232 | 1.528 | −56.188 | 70.483 | 8.254 | 0.264 |
| Lodging/Resorts | 321 | 0.570 | 0.590 | −36.555 | 67.525 | 8.710 | 0.638 |
| Office | 321 | 0.895 | 1.480 | −31.796 | 32.458 | 6.077 | −0.487 |
| Residential | 321 | 1.013 | 1.362 | −26.656 | 22.242 | 5.501 | −0.833 |
| Retail | 321 | 0.846 | 1.086 | −42.678 | 43.516 | 6.701 | −0.985 |
| Self Storage | 321 | 1.326 | 1.616 | −22.244 | 21.928 | 5.584 | −0.364 |
| All REITs | 321 | 0.888 | 1.194 | −30.226 | 27.975 | 5.296 | −0.959 |
| Mortgage REITs | 321 | 0.642 | 1.595 | −53.753 | 19.411 | 6.551 | −2.455 |
| Panel E | Observations | Mean | Median | Minimum | Maximum | St. Dev | Skewness |
| AAA | 372 | 0.536 | 0.586 | −6.544 | 6.391 | 1.473 | −0.185 |
| AA | 372 | 0.586 | 0.565 | −5.774 | 5.357 | 1.339 | −0.258 |
| BBB | 372 | 0.531 | 0.671 | −10.996 | 6.286 | 1.668 | −1.575 |
| A | 372 | 0.565 | 0.649 | −9.672 | 6.750 | 1.524 | −1.063 |
| Panel F | Observations | Mean | Median | Minimum | Maximum | St. Dev | Skewness |
| Gold | 372 | 0.533 | 0.115 | −18.005 | 16.458 | 4.430 | 0.174 |
| Silver | 372 | 0.866 | 0.235 | −27.958 | 30.462 | 8.371 | 0.199 |
| Platinum | 372 | 0.417 | 0.309 | −42.004 | 24.112 | 6.286 | −0.854 |
| Copper | 372 | 0.582 | 0.421 | −36.150 | 34.064 | 7.181 | 0.044 |
| Brent Crude | 372 | 0.660 | 0.829 | −54.988 | 40.074 | 9.540 | −0.361 |
| Gasoil | 372 | 0.557 | 0.566 | −33.617 | 31.172 | 9.286 | −0.092 |
| Light crude oil | 372 | 0.703 | 0.924 | −54.245 | 88.376 | 10.485 | 1.198 |
| Natural Gas | 372 | 1.132 | 0.512 | −41.616 | 62.613 | 15.197 | 0.568 |
| Cotton | 372 | 0.385 | 0.511 | −36.116 | 24.749 | 8.492 | −0.222 |
| Cocoa | 372 | 0.585 | 0.201 | −28.083 | 34.565 | 8.921 | 0.457 |
| Coffee | 372 | 0.638 | −0.727 | −30.391 | 50.595 | 10.622 | 1.007 |
| Corn | 372 | 0.565 | 0.081 | −31.380 | 22.190 | 7.976 | −0.185 |
| Lumber | 372 | 1.091 | 0.485 | −34.030 | 58.416 | 11.205 | 0.674 |
| Soybean Oil | 372 | 0.444 | 0.242 | −24.461 | 26.862 | 6.933 | 0.039 |
| Soybeans | 372 | 0.485 | 0.434 | −32.867 | 19.573 | 7.010 | −0.491 |
| Wheat | 372 | 0.637 | 0.204 | −25.248 | 42.335 | 8.608 | 0.516 |
| Live Cattle | 372 | 0.264 | 0.017 | −25.565 | 40.342 | 5.652 | 0.437 |
The table contains summary statistics for the sample of monthly data used in the analysis. The sample was from January 1990 to December 2020. Each panel represents a different asset class. Panel A shows descriptive statistics of US Constant Maturity Treasury rates, taken to be representative of the US yield curve, at different maturities. Panel B shows statistics for the returns on US Treasury securities. Panels C, D, E, and F show statistics for equities, REIT, corporate bonds, and commodity returns, respectively. In particular, Panel C is split into eight subsections, showing statistics for the returns on the original Fama–French factor portfolios, the 17 industry portfolios, and various bivariate types on size, book-to-market, investments, operating profitability, cash flows, dividend yield, earnings price ratio, and momentum. Values are reported in percentage points.
Appendix C. Diebold–Mariano Test on Asset Return Series
Table A2.
Diebold–Mariano test on return series.
Table A2.
Diebold–Mariano test on return series.
| Panel A—Diebold–Mariano test comparing AR(1) on NS to listed models | |||||||||
| VAR(1) on NS | AR(1) for Returns | Historical Average | |||||||
| h = 1 | h = 6 | h = 12 | h = 1 | h = 6 | h = 12 | h = 1 | h = 6 | h = 12 | |
| 3 Months | 7.283 | 1.736 | 1.517 | −1.826 | −2.140 | −1.875 | −17.772 | −5.762 | −3.419 |
| 6 Months | 6.760 | 2.771 | 2.077 | −0.786 | −4.413 | −2.878 | −16.321 | −5.688 | −3.373 |
| 1 Year | 4.887 | 2.489 | 1.046 | 0.940 | −0.026 | −0.082 | −0.331 | −0.054 | −0.089 |
| 2 Years | 4.056 | 2.256 | 1.130 | 0.903 | 0.405 | −0.026 | 0.384 | 0.402 | −0.021 |
| 3 Years | 3.059 | 1.628 | 0.786 | 0.529 | 0.328 | −0.195 | 0.323 | 0.325 | −0.193 |
| 5 Years | 1.391 | 0.374 | −0.185 | −0.294 | −0.041 | −0.677 | −0.323 | −0.045 | −0.664 |
| 7 Years | 0.559 | −0.208 | −0.630 | −0.595 | −0.177 | −0.924 | −0.810 | −0.186 | −0.896 |
| 10 Years | −0.335 | −0.749 | −0.967 | −0.909 | −0.143 | −0.975 | −1.353 | −0.162 | −0.945 |
| Panel B—Diebold–Mariano test comparing AR(1) on NS vs. listed models | |||||||||
| VAR(1) on NS | AR(1) for Returns | Historical Average | |||||||
| h = 1 | h = 6 | h = 12 | h = 1 | h = 6 | h = 12 | h = 1 | h = 6 | h = 12 | |
| SMB | 0.126 | −0.542 | −1.039 | −0.245 | 0.006 | −0.113 | −1.137 | 0.025 | −0.059 |
| HML | 1.314 | 1.387 | 1.107 | −1.489 | −1.039 | −2.176 | −2.247 | −1.869 | −1.048 |
| Mkt | 0.472 | 1.218 | 1.220 | −1.922 | 2.333 | 1.606 | −1.395 | 0.745 | 2.549 |
| Food | 2.698 | 2.504 | 1.742 | 0.930 | 1.613 | 1.606 | 0.945 | 1.679 | 1.659 |
| Mines | −2.193 | −1.363 | −1.399 | 1.068 | 2.369 | 2.022 | 1.184 | 2.364 | 2.030 |
| Oil | −1.573 | −1.955 | −0.684 | 0.192 | 0.460 | 0.935 | −1.081 | −0.519 | 1.048 |
| Textiles | 2.530 | 2.341 | 1.697 | −0.368 | 1.777 | 1.735 | 0.442 | 1.759 | 1.835 |
| Consumer Durables | −2.381 | −1.343 | 0.545 | 0.592 | 1.017 | 1.302 | −0.038 | 1.077 | 1.366 |
| Chemicals | 1.407 | 1.458 | 1.039 | −0.490 | 0.644 | 1.001 | −0.351 | −0.675 | 1.085 |
| Steel | 1.068 | 1.902 | −1.839 | −2.504 | 0.554 | 1.156 | −0.902 | −1.129 | −0.678 |
| Consumer | 0.826 | −0.061 | −0.094 | 0.592 | 1.017 | 1.302 | −0.038 | 1.077 | 1.366 |
| Construction | 0.826 | −0.061 | −0.094 | 0.592 | 1.017 | 1.302 | −0.038 | 1.077 | 1.366 |
| Steel | 1.474 | 1.525 | 0.807 | −0.435 | 1.696 | 2.370 | −0.848 | −1.662 | 2.197 |
| Fabricated Products | −1.408 | −1.357 | −1.381 | −2.364 | 1.399 | 1.588 | −0.957 | 1.496 | −1.670 |
| Machinery | 1.263 | 1.027 | 0.209 | −0.567 | 1.386 | 1.364 | −0.723 | −1.931 | −1.384 |
| Cars | −1.498 | −1.579 | −1.417 | −1.068 | 1.227 | 1.525 | −1.566 | 1.249 | 1.544 |
| Transportation | 1.397 | 1.282 | 1.070 | −1.066 | 1.209 | 1.756 | −0.686 | 1.235 | 1.904 |
| Utilities | −0.712 | −1.517 | −2.243 | −2.775 | 1.277 | 1.254 | −2.444 | −1.877 | −1.087 |
| Retail Stores | 1.269 | 1.430 | 1.833 | −2.512 | 1.234 | 1.634 | −1.742 | 1.092 | 1.643 |
| Financials | 2.085 | 2.302 | 2.081 | 0.210 | 1.304 | 1.724 | 0.293 | 1.322 | 1.770 |
| Other | −0.728 | −0.042 | 0.631 | −2.162 | −0.607 | 1.300 | −2.668 | −1.443 | 1.034 |
| Ptf 1 | −1.042 | −0.875 | −0.369 | −1.516 | 0.655 | 2.206 | −3.251 | 0.537 | 2.125 |
| Ptf 2 | −0.800 | −0.970 | −1.421 | −2.910 | 0.702 | 1.622 | −1.624 | −0.732 | 1.718 |
| Ptf 3 | −0.656 | −0.750 | −0.769 | −2.638 | 0.126 | 0.981 | −1.943 | 0.170 | 1.060 |
| Ptf 4 | −1.106 | −1.035 | −0.711 | −2.329 | −0.243 | 0.221 | −0.944 | −0.208 | 0.267 |
| Ptf 5 | −1.107 | −1.126 | −0.897 | −1.974 | −0.305 | 0.234 | −1.870 | −0.259 | 0.292 |
| Ptf 6 | −1.461 | −0.863 | −0.715 | −2.311 | 1.153 | 2.646 | −2.678 | 1.091 | 2.213 |
| Ptf 7 | 0.410 | 0.578 | −0.046 | −1.534 | 1.470 | 0.902 | −2.169 | 1.388 | 0.773 |
| Ptf 8 | −0.857 | −0.637 | −0.116 | −1.807 | 0.067 | 0.541 | −1.868 | 0.145 | 0.608 |
| Ptf 9 | −0.988 | −0.958 | −0.691 | −2.272 | −0.424 | 0.112 | −1.144 | −0.375 | 0.175 |
| Ptf 10 | −0.353 | −0.442 | −0.515 | −2.196 | −0.125 | 0.724 | −1.203 | −0.047 | −0.885 |
| Ptf 11 | −1.282 | −1.189 | −1.529 | −1.493 | 1.198 | 2.775 | −1.886 | 1.228 | 3.369 |
| Ptf 12 | −1.243 | −0.533 | 0.027 | −1.797 | 0.654 | 1.309 | −0.895 | −0.715 | 1.448 |
| Ptf 13 | −0.337 | 0.038 | −0.277 | −2.253 | −0.058 | 0.709 | −1.799 | 0.074 | 0.939 |
| Ptf 14 | −0.742 | 0.196 | 0.490 | −1.419 | 0.375 | 0.770 | −0.525 | −0.438 | 0.845 |
| Ptf 15 | 0.046 | 0.156 | −0.111 | −2.848 | −1.122 | −0.557 | −1.719 | −1.055 | −0.499 |
| Ptf 16 | −1.201 | −0.746 | −0.997 | −1.978 | 0.854 | 3.021 | −2.192 | 0.790 | 3.791 |
| Ptf 17 | −0.422 | −0.352 | −0.067 | −1.749 | 1.401 | 1.541 | −2.793 | 1.246 | 3.655 |
| Ptf 18 | −0.454 | 0.568 | 0.206 | −1.764 | 0.678 | 1.827 | −1.337 | 0.824 | 2.230 |
| Ptf 19 | −0.992 | −1.519 | −0.867 | −2.611 | −0.730 | −0.021 | −1.821 | −0.659 | 0.083 |
| Ptf 20 | 0.626 | 1.153 | 1.022 | −1.359 | 0.453 | 0.976 | −0.596 | 0.479 | 1.059 |
| Ptf 21 | 1.473 | 2.740 | 1.991 | −1.669 | 0.307 | 1.786 | −1.669 | 0.151 | 1.706 |
| Ptf 22 | 1.714 | 2.310 | 1.483 | 0.170 | 1.420 | 1.812 | −0.092 | 1.425 | 1.888 |
| Ptf 23 | 0.421 | 0.096 | 0.251 | −2.148 | 0.134 | 1.715 | −1.478 | −0.205 | 2.009 |
| Ptf 24 | 1.251 | 0.664 | 0.471 | −0.743 | 0.895 | 1.388 | −0.368 | 0.930 | 1.471 |
| Ptf 25 | 1.097 | 0.768 | 0.838 | −0.521 | 0.824 | 1.455 | −1.851 | 0.833 | 1.509 |
| Ptf 26 | −1.036 | −0.979 | −0.027 | −1.200 | 1.202 | 1.594 | −2.065 | 1.148 | 1.483 |
| Ptf 27 | −0.432 | 0.398 | 2.164 | −0.917 | 0.701 | 1.209 | −0.667 | 0.712 | 1.280 |
| Ptf 28 | 2.111 | 1.729 | 1.234 | −0.718 | 0.782 | 1.019 | −0.104 | 0.798 | 1.033 |
| Ptf 29 | −0.572 | −1.026 | −1.557 | −2.588 | 0.352 | 2.803 | −2.962 | −0.285 | 3.126 |
| Ptf 30 | −1.391 | −0.508 | −0.470 | −1.806 | −0.693 | −0.189 | −2.648 | −0.842 | 0.287 |
| Ptf 31 | 0.110 | 0.357 | −0.005 | −2.030 | −4.420 | −0.751 | −1.676 | −4.702 | −0.713 |
| Ptf 32 | −1.084 | −3.046 | −1.119 | −1.280 | −0.560 | 0.632 | −1.904 | −0.519 | 0.783 |
| Ptf 33 | −0.158 | −0.226 | −0.373 | −1.210 | −0.088 | 0.400 | −2.192 | −0.157 | 0.329 |
| Ptf 34 | 0.295 | 0.865 | 2.266 | −1.154 | 1.115 | 3.148 | −1.695 | 1.112 | 2.939 |
| Ptf 35 | −1.499 | −2.748 | −2.188 | −1.455 | 0.147 | 1.578 | −1.325 | −0.009 | 1.453 |
| Ptf 36 | −0.679 | 0.379 | 1.114 | −1.394 | 1.406 | 5.678 | −1.915 | 1.463 | 5.460 |
| Ptf 37 | 0.714 | 0.615 | 0.486 | −0.812 | −0.246 | 0.248 | −0.835 | −0.175 | 0.323 |
| Ptf 38 | −0.758 | 0.455 | 0.382 | −1.869 | 1.210 | 2.144 | −2.526 | 1.017 | 1.870 |
| Ptf 39 | 3.157 | 2.732 | 2.057 | 1.301 | 2.064 | 2.237 | 0.951 | 2.083 | 2.283 |
| Ptf 40 | −0.319 | −0.968 | −1.168 | −1.693 | −0.805 | −0.052 | −2.180 | −0.830 | −0.082 |
| Ptf 41 | 0.392 | 0.942 | 0.603 | −0.603 | 1.098 | 1.149 | −0.908 | 1.090 | 1.157 |
| Ptf 42 | −0.727 | −0.543 | −0.174 | −0.974 | 0.595 | 2.561 | −2.044 | 0.647 | 2.729 |
| Ptf 43 | 0.655 | 0.174 | 0.421 | −0.637 | 0.600 | 1.712 | −1.266 | 0.662 | 1.985 |
| Ptf 44 | −0.799 | −0.273 | 0.239 | −2.300 | 0.876 | 3.570 | −2.645 | 0.884 | 3.764 |
| Ptf 45 | −1.234 | −3.087 | −2.192 | −0.644 | 0.666 | 4.135 | −2.237 | 0.497 | 4.996 |
| Ptf 46 | 3.401 | 2.968 | 2.602 | 1.590 | 2.107 | 2.423 | 1.252 | 2.167 | 2.472 |
| Ptf 47 | 3.901 | 4.542 | 4.220 | 1.632 | 2.609 | 4.120 | 1.646 | 2.599 | 4.209 |
| Ptf 48 | 0.423 | −0.003 | 0.120 | −1.838 | 0.876 | 1.364 | −1.862 | 0.951 | 1.551 |
| Ptf 49 | −0.191 | 0.074 | 0.414 | −1.104 | 0.831 | 1.579 | −2.720 | 0.824 | 1.669 |
| Ptf 50 | 0.864 | 1.174 | 1.042 | −1.441 | 1.772 | 2.319 | −2.431 | 1.672 | 2.646 |
| Ptf 51 | −0.816 | −0.897 | −1.052 | −2.446 | 0.652 | 2.013 | −1.861 | −0.720 | 2.253 |
| Ptf 52 | −0.717 | −0.617 | −0.611 | −2.865 | −0.231 | 0.326 | −1.236 | −0.147 | 0.430 |
| Ptf 53 | −0.167 | 0.094 | −0.064 | −2.077 | −0.404 | 0.179 | −1.763 | −0.310 | 0.343 |
| Ptf 54 | 1.091 | 1.944 | 1.818 | −1.892 | 0.284 | 1.620 | −1.835 | 0.180 | 1.602 |
| Ptf 55 | 1.830 | 1.968 | 1.374 | −0.031 | 1.395 | 2.055 | −0.017 | 1.402 | 2.163 |
| Ptf 56 | 0.528 | −0.106 | −0.019 | −1.314 | 0.217 | 1.081 | −1.004 | −0.282 | 1.233 |
| Ptf 57 | −0.742 | −0.627 | −0.521 | −2.514 | 0.073 | 0.750 | −1.260 | 0.169 | 0.909 |
| Ptf 58 | −1.143 | −1.136 | −0.893 | −2.518 | 0.196 | 0.710 | −1.842 | 0.247 | 0.803 |
| Ptf 59 | −1.942 | −1.436 | −0.650 | −1.676 | 0.544 | 0.743 | −0.240 | 0.588 | 0.804 |
| Ptf 60 | 1.476 | 1.782 | 1.649 | −0.716 | 1.099 | 1.806 | −0.519 | 1.107 | 1.867 |
| Ptf 61 | 1.224 | 1.042 | 0.967 | −0.076 | 1.001 | 2.157 | −0.760 | 0.993 | 2.301 |
| Ptf 62 | 1.042 | 0.545 | 0.204 | −1.845 | 0.227 | 0.948 | −0.677 | 0.299 | 1.075 |
| Ptf 63 | −0.498 | −0.570 | −0.923 | −2.505 | 0.496 | 1.956 | −1.849 | 0.581 | 2.327 |
| Ptf 64 | −0.762 | −0.562 | −0.491 | −2.937 | −0.079 | 0.564 | −1.298 | −0.008 | 0.686 |
| Ptf 65 | −0.581 | −0.445 | −0.451 | −2.253 | −0.397 | 0.174 | −1.930 | −0.335 | 0.297 |
| Ptf 66 | 1.324 | 2.013 | 1.910 | −1.603 | 0.824 | 1.848 | −1.250 | 0.780 | 1.887 |
| Ptf 67 | 1.307 | 1.409 | 1.063 | −0.253 | 0.990 | 2.175 | −0.610 | 0.981 | 2.323 |
| Ptf 68 | 1.101 | 0.817 | 0.638 | −0.777 | 0.521 | 1.096 | −0.655 | 0.568 | 1.188 |
| Ptf 69 | −0.111 | −0.283 | −0.398 | −1.807 | 1.030 | 1.494 | −1.932 | 0.972 | 1.371 |
| Ptf 70 | −1.116 | −1.208 | −0.735 | −2.382 | 0.443 | 0.752 | −0.774 | 0.443 | 0.793 |
| Ptf 71 | −1.498 | −1.899 | −1.178 | −2.323 | 0.621 | 1.462 | −0.584 | 0.641 | 1.559 |
| Ptf 72 | −0.960 | −1.210 | −1.001 | −2.172 | −0.268 | 0.414 | −1.021 | −0.199 | 0.490 |
| Ptf 73 | −2.198 | −1.564 | −1.096 | −0.993 | 0.388 | 0.758 | −0.202 | −0.426 | 0.801 |
| Ptf 74 | 1.744 | 1.638 | 0.936 | −1.151 | 1.580 | 1.435 | 0.165 | 1.518 | 1.379 |
| Ptf 75 | 1.075 | 1.237 | 0.719 | −1.727 | 1.224 | 0.937 | −0.869 | 1.169 | 0.873 |
| Ptf 76 | −0.516 | −0.138 | −0.254 | −2.455 | 0.233 | 1.672 | −1.551 | 0.312 | 1.951 |
| Ptf 77 | −0.949 | −0.871 | −0.824 | −2.308 | −0.142 | 0.409 | −1.901 | −0.050 | 0.495 |
| Ptf 78 | −1.170 | −0.932 | −1.037 | −1.031 | 0.166 | 0.830 | −0.831 | 0.252 | 0.914 |
| Ptf 79 | 0.737 | 0.563 | 0.128 | −1.576 | 1.334 | 1.634 | −1.342 | 1.294 | 1.475 |
| Ptf 80 | −0.205 | 0.183 | −0.011 | −2.024 | 0.023 | 1.820 | −2.325 | −0.008 | 1.401 |
| Ptf 81 | 1.066 | 1.040 | 1.062 | −1.871 | 0.422 | 0.783 | −0.637 | −0.483 | 0.881 |
| Ptf 82 | −1.342 | 0.049 | 0.451 | −0.290 | 0.493 | 0.730 | −0.198 | 0.563 | 0.796 |
| Ptf 83 | −1.518 | 1.009 | 0.683 | −0.075 | 0.674 | 1.261 | −0.195 | 0.745 | 1.317 |
| Ptf 84 | −1.058 | −0.955 | −1.135 | −1.717 | 0.655 | 1.411 | −1.820 | 0.708 | 1.592 |
| Ptf 85 | −0.515 | 0.147 | 0.219 | −2.104 | 0.307 | 1.354 | −1.506 | 0.307 | 1.562 |
| Ptf 86 | 0.602 | 0.387 | 0.536 | −1.522 | 0.612 | 1.581 | −1.261 | 0.674 | 1.862 |
| Ptf 87 | 0.364 | 0.160 | 0.322 | −1.500 | 0.592 | 1.944 | −1.749 | −0.687 | 2.414 |
| Ptf 88 | −1.199 | −0.564 | −0.862 | −0.562 | 0.636 | 1.475 | −0.749 | 0.707 | 1.565 |
| Ptf 89 | 1.467 | 1.768 | 1.516 | −1.580 | 1.298 | 2.332 | −0.615 | 1.291 | 2.392 |
| Ptf 90 | −0.536 | −0.323 | 0.125 | −1.901 | 0.022 | 1.081 | −2.532 | −0.022 | 1.036 |
| Ptf 91 | 0.895 | 1.284 | 1.055 | −2.489 | 0.151 | 2.307 | −2.666 | −0.143 | 2.191 |
| Ptf 92 | 0.170 | 0.218 | 0.617 | −0.508 | 0.246 | 1.882 | −1.812 | −0.232 | 2.093 |
| Ptf 93 | 1.514 | 2.127 | 1.807 | 0.048 | 1.230 | 1.880 | −0.436 | 1.257 | 1.981 |
| Panel C—Diebold–Mariano test comparing AR(1) on NS vs. listed models | |||||||||
| VAR(1) on NS | AR(1) for Returns | Historical Average | |||||||
| h = 1 | h = 6 | h = 12 | h = 1 | h = 6 | h = 12 | h = 1 | h = 6 | h = 12 | |
| Diversified | −0.669 | −0.527 | 1.117 | −2.594 | −0.534 | −1.274 | −1.973 | −0.515 | −1.269 |
| Industrial | −0.576 | 0.370 | 1.847 | −0.371 | 0.349 | 0.390 | −1.499 | −0.353 | 0.375 |
| Lodging/Resorts | −0.994 | 0.204 | 0.710 | −1.736 | −0.142 | −0.337 | −1.824 | −0.139 | −0.323 |
| Office | −0.885 | −0.057 | −0.064 | −2.286 | −1.595 | −0.803 | −1.701 | −1.598 | −0.774 |
| Residential | −0.204 | −0.116 | 0.449 | −1.676 | −0.698 | −1.058 | −2.513 | −0.666 | −1.054 |
| Retail | −1.580 | −0.668 | −0.362 | −1.893 | −0.883 | −1.768 | −1.178 | −0.877 | −1.586 |
| Self Storage | −0.363 | 0.963 | 0.036 | −0.131 | 0.058 | −0.457 | −2.379 | −0.063 | −0.425 |
| All REITs | −0.572 | 0.158 | 1.728 | −2.790 | 0.506 | −0.974 | −1.824 | 0.539 | −0.916 |
| Mortgage REITs | 0.243 | 0.850 | 1.405 | 0.992 | 0.973 | 0.267 | 0.091 | 1.107 | 0.290 |
| Panel D—Diebold–Mariano test comparing AR(1) vs. NS to listed models | |||||||||
| VAR(1) on NS | AR(1) for Returns | Historical Average | |||||||
| h = 1 | h = 6 | h = 12 | h = 1 | h = 6 | h = 12 | h = 1 | h = 6 | h = 12 | |
| AAA | 0.720 | 0.357 | 0.386 | −1.169 | −0.076 | −0.314 | −1.117 | −0.072 | −0.290 |
| AA | 1.832 | 0.472 | −0.236 | −1.113 | −0.364 | −0.850 | −0.430 | −0.354 | −0.829 |
| BBB | 1.482 | 0.178 | −0.169 | −0.954 | 0.012 | −0.350 | −0.217 | 0.020 | −0.302 |
| A | 1.505 | 0.219 | −0.340 | −1.884 | −0.291 | −0.710 | −0.484 | −0.289 | −0.683 |
| Panel E—Diebold–Mariano test comparing AR(1) on NS vs. listed models | |||||||||
| VAR(1) on NS | AR(1) for Returns | Historical Average | |||||||
| h = 1 | h = 6 | h = 12 | h = 1 | h = 6 | h = 12 | h = 1 | h = 6 | h = 12 | |
| Gold | −1.159 | −1.594 | −1.839 | 0.787 | 1.858 | 1.572 | 0.607 | 1.683 | 1.570 |
| Silver | −0.100 | −1.614 | −1.032 | 0.152 | 5.875 | 2.313 | −0.136 | 3.118 | 2.166 |
| Platinum | 2.183 | 2.147 | 2.748 | 1.460 | 2.563 | 5.188 | 1.893 | 2.549 | 4.752 |
| Copper | −0.346 | −0.214 | −0.283 | 0.515 | 2.712 | 2.393 | 1.747 | 2.605 | 2.412 |
| Brent Crude | 0.785 | 0.171 | −0.004 | 1.339 | 2.513 | 2.538 | 0.729 | 2.498 | 2.458 |
| Gasoil | 0.660 | 0.081 | −0.104 | 1.745 | 1.892 | 2.567 | 0.907 | 1.725 | 2.438 |
| Light crude oil | 0.704 | 0.036 | -0.063 | 0.863 | 2.292 | 3.414 | 0.238 | 2.386 | 2.919 |
| Natural Gas | 0.662 | −1.216 | 0.131 | 0.205 | −1.102 | 1.434 | −0.059 | −1.539 | 1.604 |
| Cotton | −1.218 | −1.386 | −0.857 | 1.402 | 1.283 | 3.114 | 1.158 | 1.201 | 3.421 |
| Cocoa | 0.451 | −0.007 | −0.206 | 1.821 | −0.763 | 0.437 | −0.362 | −0.871 | 0.535 |
| Coffee | −2.830 | −1.829 | −0.898 | 0.068 | 0.411 | 1.632 | −0.295 | 0.104 | 1.685 |
| Corn | −0.061 | 0.470 | 3.181 | 0.325 | −0.647 | 1.662 | −1.077 | −1.144 | 1.776 |
| Lumber | 0.123 | 0.540 | 1.536 | −0.377 | −0.213 | 2.186 | −0.795 | −0.544 | 2.015 |
| Soybean Oil | −0.660 | −0.299 | −0.354 | 1.144 | 1.306 | 2.033 | 0.226 | 0.501 | 1.983 |
| Soybeans | 0.085 | 0.303 | 1.393 | 0.191 | −0.439 | 1.861 | −0.521 | −0.683 | 2.062 |
| Wheat | 0.460 | −0.752 | −0.351 | 0.926 | −0.848 | 2.093 | −0.802 | −0.918 | 2.059 |
| Live Cattle | 0.923 | 0.986 | −0.020 | 1.339 | 0.746 | 1.791 | 0.410 | 0.525 | 1.882 |
| Panel F—Summary of Diebold–Mariano test results | |||||||||
| VAR(1) on NS | AR(1) for Returns | Historical Average | |||||||
| h = 1 | h = 6 | h = 12 | h = 1 | h = 6 | h = 12 | h = 1 | h = 6 | h = 12 | |
| Equity | 4 | 5 | 4 | 52 | 1 | 1 | 49 | 4 | 0 |
| Treasuries | 0 | 0 | 0 | 1 | 2 | 2 | 2 | 2 | 2 |
| REITs | 1 | 0 | 0 | 6 | 0 | 1 | 6 | 0 | 0 |
| Corporate Bonds | 0 | 0 | 0 | 1 | 0 | 0 | 0 | 0 | 0 |
| Commodities | 1 | 1 | 1 | 0 | 0 | 0 | 0 | 0 | 0 |
The table contains results of the Diebold–Mariano test. In particular, the table shows test statistics for the three models (VAR(1) on NS factors, AR(1) on returns, and Historical Average) considered for forecasting returns of US Treasury securities, equities, REIT, corporate bonds, and commodity against an AR(1) on Nelson–Siegel factors, in the pseudo out-of-sample period going from January 2011 to September 2020. Test statistics are shown according to three dimensions: (i) the model used to compute the forecasts, (ii) the returns series considered (one of 151 among the five asset classes analyzed), and (iii) the forecast horizon (one-step-ahead, six-step-ahead, and twelve-step-ahead forecasts). Boldfaced values represent series for which we can reject the null hypothesis of equal predictive accuracy in a 5%-sized, one-tailed test. Panels A to E show DM test statistics for each of the five asset classes considered, while Panel F shows a summary of DM tests by asset class and forecast horizon. Panel F summarizes how many times we can reject the null hypothesis of equal predictive accuracy in a 5%-sized, one-tailed test. Numbers in brackets in Panel F represent the number of series analyzed for each asset class.
Table A3.
Root mean squared forecast errors (RMSE) for government bond yields.
Table A3.
Root mean squared forecast errors (RMSE) for government bond yields.
| Model | RMSE (h = 1) | RMSE (h = 6) | RMSE (h = 12) |
|---|---|---|---|
| VAR(1) on NS | 1.88 | 0.22 | 0.03 |
| AR(1) on Yields | 9.30 | 0.93 | 0.44 |
| VAR(1) on Yield Changes | 14.44 | 1.56 | 0.77 |
| Random Walk | 8.07 | 1.03 | 0.54 |
| Slope Regression | 12.75 | 0.60 | 0.84 |
| VAR(1) on Yields | 10.88 | 21.90 | 0.77 |
| Regression on Principal Components | 177.89 | 24.79 | 0.77 |
Note: The table presents root mean squared forecast errors (RMSEs) for government bond yields at different maturities (3 months, 1 year, 5 years, and 10 years) across three forecast horizons: one-step-ahead (h = 1), six-step-ahead (h = 6), and twelve-step-ahead (h = 12). The RMSEs were computed for different forecasting models, including VAR(1) on Nelson–Siegel (NS) factors, AR(1) on yields, and benchmark methods such as the Random Walk and Principal Component Regression. Lower RMSE values indicate better forecasting accuracy.
Appendix D. Combining Forecast with Goyal and Welch Predictors
Table A4.
Combining out-of-sample (%) for 6-month horizon.
Table A4.
Combining out-of-sample (%) for 6-month horizon.
| Yield Curve | Treasury Returns | Equity Returns | ||||||
| Maturity | GW Predictors | GW + NS Factor | Maturity | GW Predictors | GW + NS Factor | Sector | GW Predictors | GW + NS Factor |
| 3 M | 3.24 *** | 4.77 *** | 3 M | 0.71 *** | 5.77 *** | Mkt | 0.25 ** | 0.13 * |
| 1 Y | 3.32 *** | 4.88 *** | 6 M | 3.38 *** | 5.66 *** | Food | 0.25 | 0.09 |
| 5 Y | 2.54 *** | 3.28 *** | 1 Y | 0.87 *** | 0.95 *** | Mines | 0.06 | 0.13 * |
| 10 Y | 2.25 *** | 3.03 *** | 2 Y | 0.14 | 0.66 ** | Oil | −0.09 | −0.11 |
| 3 Y | 0.08 | 0.32 * | Textiles | 0.25 | 0.06 | |||
| 5 Y | −0.01 | 1.52 ** | Consumer Durables | 0.53 *** | 0.31 * | |||
| 7 Y | 0.04 | 0.08 | Chemicals | 0.14 | 0.09 | |||
| 10 Y | 0.14 | 0.24 | Consumer | 0.45 ** | 0.40 ** | |||
| Construction | 0.21 * | 0.14 | ||||||
| Steel | 0.03 | 0.01 | ||||||
| Fabricated Products | 0.13 | 0.26 * | ||||||
| Machinery | 0.03 | 0.01 | ||||||
| Cars | 0.31 ** | −0.01 | ||||||
| Transportation | 0.16 * | 0.01 | ||||||
| Utilities | −0.09 | −0.17 | ||||||
| Retail Stores | 0.08 | 0.13 | ||||||
| Financials | 0.38 ** | 0.26 * | ||||||
| Other | 0.52 *** | 0.53 ** | ||||||
| REITs | Corporate Bonds | Commodities | ||||||
| Sector | GW Predictors | GW + NS Factor | Rating | GW Predictors | GW + NS Factor | Commodity | GW Predictors | GW + NS Factor |
| Diversified | −0.18 | −0.27 | AAAA | 0.12 | 0.16 | Gold | 0.13 | 0.19 |
| Industrial | −0.09 | −0.79 | AA | 0.45 * | 0.55 * | Silver | 0.10 | 0.04 |
| Lodging/Resorts | 0.33 | 0.32 | BBB | 0.46 * | 0.33 | Platinum | −0.34 | −0.39 |
| Office | −0.21 | −0.23 | A | 0.85 ** | 0.74 * | Copper | −0.30 | −0.33 |
| Residential | −0.07 | −0.23 | Brent Crude | −0.12 | −0.21 | |||
| Retail | −0.17 | −0.07 | Gasoil | −0.26 | −0.35 | |||
| Self Storage | −0.19 | −0.07 | Light crude oil | −0.17 | 0.16 | |||
| All REITs | −0.23 | −0.04 | Natural Gas | 0.01 | 0.20 | |||
| Mortgage REITs | 0.15 * | 0.03 | Cotton | 0.35 ** | 0.09 | |||
| Cocoa | −0.03 | −0.02 | ||||||
| Coffee | −0.15 | −0.18 | ||||||
| Corn | −0.09 | −0.17 | ||||||
| Lumber | 0.38 *** | 0.24 ** | ||||||
| Soybean Oil | −0.06 | −0.20 | ||||||
| Soybeans | −0.09 | −0.15 | ||||||
| Wheat | −0.04 | −0.03 | ||||||
| Live Cattle | −0.04 | −0.11 | ||||||
The table reports the out-of-sample (%) for a 6-month horizon, computed as the median of the combination forecasts. The predictive variables included in the combination models consist of various macro-financial indicators. These include valuation ratios such as the dividend–price ratio (D/P), the dividend yield (D/Y), the earnings–price ratio (E/P), and the dividend–payout ratio (D/E). Additionally, measures of market risk and performance such as stock variance (SVAR), the book-to-market ratio (B/M), and net equity expansion (NTIS) are considered. The analysis also incorporates interest rate variables, including the Treasury bill rate (TBL), the long-term yield (LTY), the long-term return (LTR), and the term spread (TMS), along with credit risk indicators such as the default yield spread (DFY) and the default return spread (DFR). Finally, inflation (INFL) is included, following Welch and Goyal (2008), with the predictive regression using the previous month’s inflation rate due to the lag in data availability. The table compares combination forecasts with and without NS factors to assess their impact on predictive accuracy. The out-of-sample period spans from January 2011 to December 20200. Statistical significance for the statistic is based on the p-value for the Clark and West (2007) out-of-sample MSPE-adjusted statistic. The test corresponds to a one-sided hypothesis where the null states that the competing forecasting model has an expected squared prediction error equal to that of the historical average benchmark forecasting model. The alternative hypothesis posits that the competing forecasting model exhibits a lower expected squared prediction error relative to the benchmark. Statistical significance is denoted as follows: for the 10% level, for the 5% level, and for 1%.
Notes
| 1 | The stochastic discount factor, also called the pricing kernel, is a random variable used in asset pricing to discount future payoffs. It ensures that the expected discounted payoffs equal current asset prices. The SDF captures time value, risk preferences, and market conditions. |
| 2 | Fixing allows us to avoid estimating the baseline NS specification using nonlinear least squares, enhancing the simplicity and robustness of the forecasts obtained from the model. In fact, Diebold et al. (2006) emphasized that NLS estimation can encounter issues such as local minima or convergence failures. |
| 3 | In Equation (4) and in all subsequent forecasting models, we use a caret to denote the final predictions (of factors and yields) that we assign an economic meaning to and that we use to assess the forecast accuracy of the different models. A tilde is used to indicate intermediate-step parameter estimates that are instrumental in computing the final predictions. |
| 4 | In unreported tests, we tried to perform formal model specification searches concerning the number of VAR lags, and in particular investigated the performance for the yield series of a VAR(2) model, finding a similar or uniformly worse predictive performance. |
| 5 | The use of cointegrated VAR models for interest rate forecasting does not consistently outperform other methods. The inclusion of many yields increases model complexity, which can lead to overfitting and poor out-of-sample predictions. Despite improvements like cubic splines and ECM approaches, these models often overlook the fact that bond prices (yields) reflect all available information about future interest rates, limiting their forecasting effectiveness (Duffee, 2013). |
| 6 | In the case of Treasury data, even though an obvious (yet non linear) relationship exists between (the changes in ex ante) yields and (ex post, realized) returns, even when the NS factors are used, the forecast results obtained for returns are not presumed to be the same as the ones obtainable for yields. |
| 7 | Some literature labels this third predictive benchmark as a ”random walk”, which obviously is at odds with the restriction applied to Equation (15). The label derives from the fact that restricting in one obtains the model in (15) with . |
| 8 | These predictors are available at Goyal’s personal site: https://sites.google.com/view/agoyal145/home?authuser=0, accessed on 20 March 2025. |
| 9 | Negative yields, which may arise due to the low interest rate environment or specific technical market factors, such as cash and repurchase agreements in US Treasury markets, were excluded from the CMT calculation. This ensures that the dataset included only positive and economically meaningful yields. |
| 10 | Maturities such as the 1-month, 2-month, 20-year, and 30-year constant maturity rates, which were discontinued during the study period, were excluded. |
| 11 | https://mba.tuck.dartmouth.edu/pages/faculty/ken.french/index.html, accessed on 20 March 2025. |
| 12 | To qualify for inclusion, bonds had to have an investment-grade rating (based on an average of Moody’s, S&P, and Fitch) and meet specific criteria regarding maturity (a remaining maturity of at least one year at the time of inclusion in the index), a fixed coupon schedule (i.e., zero coupon bonds are excluded), and minimum amount outstanding (USD 250 million). The index was capitalization-weighted based on the outstanding amount of each security at the end of each month. |
| 13 | Nonetheless, this is considered partially unrealistic, also in light of the recent results in Kim and Choi (2017). |
| 14 | The sample autocorrelation in the forecast errors further supports these findings, with errors from the AR(1) and VAR(1) models for yields being serially correlated for most maturities and horizons, except for longer maturities (3-, 5-, 7-, and 10-year yields) at one-step-ahead horizons. |
| 15 | Because the results for MCS test are starker and allow us to draw sharper conclusions, in the main text, we present the tables documenting the findings from the MCS methodology and report in an Appendix those from the simpler, but intuitive DM tests. |
| 16 | While the two test statistics used in the MCS procedure, TMax and TR, produced similar results, they differed in the total number of models excluded from the superior set. The TMax statistic tended to exclude fewer models from M* across all forecasting horizons compared to the TR statistic. Using the TMax statistic, we excluded 187, 62, and 181 models from M* across all return series for the one-step-ahead, six-step-ahead, and 12-step-ahead forecasts, respectively. Using the TR statistic, we excluded 229, 76, and 188 models from M* for the same horizons. In the case of the return series, we observed the opposite of what was found for the bond yield series, with the TMax test appearing less selective than the TR statistic. This difference was likely due to the fact that only four models were compared when forecasting returns, whereas up to eight models were compared when forecasting bond yields, making pairwise comparisons based on the TR statistic more powerful than the multivariate comparisons used by the TMax statistic. |
| 17 | As a sub-question, we also explored whether an AR(1) or a VAR(1) model based on (predicted) NS factors could be preferable in forecasting returns. Although no definitive conclusion emerged, our analysis suggested that the two types of models did not exhibit equivalent performance across different asset classes and forecast horizons. In overall terms, the VAR(1) NS tended to outperform the AR(1) NS, particularly in the case of Treasury securities, equities, and corporate bonds. However, the AR(1) NS returned a slightly superior performance in the prediction of REIT returns. As far as the forecast horizon is concerned, the AR(1) NS performance relative to the VAR(1) improved as the horizon lengthened, especially for US Treasuries and corporate bonds. Conversely, the VAR(1) model proved more effective in forecasting REIT returns. |
References
- Campbell, J. Y., & Thompson, S. B. (2008). Predicting excess stock returns out of sample: Can anything beat the historical average? The Review of Financial Studies, 21(4), 1509–1531. [Google Scholar] [CrossRef]
- Chen, Y., & Niu, L. (2014). Adaptive dynamic Nelson–Siegel term structure model with applications. Journal of Econometrics, 180(1), 98–115. [Google Scholar] [CrossRef]
- Christensen, J. H. E., Diebold, F. X., & Rudebusch, G. D. (2011). The affine arbitrage-free class of Nelson–Siegel term structure models. Journal of Econometrics, 164(1), 4–20. [Google Scholar] [CrossRef]
- Clark, T. E., & West, K. D. (2007). Approximately normal tests for equal predictive accuracy in nested models. Journal of Econometrics, 138(1), 291–311. [Google Scholar] [CrossRef]
- Diebold, F. X., & Li, C. (2006). Forecasting the term structure of government bond yields. Journal of Econometrics, 130(2), 337–364. [Google Scholar] [CrossRef]
- Diebold, F. X., & Mariano, R. S. (2002). Comparing predictive accuracy. Journal of Business & Economic Statistics, 20(1), 134–144. [Google Scholar]
- Diebold, F. X., & Rudebusch, G. D. (2013). Yield curve modeling and forecasting: The dynamic Nelson-Siegel approach. Princeton University Press. [Google Scholar]
- Diebold, F. X., Rudebusch, G. D., & Aruoba, S. B. (2006). The macroeconomy and the yield curve: A dynamic latent factor approach. Journal of Econometrics, 131(1–2), 309–338. [Google Scholar]
- Drobetz, W., Stürmer, S., & Zimmermann, H. (2002). Conditional asset pricing in emerging stock markets. Swiss Journal of Economics and Statistics (SJES), 138(IV), 507–526. [Google Scholar]
- Duffee, G. (2013). Forecasting interest rates. In G. Elliott, & A. Timmermann (Eds.), Handbook of economic forecasting (Vol. 2, pp. 385–426). North Holland. [Google Scholar]
- Favero, C. A., Niu, L., & Sala, L. (2012). Term structure forecasting: No-arbitrage restrictions versus large information set. Journal of Forecasting, 31(2), 124–156. [Google Scholar] [CrossRef]
- Fernandes, M., & Vieira, F. (2019). A dynamic Nelson–Siegel model with forward-looking macroeconomic factors for the yield curve in the US. Journal of Economic Dynamics and Control, 106, 103720. [Google Scholar] [CrossRef]
- Frankel, J. A. (2008). The effect of monetary policy on real commodity prices. In Asset prices and monetary policy (pp. 291–333). University of Chicago Press. [Google Scholar]
- Guidolin, M., McMillan, D. G., & Wohar, M. E. (2013). Time varying stock return predictability: Evidence from US sectors. Finance Research Letters, 10(1), 34–40. [Google Scholar] [CrossRef]
- Guidolin, M., & Pedio, M. (2019). Forecasting and trading monetary policy effects on the riskless yield curve with regime switching Nelson–Siegel models. Journal of Economic Dynamics and Control, 107, 103723. [Google Scholar]
- Hansen, P. R., Lunde, A., & Nason, J. M. (2011). The model confidence set. Econometrica, 79(2), 453–497. [Google Scholar] [CrossRef]
- Hillebrand, E., Huang, H., Lee, T.-H., & Li, C. (2018). Using the entire yield curve in forecasting output and inflation. Econometrics, 6(3), 40. [Google Scholar] [CrossRef]
- Holmström, B., & Tirole, J. (2001). LAPM: A liquidity-based asset pricing model. The Journal of Finance, 56(5), 1837–1867. [Google Scholar] [CrossRef]
- Kim, J. H., & Choi, I. (2017). Unit roots in economic and financial time series: A re-evaluation at the decision-based significance levels. Econometrics, 5(3), 41. [Google Scholar] [CrossRef]
- Nelson, C. R., & Siegel, A. F. (1987). Parsimonious modeling of yield curves. Journal of Business, 473–489. [Google Scholar] [CrossRef]
- Nymand-Andersen, P. (2018). Yield curve modelling and a conceptual framework for estimating yield curves: Evidence from the European Central Bank’s yield curves. In ECB statistics paper. European Central Bank. [Google Scholar]
- Pesando, J. E. (1979). On the random walk characteristics of short-and long-term interest rates in an efficient market. Journal of Money, Credit and Banking, 11(4), 457–466. [Google Scholar] [CrossRef]
- Potì, V. (2018). A new tight and general bound on return predictability. Economics Letters, 162, 140–145. [Google Scholar]
- Rapach, D., Strauss, J., & Zhou, G. (2010). Out-of-sample equity premium prediction: Consistently beating the historical average. Review of Financial Studies, 23(2), 821–862. [Google Scholar] [CrossRef]
- Rapach, D., & Zhou, G. (2013). Forecasting stock returns. In Handbook of economic forecasting (Vol. 2, pp. 328–383). Elsevier Science & Technology. [Google Scholar]
- Steeley, J. M. (2014). Forecasting the term structure when short-term rates are near zero. Journal of Forecasting, 33(5), 350–363. [Google Scholar] [CrossRef]
- Toczydlowska, D., & Peters, G. W. (2018). Financial big data solutions for state space panel regression in interest rate dynamics. Econometrics, 6(3), 34. [Google Scholar] [CrossRef]
- Welch, I., & Goyal, A. (2008). A comprehensive look at the empirical performance of equity premium prediction. The Review of Financial Studies, 21(4), 1455–1508. [Google Scholar] [CrossRef]
- Xiang, J., & Zhu, X. (2013). A regime-switching Nelson–Siegel term structure model and interest rate forecasts. Journal of Financial Econometrics, 11(3), 522–555. [Google Scholar] [CrossRef]
- Yu, W.-C., & Zivot, E. (2011). Forecasting the term structures of Treasury and corporate yields using dynamic Nelson-Siegel models. International Journal of Forecasting, 27(2), 579–591. [Google Scholar] [CrossRef]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2025 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).


