3.1. The Data Set
We applied our local-linear smoother to annual US stock-market data. This dataset was provided by Robert Shiller and is made available from
http://www.econ.yale.edu/~shiller/data.htm. It includes, among other variables, long-term changes of the Standard and Poor’s (S&P) Composite Stock Price Index, consumer price index changes, and interest rate data from 1872 to 2019. This is an updated and revised version of (Shiller [
21], Chapter 26), which provides a detailed description of the data.
Note that the extension of the risk-free rate series (based on the six-month commercial paper rate until 1997 and, afterward, on the six-month certificate of deposit rate, secondary market) was not possible as it was discontinued in 2013. Here, we followed the strategy of Welch and Goyal [
22] and Mammen et al. [
23] and replace this variable by an annual yield based on the six-month Treasury-bill rate, secondary market, from
https://fred.stlouisfed.org/series/TB6MS. This new series was only available from 1958 to 2019. In the absence of information prior to 1958, we had to estimate it. To this end, we regressed the Treasury-bill rate on the risk-free rate from Shiller’s data for the overlapping period 1958 to 2013. Assuming a linear relationship and using an ordinary least squares regression, we obtained the estimated equation:
with an
of 98.6%. Therefore, we instrumented the risk-free rate from 1872 to 1957 with the predicted regression equation. The correlation between the actual Treasury-bill rate and the predictions for the estimation period was 99.3%.
3.2. Descriptive Analysis
There is much research on the predictability of returns and a lot is known about the characteristics of short-horizon stock returns. For example, stylized facts about daily and monthly returns include excess kurtosis, distributions which are not normal, and volatility clustering [
24]. However, less is known about distributions of long-horizon returns. However, such characteristics are of central interest to investors saving for distant pay-offs.
Figure 2 shows the time series of the one-year returns (left) and five-year returns (right), both in excess of the risk-free benchmark
R, which are displayed on the same scale for the sake of comparison. The five-year series exhibits larger positive returns, which is not surprising as a longer period under risk should be paid-off with a higher risk premium. The autoregressive structure of the five-year returns can be easily seen in comparison to the assumed weak dependence of one-year returns.
Figure 3 shows histograms of the one-year returns (left) and five-year returns (right) together with a kernel density estimate (red) and a fitted normal distribution (green). One notes again that the distribution of the five-year returns is shifted to the right but has a higher volatility. A Jarque–Bera test of the hypothesis of normality does reject for one-year returns (
p-value
) but does not reject for five-year returns (
p-value
). Furthermore, the density estimate for the five-year returns indicates more a mixture of normals than a single normal distribution which represents some evidence of a possible structural break in the data-generating process.
Including structural changes in the modelling process could be beneficial, as shown in the literature especially for higher-frequency returns [
25,
26]. However, this comes with additional effort which is beyond the scope of our article and is left for future research. Several important points would have to be taken into consideration, for example: (i) it is not clear for which point in time one should incorporate a structural break in our annual data (the Great Recession, the Second World War, the Global Financial Crisis, the Bretton Woods agreement, etc.); (ii) a simple sample split would result in even smaller and not balanced data sets. From a statistical perspective, as we apply a fully nonparametric method, this would lead to mostly one-dimensional and potentially linear models. This way, we would lose the analysis of higher-dimensional models and nonlinear relationships between excess stock returns and their predictor variables.
This section is concluded with
Table 1 which displays standard descriptive statistics for one-year and five-year returns as well as the available covariates. Both the one-year and five-year excess returns had a negative skewness, that is, the left tail of the distribution (large negative returns) was longer or fatter than the right tail (large positive returns). Note that this is more pronounced in the case of one-year rather than five-year returns. While one-year returns were leptokurtic (positive excess kurtosis of
), five-year returns exhibited a small negative excess kurtosis of
.
Similar plots to those in
Figure 2 and
Figure 3 and information as in
Table 1 for the other benchmarks are available upon request by the authors. In the next sections, we analyzed the predictability of one-year and five-year stock returns in excess of the different benchmarks.
3.3. The Single Benchmarking Approach
In this section, we considered a single benchmarking approach as in Kyriakou et al. [
6] where only the variable
was adjusted according to some benchmark
, as shown in (
1), while the independent variable(s) is (are) measured on the original nominal scale. The models (
3) and (9) are estimated with a local-linear kernel smoother using the quartic kernel and the optimal bandwidth is chosen by cross-validation, that is, by maximizing the
given by (
15) and (
16). Moreover, it should be kept in mind that the nonparametric method can estimate linear functions without any bias, given that we apply a local-linear smoother. Thus, the linear model is automatically embedded in our approach. We study the empirical findings of
values based on different validated scenarios shown for the one-year horizon in
Table 2 and the five-year horizon in
Table 3. Note that the one-year predictions may differ from those originally reported in Kyriakou et al. [
6] due the updated data set and the replacement of the commercial paper rate by the Treasury-bill rate; nevertheless, the models remain similarly ranked.
We found that in the case of the five-year returns, which was the focal point of this paper, the term spread s was, overall, the most powerful predictive variable for excess stock returns; this superior performance was also observed in the one-year case. More in detail, with the prediction constrained to using only one-dimensional covariates, the term spread is the best predictor for the one-year and five-year horizon under the short interest benchmark with, respectively, and , but this also does quite well in the one-year case under the long rate and earnings-by-price benchmarks, and , with (for the best is with ). In the five-year case under and , the term spread s yields a high whereas under the dividend-by-price ratio d gains ground with .
In light of these remarks, we therefore, focused the spotlight on the relationship between the spread and the excess stock returns. We present in the top panel of
Figure 4 the estimated functions
(red solid line) for the one-year horizon (left) and the five-year horizon (right) under the single risk-free benchmark together with a corresponding linear model (dash-dotted green line), and a 45-degree line (dashed black line).
Figure 4 shows thereby the three single covariates with the largest
(
and
): the term spread (9.7% and 15.5%), the short-term interest rate (3.0% and 7.8%), and the long-term interest rate (0.0% and 1.4%). Our findings for both horizons conformed to the fact that an increase in the spread corresponds to an increase in the excess stock return. While a positive spread corresponds to a positive return for the one-year case, a spread larger than
gave, on average, a positive five-year return. This finding is in line with, for example, Resnick and Shoesmith [
27] who find that the value of the yield spread holds important information about the probability of a bear stock market. Regarding our validation procedure,
Figure 4 also confirms that our approach of correcting for autocorrelation in the five-year prediction problem was successful. The estimated functions are quite smooth indicating that the chosen bandwidth is not too small and that the resulting fit and validated
are reasonable.
Back to our discussion of the results in
Table 2 and
Table 3, in broad terms, five-year predictability improved over one-year: 67 out of 112 models achieve a larger
, and we observed 64 (five-year) versus 52 (one-year) models with nonnegative
, that is, our proposed predictor-based regression model for the longer forecast horizon in this application surpassed the historical average excess return in the majority of cases. In addition, combining the term spread
s with the dividend-by-price
d results in uplifted predictability to
; this combination is, in fact, the best-performing one for 3 out of 4 benchmarks (
). In particular, imposing an additional covariate to
s results in one-year
in the range 6–
under
; under other benchmarks, such as
, the one-year
is in the range 3–
(approx.); changing to the
benchmark results in
in the range
–
. Interestingly, for a five-year horizon, we observed a substantially improved predictive power with our cross-validated
ranking some two-dimensional better than one-dimensional models, in fact, more than for a one-year horizon: in particular, as possibly anticipated by the aforementioned performances of
d and
s, the two-dimensional covariate
boosts
to
under
, performs best with
under
, and comes second with
under
being beaten by
with
.
In the one-year case, quite remarkable is the predictor , either in itself or combined with covariates , under the inflation benchmark leading to in the range –. In addition, when put together with the term spread, the resulting combination under is the clear winner reaching up to . This is probably good news in an actuarial context where the inflation benchmark can be seen as an important one in pension product applications. In the five-year case, still does quite well in the range – (with an exception of for ) and remains generally the best predictor under , nevertheless, it is no longer the globally best one.
3.4. The Full Benchmarking Approach
The next step now is to analyze whether transforming the explanatory variables can improve predictions. Recall that fully nonparametric models suffer in general by the curse of dimensionality, as in our framework where we confront sparsely distributed annual observations in higher dimensions. Importing more structure in the estimation process can help reduce or circumvent such problems.
Here, we extend the study in
Section 3.3 using economic structure in the sense that we consider adjusting both the independent and dependent variables according to the same benchmark. To this end, in our full (double) benchmarking approach, the prediction problems are reformulated as
where we use transformed predictive variables
This approach can be interpreted as a way of reducing dimensionality of the estimation procedure as
encompasses an additional predictive variable.
Results of this empirical study are presented for the one-year horizon in
Table 4 and for the five-year horizon in
Table 5. In addition,
Figure 5 presents the three single covariates with the largest
(
and
) for the double inflation benchmark case: the earnings-by-price ratio (12.2% and 12.4%), the dividend-by-price ratio (10.4% and 10.9%), and the long-term interest rate (10.5% and 8.7%).
We find that, in the majority of cases, the full outruns the single benchmarking approach, even more when we consider a longer horizon, and the number of models with nonnegative (that is, cases of beating the historical average excess return) increases: 68 out of 82 models (full benchmarking, five-year); 55 out of 82 (full benchmarking, one-year); 64 out of 112 (single benchmarking, five-year); and 52 out of 112 (single benchmarking, one-year).
The pair in the full benchmarking approach for a five-year horizon yields against in the single benchmarking under , whereas in the full benchmarking approach for a one-year horizon yields against using the predictor in the single benchmarking under . It, therefore, seems that s is an important predictor, whose power is mostly unveiled when combining with another predictor depending on the benchmark choice and the forecast horizon. In addition, although under and , full benchmarking does not improve predictability, it does under and, especially, which is important if we aim to identify a likely common well-performing benchmark and predictor, that is, , independently of the horizon length. For full benchmarking, lies in the range –% (five-year) and –% (one-year), which are both an improvement from single benchmarking yielding in –% (five-year) and –% (one-year), that is, a maximum width reduction by a factor of almost 3 for the five-year horizon.
Overall, we conclude that the term spread is a good predictor; if we aim to homogenize our choice of predictor and benchmark with respect to the horizon length, then the earnings-by-price and the term spread under the inflation benchmark would be an ideal compromise, even if not the winning one. This is welcoming, as, for example, in pension research or other long-term saving strategies, it is sensible to look at real value and employ such a model with all returns and covariates net-of-inflation.
3.5. Real-Income Long-Term Pension Prediction
In long-term pension planning, real-income protection is often an important aspect [
28,
29,
30]. When optimizing investment asset allocation for the long term, one therefore needs a good econometric model in real terms. Based on the research in this paper and in Kyriakou et al. [
6], we are able to conclude that, in the natural double benchmark setting for real-income econometrics, it looks like earnings divided by price is the natural covariate to consider. In
Table 2 of Kyriakou et al. [
6] and in
Table 4 of this paper, it is concluded that earnings divided by price is the best single covariate to use in the double inflation benchmark case and, in this paper’s
Table 5, this is also the case in the five-year view. On balance, we therefore conclude that the intuitively appealing earnings divided by price is a good long-term predictor for real-income forecasting. In the one-year view of Kyriakou et al. [
6], the nonparametric smoother estimated for the relationship between earnings divided by price and return in the inflation double benchmarking case has the exact functional form of a simple line. So, even though we consider a nonparametric estimator that can pick any functional form, the resulting functional form is a simple line. This provides us with a strong argument for using the simple line in this case. The functional form of a line has been picked via a validation procedure against all functional forms. The linear expression is
Notice that a very good long-term predictor of real income can, therefore, be expressed as a simple linear relationship, where the expected return adds first 12% to the earnings divided by price and then another 0.5%. This is a very simple relationship that is easy for long-term investors to remember. Similarly to the one-year view, our validation procedure exactly picks a line against all other functional alternatives in the five-year view. The linear form for the five-year view is
The top panel of
Figure 5 shows the estimated nonparametric function
(red solid line) for the one-year horizon (left) and five-year horizon (right) under the double inflation benchmark for the earnings-by-price covariate together with the corresponding linear model (dash-dotted green line), and a 45-degree line (dashed black line). Note that the linear relationship discovered for the earnings-by-price predictor must not hold true for other covariates or their combinations. In those cases, the full benefit of our approach comes into its own. For example, the bottom panel of
Figure 5 clearly shows nonlinearities for the five-year case when the long-term interest rate is considered. For a suitable statistical smoothing-based test (nonparametric versus linear model), see, for example, the test based on wild bootstrap proposed in Scholz et al. [
8] or the discussion in the survey of González-Manteiga and Crujeiras [
31].
3.6. One-Year ahead Real-Time Predictions
The four benchmarks proposed in this paper are useful in different situations. While in
Section 3.5 we focused on real-income long-term predictions based on models using the inflation-double benchmark, we now want to explore the development of ‘pure’ stock returns
S (without a benchmark) in the near future. Kyriakou et al. [
6] found the model using the earnings benchmark with the term spread as the covariate, that is,
, to perform best in terms of
when the predicted values are back-transformed and validated on stock returns
S. We use this simple model to illustrate the usefulness of the earnings benchmark. For this purpose, we estimate this model over the full sample as before and predict the stock returns in excess of earnings using the current spread (in the period September 2018–March 2020). Finally, we back-transform those predictions to get a one-year ahead forecast for nominal stock returns
from
As a by-product, we also calculate a prediction for the real-stock return,
, and the risk premium,
, of holding stocks versus a risk-free asset.
Table 6 shows the results of this forecasting exercise. The considered forecasting period is of interest for two specific features: (i) the term spread is U-shaped, that is, it reduces, gets negative (an inverted yield curve in August and September 2019), and finally increases again; and (ii) the external shock to the market caused by the Covid-19 pandemic leads to large negative returns starting in March 2020.
We find (i) that for the (slightly) negative spread the predicted stock return in excess of earnings is also negative. Nevertheless, the predicted nominal stock return is positive (around 4.1% and mainly driven by the earnings of around 4.5%) as well as the real return (around 2.4%) and the risk-premium (around 2.3%). Note (ii) that the one-year ahead predictions in March 2020 are not frightening even though we are at the beginning of a worldwide economic crisis and recession. They seem to reflect optimistic market expectations. Of course, we cannot incorporate external shocks in our model but the key variables show comforting figures as both, the term spread and the earnings-by-price, are at their second-highest values in the last 20 months. Thus, our model predicts that compared to the month before the crises started nominal one-year stock returns increase by 2.3%, real returns increase by 3.1%, and the risk-premium increases by 3.4%. Low inflation, low short-term interest rates (the latter being almost zero), and increasing prices will bring back the market to past performance such that this prediction was in line with what happened.