Short-Term Exuberance and Long-Term Stability: A Simultaneous Optimization of Stock Return Predictions for Short and Long Horizons

Kyriakou, Ioannis; Mousavi, Parastoo; Nielsen, Jens Perch; Scholz, Michael

doi:10.3390/math9060620

Open AccessArticle

Short-Term Exuberance and Long-Term Stability: A Simultaneous Optimization of Stock Return Predictions for Short and Long Horizons

¹

Faculty of Actuarial Science and Insurance, Cass Business School, University of London, 106 Bunhill Row, London EC1Y 8TZ, UK

²

Department of Economics, University of Graz, Universitätsstraße 15/F4, 8010 Graz, Austria

^*

Author to whom correspondence should be addressed.

Mathematics 2021, 9(6), 620; https://doi.org/10.3390/math9060620

Submission received: 2 March 2021 / Revised: 10 March 2021 / Accepted: 11 March 2021 / Published: 15 March 2021

(This article belongs to the Special Issue Advances in Multivariate Analysis and Their Applications in Actuarial and Financial Economics)

Download

Browse Figures

Versions Notes

Abstract

:

The fundamental interest of investors in econometric modeling for excess stock returns usually focuses either on short- or long-term predictions to individually reduce the investment risk. In this paper, we present a new and simple model that contemporaneously accounts for short- and long-term predictions. By combining the different horizons, we exploit the lower long-term variance to further reduce the short-term variance, which is susceptible to speculative exuberance. As a consequence, the long-term pension-saver avoids an over-conservative portfolio with implied potential upside reductions given their optimal risk appetite. Different combinations of short and long horizons as well as definitions of excess returns, for example, concerning the traditional short-term interest rate but also the inflation, are easily accommodated in our model.

Keywords:

finance; investment analysis; stock returns; cross-validation; variation reduction

JEL Classification:

C14; C53; C58; G17; G22

1. Introduction

Considerable practical and theoretical effort is being channelled into understanding the movements of the stock market. This is natural as this is, perhaps, the most significant driver of returns providing long-term savers with sufficient wealth at retirement. Long-term predictability strongly impacts investors’ welfare, as pointed out, for example, by Lioui and Poncet [1].Recent years have witnessed the emergence of research on pension products, marking the need to provide an econometric model for planning long-term savings [2,3,4]. Such a model should be able to forecast the future serving, for example, institutional investors like pension funds reacting dynamically to market information. An accurate econometric approach to long-term savings needs to be able to concurrently consider both the long and short terms, aiming to align the long-term projections while circumventing inaccurate trading due to short-term bubbles in the market. Notably, long-horizon predictability has also been studied in other contexts, e.g., by Carmona et al. [5], who analyzed return predictability effects on the fair value of long-term executive stock options, and Bodnar et al. [6] who studied the multi-period (long-run) portfolio choice problem under return predictability.

Our paper provides a general strategy to support such a novel econometric model. We consider the standard case of returns in excess of the short-term interest rate and the perhaps more relevant case of returns in excess of inflation (i.e., real returns) as led by Merton [2]. In our empirical application, we consider a short-term period of one year and a long-term period of five years. By its universality, our approach lends itself to any benchmark, not just short-term interest rate or inflation, and can fit with any assumption of a short and long term. The baseline provides a model for the earnings-by-price, which is an intuitive, attractive quantity, which can be compared with interest rates and other returns, and is one of the most important drivers of return predictability [7]. A final correction is applied to ensure that the model is capable of capturing the returns trend.

An accurate model does not only provide a better understanding of the expected return, but also reduced variation. Our contribution is twofold: First, the application of predictive regressions for two different horizons individually reduces the noise in short- and long-term investments. Second, by combining predictions of different horizons, we further reduce the noise for short-term investment.s This confirms the view put forward by Lioui and Poncet [1] of long-horizon predictability defenders that the use of long-term returns reduces the noise in asset returns. The reason is that even in, for example, one-year returns, a large amount of speculative variation is still included. This is clearly reduced in longer-horizon investments. However, our simple model is able to optimize the one-year investments according to the bubble-free long-term variance and reduce variation for the short-term predictions after year one. We focus on two findings that are particularly interesting and intuitively appealing. With the aid of our optimal predictive model, we perceptibly reduce the standard deviation of the one-year returns by

10 %

from about

18 %

to

16 %

. The prediction with incorporated long-term modeling also has a standard deviation beyond the short term of only around

14 %

. Therefore, a long-term investor that optimizes pensions or other long-term savings should rely on this value based on the available information, rather than the typically used standard deviation of

16 %

or even

18 %

. In general, a larger, out-of-line standard deviation would lead to an over-conservative portfolio with implied potential upside reductions for the long-term saver given their optimal risk appetite.

The remainder of the paper is structured as follows: In Section 2, we gradually build our proposed framework from short-term and long-term nonlinear predictive modeling to their merge to a single model that aims to reduce the investment risk. Section 3 focuses on our empirical application to one- and five-year excess stock returns based on historical U.S. market data. Section 4 concludes the paper.

2. Materials and Methods

Linear regression models are popular in predictive modeling as these classical benchmarks are easy to estimate and interpret. However, the fixed functional form of the relationship between stock returns and predictive variables leads to inferior predictive power compared with nonlinear approaches [8,9,10,11,12]. Therefore, we focus on potentially nonlinear predictive relationships between returns over the next T years in excess of a reference rate (or benchmark) and a set of economic predictors relevant for the long-term investor using a fully nonparametric smoother. We analyze the two most important benchmark models of Kyriakou et al. [7,13]: the short-term interest rate and the inflation rate. Note that the former directly corresponds to the prediction of the risk premium (over a risk-free investment), whereas the latter refers to the forecast of real returns. We aim, first, to investigate their predictability over horizons of one year and five years separately and then provide an intuitive single econometric model that combines both predictive horizons.

2.1. One-Year Predictions

We start with annual nominal stock returns defined by

S_{t} : = (P_{t} + D_{t}) / P_{t - 1}

, where

P_{t}

is the stock price at the end of year t and

D_{t}

is the dividends paid during year t. We focus on returns in excess (log-scale) of a given benchmark

B_{t - 1}^{(A)}

with

A \in {R, C}

:

Y_{t}^{(A)} = ln \frac{S_{t}}{B_{t - 1}^{(A)}},

(1)

where

B_{t}^{(R)} : = 1 + R_{t} / 100

and

B_{t}^{(C)} : = 1 + π_{t}

with

R_{t}

denoting the short-term interest rate and

π_{t} : = (C P I_{t} - C P I_{t - 1}) / C P I_{t - 1}

is the inflation rate for the consumer price index

C P I_{t}

for year t.

Our predictive nonparametric regression model for the one-year (

1 y

) excess returns defined in Equation (1) is now given by

Y_{t}^{(A)} = m_{1 y} (X_{t - 1}^{(A)}) + ξ_{t} .

(2)

Note that the conditional mean in Equation (2),

m_{1 y} (x^{(A)}) : = E (Y^{(A)} | X^{(A)} = x^{(A)}), x^{(A)} \in R^{q},

(3)

is unknown and its functional form is not predetermined, for example, to be linear, but can take any shape. Our preferred nonparametric method to estimate this function

m_{1 y}

is the local-linear smoother because of its flexibility and well-known statistical properties. For example, the linear function can be estimated without any bias and is thus automatically embedded in our analysis; that is, if the data-generating process is linear, we expose this simple functional form. Note further that the error terms

ξ_{t}

in Equation (2) form a martingale difference process, i.e.,

ξ_{t}

are serially uncorrelated random variables with zero mean, given the past, and the unknown conditionally heteroscedastic variance of the form

σ_{1 y}^{2} (x^{(A)})

. The elements of the q-dimensional vector

X_{t - 1}^{(A)}

in Equation (2), which collects the explanatory variables, are also transformed under the chosen benchmark A according to

X_{t - 1}^{(A)} : = \{\begin{matrix} \frac{1 + X_{t - 1}}{B_{t - 1}^{(A)}}, X_{t - 1} \in {d_{t - 1}, e_{t - 1}, r_{t - 1}, l_{t - 1}, π_{t - 1}} \\ \frac{s_{t - 1}}{B_{t - 1}^{(A)}} = \frac{l_{t - 1} - r_{t - 1}}{B_{t - 1}^{(A)}} \end{matrix} .

(4)

Therefore,

X_{t - 1}^{(A)}

contains (combinations of transformed) popular time-lagged predictive variables based on the: (i) dividend-by-price ratio

d_{t - 1} = D_{t - 1} / P_{t - 1}

; (ii) earnings-by-price ratio

e_{t - 1} = E_{t - 1} / P_{t - 1}

, where

E_{t}

denotes the earnings accruing to the index in year t; (iii) short-term interest rate

r_{t - 1} = R_{t - 1} / 100

; (iv) long-term interest rate

l_{t - 1} = L_{t - 1} / 100

; (v) inflation rate

π_{t - 1}

; and (vi) term spread

s_{t - 1} = l_{t - 1} - r_{t - 1}

. The use of such a transformation is one example of the careful imposition of an additional structure in the statistical modeling process, which has shown promising results in previous works [10,11,14]. We call this adjustment of both the independent and dependent variables according to the same benchmark double (or full) benchmarking.

2.2. Longer-Horizon Predictions

A main contribution of our work is the combination of short- and long-term predictions into one single model. Hence, we introduce, in addition to the short one-year predictions, our version of long-horizon predictions. We highlight three important points that distinguish both cases fundamentally from each other: first, the autoregressive behavior of the underlying predictive variable in Equation (6), which is used as the building block of our econometric model in Section 2.4 as well; second, the more complicated error structure (serial correlation by construction) in the predictive relationship (8); and, third, closely related to the last point, a more complicated smoothing parameter selection for the correct estimation of

m_{T y}

in Equation (9).

For longer horizons T, with

T > 1

, we consider the sum of annual continuously compounded returns defined in Equation (1), that is,

Z_{t}^{(A)} : = \sum_{i = 0}^{T - 1} Y_{t + i}^{(A)} .

(5)

Here, careful econometric modeling is necessary because of the overlapping nature of the returns

Z_{t}^{(A)}

(refer also to Appendix A). For ease of illustration, assume a linear model for

Y_{t}^{(A)}

in Equation (2) as well as some linear and autoregressive behaviors with an order of one for the forecasting variable

X_{t - 1}^{(A)}

:

Y_{t}^{(A)} = β_{0} + β_{1} X_{t - 1}^{(A)} + ξ_{t} and X_{t}^{(A)} = γ_{0} + γ_{1} X_{t - 1}^{(A)} + η_{t},

(6)

with

ξ_{t}

as in Equation (2),

η_{t}

being a white noise, and regression parameters

β_{0}, β_{1}, γ_{0}

, and

γ_{1}

. A simple linear model for the T-year (

T y

) regression problem that directly follows from Equations (5) and (6) is then

Z_{t}^{(A)} = ϕ_{0} + ϕ_{1} X_{t - 1}^{(A)} + ν_{t},

(7)

with parameters

ϕ_{0}

and

ϕ_{1}

, and error terms

ν_{t}

(more details are deferred to the Appendix A). Equation (7) shows that the excess stock return for year t over the next T years can be decomposed into two parts: a predictive linear part dependent only on the variable

X_{t - 1}^{(A)}

, the same predictive variable as in the one-year case, and unpredictable error terms

ν_{t}

, which are now serially correlated by construction.

As the linear setup of Equation (6) could be misspecified and thus not account for important nonlinearities, we model the functional relationship between the predictive variable

X_{t - 1}^{(A)}

and T-year excess stock returns

Z_{t}^{(A)}

in a more flexible nonparametric way analogous to Equation (2)

Z_{t}^{(A)} = m_{T y} (X_{t - 1}^{(A)}) + ν_{t},

(8)

where

m_{T y} (x^{(A)}) : = E (Z_{t}^{(A)} | X^{(A)} = x^{(A)}), x^{(A)} \in R^{q},

(9)

is an unknown smooth function. Note again the important difference between the error terms of Model (2) and Model (8): while

ξ_{t}

is a martingale difference process,

ν_{t}

is serially correlated by construction. This property has to be considered when estimating the unknown conditional mean function

m_{T y}

; otherwise, fundamental problems occur: the estimators are still consistent but less efficient than those correcting for autocorrelation [15,16,17,18]; and, more importantly, the commonly applied automatic smoothing parameter selection procedures (such as cross-validation and plug-in) break down [19,20]. In the empirical part of our work, we overcome the aforementioned problems using a special leave-l-out cross-validation strategy, which is closely related to our method of measuring predictive power. Our approach to this issue is discussed in detail in the next section.

Before we proceed, we summarize what we have discussed so far: the nonparametric Models (2) and (8) for one-year and T-year returns, the autoregressive behavior of order one for the predictive variable in (6), and the necessity of a leave-l-out cross-validation in the estimation procedure.

2.3. Predictive Power, Variable Selection, and Smoothing Parameter Choice

For our nonparametric one- and T-year models defined earlier, we need an adequate measure that (a) quantifies and validates the predictive power, (b) allows for comparisons and ranking of models when different sets of explanatory variables are used (variable selection), and (c) best selects the bandwidth(s) and thus determines the functional form of the conditional mean for the given predictive variables (smoothing parameter choice). In our work, we apply the validated R-squared (

R_{V}^{2}

) of Nielsen and Sperlich [14], which conforms to these requirements. It directly aims to estimate the k-year (

k y

)-ahead prediction error based on a leave-l-out cross-validation (with

l : = 2 k - 1

) and can thus be used for both variable as well as smoothing parameter selection. In our notation, the validated R

^{2}

is defined as

R_{V, k y}^{2} = 1 - \frac{\sum_{t} {(W_{t} - {\hat{m}}_{- t, k y})}^{2}}{\sum_{t} {(W_{t} - {\bar{W}}_{- t})}^{2}},

(10)

where such estimators are used that leave out l observations around the tth point in time,

{\hat{m}}_{- t, k y}

, for the conditional mean function

m_{k y}

from Equations (2) or (8) with

k \in {1, T}

and

{\bar{W}}_{- t}

for the unconditional (historical) mean of

W_{t}

, that is, the k-year return to predict (equal to

Y_{t}^{(A)}

for

k = 1

and

Z_{t}^{(A)}

for

k = T

). To maintain the simplicity of notation, we drop an extra subscript for the bandwidth h used in the calculation of

{\hat{m}}_{- t, k y}

, as we always choose h in the numerator in Equation (10) so that the prediction error is minimized and thus the largest possible

R_{V}^{2}

is achieved for the given predictive variables. Note that

R_{V}^{2}

measures the predictive power of a given model against a benchmark (here, the cross-validated historical mean). For our setup, this means that when

R_{V}^{2}

is positive, the predictor-based regression Model (2) or (8) outperforms the corresponding historical mean forecast.

In a time-series context, out-of-sample evaluations are often proposed where a fraction of the data from the end of the time-series is not used for estimation but is withheld for evaluation. In the case of uncorrelated errors, Bergmeir et al. [20] showed that cross-validation, as proposed in this section, is preferred to out-of-sample evaluation. Another advantage is that cross-validation involves various evaluations, whereas out-of-sample analysis can test the data only once. This property is especially beneficial when the number of recorded observations is small, as in our case with annual stock market data. When errors are correlated, as discussed in Section 2.2 for our T-year predictions, it may be necessary to omit more than a single point and apply leave-l-out cross-validation (with

l > 1

). This strategy avoids model fits that are progressively under-smoothed caused by too-small bandwidths [21]. Alternative approaches, for example, involve using bimodal kernels [22] or the correlation-corrected cross-validation [19]. Note that in the case of a large fraction of skipped data, additional corrections might be required [23].

2.4. An Econometric Model for Combined Short- and Long-Term Predictions

In this section, we present a simple method of combining short- and long-term predictions. Our model builds on the autoregressive development of the earnings variable

e^{(A)}

or, more precisely, on the change in earnings growth, which has been identified as one of the key drivers of stock prices P. Other important factors, such as the dividend yield

d^{(A)}

, can be easily incorporated in our model as well, for example, as covariates in the one- or five-year conditional mean regressions in (2) or (8), which will be used to calibrate our model. The important contribution of our approach is twofold. First, the application of predictive regressions for two different horizons individually reduces the noise or risk for short- and long-term investments. Second, the combination of predictions of different horizons further reduces the noise or risk for the short-term investment. The reason is that even in, for example, one-year returns, a large amount of speculative variation is still included. This is clearly reduced in longer-horizon investments. Using now such T-year predictions in combination with the one-year ones, the latter benefits from the former as they are forced to sum up to the long-term predictions after our model is calibrated. In other words, our model provides one- and T-year predictions that are equal to the conditional mean forecasts based on regressions (2) and (8) (and thus with an interpolation argument for the horizons in between), and reduces the variation in the short-term predictions after year one.

We start with the linear formulations of the autoregressive behavior of order one of the predictive variable and the linear model version of one-year return predictions in Equation (6). Here, we consider the earnings variable

e^{(A)}

to be this special predictor and estimate the linear models by ordinary least squares (OLS). In a first step, we obtain:

e_{t}^{(A)} - e_{t - 1}^{(A)} = ρ (e_{t - 1}^{(A)} - {\bar{e}}^{(A)}) + η_{t} \Leftrightarrow e_{t}^{(A)} = γ_{0} + γ_{1} e_{t - 1}^{(A)} + η_{t}

(11)

with unknown parameters

γ_{0} : = - ρ {\bar{e}}^{(A)}

and

γ_{1} : = ρ + 1

, sample average of earnings

{\bar{e}}^{(A)}

, and independent and identically distributed error terms

η_{t}

. The OLS estimates of

γ_{0}

and

γ_{1}

shall be denoted by

c_{0}

and

c_{1}

, respectively. In a second step, we apply the linear version of Equation (2) for the earnings variable

e^{(A)}

:

Y_{t + 1}^{(A)} = β_{0} + β_{1} e_{t}^{(A)} + ξ_{t + 1}

(12)

with unknown parameters

β_{0}

and

β_{1}

, which will be estimated again by OLS; their estimates are denoted by

b_{0}

and

b_{1}

, respectively. Remember that we have n observations in our records. Thus, with Equations (11) and (12) and the corresponding OLS estimates, which we keep fixed in the following steps, we now forecast out-of-sample

{\hat{Y}}_{n + 1}^{(A)}, {\hat{Y}}_{n + 2}^{(A)}, \dots, {\hat{Y}}_{n + T}^{(A)}

.

Our aim was to construct an econometric model that reflects one-year and T-year predictions (from the preferred models (2) and (8) at hand) simultaneously. For this reason, we correct

{\hat{Y}}_{n + 1}^{(A)}, {\hat{Y}}_{n + 2}^{(A)}, \dots, {\hat{Y}}_{n + T}^{(A)}

in the following linear way:

\begin{matrix} {\hat{Y}}_{n + 1}^{(A), c} & = α_{0} + α_{1} {\hat{Y}}_{n + 1}^{(A)} + ε_{n + 1} \end{matrix}

(13)

\begin{matrix} {\hat{Y}}_{n + 2}^{(A), c} & = α_{0} + α_{1} {\hat{Y}}_{n + 2}^{(A)} + ε_{n + 2} \\ ⋮ \end{matrix}

(14)

\begin{matrix} {\hat{Y}}_{n + T}^{(A), c} & = α_{0} + α_{1} {\hat{Y}}_{n + T}^{(A)} + ε_{n + T}, \end{matrix}

(15)

where

α_{0}

and

α_{1}

are unknown parameters;

ε_{n + 1} \sim N (0, σ_{1}^{2})

, and

ε_{n + 2}, \dots ε_{n + T} \sim N (0, σ_{2}^{2})

are independent error terms with unknown variances

σ_{1}^{2}

and

σ_{2}^{2}

. Note that we allow for a different variation in the first corrected one-year-ahead prediction

{\hat{Y}}_{n + 1}^{(A), c}

in Equation (13) compared with the second to Tth corrected one-year-ahead predictions

{\hat{Y}}_{n + 2}^{(A), c}, \dots, {\hat{Y}}_{n + T}^{(A), c}

in Equations (14) to (15). This way, our model can account for the lower variation in longer-horizon returns relative to one-year returns. In other words, after calibrating the model, we expect

σ_{2}^{2}

to be smaller than

σ_{1}^{2}

. Note further that from Equations (13)–(15), we directly obtain an expression for the corrected T-year return

Z_{n + T}^{(A), c}

:

Z_{n + T}^{(A), c} = \sum_{k = 1}^{T} {\hat{Y}}_{n + k}^{(A), c} = α_{0} T + α_{1} \sum_{k = 1}^{T} {\hat{Y}}_{n + k}^{(A)} + \sum_{k = 1}^{T} ε_{n + k} .

(16)

Next, we adequately calibrate Equations (13)–(16), i.e., choose the model parameters

α_{0}

,

α_{1}

,

σ_{1}^{2}, and σ_{2}^{2}

, and based on these, obtain the corrected one-year and T-year returns. Here, we use the recursive representation of the earnings

e^{(A)}

from Equation (11) with the starting value

e_{n}^{(A)}

(the last earnings observation in our records) together with the linear predictive model (12) and the corresponding OLS estimates

c_{0}

,

c_{1}

,

b_{0}

,

b_{1}

. Plugging-in for the corrected

{\hat{Y}}_{n + 1}^{(A)}, \dots, {\hat{Y}}_{n + T}^{(A)}

, and

Z_{n + T}^{(A), c}

gives:

\begin{matrix} {\hat{Y}}_{n + 1}^{(A), c} & = α_{0} + α_{1} (b_{0} + b_{1} e_{n}^{(A)}) + ε_{n + 1} \end{matrix}

(17)

\begin{matrix} {\hat{Y}}_{n + 2}^{(A), c} & = α_{0} + α_{1} (b_{0} + b_{1} (c_{0} + c_{1} e_{n}^{(A)})) + ε_{n + 2} \\ ⋮ \end{matrix}

(18)

\begin{matrix} {\hat{Y}}_{n + T}^{(A), c} & = α_{0} + α_{1} (b_{0} + c_{0} b_{1} \sum_{i = 0}^{T - 2} c_{1}^{i} + c_{1}^{T - 1} b_{1} e_{n}^{(A)}) + ε_{n + T} \end{matrix}

(19)

and

Z_{n + T}^{(A), c} = α_{0} T + α_{1} b_{0} T + α_{1} c_{0} b_{1} \sum_{k = 2}^{T} \sum_{i = 0}^{k - 2} c_{1}^{i} + α_{1} b_{1} e_{n}^{(A)} \sum_{i = 0}^{T - 1} c_{1}^{i} + \sum_{k = 1}^{T} ε_{n + k} .

(20)

Now, we fix the first and second moments of

{\hat{Y}}_{n + 1}^{(A), c}

and

Z_{n + T}^{(A), c}

with the estimated values from our preferred (best) one- and T-year predictive Models (2) and (8). By doing so, we obtain a linear equation system with four equations, which can be easily solved for the four unknown parameters

α_{0}

,

α_{1}

,

σ_{1}^{2}, and σ_{2}^{2}

. For this purpose, let

{\hat{μ}}_{1 y}

and

{\hat{σ}}_{1 y}^{2}

be the conditional mean forecast and its estimated variance from Equation (2), respectively; and

{\hat{μ}}_{T y}

and

{\hat{σ}}_{T y}^{2}

be the conditional mean forecast and its estimated variance from Equation (8), respectively. Note that

{\hat{σ}}_{1 y}^{2}

and

{\hat{σ}}_{T y}^{2}

can be readily calculated from the

R_{V}^{2}

of the predictive regressions (2) and (8). A closer inspection of Equation (10) shows that the ratio in our validation criterion compares the sample variance of the estimated residuals from the preferred predictive model (the numerator) with the sample variance of the benchmarked returns (the denominator). Algebraically, we therefore have that

R_{V, 1 y}^{2} = 1 - {\hat{σ}}_{1 y}^{2} / σ_{Y^{(A)}}^{2}

and

R_{V, T y}^{2} = 1 - {\hat{σ}}_{T y}^{2} / σ_{Z^{(A)}}^{2}

, and

\begin{matrix} {\hat{σ}}_{1 y}^{2} & = (1 - R_{V, 1 y}^{2}) σ_{Y^{(A)}}^{2}, \end{matrix}

(21)

\begin{matrix} {\hat{σ}}_{T y}^{2} & = (1 - R_{V, T y}^{2}) σ_{Z^{(A)}}^{2} . \end{matrix}

(22)

Given

\begin{matrix} E ({\hat{Y}}_{n + 1}^{(A), c}) = {\hat{μ}}_{1 y}, E ({\hat{Z}}_{n + T}^{(A), c}) = {\hat{μ}}_{T y}, \\ Var ({\hat{Y}}_{n + 1}^{(A), c}) = {\hat{σ}}_{1 y}^{2}, Var ({\hat{Z}}_{n + T}^{(A), c}) = {\hat{σ}}_{T y}^{2}, \end{matrix}

(23)

the solution of the equation system (23) is

\begin{matrix} α_{0} & = {\hat{μ}}_{1 y} - α_{1} (b_{0} + b_{1} e_{n}^{(A)}), \end{matrix}

(24)

\begin{matrix} α_{1} & = \frac{{\hat{μ}}_{T y} - T {\hat{μ}}_{1 y}}{S - b_{0} T - b_{1} T e_{n}^{(A)}}, \end{matrix}

(25)

where

S : = b_{0} T + c_{0} b_{1} \sum_{k = 2}^{T} \sum_{i = 0}^{k - 2} c_{1}^{i} + b_{1} e_{n}^{(A)} \sum_{i = 0}^{T - 1} c_{1}^{i},

and

\begin{matrix} σ_{1}^{2} & = {\hat{σ}}_{1 y}^{2}, \end{matrix}

(26)

\begin{matrix} σ_{2}^{2} & = \frac{1}{T - 1} ({\hat{σ}}_{T y}^{2} - {\hat{σ}}_{1 y}^{2}) . \end{matrix}

(27)

The a priori expectations about our model are the following: First, when the autoregressive behavior of the earnings in Model (11) and the linear model for stock returns (12) produce reasonable predictions

{\hat{Y}}_{n + 1}^{(A)}, {\hat{Y}}_{n + 2}^{(A)}, \dots, {\hat{Y}}_{n + T}^{(A)}

, only a marginal correction is necessary, i.e.,

α_{0}

is close to zero and

α_{1}

close to one. Second, when

T {\hat{μ}}_{1 y} > {\hat{μ}}_{T y}

, one-year returns should diminish over time (as the sum of

{\hat{Y}}_{n + 1}^{(A)}, \dots, {\hat{Y}}_{n + T}^{(A)}

still has to be equal to

{\hat{μ}}_{T y}

) and

α_{1}

becomes negative. Now

α_{0}

takes the role of an upper limit (larger than

{\hat{μ}}_{1 y}

), from which increasing values (over time) are subtracted to match the T-year prediction

{\hat{μ}}_{T y}

. Finally, note that by construction,

σ_{1}^{2} > σ_{2}^{2}

if and only if

T {\hat{σ}}_{1 y}^{2} > {\hat{σ}}_{T y}^{2}

, that is, the cumulated risk over T periods of short-term investments exceeds the risk of a T-year investment (as discussed earlier).

2.5. Data Sources and Descriptive Statistics

Our empirical application is based on historical U.S. stock market data on the annual frequency. The dataset includes, among other variables, the Standard and Poor’s (S&P) Composite Stock Price Index, dividends and earnings accruing to the index, as well as macroeconomic measures like the short-term interest rate, the long-term interest rate, and the consumer price index covering the period from 1872 to 2020. Table 1 exhibits their basic descriptive statistics.

We here use an updated and revised version of Shiller’s ([24], Chapter 26) data, which are available from http://www.econ.yale.edu/~shiller/data.htm (accessed on 16 April 2020). Note that a simple extension of the risk-free rate series was not possible because the underlying 6 month certificate of deposit rate (secondary market) was discontinued in 2013. We thus followed the strategy of Welch and Goyal [25] and replaced this variable by an annual risk-free rate based on the 6 month treasury-bill rate (secondary market) from https://fred.stlouisfed.org/series/TB6MS (accessed on 16 April 2020). As this series is available only from 1958, we had to estimate the information prior to 1958 using results from an OLS regression of the treasury-bill rate on the risk-free rate from Shiller’s data for the overlapping period 1958 to 2013. With the estimated linear model (

R^{2}

of 98.6%, estimated standard errors in brackets) of

Treasury-bill rate = \underset{(0.1009)}{0.0961} + \underset{(0.0146)}{0.8648} \times commercial paper rate,

we finally instrumented the risk-free rate from 1872 to 1957. The high correlation of 99.3% between the actual treasury-bill rate and the predictions for the estimation period verified the usefulness of this approach.

This section is concluded with Table 2, which displays the standard descriptive statistics for the transformed variables according to Equations (1), (4) and (5). The predictive variables under the inflation benchmark are more spread out, with a wider range and a higher standard deviation than the variables under the risk-free rate benchmark. This property of the inflation benchmark could be beneficial for the estimation process because a larger variability in the regressors usually leads to a more efficient predictor.

However, the returns transformed with the two benchmarks differ only slightly. A small upward shift under the inflation benchmark is noticeable in Figure 1, which shows density plots of the benchmarked returns for both the one- and five-year horizons.

3. Results and Discussion

3.1. One- and Five-Year Excess Stock Return Predictability

In what follows, we apply the double benchmarking approach introduced in Section 2 to the annual U.S. stock market data. The Models (2) and (8) are estimated with a local-linear kernel smoother using the quartic kernel. The optimal bandwidths were chosen by cross-validation, that is, by maximizing the

R_{V}^{2}

introduced in Equation (10). Note that the linear model is automatically embedded in our approach because of the ability of the local-linear smoother to estimate this simple functional form without any bias. Remember that the

R_{V}^{2}

value compares the predictive power of a specific model (as a combination of predictive variables) with the predictive power of the historical mean. Thus, the largest positive

R_{V}^{2}

under each benchmark indicates our favored model with the highest predictive power. We study the empirical findings of

R_{V}^{2}

values based on different validated scenarios shown for the one- and five-year horizon in Table 3 and Table 4, respectively.

For almost all the variable combinations in the one- and five-year cases as well as under both benchmarks, we found a positive

R_{V}^{2}

, that is, a better predictive power compared with the historical mean. Only the inflation rate as a single covariate under the short-term benchmark has a negative

R_{V}^{2}

for both horizons and thus no predictive power.

When comparing one- with five-year predictions for the risk-free rate benchmark, we confirmed the findings of Rapach and Zhou [26] that longer-horizon predictions produce more accurate estimates than shorter horizons. All considered combinations of predictive variables have higher

R_{V}^{2}

values for the five-year case. However, for the inflation benchmark, we observed the contrary, that is, almost all models for the one-year horizon have a higher predictive power. The only exception is the earnings-by-price variable with a slightly increased

R_{V}^{2}

value in the five-year case.

Under the short interest benchmark

B^{(R)}

, the term spread

s^{(R)}

is the most powerful predictive variable for excess stock returns. In detail, with the prediction constrained to using only single covariates, the term spread is the best predictor for the one- and five-year horizon with

R_{V}^{2} = 9.6 %

and

15.9 %

, respectively. Note that

s^{(R)}

and

l^{(R)}

(and their combinations with

d^{(R)}, e^{(R)}, π^{(R)}

) have the same

R_{V}^{2}

by construction of the transformed spread according to (4). For example,

s_{t - 1}^{(R)} = (l_{t - 1} - r_{t - 1}) / B_{t - 1}^{(R)} = (1 + l_{t - 1}) / (1 + r_{t - 1}) - 1

and

l_{t - 1}^{(R)} = (1 + l_{t - 1}) / (1 + r_{t - 1})

. Both differ by a constant shift of one, which has no impact on the estimation process with the local-linear smoother. Considering now the models with combined predictive variables, we find in the one-year case that

(e^{(R)}, s^{(R)})

yields

R_{V}^{2} = 10.7 %

, whereas in the five-year case,

(d^{(R)}, s^{(R)})

and

(e^{(R)}, s^{(R)})

perform closely with

R_{V}^{2} = 22.2 %

and

21.8 %

, respectively; for both cases, there is thus increased predictive power compared with the best model with the single term spread covariate.

Under the inflation benchmark

B^{(C)}

, the earnings variable

e^{(C)}

is the most powerful single predictor for the one- and five-year horizons with

R_{V}^{2} = 12.0 %

and

12.4 %

, respectively. In the one-year case, the pair

(e^{(C)}, s^{(C)})

further boosts the predictive power to

R_{V}^{2} = 17.5 %

, whereas for the five-year horizon, we find the same variable combination to be the most predictive model with

R_{V}^{2} = 14.9 %

.

In our model, which combines both one-year and five-year predictions, we use the optimal combination of predictive variables for each benchmark and horizon. For convenience, Table 5 summarizes the best models. For consistency and to examine the robustness of our results, we additionally consider the second-best set of predictors under the risk-free rate benchmark for the five-year horizon, that is,

(e^{(R)}, s^{(R)})

.

To obtain deeper insights into the relationship between excess stock returns and the predictive variables for the different benchmarks and horizons discussed above, Figure 2 and Figure 3 show the estimated nonparametric function

\hat{m}

(light blue surface) together with the underlying observations (dark blue balls). Especially for the risk-free rate benchmark, a nonlinear relationship is notable. However, the estimated function seems to be more stable under the inflation benchmark, that is, it is very similar for the one- and five-year horizons. All four plots indicate that with an increase in the spread, the predicted return increases, holding other factors fixed. An increase in the earnings predicts an increase in the return. Note that this effect is stronger for the inflation than for the risk-free rate benchmark. The dividend-by-price versus excess stock return relation for a fixed spread under the risk-free rate benchmark and the five-year horizon is U-shaped.

3.2. Short-Term Exuberance and Long-Term Stability: Combining Predictions of Short and Long Horizons

In this section, we illustrate the main empirical contribution of our paper. Recall that the simple econometric model introduced in Section 2.4, which combines short- and long-term predictions, builds on (a) the predictive power of earnings for excess stock returns with its linear model formulation of Equation (12) and (b) the autoregressive development of order one of the earnings in Equation (11). Table 6 shows the estimated OLS coefficients and regression summaries of the linear Models (11) and (12). We find that the earnings variable has more predictive power for excess stock returns under the inflation benchmark than the short-term benchmark: the

R^{2}

of the former is more than twice as large as that of the latter (

13.4 %

versus

5.8 %

). The autoregressive behavior of the earnings is stronger in terms of

R^{2}

for the short-term benchmark than the inflation benchmark (

63.8 %

versus

8.2 %

). However, the much smaller estimated coefficient under the inflation benchmark (

0.286

versus

0.798

) indicates a more stable variation in the earnings around the scaled historical mean. The intercept is also significantly differently estimated from zero for both benchmarks. Figure 4 and Figure 5 show the linear Models (11) and (12) for both benchmarks (solid red line) together with estimates of the local-linear smoother (dashed green line) and the

45^{\circ}

-line (dotted black line). These illustrations verify the usefulness of using linear functions in our econometric model in Section 2.4.

The next step in running our model is its calibration to the conditional mean and variance estimates for the one- and five-year horizons (the right-hand side values in Equation (23)); Table 7 shows those estimates for both benchmarks. Note that we used out-of-sample predictions from the optimal models discussed in Section 3.1 for both horizons (see also Table 5), that is,

{\hat{μ}}_{1 y}

and

{\hat{μ}}_{5 y}

are based on the newest predictive variables in our records (corresponding to December 2019 values). For the short-term benchmark, the optimal models predict returns of

4.30

(1 year) and

18.81

(5 years). Note that the average annual return for the five-year horizon of

18.81 / 5 = 3.76

is smaller than the predicted return for the one-year horizon of

4.30

. The econometric model should be able to adequately capture such a decline in annual returns and we exactly achieved this, as we show later, via the simple linear correction proposed in Section 2.4. For the inflation benchmark, the corresponding predictions are

4.15

(1 year) and

27.41

(5 years). Although the picture is similar for the one-year horizon, the behavior of the five-year predictions is different. We forecast an increase in one-year real returns as the average annual return for the five-year horizon of

27.41 / 5 = 5.48

, which is now larger than the predicted return for the one-year horizon of

4.15

.

From the upper panel of Table 7, the standard deviations of predicted one-year and five-year returns,

{\hat{σ}}_{1 y}

and

{\hat{σ}}_{5 y}

, respectively, appear reduced compared with the standard deviation

σ

of observed returns through the statistical modeling process for both benchmarks (Equations (21) and (22)). Under the short-term benchmark, we obtain a reduction from

17.28

to

16.34

(1 year) and

36.65

to

32.33

(5 years), whereas under the inflation benchmark, from

18.04

to

16.38

(1 year) and

36.33

to

33.52

(5 years). Note that our model combining the one- and five-year horizons further reduces the uncertainty and thus the risk for one-year returns under both benchmarks, as we explain below.

Using the estimated coefficients of the linear Models (11) and (12) (

c_{0}

,

c_{1}

,

b_{0}

, and

b_{1}

) as well as the predicted one-year and five-year returns and estimated variation (

{\hat{μ}}_{1 y}

,

{\hat{μ}}_{5 y}

,

{\hat{σ}}_{1 y}^{2}

, and

{\hat{σ}}_{5 y}^{2}

), we solve the equation system (23) and obtain the estimates

{\hat{α}}_{0}

,

{\hat{α}}_{1}

,

{\hat{σ}}_{1}

, and

{\hat{σ}}_{2}

(Equations (24)–(27)) reported in the lower panel of Table 7 for both benchmarks. For the inflation benchmark, we obtain

{\hat{α}}_{0} = - 0.17

and

{\hat{α}}_{1} = 0.95

, i.e., an intercept in our simple linear correction of predicted one-year returns (13)–(15), which is close to zero and a slope near one. This implies that only a slight correction suffices in combining optimal one-year and five-year stock return predictions. However, under the short-term benchmark, a much stronger correction is necessary to model the decline in the one-year returns over time:

{\hat{α}}_{0} = 13.70

and

{\hat{α}}_{1} = - 2.30

. Table 8 shows the development of the one-year returns for the periods of interest, i.e., from

n + 1

to

n + 5

for both benchmarks. Note that the corrected risk premium

{\hat{Y}}^{(R), c}

equals

4.30

in period

n + 1

by construction, reduces over time to

3.33

in period

n + 5

, and sums up over the five-year horizon to

18.81

again by construction. Similarly, the corrected real return

{\hat{Y}}^{(C), c}

equals

4.15

in period

n + 1

by construction, increases from year to year to

5.98

in period

n + 5

, and sums up to the five-year prediction of

27.41

. The underlying development of the earnings and the simple return predictions from Models (11) and (12) are also shown in Table 8.

Another important outcome of our econometric model is the additionally reduced variation in the corrected one-year returns of the periods

n + 2

to

n + 5

. For the short-term benchmark, Table 7 reports a reduction from

{\hat{σ}}_{1} = 16.34

to

{\hat{σ}}_{2} = 13.95

, that is, a drop in variation of

19.2 %

compared with the sample standard deviation of one-year returns of

σ = 17.28

—the starting point of our analysis. Similarly, for the inflation benchmark,

{\hat{σ}}_{1} = 16.38

reduces to

{\hat{σ}}_{2} = 14.62

, that is, a decrease in variation of

19.0 %

from the sample standard deviation of one-year returns of

σ = 18.04

.

{\hat{σ}}_{1} > {\hat{σ}}_{2}

tells us that in predicted pure one-year returns (i.e., ignoring the long-term view), a sort of bubble is still present. In other words, even short-term predictions of the one-year horizon are prone to speculative exuberance. However, our simple model optimizes the one-year investments according to the bubble-free long-term variance, reducing the variation/risk. This finding is relevant for long-term investors (above one year), i.e., for the majority of us via our pensions. Figure 6 illustrates this discussion and shows reward-to-variability ratios for each benchmark A based on the corrected one-year predictions

{\hat{Y}}^{(A), c}

for the periods

n + 2

to

n + 5

and the three standard deviations in the model

σ

,

σ_{1}

, and

σ_{2}

.

Finally, we repeat the model calibration for an alternative set of predictors under the risk-free rate benchmark for the five-year case, that is, the combination of earnings and term spread, aiming for congruity in the choice of the baseline set of predictors across benchmarks and horizons. Analagous to the reports in Table 7 and Table 8, Table A1 presents our estimates, which are minimally affected by this choice, whereas Table A2 exhibits the development of the earnings and return predictions, which remain qualitatively similar.

3.3. A Final Comment on the Performance and the Choice of the Benchmark

Notice that our underlying estimates when considering the inflation benchmark are more stable than in the equivalent short-term interest case. The autoregressive earnings model is also more stable in the inflation case compared with the short-term interest case, with a much higher mean-reversion. The modeling of excess returns shows a linear shape in the inflation case (Figure 3) but has considerable variability in the short-term interest case (Figure 2). Adjusting under the inflation benchmark from a one-year model to a five-year model is non-dramatic, contrary to the complete change involved in the short-term interest case. Although both models perform similarly after validation by the short-term interest rate being ahead in the long-term case, we might tend to prefer to work with the stable and intuitive inflation benchmark when providing our long-term and short-term model of stock returns. The choice of the benchmark depends on the ultimate application. If one follows, for example, one of the key messages of Merton [2], then forecast, especially for pensions, should be net of inflation.

4. Concluding Remarks

We propose a state-of-the-art econometric model that accounts contemporaneously for short- and long-term predictions. Therefore, it serves for short-term market timing as well as a long-term asset-allocation strategy for the long-term saver. The combination of the one- and five-year investment horizons thereby reduces the short-term variation by almost 20%. This finding has several implications: First, the high sample standard deviation of short-term returns indicates the presence of bubbles even in one-year returns. Second, institutional long-term investors such as pension funds should disregard pure short-term econometric models when deciding on their long-term asset allocation. Third, for a given risk appetite level, the ability to add equity exposure to result in increased long-term savers’ portfolio return is significant as it provides better pensions for everyone (see also [2,3]). Fourth, we found the inflation benchmark that expresses everything in real terms to be more stable than the often-used short-term interest rate benchmark. The former perfectly links with Merton’s [2] pension vision and provides good predictive power based on our empirical results.

We applied our framework to U.S. stock market excess returns and common predictors based on the short-term interest rate or the inflation benchmark for the one- and five-year horizons. Of potential interest are reference rates [13], longer horizons (say ten years) [27], or econometric modeling of the conditional variance [28], but these tasks remain for future research.

Author Contributions

Conceptualization, I.K., P.M., J.P.N. and M.S.; Formal analysis, I.K., P.M., J.P.N. and M.S.; Funding acquisition, J.P.N.; Investigation, P.M.; Methodology, M.S.; Software, M.S.; Supervision, J.P.N.; Writing—original draft, I.K., P.M., J.P.N. and M.S. All authors contributed equally to this work. All authors have read and agreed to the published version of the manuscript.

Funding

The authors thank the University of Graz for the open access funding.

Institutional Review Board Statement

Not applicable.

Informed Consent Statement

Not applicable.

Data Availability Statement

Financial data was obtained from http://www.econ.yale.edu/~shiller/data.htm (accessed on 16 April 2020) and https://fred.stlouisfed.org/series/TB6MS (accessed on 16 April 2020).

Acknowledgments

The authors thank the Open Access Funding by the University of Graz.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A

In Section 2.2, we introduce our setup for the T-year predictions. Here, we describe in more detail important single steps. Equation (5) defines

Z_{t}^{(A)}

as the sum of annual continuously compounded returns, which are of an overlapping nature:

\begin{matrix} Z_{1}^{(A)} & = Y_{1}^{(A)} + Y_{2}^{(A)} + Y_{3}^{(A)} + \dots + Y_{T}^{(A)} \\ Z_{2}^{(A)} & = Y_{2}^{(A)} + Y_{3}^{(A)} + Y_{4}^{(A)} + \dots + Y_{T + 1}^{(A)} \\ Z_{3}^{(A)} & = Y_{3}^{(A)} + Y_{4}^{(A)} + Y_{5}^{(A)} + \dots + Y_{T + 2}^{(A)} \\ ⋮ \\ Z_{n - T + 1}^{(A)} & = Y_{n - T + 1}^{(A)} + Y_{n - T + 2}^{(A)} + Y_{n - T + 3}^{(A)} + \dots + Y_{n}^{(A)}, \end{matrix}

where n is the number of observations of one-year returns. Using the relations stated in Equation (6), we easily obtain

\begin{matrix} Z_{t}^{(A)} & = (β_{0} + β_{1} X_{t - 1}^{(A)} + ξ_{t}) + \dots + (β_{0} + β_{1} X_{t + T - 2}^{(A)} + ξ_{t + T - 1}) \\ = β_{0} T + β_{1} γ_{0} \sum_{i = 0}^{T - 1} \sum_{j = 0}^{T - 2 - i} γ_{1}^{j} + β_{1} X_{t - 1}^{(A)} \sum_{i = 0}^{T - 1} γ_{1}^{i} + β_{1} \sum_{i = 0}^{T - 1} \sum_{j = 0}^{T - 2 - i} γ_{1}^{j} η_{t + i} + \sum_{i = 0}^{T - 1} ξ_{t + i} \\ = ϕ_{0} + ϕ_{1} X_{t - 1}^{(A)} + ν_{t}, \end{matrix}

where

\begin{matrix} ϕ_{0} & : = β_{0} T + β_{1} γ_{0} \sum_{i = 0}^{T - 1} \sum_{j = 0}^{T - 2 - i} γ_{1}^{j}, \\ ϕ_{1} & : = β_{1} \sum_{i = 0}^{T - 1} γ_{1}^{i}, \\ ν_{t} & : = β_{1} \sum_{i = 0}^{T - 1} \sum_{j = 0}^{T - 2 - i} γ_{1}^{j} η_{t + i} + \sum_{i = 0}^{T - 1} ξ_{t + i} . \end{matrix}

Table A1. Estimated parameters of the econometric model under the short-term interest rate benchmark or the inflation benchmark based on common predictive variables (earnings and spread) for one- and five-year horizons. Changes (compared to Table 7) are provided in boldface. See also the notes in Table 7.

Benchmark	Short-Term Interest Rate				Inflation Rate
	${\hat{μ}}_{ky}$	${\hat{σ}}_{ky}$	$R_{V, ky}^{2}$	$σ$	${\hat{μ}}_{ky}$	${\hat{σ}}_{ky}$	$R_{V, ky}^{2}$	$σ$
One-year ( $k = 1$ )	4.30	16.34	10.67	17.28	4.15	16.38	17.53	18.04
Five-year ( $k = 5$ )	20.93	32.41	21.78	36.65	27.41	33.52	14.85	36.33
	${\hat{α}}_{0}$	${\hat{α}}_{1}$	${\hat{σ}}_{1}$	${\hat{σ}}_{2}$	${\hat{α}}_{0}$	${\hat{α}}_{1}$	${\hat{σ}}_{1}$	${\hat{σ}}_{2}$
Parameter estimate	6.27	−0.48	16.34	14.00	−0.17	0.95	16.38	14.62

Table A2. Predicted excess stock returns from the econometric model in Section 2.4 under the short-term interest rate benchmark or the inflation rate benchmark based on common predictive variables (earnings and spread). Changes (compared to Table 8) are provided in boldface.

Benchmark	Short-Term Interest Rate			Inflation Rate
Period	$e^{(R)}$	${\hat{Y}}^{(R)}$	${\hat{Y}}^{(R), c}$	$e^{(C)}$	${\hat{Y}}^{(C)}$	${\hat{Y}}^{(C), c}$
n	2.76	–	–	3.47	–	–
$n + 1$	2.87	4.08	4.30	4.72	4.56	4.15
$n + 2$	2.95	4.23	4.23	5.08	5.95	5.47
$n + 3$	3.02	4.34	4.17	5.18	6.35	5.85
$n + 4$	3.07	4.43	4.13	5.21	6.47	5.95
$n + 5$	–	4.50	4.10	–	6.50	5.98

References

Lioui, A.; Poncet, P. Long horizon predictability: An asset allocation perspective. Eur. J. Oper. Res. 2019, 278, 961–975. [Google Scholar] [CrossRef]
Merton, R.C. The crisis in retirement planning. Harv. Bus. Rev. 2014, 92, 43–50. [Google Scholar]
Gerrard, R.; Hiabu, M.; Kyriakou, I.; Nielsen, J.P. Communication and personal selection of pension saver’s financial risk. Eur. J. Oper. Res. 2019, 274, 1102–1111. [Google Scholar] [CrossRef]
Gerrard, R.; Hiabu, M.; Nielsen, J.; Vodicčka, P. Long-term real dynamic investment planning. Insur. Math. Econ. 2020, 92, 90–103. [Google Scholar] [CrossRef]
Carmona, J.; León, A.; Vaello-Sebastià, A. Does stock return predictability affect ESO fair value? Eur. J. Oper. Res. 2012, 223, 188–202. [Google Scholar] [CrossRef] [Green Version]
Bodnar, T.; Parolya, N.; Schmid, W. On the exact solution of the multi-period portfolio choice problem for an exponential utility under return predictability. Eur. J. Oper. Res. 2015, 246, 528–542. [Google Scholar] [CrossRef] [Green Version]
Kyriakou, I.; Mousavi, P.; Nielsen, J.P.; Scholz, M. Longer-Term Forecasting of Excess Stock Returns–The Five-Year Case. Mathematics 2020, 8, 927. [Google Scholar] [CrossRef]
Lettau, M.; Van Nieuwerburgh, S. Reconciling the return predictability evidence. Rev. Financ. Stud. 2008, 21, 1601–1652. [Google Scholar] [CrossRef]
Chen, Q.; Hong, Y. Predictability of Equity Returns over Diferent Time Horizons: A Nonparametric Approach; Working Paper; Cornell University/Department of Economics: Ithaca, NY, USA, 2009. [Google Scholar]
Scholz, M.; Nielsen, J.P.; Sperlich, S. Nonparametric prediction of stock returns based on yearly data: The long-term view. Insur. Math. Econ. 2015, 65, 143–155. [Google Scholar] [CrossRef]
Scholz, M.; Sperlich, S.; Nielsen, J.P. Nonparametric long term prediction of stock returns with generated bond yields. Insur. Math. Econ. 2016, 69, 82–96. [Google Scholar] [CrossRef] [Green Version]
Cheng, T.; Gao, J.; Linton, O. Nonparametric Predictive Regressions for Stock Return Predictions; Cambridge Working Papers in Economics: 1932; Faculty of Economics, University of Cambridge: Cambridge, UK, 2019. [Google Scholar]
Kyriakou, I.; Mousavi, P.; Nielsen, J.P.; Scholz, M. Forecasting benchmarks of long-term stock returns via machine learning. Ann. Oper. Res. 2021, 287, 221–240. [Google Scholar] [CrossRef] [Green Version]
Nielsen, J.P.; Sperlich, S. Prediction of stock returns: A new way to look at it. ASTIN Bull. 2003, 33, 399–417. [Google Scholar] [CrossRef] [Green Version]
Xiao, Z.; Linton, O.B.; Carroll, R.J.; Mammen, E. More efficient local polynomial estimation in nonparametric regression with autocorrelated errors. J. Am. Stat. Assoc. 2003, 98, 980–992. [Google Scholar] [CrossRef]
Su, L.; Ullah, A. More efficient estimation in nonparametric regression with nonparametric autocorrelated errors. Econom. Theory 2006, 22, 98–126. [Google Scholar] [CrossRef]
Linton, O.B.; Mammen, E. Nonparametric transformation to white noise. J. Econom. 2008, 142, 241–264. [Google Scholar] [CrossRef] [Green Version]
Geller, J.; Neumann, M.H. Improved local polynomial estimation in time series regression. J. Nonparametr. Stat. 2018, 30, 1–27. [Google Scholar] [CrossRef]
De Brabanter, K.; De Brabanter, J.; Suykens, J.; De Moor, B. Kernel regression in the presence of correlated errors. J. Mach. Learn. Res. 2011, 12, 1955–1976. [Google Scholar]
Bergmeir, C.; Hyndman, R.J.; Koo, B. A note on the validity of cross-validation for evaluating autoregressive time series predictions. Comput. Stat. Data Anal. 2018, 120, 70–83. [Google Scholar] [CrossRef]
Opsomer, J.; Wang, Y.; Yang, Y. Nonparametric regression with correlated errors. Stat. Sci. 2001, 16, 134–153. [Google Scholar]
Chu, C.K.; Marron, J.S. Comparison of two bandwidth selectors with dependent errors. Ann. Stat. 1991, 19, 1906–1918. [Google Scholar] [CrossRef]
Burman, P.; Chow, E.; Nolan, D. A cross-validatory method for dependent data. Biometrika 1994, 81, 351–358. [Google Scholar] [CrossRef]
Shiller, R.J. Market Volatility; MIT Press: Cambridge, MA, USA, 1989. [Google Scholar]
Welch, I.; Goyal, A. A comprehensive look at the empirical performance of equity premium prediction. Rev. Financ. Stud. 2008, 21, 1455–1508. [Google Scholar] [CrossRef]
Rapach, D.; Zhou, G. Forecasting Stock Returns. In Handbook of Economic Forecasting, 2nd ed.; Elliott, G., Timmerman, A., Eds.; Elsevier: Amsterdam, The Netherlands, 2013; pp. 328–383. [Google Scholar]
Munk, C.; Rangvid, J. New Assumptions of a pension forecast model: Background, level and consequences for individuals forecasted pension. Finans/Invest 2018, 6, 6–14. [Google Scholar]
Mammen, E.; Nielsen, J.P.; Scholz, M.; Sperlich, S. Conditional Variance Forecasts for Long-Term Stock Returns. Risks 2019, 7, 113. [Google Scholar] [CrossRef] [Green Version]

Figure 1. Kernel density estimates of the probability density function of returns transformed with the risk-free rate benchmark (solid) and the inflation benchmark (dotted). (Left) One-year horizon. (Right) Five-year horizon. Period: 1872–2020. Data: annual S&P 500.

Figure 2. Risk-free rate benchmark. The relation between excess stock returns and predictive variables: the earnings-by-price ratio and the spread, one-year horizon (left); the dividend-by-price ratio and the spread, five-year horizon (right). Estimated nonparametric function

\hat{m}

(light blue surface) and observations (dark blue balls). Period: 1872-2020. Data: annual S&P 500.

Figure 2. Risk-free rate benchmark. The relation between excess stock returns and predictive variables: the earnings-by-price ratio and the spread, one-year horizon (left); the dividend-by-price ratio and the spread, five-year horizon (right). Estimated nonparametric function

\hat{m}

(light blue surface) and observations (dark blue balls). Period: 1872-2020. Data: annual S&P 500.

Figure 3. Inflation benchmark. Relation between excess stock returns and predictive variables: the earnings-by-price ratio and the spread, one-year horizon (left); the earnings-by-price ratio and the spread, five-year horizon (right). Estimated nonparametric function

\hat{m}

(light blue surface) and observations (dark blue balls). Period: 1872-2020. Data: annual S&P 500.

Figure 3. Inflation benchmark. Relation between excess stock returns and predictive variables: the earnings-by-price ratio and the spread, one-year horizon (left); the earnings-by-price ratio and the spread, five-year horizon (right). Estimated nonparametric function

\hat{m}

(light blue surface) and observations (dark blue balls). Period: 1872-2020. Data: annual S&P 500.

Figure 4. Relation between excess stock returns and the earnings-by-price ratio. (Left) Short-term interest rate benchmark. (Right) Inflation benchmark. Shown are estimates of a linear function (solid red), the local-linear smoother (dashed green), and the 45

^{\circ}

-line (dotted black). Period: 1872–2020. Data: annual S&P 500.

Figure 4. Relation between excess stock returns and the earnings-by-price ratio. (Left) Short-term interest rate benchmark. (Right) Inflation benchmark. Shown are estimates of a linear function (solid red), the local-linear smoother (dashed green), and the 45

^{\circ}

-line (dotted black). Period: 1872–2020. Data: annual S&P 500.

Figure 5. Autoregressive behavior of the earnings-by-price ratio. (Left) Short-term interest rate benchmark. (Right) Inflation benchmark. Shown are estimates of a linear function (solid red), the local-linear smoother (dashed green), and the 45

^{\circ}

-line (dotted black). Period: 1872–2020. Data: annual S&P 500.

Figure 5. Autoregressive behavior of the earnings-by-price ratio. (Left) Short-term interest rate benchmark. (Right) Inflation benchmark. Shown are estimates of a linear function (solid red), the local-linear smoother (dashed green), and the 45

^{\circ}

-line (dotted black). Period: 1872–2020. Data: annual S&P 500.

Figure 6. Comparison of reward-to-variability ratios based on corrected one-year predictions for periods

n + 2, \dots, n + 5

and

σ

(red),

σ_{1}

(yellow), and

σ_{2}

(green). (Left) Short-term interest rate benchmark. (Right) Inflation benchmark.

Figure 6. Comparison of reward-to-variability ratios based on corrected one-year predictions for periods

n + 2, \dots, n + 5

and

σ

(red),

σ_{1}

(yellow), and

σ_{2}

(green). (Left) Short-term interest rate benchmark. (Right) Inflation benchmark.

Table 1. U.S. market data (1872–2020).

	Max	Min	Mean	SD	Skew	Exc. kurt
S&P stock price index	3278.20	3.25	297.62	607.94	2.57	6.62
Dividend accruing to index	58.24	0.18	6.39	11.36	2.52	6.35
Earnings accruing to index	139.47	0.16	14.80	28.17	2.47	5.66
Short-term interest rate	14.93	0.07	3.99	2.49	0.94	2.27
Long-term interest rate	14.59	1.76	4.51	2.25	1.80	3.68
Consumer price index	257.97	6.47	60.32	74.02	1.34	0.35

Table 2. Summary statistics of transformed variables (in percentages). Panel (a) shows the available variables transformed according to the short-term interest rate, e.g., excess returns corresponding to the risk premium. Panel (b) shows the available variables net of inflation, i.e., in real terms.

l^{(R)}

equals

s^{(R)}

by construction as explained in the footnote in Section 3.1.

Table 2. Summary statistics of transformed variables (in percentages). Panel (a) shows the available variables transformed according to the short-term interest rate, e.g., excess returns corresponding to the risk premium. Panel (b) shows the available variables net of inflation, i.e., in real terms.

l^{(R)}

equals

s^{(R)}

by construction as explained in the footnote in Section 3.1.

	Max	Min	Mean	Sd	Skew	Exc. kurt
(a) Benchmark: short-term interest rate ( $A \equiv R$ )
One-year excess stock returns $Y^{(R)}$	42.39	−58.26	4.71	17.28	−0.58	0.68
Five-year excess stock returns $Z^{(R)}$	107.27	−78.54	23.69	36.65	−0.16	−0.37
Dividend-by-price $d^{(R)}$	7.26	−8.96	0.37	2.78	−0.15	0.15
Earnings-by-price $e^{(R)}$	13.25	−3.29	3.22	3.07	0.96	1.18
Long-term interest rate $l^{(R)}$	3.55	−3.46	0.55	1.27	0.01	−0.04
Inflation $π^{(R)}$	17.00	−19.16	−1.64	5.62	0.24	1.79
Spread $s^{(R)}$	3.55	−3.46	0.55	1.27	0.01	−0.04
(b) Benchmark: inflation rate ( $A \equiv C$ )
One-year excess stock returns $Y^{(C)}$	54.04	−48.81	6.52	18.04	−0.41	0.64
Five-year excess stock returns $Z^{(C)}$	122.96	−57.34	32.47	36.33	−0.06	−0.39
Dividend-by-price $d^{(C)}$	25.49	−13.90	2.38	6.51	0.93	1.77
Earnings-by-price $e^{(C)}$	29.50	−10.98	5.23	5.93	0.94	2.13
Short-term interest rate $r^{(C)}$	23.70	−14.53	2.00	5.85	0.40	1.84
Long-term interest rate $l^{(C)}$	23.70	−13.81	2.55	5.78	0.23	2.17
Spread $s^{(C)}$	3.51	−3.45	0.55	1.28	−0.02	−0.05

Table 3. Predictive power (%) for the one-year excess stock returns

Y_{t}^{(A)}

corresponding to the prediction problem defined in (2). The predictive power is measured by

R_{V}^{2}

as defined in (10). The benchmarks

B^{(A)}

considered are based on the short-term interest rate (

A \equiv R

) and the inflation rate (

A \equiv C

). The predictive variables used are

X_{t - 1}^{(A)}

, using the indicated benchmark

B_{t - 1}^{(A)}

as shown in (4). “–” indicates not applicable cases of matched covariate with benchmark.

l^{(R)}

equals

s^{(R)}

by construction as explained in the footnote in Section 3.1.

Table 3. Predictive power (%) for the one-year excess stock returns

Y_{t}^{(A)}

corresponding to the prediction problem defined in (2). The predictive power is measured by

R_{V}^{2}

as defined in (10). The benchmarks

B^{(A)}

considered are based on the short-term interest rate (

A \equiv R

) and the inflation rate (

A \equiv C

). The predictive variables used are

X_{t - 1}^{(A)}

, using the indicated benchmark

B_{t - 1}^{(A)}

as shown in (4). “–” indicates not applicable cases of matched covariate with benchmark.

l^{(R)}

equals

s^{(R)}

by construction as explained in the footnote in Section 3.1.

Benchmark $B^{(A)}$	Explanatory Variable(s) $X_{t - 1}^{(A)}$
	$d^{(A)}$	$e^{(A)}$	$r^{(A)}$	$l^{(A)}$	$π^{(A)}$	$s^{(A)}$
Short-term rate	3.0	5.0	–	9.6	-1.3	9.6
Inflation	10.2	12.0	7.1	10.4	–	6.6
	$(d^{(A)}, e^{(A)})$	$(d^{(A)}, r^{(A)})$	$(d^{(A)}, l^{(A)})$	$(d^{(A)}, π^{(A)})$	$(d^{(A)}, s^{(A)})$
Short-term rate	2.3	–	9.8	1.4	9.8
Inflation	10.1	9.3	9.8	–	15.4
	$(e^{(A)}, r^{(A)})$	$(e^{(A)}, l^{(A)})$	$(e^{(A)}, π^{(A)})$	$(e^{(A)}, s^{(A)})$
Short-term rate	–	10.7	3.3	10.7
Inflation	11.2	11.1	–	17.5
	$(r^{(A)}, l^{(A)})$	$(r^{(A)}, π^{(A)})$	$(r^{(A)}, s^{(A)})$
Short-term rate	–	–	–
Inflation	13.6	–	14.7
	$(l^{(A)}, π^{(A)})$	$(l^{(A)}, s^{(A)})$
Short-term rate	7.2	–
Inflation	–	14.6
	$(π^{(A)}, s^{(A)})$
Short-term rate	7.2
Inflation	–

Table 4. Predictive power (%) for the five-year excess stock returns

Z_{t}^{(A)}

corresponding to the prediction problem defined in (8). For additional notes, refer to Table 3.

Table 4. Predictive power (%) for the five-year excess stock returns

Z_{t}^{(A)}

corresponding to the prediction problem defined in (8). For additional notes, refer to Table 3.

Benchmark $B^{(A)}$	Explanatory Variable(s) $X_{t - 1}^{(A)}$
	$d^{(A)}$	$e^{(A)}$	$r^{(A)}$	$l^{(A)}$	$π^{(A)}$	$s^{(A)}$
Short-term rate	11.7	10.6	–	15.9	-2.4	15.9
Inflation	10.8	12.4	5.5	8.6	–	1.0
	$(d^{(A)}, e^{(A)})$	$(d^{(A)}, r^{(A)})$	$(d^{(A)}, l^{(A)})$	$(d^{(A)}, π^{(A)})$	$(d^{(A)}, s^{(A)})$
Short-term rate	8.6	–	22.2	7.4	22.2
Inflation	9.5	4.2	1.7	–	13.1
	$(e^{(A)}, r^{(A)})$	$(e^{(A)}, l^{(A)})$	$(e^{(A)}, π^{(A)})$	$(e^{(A)}, s^{(A)})$
Short-term rate	–	21.8	8.4	21.8
Inflation	8.6	4.9	–	14.9
	$(r^{(A)}, l^{(A)})$	$(r^{(A)}, π^{(A)})$	$(r^{(A)}, s^{(A)})$
Short-term rate	–	–	–
Inflation	10.8	–	10.1
	$(l^{(A)}, π^{(A)})$	$(l^{(A)}, s^{(A)})$
Short-term rate	16.3	–
Inflation	–	10.2
	$(π^{(A)}, s^{(A)})$
Short-term rate	16.3
Inflation	–

Table 5. Summarized optimal combinations of predictive variables and their predictive power

R_{V}^{2}

(%).

Table 5. Summarized optimal combinations of predictive variables and their predictive power

R_{V}^{2}

(%).

Horizon	Risk-Free Rate Benchmark		Inflation Benchmark
	Variables	$R_{V}^{2}$	Variables	$R_{V}^{2}$
One-year	$(e^{(R)}, s^{(R)})$	10.7	$(e^{(C)}, s^{(C)})$	17.5
Five-year	$(d^{(R)}, s^{(R)})$	22.2	$(e^{(C)}, s^{(C)})$	14.9

Table 6. Estimated parameters (and standard errors in parentheses) of the linear Models (11) and (12) used for the econometric model in Section 2.4 under the short-term interest rate benchmark or the inflation benchmark.

R^{2}

, the standard coefficient of determination of a linear model; Adj.

R^{2}

, the adjusted

R^{2}

; Num. obs., the number of observations used in the regression; RMSE, root mean square error.

Table 6. Estimated parameters (and standard errors in parentheses) of the linear Models (11) and (12) used for the econometric model in Section 2.4 under the short-term interest rate benchmark or the inflation benchmark.

R^{2}

, the standard coefficient of determination of a linear model; Adj.

R^{2}

, the adjusted

R^{2}

; Num. obs., the number of observations used in the regression; RMSE, root mean square error.

Benchmark	Short-Term Interest Rate		Inflation Rate
Dependent Variable	$e_{t + 1}^{(R)}$	$Y_{t + 1}^{(R)}$	$e_{t + 1}^{(C)}$	$Y_{t + 1}^{(C)}$
Intercept	0.0066 **	0.0035	0.0373 ***	0.0069
	(0.0022)	(0.0201)	(0.0063)	(0.0186)
$e_{t}^{(A)}$	0.7976 ***	1.3522 **	0.2859 ***	1.1144 ***
	(0.0500)	(0.4531)	(0.0799)	(0.2350)
$R^{2}$	0.6384	0.0579	0.0817	0.1343
Adj. $R^{2}$	0.6359	0.0514	0.0753	0.1283
Num. obs.	146	147	146	147
RMSE	0.0186	0.1683	0.0572	0.1684

*** p < 0.001, ** p < 0.01, * p < 0.05.

Table 7. Estimated parameters of the econometric model under the short-term interest rate benchmark or the inflation benchmark (in %). For (conditional) predictions of the mean

{\hat{μ}}_{k y}

, the variable combination with the largest

R_{V, k y}^{2}

is used (Table 5).

{\hat{σ}}_{k y}

denotes the estimated standard deviation of the predictions

{\hat{Y}}^{(A), c}

or

{\hat{Z}}^{(A), c}

,

σ

is the sample standard deviation of

Y^{(A)}

or

Z^{(A)}

(Table 2).

{\hat{α}}_{0}

,

{\hat{α}}_{1}

,

{\hat{σ}}_{1}

, and

{\hat{σ}}_{2}

are the parameter estimates of the econometric model in Section 2.4.

Table 7. Estimated parameters of the econometric model under the short-term interest rate benchmark or the inflation benchmark (in %). For (conditional) predictions of the mean

{\hat{μ}}_{k y}

, the variable combination with the largest

R_{V, k y}^{2}

is used (Table 5).

{\hat{σ}}_{k y}

denotes the estimated standard deviation of the predictions

{\hat{Y}}^{(A), c}

or

{\hat{Z}}^{(A), c}

,

σ

is the sample standard deviation of

Y^{(A)}

or

Z^{(A)}

(Table 2).

{\hat{α}}_{0}

,

{\hat{α}}_{1}

,

{\hat{σ}}_{1}

, and

{\hat{σ}}_{2}

are the parameter estimates of the econometric model in Section 2.4.

Benchmark	Short-Term Interest Rate				Inflation Rate
	${\hat{μ}}_{ky}$	${\hat{σ}}_{ky}$	$R_{V, ky}^{2}$	$σ$	${\hat{μ}}_{ky}$	${\hat{σ}}_{ky}$	$R_{V, ky}^{2}$	$σ$
One-year ( $k = 1$ )	4.30	16.34	10.67	17.28	4.15	16.38	17.53	18.04
Five-year ( $k = 5$ )	18.81	32.33	22.18	36.65	27.41	33.52	14.85	36.33
	${\hat{α}}_{0}$	${\hat{α}}_{1}$	${\hat{σ}}_{1}$	${\hat{σ}}_{2}$	${\hat{α}}_{0}$	${\hat{α}}_{1}$	${\hat{σ}}_{1}$	${\hat{σ}}_{2}$
Parameter estimate	13.70	–2.30	16.34	13.95	–0.17	0.95	16.38	14.62

Table 8. Predicted excess stock returns from the econometric model in Section 2.4 under the short-term interest rate benchmark or the inflation rate benchmark.

e_{n}^{(A)}

is the last earnings-by-price observation in our records (transformed according to the benchmark A) and corresponds to December 2019.

{\hat{Y}}^{(A)}

denotes the one-year predictions of excess stock returns from the linear Model (12) (parameter estimates in Table 6) and

{\hat{Y}}^{(A), c}

is their corrected counterparts based on (13)–(15) (parameter estimates in Table 7).

Table 8. Predicted excess stock returns from the econometric model in Section 2.4 under the short-term interest rate benchmark or the inflation rate benchmark.

e_{n}^{(A)}

is the last earnings-by-price observation in our records (transformed according to the benchmark A) and corresponds to December 2019.

{\hat{Y}}^{(A)}

denotes the one-year predictions of excess stock returns from the linear Model (12) (parameter estimates in Table 6) and

{\hat{Y}}^{(A), c}

is their corrected counterparts based on (13)–(15) (parameter estimates in Table 7).

Benchmark	Short-Term Interest Rate			Inflation Rate
Period	$e^{(R)}$	${\hat{Y}}^{(R)}$	${\hat{Y}}^{(R), c}$	$e^{(C)}$	${\hat{Y}}^{(C)}$	${\hat{Y}}^{(C), c}$
n	2.76	–	–	3.47	–	–
$n + 1$	2.87	4.08	4.30	4.72	4.56	4.15
$n + 2$	2.95	4.23	3.97	5.08	5.95	5.47
$n + 3$	3.02	4.34	3.71	5.18	6.35	5.85
$n + 4$	3.07	4.43	3.50	5.21	6.47	5.95
$n + 5$	–	4.50	3.33	–	6.50	5.98

Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kyriakou, I.; Mousavi, P.; Nielsen, J.P.; Scholz, M. Short-Term Exuberance and Long-Term Stability: A Simultaneous Optimization of Stock Return Predictions for Short and Long Horizons. Mathematics 2021, 9, 620. https://doi.org/10.3390/math9060620

AMA Style

Kyriakou I, Mousavi P, Nielsen JP, Scholz M. Short-Term Exuberance and Long-Term Stability: A Simultaneous Optimization of Stock Return Predictions for Short and Long Horizons. Mathematics. 2021; 9(6):620. https://doi.org/10.3390/math9060620

Chicago/Turabian Style

Kyriakou, Ioannis, Parastoo Mousavi, Jens Perch Nielsen, and Michael Scholz. 2021. "Short-Term Exuberance and Long-Term Stability: A Simultaneous Optimization of Stock Return Predictions for Short and Long Horizons" Mathematics 9, no. 6: 620. https://doi.org/10.3390/math9060620

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Short-Term Exuberance and Long-Term Stability: A Simultaneous Optimization of Stock Return Predictions for Short and Long Horizons

Abstract

1. Introduction

2. Materials and Methods

2.1. One-Year Predictions

2.2. Longer-Horizon Predictions

2.3. Predictive Power, Variable Selection, and Smoothing Parameter Choice

2.4. An Econometric Model for Combined Short- and Long-Term Predictions

2.5. Data Sources and Descriptive Statistics

3. Results and Discussion

3.1. One- and Five-Year Excess Stock Return Predictability

3.2. Short-Term Exuberance and Long-Term Stability: Combining Predictions of Short and Long Horizons

3.3. A Final Comment on the Performance and the Choice of the Benchmark

4. Concluding Remarks

Author Contributions

Funding

Institutional Review Board Statement

Informed Consent Statement

Data Availability Statement

Acknowledgments

Conflicts of Interest

Appendix A

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI