Debt-by-Price Ratio, End-of-Year Economic Growth, and Long-Term Prediction of Stock Returns

: With the prominent role of government debt in economic growth in recent decades, one would expect that government debt alongside economic growth to be a risk factor priced in the time series of stock returns. In this paper, this idea is investigated by applying a nonparametric model, namely, a local-linear kernel smoother with the aim of forecasting long-term stock returns where the model and smoothing parameters are chosen by cross-validation. While a wide range of predictive variables are examined, we ﬁnd that our newly introduced debt-by-price ratio and the third to fourth quarter economic growth are robust predictors of stock returns, beating the well-known predictive variables in the literature by a signiﬁcant difference. The combination of these two covariates can explain almost 30% variation of stock returns at a one-year horizon. This is very crucial considering the difﬁculty in capturing even a small proportion of movements in stock returns.


Introduction
Following World War II, the United States experienced a high and steady economic growth from 1949 to 1975. This period of real growth was accompanied by a high growth of the stock market [1]. After this period, as shown in Figure 1, the economic growth has been significantly characterised by the augmentation of public debt reaching a debt-to-GDP ratio of 135.6% in the second quarter of 2020 [2]. The policies carried out to ease the Great Recession and the COVID-19 pandemic led the economy to larger amounts of debt as well as near-zero interest rates, making us ponder if the high growth on the stock market is only to a limited degree related to the real economic growth and might as well be linked to the growing debt and low interest rates. Meanwhile, the consecutive years with a low bond yield environment have been very challenging for the long-term savers see, e.g., [3]. The objective of this research is to provide a long-term forecasting approach to improve strategic asset allocation, also when interest rates are low, by investigating the hypothesis that both economic growth and government debt are risk factors priced in the time series of stock returns.
Many studies have attempted to predict stock returns using macroeconomic variables, mainly because they can affect the firm's future cash flows by variation in the investment opportunities and consumption. Examples include default and term spreads on bonds [4]; interest rates [5,6]; inflation rate [7]; consumption-wealth ratio [8]; investment-capital ratio [9]; investment plans [10]; ratio of labor income to total income [11]; ratio of housing to total consumption [12], out-put gap [13] and end-of-year economic growth [14]. Regardless of this substantial effort, it has been empirically puzzling to find signs of macroeconomic variables when forecasting stock returns. There is a lack of robust forecasting power within macroeconomic variables, excluding the term spread, short-term interest rate and consumption-wealth ratio. The most-examined variables have been financial indicators instead of macroeconomic variables [15]. Following this line of investigation, this paper introduces a brand new variable which is the ratio of public debt to stock prices, namely, debt-by-price ratio. When debt-by-price ratio is combined with the end-of-year economic growth [14] in a multivariate setting accounts for a considerable fraction of the US stock returns variation confirming our initial hypothesis.  Figure 1. The United States annual debt-to-GDP ratio % growth (dark blue line), calculated using the annual gross federal debt as percent of gross domestic product [16]. The sample period is 1947-2020.
This work is inspired by the research on valuation ratios which connect stock prices to their cash flows, namely, dividend-by-price and earnings-by-price ratios, e.g., [17][18][19][20][21]; price-by-GDP ratio [22], which is the combination of valuation ratios and macroeconomic variables due to the belief that changes in aggregate dividends or earnings are linked with the aggregate output in the economy; the recent literature on the relationship between government debt and expected returns, e.g., [23]; the importance of information stored in the end-of-year economic growth [14].
It is historically popular that additional government debt can boost the economy in the short-term and might also trigger long-term growth since the extra spending helps the economy. On the contrary, debt can hamper the long-term economic growth on the path to recovery with rises in the inflation, tax and borrowing costs, e.g., [24][25][26]. In spite of these, the empirical literature on the relationship between fiscal policy and stock returns has been rather sparse until recently. For instance, a limited number of studies examine the impact of fiscal policy on stock prices in the Canadian and US stock markets [27][28][29]. Furthermore, a positive relationship between debt-to-GDP ratio and risk premiums for high R&D firms using the cross-section of stock returns has been confirmed [23]. In this research, by examining a long list of debt related covariates, debt-by-price ratio shows exceptional performance in forecasting US stock returns such that an increase in debt-by-price ratio is associated with a rise in excess stock returns.
Furthermore, it is stated that the end-of-year economic growth contains important information about stock returns that other quarters do not [14]. It is argued that establishing a relationship between stock returns and economic growth using all the quarters offset the end-of-year effect. By examining the forecasting power of both fourth to fourth quarter and third to fourth quarter economic growth rates in predicting one-year-ahead excess stock returns, the results of this paper support the argument that the third to fourth quarter economic growth contains such powerful information that cannot be detected from the fourth to fourth quarter growth rates. This finding exclusively holds for economic-growth related variables, namely, GDP, industrial production and consumption. On that account, annual frequency data for the remainder of the predictive variables would suffice.
The empirical findings of this paper demonstrate that debt-by-price ratio and endof-year GDP growth are both individually strong predictors of expected returns with out-of-sample R-squared equal to 17.3% and 15.4%, respectively. Both variables contain independent information such that when combined together, the out-of-sample R-squared reaches 28% for a one-year horizon. While there is a positive relationship between debt-byprice ratio and excess stock returns, GDP growth and excess stock returns are negatively correlated. High debt-by-price ratio and low GDP growth rate relate to high equity risk premiums. While the out-of-sample R-squared of many well-known predictive variables can hardly beat the historical average [30], as evidenced by the results in this paper, the current research adds to the stock return predictability literature [15,[31][32][33][34][35][36]. This new approach describing the impact of US economic growth and government debt on stock returns also adds new insights into recent methodological advances (see [35]).
The rest of the paper proceeds as follows. Section 2 explains the technical motivation. Section 3 illustrates the framework of prediction, the underlying financial model and the data. In Section 4, the results of the empirical application in different validated scenarios are demonstrated. Section 5 concludes.

The Technical Motivation
For the application of long-term stock returns prediction, one-year horizon models are constructed using the annual data of S&P500 index during 1947-2020. It is noteworthy that applying a data set with a small number of observations can give rise to problems related to data sparsity and overfitting, and one could suggest the use of a shorter-term frequency data to overcome this challenge. However, the bias-variance trade off is determined by the horizon and the application of prediction differs extremely for various horizons.
To avoid such problems, this paper builds on the large actuarial-predictability literature [35][36][37][38][39][40] with the intention of importing more structure in the estimation process. Initially, a validated R-squared appropriate for nonparametric prediction is introduced in an attempt to predict the Danish stock market [37]. Later on, the US stock returns in excess of risk-free rates are predicted comparing parametric, semi-parametric and nonparametric regressions [38,39]. Then, the popular forecasting variables in the literature are tested in a nonparametric setting using different benchmarks, namely, US stock returns in excess of risk-free rates; long-term interest rate; earnings-by-price ratio and inflation [36]. The extension to longer horizon prediction is subsequently studied [35]. Finally, the conditional variance of long-term stock returns for both the one and five year horizons are predicted [40]. The latter is recently mentioned in the Special Issue on "Machine Learning in Insurance" as one of the ten high-quality articles developing new theoretical or practical advances in insurance [41].
According to the literature, stock return predictability is more powerful when nonparametric models are considered, yet the linear regression is the more commonly used approach. In keeping with the studies mentioned above, a local-linear kernel smoother combined with a leave-k-out cross-validation is applied in this research. Thereafter, an out-of-sample measure of predictive power is used, as an in-sample measure such as goodness-of-fit does not prove suitable for nonparametric models. For both model and optimal bandwidth selection, the generalised version of the validated R-squared R 2 V , introduced by Nielsen and Sperlich [37], which measures the predictive power of a given model based on a leave-k-out cross-validation compared to the cross-validated historical mean, is applied. R 2 V takes negative values when the estimated prediction model preforms worse than the cross-validated historical mean. For instance, only one-and two-dimensional models are illustrated, as all attempted three-dimensional models achieve unsatisfactory predictions. The classical R 2 would clearly choose the three-dimensional covariate as it tends to choose the most complicated model. We should note that the absence of prediction is not due to an inadequate model but due to the estimation error. Considering the fact that the prediction of stock returns is rather difficult, the majority of the attempted covariates result in negative R 2 V values. For greater clarity, only the covariates with the highest predictive power, and a few for the purpose of comparison, are indicated; otherwise, many more covariates have been examined throughout this research.

The Nonparametric Model
Stock returns are defined as where D t denotes the (nominal) dividends paid during year t and P t the (nominal) stock price at the end of year t, in excess (log-scale) of a given benchmark B where A ∈ {R, L, E, C} with, respectively, R t is the short-term interest rate, L t the long-term interest rate, E t the earnings accruing to the index in year t, and CPI t the consumer price index for year t. The nonparametric regression model for the one-year horizon prediction is then given by where the conditional mean m(x) = E(Y (A) |X = x), x ∈ R q , is an unknown smooth function and ξ t a martingale difference process, i.e., has serially uncorrelated zero-mean random error terms, given the past, of an unknown conditionally heteroscedastic form σ(x). We predict one-year horizon excess stock returns by applying a local-linear kernel smoother using a quartic kernel where the optimal bandwidth, and consequently the best model, is chosen by cross-validation. Provided that a local-linear smoother is used, the model can estimate linear functions without any bias and the linear regression is embedded by design in this approach. Longer horizon predictions can also be incorporated in the analysis with the sum of annual continuously compounded returns, Z t+i . Further details regarding the nonparametric regression and its extension to longer horizon predictions are included in Kyriakou et al. [35,36].
This paper attempts to forecast one-year-ahead excess stock returns Y (A) t measured over the calendar year using the time-lagged macroeconomic variables X t−1 related to government debt and economic growth. A few of these variables are demonstrated in Table 1; otherwise, many more variables have been examined. For the purpose of comparison, year-over-year growth rates as well as popular predictive variables in the literature are also illustrated, namely, dividend-by-price ratio d t−1 = D t−1 /P t−1 ; earningsby-price ratio e t−1 = E t−1 /P t−1 ; short-term interest rate r t−1 = R t−1 /100; long-term interest rate l t−1 = L t−1 /100; inflation π t−1 = (CPI t−1 − CPI t−2 )/CPI t−2 ; term spread s t−1 = l t−1 − r t−1 ; the consumption-wealth ratio cay t−1 [8]. Table 1. Time-lagged predictive variables X t−1 related to government debt and economic growth. Q/Q is the quarter-over-quarter growth rate from the third to the fourth quarter of a year, GDP is the nominal gross domestic product, IP the industrial production index, CON nominal personal consumption expenditures and DEB the total government debt.

Predictive Variables
Growth in personal consumption expenditures (Q/Q)

Out-of-Sample Validation for Optimal Model and Bandwidth Selection
Since a nonparametric regression is applied in this research, classical in-sample measures such as R 2 or adjusted R 2 are not appropriate. Thus, the generalised version of the validated R 2 (R 2 V ) [37], a suitable out-of-sample measure, is used in order to determine the forecasting power of the nonparametric models. This method is based on a leave-k-out cross-validation and is employed for both model and bandwidth selection. Cross-validation is an advantageous measure for preventing the occurrence of overfitting as well as for finding the optimal bandwidth for non-and semi-parametric estimators, e.g., [42]. R 2 V measures the predictive power of a given model compared to the cross-validated historical mean; positive R 2 V implies that the predictor-based regression model (2) outperforms the corresponding historical average excess stock returns.
The validation criterion is defined as where leave-k-out estimators are used:m −t for the nonparametric function m andȲ Additionally, the forecasting power of nominal stock returns ln S is measured by back-transforming the prediction models of benchmarked excess stock returns in order to make the results comparable for the different benchmarks. It should be acknowledged that prediction based on different benchmarks has its own merits. For instance, the short-term interest rate is beneficial for classical market-timing strategies or the inflation benchmark for wealth and purchasing power problems of long-term investment planning see, e.g., [43].
The stock return in excess of benchmark A is Then, the back-transformed stock return prediction is defined as whereŶ is the predicted benchmarked excess return, and we redefine our original validation criterion (3) asR where ln S −t is the back-transformed stock return prediction from the original leaveone-out benchmarked return prediction and ln S −t the unconditional mean of ln S. This allows us to validate in terms of R 2 V with the observed ln S t . There is a fundamental difference between Equations (3) and (5). Equation (3) chooses the optimal bandwidth and, subsequently, the predictive model via cross-validation. However, Equation (5) is not based on cross-validation and applies the bandwidths chosen in Equation (3) only to measure the predictive power of nominal returns for different benchmarking scenarios. When the main goal of the prediction is the nominal stock returns, one would cross-validate directly.

Data
The empirical analysis in this research attempts to connect government debt and economic growth data to stock returns stressing on the long-term actuarial predictability.
The prediction models cover the sample period from 1947 to 2020 since the quarterly national accounts are only available from 1947.
Annual and quarterly frequency data of seasonally adjusted nominal gross domestic product (GDP) and nominal personal consumption expenditures are extracted from the U.S. Bureau of Economic Analysis [44]. Furthermore, the seasonally adjusted industrial production index is obtained from the Federal Reserve Economic Data (FRED) database of the Federal Reserve Bank of St. Louis [45]. The nominal total public debt (not seasonally adjusted) is downloadable from the U.S. Department of the Treasury Bureau of the Fiscal Service database [46]. It should be noted that seasonality has not been detected in the debt series and no seasonal adjustments have been done. Furthermore, the GDP, consumption expenditures, and government debt are measured in trillions of dollars. The rest of the financial data is available from Robert Shiller's website [47] which, for a given year, offers the January value of the Standard and Poor's Composite Stock Price Index, the value of the related dividends and earnings, consumer price index, and interest rate data. The short-term interest rate from Shiller's website is based on a 6-month commercial paper rate until 1997 and subsequently on the 6-month certificate of deposit rate which is no longer available after 2013. This series is substituted by an annual yield based on the six-month Treasury-bill rate from 1958 to 2020 [48], and prior to this period the estimations by Kyriakou et al. [35] using the regressions of Treasury-bill rate on the risk-free rate from Shiller's data for the overlapping period 1958-2013 have been used. Finally, the alth ratio has been downloaded from Amit Goyal's webpage [49]. Table 2 reports the mean and standard deviation for the one-year excess stock returns and the forecasting variables introduced in Table 1. In Figure 2, the most successful forecasting variables in this analysis are plotted in an attempt to detect the elements of expected returns they could demonstrate. Debt-by-price ratio, third to fourth quarter GDP growth and term spread alongside business cycles measured by the NBER [50] covering 1947-2020 are demonstrated. Term spread is historically known as a good predictor of the stock market and is widely recognised to contain information about shorter-term business cycles [4,35,36,51]. It is also one of the outstanding predictors amongst the regression models in this paper. While the debt-by-price ratio demonstrates some business cycle fluctuations with an increase near the peaks and a decrease near the troughs during almost all the recessions, its overall fluctuations seem to be persistent over the years, and contain signals about the longer-term business conditions that tend to persist. On the other hand, the GDP growth and the term spread fluctuations are mainly near business peaks and troughs. It appears that both similarly contain information about short-term fluctuations and recessions. Table 2. The table shows the summary statistics for the dependent and forecasting variables. Y (R) is the stock return in excess of risk free rate; gdp the third to fourth quarter GDP growth; ip the third to fourth quarter industrial production growth; con the third to fourth quarter consumption growth; gpr the GDP-by-price ratio ; dpr the debt-by-price ratio and dgr the debt-by-GDP ratio. The sample period is 1947-2020.

The Relationship between Stock Returns and Government Debt
The present value model which relates stock prices to their expected future cash flows based on the dynamic generalisation of the Gordon growth model states that log dividend-price ratio d t − p t can be written as discounted value of all future returns r t+j and dividend growth rates ∆d t+j discounted at the constant rate ρ see, e.g., [19,20,52]. This model is derived by taking a first-order Taylor approximation of the equation defining the log stock returns s t = log(P t where E t is the expectation operator conditional on the information at time t, ρ ≡ 1/(1 + exp(d − p)), d − p is the mean log dividend-price ratio and k is a constant. Equation (6) by ruling out the rational bubbles demonstrates the possibility of capturing the expectations of investors by examining the change in the dividend-price ratio. When the stock prices are high for the given dividends, stock market participants ought to be expecting either high dividend growth rates or low returns on the assets and vice versa. Nevertheless, the importance of valuation ratios has been less in forecasting stock returns in recent years [15,53,54]. It is demonstrated that the relationship between dividend yields and future stock returns is cancelled out with the time-varying expected dividend growth [11]. Furthermore, it is found that the dividend-by-price ratio fails to predict the fluctuations in expected returns and dividend growth due to the offsetting effect of positively correlated movements [55], originating the question whether there is another variable that can explain the movements in stock returns in a way that can also predict this offsetting component. For example, the consumption-wealth ratio and price-to-GDP ratio are introduced in an attempt to find this unknown covatiate [8,22].
Alternatively, in this research, after examining many debt-related variables, such as government debt, budget deficit, debt-to-GDP ratio and their associated growth rates, the debt-by-price ratio is proposed. This consideration is based on the fact that government debt is the main function of the current economy, and it is probably the main determinant of the variation in expected returns. It is worth noting that dividing by prices is just a practical method to detrend the government debt series and what matters is the high levels of government debt.
According to Equation (6), what could be happening is that in times of financial uncertainty and rising levels of government debt, investors consider the market risky possibly because of the anticipated increase in tax payments, lower cash flows, decline in economic growth, and generally the elevated fiscal uncertainty. Hence, they panic, sell and bring down the share prices which accordingly increases the expected returns. The government's money injection in the economy in order to overcome the hardship comes into force eventually via the positive relationship of the government debt with stock prices and dividends payout, and the investors who bought low can reap the rewards.
Government debt can positively affect stock prices and dividends payout for a few possible reasons. First, the government debt can affect firms' dividends payout as companies have become more friendly towards shareholders, returning more of their earnings in the form of dividends and stock buybacks even if they are burdened with debt. Second, the increased stock buybacks can inflate the companies' stock prices artificially. Third, the government debt can stimulate GDP growth by investing in the company and creating more jobs; also the stockholders' gains from the share buybacks can be reinvested in other companies, hence increase the stock prices. Fourth, high levels of government debt are usually associated with low interest rates, which pressure the investors to move their money from the bond to the equity market and, as a result, amplify stock prices.
On the other hand, it is possible that there is a simpler explanation beyond such asset pricing models. The authors of [56] picture the history of the last several hundred years filled with credit cycles. It is explained that some structural change leads to a change in the amount of money and a massive credit boom, followed by an asset boom in the domestic markets which encourages investors for risky investments, and they will continue investing in the markets as those investments pay off. At some point, the capital flow reaches the foreign lands from the starting markets. Finally, the expansion ends even more unexpectedly than when it started. It appears that, throughout history, credit booms-or, to put it more bluntly, pumping liquidity into the economy-have been the real reason behind the stock market booms.
Univariate Regressions of Stock Returns on Price Ratios Table 3 reports R 2 V values based on the univariate regressions of excess stock returns on dividend-by-price ratio, earnings-by-price ratio, debt-by-price ratio and GDP-by-price ratio under different benchmarks, and the correspondingR 2 V values related to the prediction of stock returns ln S t validated on the original scale without benchmark in terms of backtransformed stock return prediction ln S t . . The prediction problem is defined in (2). The predictive power (%) is measured by R 2 V as defined in (3). The benchmarks B (A) considered are based on the short-term interest rate (A ≡ R), long-term interest rate (A ≡ L), earnings-by-price ratio (A ≡ E), and consumer price index (A ≡ C). The predictive variables X t−1 used are given by dividend-by-price ratio d t−1 , earnings-by-price ratio e t−1 , debt-by-price ratio dpr t−1 and GDP-by-price ratio gpr t−1 . The predictive power (%) for the one-year stock returns ln S t validated on the original scale without benchmark is measured byR 2 V as defined in (5) in terms of back-transformed stock return prediction ln S t defined in (4). The sample period is 1947-2020. The results confirm the literature that casts doubts about the predictive power of dividend-by-price or earnings-by-price ratios. While earnings-by-price ratio (e) shows low explanatory power under all the benchmarks with R 2 V in the range of −1.9-1.6% and R 2 V in the range of −5.8-4.4%, dividend-by-price ratio (d) can at the most explain 5.1% variation of stock returns in excess of short-term rate with R 2 V in the range of 0.1-4.1% under the remaining of the benchmarks andR 2 V in the range of −2.1-6.2%. The GDP-byprice ratio (gpr) also does not look promising with R 2 V in the range of 0.17-0.5% andR 2 V in the range of −6.15-6.44%. All of these variables perform acceptably in the prediction of nominal stock returns under the earnings-by-price benchmark. This is not unanticipated, as earnings-by-price ratio is regularly used as a return measure by investors and provides an adequate understanding of an investment return. On the other hand, the debt-by-price ratio (dpr) performance is remarkable with R 2 V = 17.3% under the risk-free rate benchmark andR 2 V = 16.2% under the earnings-by-price benchmark. It also appears to be a powerful predictive variable with regards to the other benchmarks with R 2 V = 15.1%, 10.7% and 13.3% under the long-term rate, earnings-by-price and inflation benchmarks, andR 2 V = 11.9%, 12.6% and 8.8% under the short-term rate, long-term rate and inflation benchmarks. Figure 3 demonstrates the estimated functionm (dark blue line) for the one-year horizon together with a corresponding linear model (dotted red line). The figure confirms that an increase in the debt-by-price ratio corresponds to an increase in the stock returns. In general, a debt-by-price ratio of more than 0.5% forecasts a positive return of up to almost 20% for a debt-by-price ratio of 1.25%. Excluding the two outliers which belong to the post-war period of 1947-1948, using the risk-free benchmark then the functional form can be written as a simple line:
Equation (7) simply illustrates that when 2027.971% is added to the government debt divided by the stock price plus −9.874% equals the one-year ahead expected return.

Third to Fourth Quarter Economic Growth
In this section, the predictive power of both the third to fourth quarter and the fourth to fourth quarter (annual) growth rates of GDP, industrial production and consumption in forecasting stock returns are studied. Different economic growth variables are applied, establishing the fact that the results are not guided by chance. It must be pointed out that only the third to fourth and the fourth to fourth quarter growth rates are examined as the one-year ahead excess stock return measured over the calendar year is the target of the prediction.
Møller and Rangvid [14] analyze the first, second, third and fourth quarter growth rates of economic growth to predict one-year ahead excess stock returns, and indicate that except the fourth quarter economic growth rate, other quarters do not affect the expected returns significantly. They also examine the fourth to fourth versus the third to fourth quarter growth rates, and show that the third to fourth quarter growth rates contain much more information than the fourth to fourth quarter growth rates. They argue that investors are lazy [57] and make portfolio decisions about expected returns mostly at the the end of the year [58,59] because of information and transaction costs [60]. Moreover, extensive macroeconomic fluctuations around the end of the year result in large business cycle variations [61] which causes the investors to take this huge end-of-the-year economic growth into consideration [62]. This argument fairly summarizes the economic intuition behind the fact that expected returns are more affected by the third to fourth quarter growth rates. Table 4 demonstrates R 2 V values based on the univariate regressions of excess stock returns on the third to fourth quarter and the fourth to fourth quarter (annual) economic growth rates of GDP, industrial production and consumption under different benchmarks, and the correspondingR 2 V values related to the prediction of stock returns ln S t validated on the original scale without benchmark in terms of back-transformed stock return prediction ln S t .

Univariate Regressions of Stock Returns on Economic Growth
The results in Panel A illustrate a robust relationship between the third to fourth quarter economic growth variables and both excess and nominal stock returns under all the benchmarks with R 2 V in the range of 13.3-17.3% andR 2 V in the range of 8.9-20.9%. It is notable that all the third to fourth quarter economic growth rates achieve high R 2 V values. GDP (gdp) performs the best under the inflation benchmark with R 2 V = 17.2%, but also does well under the short-term rate, long-term rate and earnings-by-price benchmarks with R 2 V = 15.4%, 14.5% and 15.7%, respectively. Consumption (con) is the leading predictor under the short-term rate benchmark with R 2 V = 17.3%, and R 2 V in the range of 14.6-16.2% under the remainder of the benchmarks. R 2 V values related to industrial production (ip) remain moderately similar within the range 13.3-15.7%. The values related to nominal stock returns tell a slightly different story. While the forecasting power of GDP (gdp), industrial production (ip) and consumption (con) is reasonable under the short-term rate, long-term rate and inflation benchmarks with, respectively,R 2 V in the ranges of 9.9-13%, 8.9-12.8% and 10.2-13.8%, the case of the earnings-by-price benchmark is remarkable, leading toR 2 V in the range of 20.1-20.9%. . The prediction problem is defined in (2). The predictive power (%) is measured by R 2 V as defined in (3). The benchmarks B (A) considered are based on the short-term interest rate (A ≡ R), long-term interest rate (A ≡ L), earnings-by-price ratio (A ≡ E), and consumer price index (A ≡ C). The predictive variables X t−1 used on Panel A are given by the third to fourth quarter growth rates of GDP gdp t−1 , industrial production ip t−1 and personal consumption expenditures con t−1 . Panel B shows the fourth to fourth quarter (annual) growth rates of these corresponding variables. The predictive power (%) for the one-year stock returns ln S t validated on the original scale without benchmark is measured byR 2 V as defined in (5)  On the other hand, as demonstrated in Panel B, the fourth to fourth quarter economic growth rates are not promising to any extent. GDP (gdp) and consumption (con) result in negative R 2 V values under any benchmark, and industrial production (ip) shows minor signs of forecasting power with R 2 V in the range of 0.8-3.0%. In addition, earnings-byprice is the only benchmark that results in positiveR 2 V values in the range of 0.6-6.9% in prediction of nominal stock returns.
Overall, these findings confirm the fact that the third to fourth quarter growth rates contain such powerful information about the stock returns which, when the fourth to fourth quarter growth rates are used, this information fades away with the noise existing in the other quarters. It should be pointed out that both seasonally adjusted and not seasonally adjusted data were examined during this analysis, and it is by removing the seasonality that the excellent performance of the third to fourth quarter growth rates comes into effect. As is clear from Figure 4, there is a negative relationship between the third to fourth quarter GDP growth and the excess returns, forecasting positive excess returns for a GDP growth rate of less than 3%. This outcome is in line with the hypothesis of the variation of excess returns with business cycles; for instance, investors' unwillingness to invest during the recessions increases the expected returns [62].   Table 5 displays, for the sake of comparison, R 2 V values based on the univariate regressions of excess stock returns on a couple of well-known predictive variables in the literature such as the short-term interest rate, long-term interest rate, inflation, term spread, debt-by-GDP ratio and consumption-wealth ratio under various benchmarks, and the corre-spondingR 2 V values related to the prediction of stock returns ln S t validated on the original scale without benchmark in terms of back-transformed stock return prediction ln S t . Table 5. Predictive power for the one-year excess stock returns Y (A) t . The prediction problem is defined in (2). The predictive power (%) is measured by R 2 V as defined in (3). The benchmarks B (A) considered are based on the short-term interest rate (A ≡ R), long-term interest rate (A ≡ L), earnings-by-price ratio (A ≡ E), and consumer price index (A ≡ C). The predictive variables X t−1 used are given by the short-term interest rate r t−1 , long-term interest rate l t−1 , inflation π t−1 , term spread s t−1 , debt-by-GDP ratio dgr t−1 and consumption-wealth ratio cay t−1 . The predictive power (%) for the one-year stock returns ln S t validated on the original scale without benchmark is measured byR 2 V as defined in (5) in terms of back-transformed stock return prediction ln S t defined in (4). The sample period is 1947-2020. All in all, the term spread (s) is the most important predictor with R 2 V = 11.4% under the short-term benchmark andR 2 V = 14.3% under the earnings-by-price benchmark, but also performs reasonably under the different benchmarks with R 2 V andR 2 V in the ranges of 8.3-9.6% and 5.0-5.7%, respectively. This is consistent with the previous work of the authors of [35,36] where the term spread is the most favourable variable explaining the excess returns in the univariate setting with R 2 V in the range of 6.2-9.7% for the sample period 1872-2019. As shown in Figure 5, the term spread signals crucial information about the possibility of a bear stock market, any term spread less than −0.3% yields negative returns. The debt-by-GDP ratio (dgr) which has been mostly applied in the literature on forecasting stock returns emphasizing on government debt can at most explain 6.9% variation of excess returns under the risk-free rate benchmark, and 7.8% variation of nominal returns under the earnings-by-price benchmark. Under the long-term rate, earnings-by-price and inflation benchmarks, the performance of this covariate declines to R 2 V = 6.0%, 1.7% and 3.7%, respectively.R 2 V values also result in lower predictive power under the different benchmarks within the range of −1.3-3.3%.

Benchmark B
It comes to our attention that other covariates cannot forecast stock returns with remarkable R 2 V andR 2 V values. The well-known consumption-wealth ratio ( cay) can only explain the variation of excess returns with R 2 V = 3.0%, 3.7% and 3.8% under the short-term rate, long term rate and inflation benchmarks, respectively. The short-term interest rate (r) accounts for the variation of excess returns with R 2 V = 3.7% under the short-term rate benchmark and R 2 V = 2.6% under the long-term rate benchmark. Likewise, the long-term interest rate (l) results in R 2 V of 2.1% and 2.4% under the same benchmarks. Inflation (π) demonstrates almost no signs of prediction power with R 2 V in the range of −1.6-1.7%. Finally, the predictive power of the earnings-by-price ratio benchmark in prediction of nominal stock returns has been, once more, highlighted with, respectively,R 2 V = 5.4%, 4.6%, 4.6% and 6.1% for the covariates r, l, π and cay.
A few more interesting comments are in order. The term spread is extensively known to contain robust information about prediction of stock returns. While this holds true in this analysis, term spread also proves to be a very consistent predictive variable as it predicts excess returns with comparable R 2 V values in different sample periods. Furthermore, the reasonable predictive power of debt-by-GDP ratio confirms our findings about the importance of government debt in forecasting US stock returns. Table 6 illustrates a few of the best two-dimensional models, namely, the multivariate regressions of excess stock returns on the combinations of third to fourth quarter growth rates of GDP, industrial production and consumption with the dividend-by-price ratio, earnings-by-price ratio, term spread and debt-by-price ratio. More specifically, many more combinations with a reasonable forecasting power have been obtained throughout this research, however, only the best models are demonstrated. We can observe that the predictive power of two-dimensional models are higher compared to the one-dimensional cases. It must be underlined that only one-and two-dimensional models are the main focus of this analysis, as all the attempted three-dimensional models have resulted in unsatisfactory R 2 V values. This is anticipated, as a nonparametric model requires more observations than a linear regression to achieve reliable estimates when dealing with higher-dimensional models.

The Multivariate Setting
The case of the predictor GDP (gdp) is impressive; combining this with any of the covariates d, e, s and dpr results in the highest R 2 V values in the ranges of 18.  in a two-dimensional setting. The prediction problem is defined in (2). The predictive power (%) is measured by R 2 V as defined in (3). The benchmarks B (A) considered are based on the short-term interest rate (A ≡ R), long-term interest rate (A ≡ L), earnings-by-price ratio (A ≡ E), and consumer price index (A ≡ C). The different pairwise combinations of predictive variables, X t−1 are given by the combination of third to fourth quarter growth rates of GDP gdp t−1 , industrial production ip t−1 and personal consumption expenditures con t − 1 with dividend-by-price ratio d t−1 , earnings-by-price ratio e t−1 , term spread s t−1 and debt-by-price ratio dpr t−1 . The predictive power (%) for the one-year stock returns ln S t validated on the original scale without benchmark is measured byR 2 V as defined in (5) in terms of back-transformed stock return prediction ln S t defined in (4). The sample period is 1947-2020. The two-dimensional combinations of industrial production (ip) and consumption (con) bring about lower predictive power than the previous ranges. The best combinations (ip, dpr) and (con, dpr) although resulting in R 2 V within the range of 19.9-25.4% and 18.2-25.9%, still perform worse. We find that consumption (con) performs much better than industrial production (ip) when put together with the other covariates excluding the term spread (s). Notable contributions are (con, d), (con, e) and (ip, s).
The important aspects to notice from Table 6 are as follows: First, the third to fourth quarter GDP growth rate generate the highest R 2 V andR 2 V combined with other covariates. Second, the enhanced forecasting power of dividend-by-price and earnings-by-price ratios when combined with additional information, for instance, with R 2 V of 24.8% (gdp, d) and 22.9% (gdp, e) against maximum R 2 V of 5.1% (d) and 1.6% (e) under the long-term rate benchmark. Third, the importance of earnings-by-price ratio benchmark in forecasting nominal stock returns has once again been emphasized. To conclude, the winning pair is debt-by-price ratio and third to fourth quarter GDP growth with R 2 V = 28.6% under the risk-free rate benchmark andR 2 V = 28.5% under the earnings-by-price benchmark. This combination also performs perfectly under the inflation benchmark with R 2 V = 26.2% andR 2 V = 22.4%. This finding is extremely valuable as the focus of this research is modelling of stock returns in real terms applicable to long-term saving strategies. The significant improvement brought in the two-dimensional setting points out to the assumption that economic growth and debt-by-price ratio contain different types of information about the stock returns. As shown in Figure 2, the debt-by-price ratio signals more persistent business conditions, whereas the GDP growth signals shorter-term fluctuations. To get a better understanding, Figure 6 shows the relationship between the one year excess stock returns and the combination of third to fourth quarter GDP growth with dividend-by-price ratio, earnings-by-price ratio, term spread and debt-by-price ratio under the risk-free benchmark by the estimated nonparametric functionm (light blue surface) together with the underlying observations (green balls). It can be observed that a high debt-by-price ratio of 1.7% and a low third to fourth quarter GDP growth rate of −1.6% predict an extremely high one-year excess stock return of 39%.  Figure 6. Risk-free rate benchmark. Relation between one year excess stock returns and the combination of third to fourth quarter GDP growth with the dividend-by-price ratio, earnings-by-price ratio, term spread and the debt-by-price ratio. Estimated nonparametric functionm (light blue surface), observations (green balls). The sample period is 1947-2020.

Conclusions
In this paper, we extend the application of forecasting stock returns in Kyriakou et al. [35,36] in order to explore new predictive variables to beat the commonly discussed covariates in the literature. We predict one-year horizon stock returns in excess of the short-term rate, long-term rate, earnings-by-price ratio and inflation using the forecasting variables related to government debt and economic growth. A large number of covariates and their combinations have been examined for purposes of comparison throughout this research.
We propose a brand-new variable, debt-by-price ratio, which together with the third to fourth quarter economic growth prove to be extremely effective in the prediction of excess and nominal stock returns, individually and combined together. This superb performance outdoes the popular forecasting variables such as dividend-by-price ratio, earnings-byprice ratio, consumption-wealth-ratio, term spread and debt-by-GDP ratio by a perceptible difference, and is also valid under all the benchmarks; therefore, it can be used for various investment strategies including real value pension forecasts.
The overall conclusions are as follows. First, we find that US government debt and economic growth have a significant impact on stock prices. The most successful combination is debt-by-price ratio and third to fourth quarter GDP growth; using debtby-price ratio or third to fourth quarter GDP growth as an individual predictor and twodimensional predictors that include the third to fourth quarter GDP growth capture the best predictive performances. While the prediction of stock returns is incredibly puzzling, an appropriate choice of prediction model combined with suitable forecasting variables can achieve high-quality results. Second, this paper highlights the impact of liquidity on stock prices formulated in simple terms via government debt. There seems to be a political element of stock price variation that gives rise to worldwide distribution of wealth that is worth investigating further in the future. The limitation of the research includes lack of available data to expand the sample size and further study various predictive variables.