Investor Happiness and Predictability of the Realized Volatility of Oil Price

Bonato, Matteo; Gkillas, Konstantinos; Gupta, Rangan; Pierdzioch, Christian

doi:10.3390/su12104309

Open AccessArticle

Investor Happiness and Predictability of the Realized Volatility of Oil Price

¹

Department of Economics and Econometrics, University of Johannesburg, P.O. Box 524 Auckland Park, Johannesburg, South Africa

²

IPAG Business School, 184 Boulevard Saint-Germain, 75006 Paris, France

³

Department of Business Administration, University of Patras, University Campus, Rio, P.O. Box 1391, 26500 Patras, Greece

⁴

Department of Economics, University of Pretoria, Pretoria 0002, South Africa

⁵

Department of Economics, Helmut Schmidt University, Holstenhofweg 85, P.O. Box 700822, 22008 Hamburg, Germany

^*

Author to whom correspondence should be addressed.

Sustainability 2020, 12(10), 4309; https://doi.org/10.3390/su12104309

Submission received: 23 March 2020 / Revised: 23 April 2020 / Accepted: 5 May 2020 / Published: 25 May 2020

(This article belongs to the Special Issue Behavioral Business and Behavioral Financial Economics with Applications)

Download Versions Notes

Abstract

We use the the heterogeneous autoregressive realized volatility (HAR-RV) model to analyze both in sample and out-of-sample whether a measure of investor happiness predicts the daily realized volatility of oil-price returns, where we use high-frequency intraday data to measure realized volatility. Full-sample estimates reveal that realized volatility is significantly negatively linked to investor happiness at a short forecast horizon. Similarly, out-of-sample results indicate that investor happiness significantly improves the accuracy of forecasts of realized volatility at a short forecast horizon. Results for a medium and a long forecast horizon are insignificant. We argue that our results shed light on the role played by speculation in oil products and the potential function of oil-related products as a hedge against risks in traditional financial assets.

Keywords:

investor happiness; oil market; realized volatility; forecasting

JEL Classification:

G15; G17; Q02

1. Introduction

The oil market’s recent financialization has led to increased participation of hedge funds, pension funds and insurance companies, in the market, thus, rendering oil a profitable alternative investment in the portfolio decisions of financial institutions [1,2,3,4,5] (Bahloul et al., 2018, Bonato 2019). Hence, accurate estimates of oil-price volatility are of vital importance to oil traders. At the same time, this is a concern from the policy perspective, as oil-price volatility has been shown to negatively impact economic activity as well since it captures macroeconomic uncertainty [6,7] (Elder and Serletis 2010, van Eyden et al., 2019). Oil-price fluctuations have also many consequences for most non-energy producing companies by increasing the cost of doing business. Companies always seek new ways of managing oil-price volatility, and governments are concerned about the impact of oil-price volatility on economic growth and prosperity. A better econometric understanding of oil-price volatility is vital for its effective management and could lead to a competitive advantage by reducing operating costs and business risk. According to [8] Henriques and Sadorsky (2010), the key is an increase in a firm’s environmental sustainability because it goes in line with a lower energy-price exposure. However, in the short term, it is economically important to proceed with a systematic characterization of the types of events that cause oil-price volatility to fluctuate over time. In light of this, high-frequency forecasts of oil-market volatility can be used in mixed-frequency models to predict the future path of low-frequency measures of economic activity, besides, of course, some existing high-frequency measures of the latter. Indeed, the impact of oil-price shocks and oil-price volatility has received great attention in the earlier literature ([9,10,11](see Jiang et al., 2018, Zao et al., 2019, Gkillas et al., 2020, among others).

Naturally, a great body of literature exists (see [12] Lux et al., 2016 for a detailed review) on the forecastability of daily oil-price volatility using different kinds of univariate and multivariate Generalized Autoregressive Conditional Heteroskedasticity (GARCH) models, as well as the Markov-switching multifractal (MSM) model. In general, studies in this literature find that while the univariate GARCH-type models are able to produce more accurate forecasts than its competitors within the GARCH category, the MSM model in general is the preferable framework the majority of the times across forecasting horizons and sub-samples relative to the other models considered.

A shared characteristic of the above studies is that all of them use oil-price returns at a daily frequency, and forecast the daily conditional oil-price volatility. Nevertheless, as pointed out by [13] McAleer and Medeiros (2008), intraday data containing rich information can lead to more accurate estimates and forecasts of daily volatility. In this respect, Haugom et al. (2014), Sévi (2014), Prokopczuk et al. (2015), Degiannakis and Filis (2017), Liu et al. (2017), Chen et al. (2019), and Gkillas et al. (forthcoming) [14,15,16,17,18,19,20] make use of variations of the Heterogeneous Autoregressive (HAR) model employed by [21] Corsi (2009) to forecast the realized volatility (RV) of oil-price returns (i.e., the sum of non-overlapping squared high-frequency oil returns observed within a day; see [22] Andersen and Bollerslev 1998). Note that [23,24] Phan et al. (2016) and Chatrath et al. (2015) also forecast realized oil-price volatility derived using intraday data, but instead of using the HAR model, they use regression and GARCH-based models. The HAR model has become increasingly popular because it is able to decode significant features of financial-market volatility, including long memory and multi-scaling behavior. In sum, except for the recent studies of [17,18,19,20] Degiannakis and Filis (2017), Liu et al. (2018), Chen et al. (2019) and Gkillas et al. (forthcoming), previous studies based on intraday data are led to the conclusion that all models models fail to beat the accuracy of forecasting that a simple HAR-RV model has using only the information embedded in the realized volatility in the production of forecasts. On the other hand, Degiannakis and Filis (2017) [17] argued in favor of the likelihood to outperform the HAR-RV model via the incorporation of information on the exogenous volatilities of four different asset classes (stocks, currencies, commodities and macroeconomic policy), whereas [20] Gkillas et al. (forthcoming) claim that forecast accuracy is improved when extending the baseline linear HAR-RV model to incorporate an index of financial stress, since it explains the possible asymmetry of the loss function of a forecaster. At the same time, Liu et al. (2018) and Chen et al. (2019) [18,19] argued that the benchmark HAR-RV model can be outperformed when considering time-variation and asymmetric jumps and co-jumps with the equity (S&P 500) market.

In light of this, our study aims to extend the existing (restricted) literature on forecasting realized oil-price volatility (using 5 min-interval intraday data) based on the HAR-RV model by integrating the role of a daily happiness index extracted from Twitter, as a proxy for (an otherwise unobservable) investor sentiment into the modeling framework, where the sample period covers the daily period of 9 September 2008 to 26 May 2017. The happiness index has been successfully used in analyzing the predictability of returns and volatility of international equity markets (see, for example, Zhang et al., 2016, 2018, You et al., 2017, Reboredo and Ugolini 2018 [25,26,27,28]. The appeal of this index emanates from the fact that it is available at high-frequency and global in nature, given the dominance of Twitter users in countries serving as major players in the world financial system, and is likely to influence a global market like oil. Intuitively, the impact of investor sentiment on RV of the oil market can either be positive or negative, both a likely result owing to the financialization of the oil market. The fact that investor sentiment can increase RV is contingent on the clinical and psychological evidence that sentiment influences risk tolerance and, therefore, the tendency to speculate. Similarly, as soon as investor sentiment improves, risk aversion reduces, leading to investors tolerating more risk, which brings about more speculation in oil products and, hence, higher volatility due to higher trading [29,30] (Hong and Yogo 2012, Singleton 2014). At the same time, if investor sentiment weakens, with oil-related products now serving as a possible hedge against risks in traditional financial assets [31,32] (Olson et al., 2017, 2019), trading in the oil market may increase, resulting in higher volatility, and hence a negative relationship between RV and the happiness index.

To the best of our knowledge, this is the first paper to analyze the role of the happiness index in out-of-sample forecasting of the realized volatility of oil-prive movements. Two papers related to our work are the studies by [33,34] Qadan and Nama (2018) and Zhang and Li (2019), who provide in-sample evidence of predictability from measures of investor sentiment for oil-market volatility at daily, weekly, and monthly frequencies, but not based on intraday data, using various linear, nonlinear, and frequency-domain (wavelet) econometric methods. Somewhat related are the analyses of [35,36] Guo and Ji (2013) and Ji and Guo (2015). They look at the role of internet-searches on oil-related events for in-sample predictability of oil-market volatility. [37] Campbell (2008) highlighted that the best test of any predictive model (with regard to the econometric methods used and in terms of the predictors employed) is in its out-of-sample performance, and, hence, our analysis can be considered to be a robust extension of the works of [33,34] Qadan and Nama (2018) and Zhang and Li (2019), given that in-sample predictability cannot obligatorily induce out-of-sample predictability.

The remainder of the paper is organized as follows: Section 2 describes the methods used in our empirical analysis. Section 3 presents our data. Section 4 summarizes our empirical results and Section 5 concludes the paper.

2. Methods

Building on the results documented by [38] Andersen et al. (2012), we use intraday data to compute the median realized variance (

M R V

) as our jump-robust estimator of the integrated daily realized variance of oil-price returns. Since there is chance of confusion, in this study we make use of the terms realized volatility and realized variance in an interchangeable way. The median realized variance

M R V

has the advantage that it minimizes the potential effects of market-microstructure noise and jumps in our study. More specifically, we use

M R V

because, first, it has has better theoretical properties than other tripower variation estimators. Second,

M R V

is a jump-robust measure of integrated variance.

M R V

is less biased than other estimators in the presence of jumps. Thereby,

M R V

helps us to keep our forecasting models parsimonious. Third,

M R V

mitigates the effect of microstructure noise and has better sample properties as compared to other estimators of realized volatility. Fourth,

M R V

has better finite-sample robustness in the presence of “zero” intraday returns during a trading day. We define

M R V

as follows:

\begin{matrix} M R V_{t} = \frac{π}{6 - 4 \sqrt{3} + π} \frac{T}{T - 2} \sum_{i = 2}^{T - 1} median {(| X_{t, i - 1} |, | X_{t, i} |, | X_{t, i + 1} |)}^{2}, \end{matrix}

(1)

where

X_{t, i}

stands for intraday oil-price return i within day t, and

i = 1, \dots, T

is the number of oil-price intraday observations (or

T - 1

oil price returns within a day. The scaling factors make certain that every summand on the right-hand side gives an unbiased estimate of the underlying spot variance if the corresponding block of returns is i.i.d. Gaussian (see [38] Andersen et al., 2012 for more information regarding this issue).

Variants of the HAR-RV model [21] (Corsi 2009) are employed for modeling and therefore forecasting daily realized volatility of oil-price returns. The key feature of the HAR-RV model is that it uses volatilities from different time resolutions to forecast the realized volatility of oil-price returns. The model, thereby, captures the main idea motivating the heterogeneous market hypothesis ([39] Müller et al., 1997). This hypothesis stipulates that different classes of market participants populate the oil market, where traders in the different classes differ in their sensitivity to information flows at different time horizons (that is, short-term traders versus long-term traders). Despite its simple structure, the HAR-RV model can capture various volatility properties (i.e., long memory and multi-scaling behavior). The benchmark HAR-RV model is given as follows:

R V_{t + h} = β_{0} + β_{d} R V_{t} + β_{w} R V_{w, t} + β_{m} R V_{m, t} + ϵ_{t + h},

(2)

where the index h stands the forecast horizon, the

β

’s represent the coefficients to be estimated, and

ϵ_{t + h}

represents the error term. We study a short and two longer forecast horizons:

h = 1, 5, 22

. As for the two longer forecast horizons, we follow earlier literature and use the average daily realized volatility over the forecast horizon being studied. Furthermore,

R V_{w, t}

is the average

R V

from day

t - 5

to day

t - 1

, while

R V_{m, t}

stands for the average

R V

from day

t - 22

to day

t - 1

. When we include investor happiness (

H A

) in the benchmark HAR-RV model, we get the following extended model:

\begin{matrix} R V_{t + h} & = β_{0} + β_{d} R V_{t} + β_{w} R V_{w, t} + β_{m} R V_{m, t} + θ H A_{t} + ϵ_{t + h} . \end{matrix}

(3)

We also extend the benchmark HAR-RV model in several other dimensions. These other extensions render it possible to assess the role played by

H A

for forecasting realized volatility when we take into account other predictors commonly studied in the literature on realized volatility. Specifically, we extend the benchmark HAR-RV model to feature a measures of realized kurtosis (

R K U

) and realized skewness (

R S K

). In line with [40] Amaya et al. (2015),

R S K_{t} = \frac{\sqrt{T} \sum_{i = 1}^{T} X_{t, i}^{3}}{{(\sum_{i = 1}^{T} X_{t, i}^{2})}^{3 / 2}}

, and

R K U_{t} = \frac{T \sum_{i = 1}^{T} X_{t, i}^{4}}{{(\sum_{i = 1}^{T} X_{t, i}^{2})}^{2}}

, where scaling by

\sqrt{T}

and T, respectively, implies that the magnitudes correspond to daily skewness and kurtosis. We consider

R S K

as a measure of the asymmetry of the daily oil-return distribution and

R K U

as a measure that allows us to capture extreme deviations far away from the center of the daily oil-return distribution. We also take into account jumps.

In line with [41] Andersen et al. (2011), when

{lim}_{T \to \infty} R V_{t} = \int_{t - 1}^{t} σ^{2} (s) d s + \sum_{j = 1}^{N_{t}} κ_{t, j}^{2}

, where

N_{t}

is the number of jumps within day t, and

κ t, j

is the jump size. Therefore,

R V_{t}

is as a consistent estimator of the integrated variance

\int_{t - 1}^{t} σ^{2} (s) d s

including the jump component. In this analysis, in order to detect jumps we construct

R V_{t}

by the following:

\sum_{i = 1}^{T} X_{t, i}^{2}

. Then, following [42] Barndorff-Nielsen and Shephard (2004), when

{lim}_{T \to \infty} B V_{t} = \int_{t - 1}^{t} σ^{2} (s) d s

, where

B V_{t}

is the realized bipower variation given by

B V_{t} = μ_{1}^{- 1} (\frac{T}{T - 1}) \sum_{i = 2}^{T} | X_{t, i - 1} | | X_{i, t} | = \frac{π}{2} \sum_{i = 2}^{T} | X_{t, i - 1} | | X_{i, t} |

, where

μ_{a} = {E (| Z |}^{a})

,

Z \overset{D}{\sim} N (0, 1)

,

a > 0

. Thus,

B V_{t}

is considered as a consistent estimator of integrated variance whiteout the jump component. We apply a formal test for detecting jumps. In line with [43] Barndorff-Nielsen and Shephard (2006), the jump test is given by:

J T_{t} = \frac{R V_{t} - B V_{t}}{(ν_{b b} - ν_{q q}) \frac{1}{N} T P_{t}}

, where

ν_{b b} = {(\frac{π}{2})}^{2} + π - 3

,

ν_{q q} = 2

, and

T P_{t}

stands for the Tri-Power Quarticity given by:

T P_{t} = T μ_{4 / 3}^{- 3} (\frac{T}{T - 1}) \sum_{i = 3}^{T} | X_{t, i - 2} |^{4 / 3} | X_{t, i - 1} |^{4 / 3} {| X_{t, i} |}^{4 / 3}

which converges to

T P_{t} \to \int_{t - 1}^{t} σ^{4} (s) d s

even in the presence of jumps. Take into account that for each t the

J T_{t} \overset{D}{\sim} N (0, 1)

as

T \to \infty

. Finally, based on the study implement by [44] Zhou and Zhu (2012), the jump detection scheme is re-defined by the following:

J_{t} = m a x (R V_{t} - B V_{t}; 0)

.

When we study the out-of-sample predictability of

R V

, we use a fixed-length daily rolling-estimation window. We use as our benchmark a rolling-estimation window that comprises 1200 daily data (which corresponds to approximately half the sample size), but we also study a somewhat shorter (1000 daily data) and a somewhat longer (1400 daily data) rolling-estimation window. In order to compare the out-of-sample accuracy of the different HAR-RV models (that is, the models without and with

H A

included in the vector of regressors), we use the modified [45] Diebold and Mariano (1995) test proposed by [46] Harvey, Leybourne and Newbold (1997). In doing so, we use the relative forecast errors to take into account the impact of heteroskedasticity on our results (e.g., [47] Bollerslev and Ghysels 1996). All computations are carried out using the R programming environment ([48] R Core Team 2019). Results for the Diebold–Mariano test are computed using the R package “forecast” ([49,50] Hyndman 2017, Hyndman and Khandakar 2008).

3. Data

We employ intraday data obtained from West Texas Intermediate (WTI) oil futures traded in NYMEX over a 24 h trading day (pit and electronic) to calculate daily measures of realized oil-price volatility as well as realized skewness and kurtosis. The data (in continuous format) came from from www.disktrading.com and www.kibot.com. When the expiration of a contract approaches, we roll over the position of the contract to the next available one, given that there is an increase in activity. We define daily oil-returns in terms of end of day (New York time) price difference (close to close). As for intraday returns, we construct 5-min prices via last-tick interpolation, and we construct 5-min returns by taking the log-differences of these prices, which we then use to calculate the realized skewness and kurtosis. Following [51] Liu et al. (2015), a 5-min sampling frequency is adequate for liquid assets, such as WTI futures. In other words, on the one hand, such sampling frequency used in this study is not too low to give poor data analysis, while on the other hand it is not too high to give rise to spurious jumps because of market frictions.

It is clear that investor sentiment cannot be directly considered as a measurable or observable. Traditionally, two paths have been taken to measure investor sentiment ([52,53] Bathia and Bredin 2013, Bathia et al. 2016). Taking the first path means that investor sentiment, as proposed by [54,55] Baker and Wurgler (2006, 2007), is identified by several market-based measures that are used as proxies for investor sentiment, while survey-based indices comprise the second path. More recently, building on the research by [56] Da et al. (2015), who constructed an investor-sentiment index employing daily Internet search data coming from millions of households in the U.S. by emphasizing specific ‘economic’ keywords that mirror investors’ sentiment towards economic developments, a third approach has originated. The idea motivating this third approach is to extract metrics of investor sentiment from news and contents of social media (for example, see [57] Garcia 2013). Da et al. (2015) [56] argued that their method, and in general the third approach associated with internet-based measure of investor sentiment, is more transparent compared to the two other competing market and survey-based approaches. This is because the former has the disadvantage of being the equilibrium outcome of many economic forces other than investor sentiment, while the latter is more likely to be beleaguered by measurement errors as it inquires about attitudes. Furthermore, both traditional approaches tend to produce metrics of investor sentiments at lower (monthly or quarterly) frequencies.

Keeping these points in mind, our proxy for investor sentiment corresponds to the daily happiness index derived from the website https://hedonometer.org/api.html. The raw daily happiness scores are extracted by means of a natural language processing technique based on a random sampling of about 10% (50 million) of all messages posted in Twitter’s Gardenhose feed. In order to quantify the happiness of the atoms of language, Hedonometer.org merged the 5000 most frequent words from a collection of four corpora: Google Books, New York Times articles, Music Lyrics, and Twitter messages. The result is a composite collection of approximately 10,000 unique words. Then, using Amazon’s Mechanical Turk service, Hedonometer.org had each of these words scored on a nine point scale of happiness, with 1 corresponding to “sad” and 9 to “happy”. Words in messages written in English (containing about 100 million words per day) are assigned a happiness score based on the average happiness score of the words contained in the messages.

Our analysis spans the period from 9 September 2008 to 26 May 2017 on a daily basis, while the start and end dates of the sample used are solely restricted by the availability of the happiness index and the intraday data on oil prices, respectively. Basic statistics of the data used are given in Table 1.

4. Empirical Results

Table 2 summarizes our in-sample results for the full sample of data. The estimated coefficients of

M R V

,

M R V_{w}

and

M R V_{M}

are always significant at conventional levels of significance for all three forecast horizons under consideration. The estimated coefficients are positive. The estimated coefficients of RKU and RSK are not significant. [58] Mei et al. (2017) have observed significantly negative coefficients of realized skewness and realized kurtosis in the case of realized stock-market volatility.

As already mentioned in Section 1, the link between investor happiness and realized volatility of oil-price returns can be either positive or negative. Our in-sample results for the full sample of data demonstrate that, when we use the HAR-RV model to capture the implications of the heterogeneous market hypothesis for the dynamics of realized volatility, the estimated coefficient of investor happiness,

H A

, has a negative sign and is highly significant for

h = 1

, while the coefficient becomes insignificant and turns positive for

h = 5

and

h = 22

. The results for the HAR-RV model for

h = 1

, hence, can be interpreted as indicating that, with oil acting as a hedge against risks in traditional financial assets, trading in the oil market and, as a result, volatility increase in times of lower investor happiness and, thereby, weaker investor sentiment. The results for the two longer forecast horizons, in contrast, suggest that investor happiness has no clear effect on the dynamics of the oil price, perhaps reflecting that the price pressure on oil is driven by a mixture of (i) the economic cycle (when investor happiness is high; the price of oil goes up in good market condition due to increase in industrial production), and, (ii) oil-market related shocks (when investor happiness is low). Any interpretation, however, should not be stretched too far given that the estimated coefficient of investor happiness for

h = 5

and

h = 22

is not significantly different from zero.

Table 3 reports our main out-of-sample results. We report results for three different lengths of the rolling-estimation window, three different forecast horizons, and two loss functions (linear and quadratic). The results compare the accuracy of forecast computed by means of the HAR-RV model with the accuracy of forecast as extracted from the HAR-RV-HA model. The results are consistent across the linear and quadratic loss functions and strongly suggest that investor happiness improves out-of-sample forecast accuracy for the short forecast horizon (that is, for

h = 1

). Results for the two longer forecast horizons (

h = 5

and

h = 22

) are not significant, which is consistent with the in-sample results.

We next assess the robustness of our results. To this end, we document in Table 4 results for three alternative HAR-RV models. On one model, we add realized kurtosis (RKU) to the vector of standard HAR-RV regressors. In another model, we add realized skewness (RSK) to the benchmark HAR-RV model. In yet another model, we consider a measure of jumps as an additional regressor. Results are consistent across the three models: investor happiness improves forecast accuracy at the short but not at the two longer forecast horizons. As another robustness check, we used the fluctuation test developed by [59] Giacomini and Rossi (2010) to compare the HAR-RV with the HAR-RV-HA model. Corroborating the results we report in Table 3, the fluctuations test indicates a superior performance of the HAR-RV-HA model at the short forecast horizon. Results of the fluctuations test are available from the authors upon request.

As a final extension, we estimate the HAR-RV model separately for a measure of downside and upside realized semivariances. Barndorff-Nielsen et al. (2010) [60], among others, proposed and further studied study the concept of downside and upside realized semi-variances (

R V^{-}

and

R V^{+}

) as measures based entirely on downward or upward movements of intraday returns.

R V_{t}^{-}

and

R V_{t}^{+}

are computed by the following:

R V_{t}^{-} = \sum_{i = 1}^{T} X_{t, i}^{2} I_{[(X_{t, i}) < 0]}

, and

R V_{t}^{+} = \sum_{i = 1}^{T} X_{t, i}^{2} I_{[(X_{t, i}) > 0]}

, where

I_{{.}}

is the indicator function. Downside and upside realized semivariances allow us to capture the sign asymmetry of the prices process, which is crucial for portfolio risk assessment and management. Again, we observe significant test results, this time for downside as well as upside realized semivariance, for the short forecast horizon. The test results for the two longer forecast horizons are insignificant (Table 5).

5. Concluding Remarks

We have estimated various HAR-RV model to assess whether a recently developed search-based measure of investor happiness predicts the daily realized volatility of oil-price returns, where we have estimated realized volatility from high-frequency intraday data. We have reported results of both in sample and out-of-sample analyses. Our main finding is that, when we use the HAR-RV model to capture the implications of the heterogeneous market hypothesis, investor happiness is significantly negatively linked at a short forecast horizon to realized volatility as far as the in-sample analysis is concerned. In a similar vein, investor happiness improves the accuracy of short-term forecasts of realized volatility in our out-of-sample analysis. In sum, our empirical results are consistent with the view that (i) trading in the oil market and, thereby, realized volatility increase in times of lower investor happiness because oil acts as a hedge against risks in traditional financial assets, and, (ii) this hedging property helps to improve the accuracy of short-term out-of-sample forecasts of realized volatility of oil-price returns.

In a recent paper, [61] Deeney et al. (2015) developed sentiment indices directly related to the WTI and Brent crude oil markets using a suite of financial proxies similar to those used in equity research, though at lower (monthly) frequency. As part of future research, it would be interesting to develop such indices at daily frequency and use it in our forecasting experiment. This will allow us to compare the relative roles of the proxies of sentiments that are directly related to the oil market with those that measure the general mood of investors associated with the overall financial market. Finally, it would be particularly interesting to expand our study so as to see whether investor happiness predicts other energy commodities.

Author Contributions

Authors have equal contributions. All authors have read and agreed to the published version of the manuscript.

Acknowledgments

We thank two anonymous reviewers for helpful comments. The research of C. Pierdzioch was supported by the German Science Foundation (Project: Exploring the experience-expectation nexus in macroeconomic forecasting using computational text analysis and machine learning; Project number: 275693836). The usual disclaimer applies.

Conflicts of Interest

The authors declare no conflict of interest.

References

Bahloul, W.; Balcilar, M.; Cunado, J.; Gupta, R. The role of economic and financial uncertainties in predicting commodity futures returns and volatility: Evidence from a nonparametric causality-in-quantiles test. J. Multinatl. Financ. Manag. 2018, 45, 52–71. [Google Scholar] [CrossRef]
Bonato, M. Realized correlations, betas and volatility spillover in the agricultural commodity market: What has changed? J. Int. Financ. Mark. Inst. Money 2019, 62, 184–202. [Google Scholar] [CrossRef]
Asai, M.; Gupta, R.; McAleer, M. Forecasting Volatility and co-volatility of crude oil and gold futures: Effects of leverage, jumps, spillovers, and geopolitical risks. Int. J. Forecast. 2020. [Google Scholar] [CrossRef]
Asai, M.; Gupta, R.; McAleer, M. The Impact of Jumps and Leverage in Forecasting the Co-Volatility of Oil and Gold Futures. Energies 2019, 12, 3379. [Google Scholar] [CrossRef]
Demirer, R.; Gupta, R.; Suleman, T.; Wohar, M.E. Time-varying rare disaster risks, oil returns and volatility. Energy Econ. 2018, 75, 239–248. [Google Scholar] [CrossRef]
Elder, J.; Serletis, A. Oil price uncertainty. J. Money Credit Bank. 2010, 42, 1137–1159. [Google Scholar] [CrossRef]
Van Eyden, R.; Difeto, M.; Gupta, R.; Wohar, M.E. Oil price volatility and economic growth: Evidence from advanced OECD countries using over one century of data. Appl. Energy 2019, 233, 612–621. [Google Scholar] [CrossRef]
Henriques, I.; Sadorsky, P. Can environmental sustainability be used to manage energy price risk? Energy Econ. 2010, 32, 1131–1138. [Google Scholar] [CrossRef]
Jiang, Y.; Ma, C.Q.; Yang, X.G.; Ren, Y.S. Time-Varying Volatility Feedback of Energy Prices: Evidence from Crude Oil, Petroleum Products, and Natural Gas Using a TVP-SVM Model. Sustainability 2018, 10, 4705. [Google Scholar] [CrossRef]
Zhao, L.T.; Liu, L.N.; Wang, Z.J.; He, L.Y. Forecasting Oil Price Volatility in the Era of Big Data: A Text Mining for VaR Approach. Sustainability 2019, 11, 3892. [Google Scholar] [CrossRef]
Gkillas, K.; Gupta, R.; Wohar, M.E. Oil shocks and volatility jumps. Rev. Quant. Financ. Account. 2020, 54, 247–272. [Google Scholar] [CrossRef]
Lux, T.; Segnon, M.; Gupta, R. Forecasting crude oil price volatility and value-at-risk: Evidence from historical and recent data. Energy Econ. 2016, 56, 117–133. [Google Scholar] [CrossRef]
McAleer, M.; Medeiros, M.C. Realized volatility: A review. Econom. Rev. 2008, 27, 10–45. [Google Scholar] [CrossRef]
Haugom, E.; Langeland, H.; Molnár, P.; Westgaard, S. Forecasting volatility of the US oil market. J. Bank. Financ. 2014, 47, 1–14. [Google Scholar] [CrossRef]
Sévi, B. Forecasting the volatility of crude oil futures using intraday data. Eur. J. Oper. Res. 2014, 235, 643–659. [Google Scholar] [CrossRef]
Prokopczuk, M.; Symeonidis, L.; Wese Simen, C. Do jumps matter for volatility forecasting? Evidence from energy markets. J. Futur. Mark. 2015, 36, 758–792. [Google Scholar] [CrossRef]
Degiannakis, S.; Filis, G. Forecasting oil price realized volatility using information channels from other asset classes. J. Int. Money Financ. 2017, 76, 28–49. [Google Scholar] [CrossRef]
Liu, J.; Ma, F.; Yang, K.; Zhang, Y. Forecasting the oil futures price volatility: Large jumps and small jumps. Energy Econ. 2018, 72, 321–330. [Google Scholar] [CrossRef]
Chen, Y.; Ma, F.; Zhang, Y. Good, bad cojumps and volatility forecasting: New evidence from crude oil and the U.S. stock markets. Energy Econ. 2019, 81, 52–62. [Google Scholar] [CrossRef]
Gkillas, K.; Gupta, R.; Pierdzioch, C. Forecasting realized oil-price volatility: The Role of financial stress and asymmetric loss. J. Int. Money Financ. 2020, 104, 102137. [Google Scholar] [CrossRef]
Corsi, F. A simple approximate long-memory model of realized volatility. J. Financ. Econ. 2009, 7, 174–196. [Google Scholar] [CrossRef]
Andersen, T.G.; Bollerslev, T. Answering the skeptics: Yes, standard volatility models do provide accurate forecasts. Int. Econ. Rev. 1998, 39, 885–905. [Google Scholar] [CrossRef]
Phan, D.H.B.; Sharma, S.S.; Narayan, P.K. Intraday volatility interaction between the crude oil and equity markets. J. Int. Financ. Mark. Inst. Money 2016, 40, 1–13. [Google Scholar] [CrossRef]
Chatrath, A.; Miao, H.; Ramchander, S.; Wang, T. The forecasting efficacy of risk-neutral moments for crude oil volatility. J. Forecast. 2015, 34, 177–190. [Google Scholar] [CrossRef]
Zhang, W.; Li, X.; Shen, D.; Teglio, A. Daily happiness and stock returns: Some international evidence. Phys. A 2016, 460, 201–209. [Google Scholar] [CrossRef]
Zhang, W.; Wang, P.; Li, X.; Shen, D. Twitter’s daily happiness sentiment and international stock returns: Evidence from linear and nonlinear causality tests. J. Behav. Exp. Financ. 2018, 18, 50–53. [Google Scholar] [CrossRef]
You, W.; Guo, Y.; Cheng, P. Twitter’s daily happiness sentiment and the predictability of stock returns. Financ. Res. Lett. 2017, 23, 58–64. [Google Scholar] [CrossRef]
Reboredo, J.C.; Ugolini, A. The impact of Twitter sentiment on renewable energy stocks. Energy Econ. 2018, 76, 153–169. [Google Scholar] [CrossRef]
Hong, H.; Yogo, M. What does futures market interest tell us about the macroeconomy and asset prices? J. Financ. Econ. 2012, 105, 473–490. [Google Scholar] [CrossRef]
Singleton, K.J. Investor flows and the 2008 boom/bust in oil prices. Manag. Sci. 2014, 60, 300–318. [Google Scholar] [CrossRef]
Olson, E.; Vivian, A.J.; Wohar, M.E. Do commodities make effective hedges for equity investors? Res. Int. Bus. Financ. 2017, 1274–1288. [Google Scholar] [CrossRef]
Olson, E.; Vivian, A.J.; Wohar, M.E. What is a better cross-hedge for energy: Equities or other commodities? Glob. Financ. J. 2019, 42, 100417. [Google Scholar] [CrossRef]
Qadan, M.; Nama, H. Investor sentiment and the price of oil. Energy Econ. 2018, 69, 42–58. [Google Scholar] [CrossRef]
Zhang, Y.-J.; Li, S.-H. The impact of investor sentiment on crude oil market risks: Evidence from the wavelet approach. Quant. Financ. 2019, 19, 1357–1371. [Google Scholar] [CrossRef]
Guo, J.-F.; Ji, Q. How does market concern derived from the Internet affect oil prices? Appl. Energy 2013, 112, 1536–1543. [Google Scholar] [CrossRef]
Ji, Q.; Guo, J.-F. Oil price volatility and oil-related events: An Internet concern study perspective. Appl. Energy 2015, 137, 256–264. [Google Scholar] [CrossRef]
Campbell, J.Y. Viewpoint: Estimating the equity premium. Can. J. Econ. 2008, 41, 1–21. [Google Scholar] [CrossRef]
Andersen, T.G.; Dobrev, D.; Schaumburg, E. Jump-robust volatility estimation using nearest neighbor truncation. J. Econom. 2012, 169, 75–93. [Google Scholar] [CrossRef]
Müller, U.A.; Dacorogna, M.M.; Davé, R.D.; Olsen, R.B.; Pictet, O.V. Volatilities of different time resolutions—Analyzing the dynamics of market components. J. Empir. Financ. 1997, 4, 213–239. [Google Scholar] [CrossRef]
Amaya, D.; Christoffersen, P.; Jacobs, K.; Vasquez, A. Does realized skewness predict the cross-section of equity returns? J. Financ. Econ. 2015, 118, 135–167. [Google Scholar] [CrossRef]
Andersen, T.G.; Bollerslev, T.; Huang, X. A reduced form framework for modeling volatility of speculative prices based on realized variation measures. J. Econom. 2011, 160, 176–189. [Google Scholar] [CrossRef]
Barndorff-Nielsen, O.E. and Shephard, N. Power and bipower variation with stochastic volatility and jumps. J. Financ. Econom. 2004, 2, 1–37. [Google Scholar] [CrossRef]
Barndorff-Nielsen, O.E.; Shephard, N. Econometrics of Testing for Jumps in Financial Economics using Bipower Variation. J. Financ. Econom. 2006, 4, 1–30. [Google Scholar] [CrossRef]
Zhou, H.; Zhu, J.Q. An empirical examination of jump risk in asset pricing and volatility forecasting in China’s equity and bond markets. Pac. Basin Financ. J. 2012, 20, 857–880. [Google Scholar] [CrossRef]
Diebold, F.X.; Mariano, R.S. Comparing predictive accuracy. J. Bus. Econ. Stat. 1995, 13, 253–263. [Google Scholar]
Harvey, D.; Leybourne, S.; Newbold, P. Testing the equality of prediction mean squared errors. Int. J. Forecast. 1997, 13, 281–291. [Google Scholar] [CrossRef]
Bollerslev, T.; Ghysels, E. Periodic autoregressive conditional heteroscedasticity. J. Bus. Econ. Stat. 1996, 14, 139–151. [Google Scholar]
R Core Team. R: A Language and Environment for Statistical Computing, R version 3.3.3; R Foundation for Statistical Computing: Vienna, Austria, 2019; Available online: http://www.R-project.org/ (accessed on 20 March 2020).
Hyndman, R.J. Forecast: Forecasting Functions for Time Series and Linear Models; R Package Version 8.0; 2017; Available online: http://github.com/robjhyndman/forecast (accessed on 20 March 2020).
Hyndman, R.J.; Khandakar, Y. Automatic time series forecasting: The forecast package for R. J. Stat. Softw. 2008, 26, 1–22. [Google Scholar]
Liu, L.Y.; Patton, A.J.; Sheppard, K. Does anything beat 5-minute RV? A comparison of realized measures across multiple asset classes. J. Econom. 2015, 187, 293–311. [Google Scholar] [CrossRef]
Bathia, D.; Bredin, D. An examination of investor sentiment effect in G7 stock market returns. Eur. J. Financ. 2013, 19, 909–937. [Google Scholar] [CrossRef]
Bathia, D.; Bredin, D.; Nitzsche, D. International sentiment spillovers in equity returns. Int. J. Financ. Econ. 2016, 21, 332–359. [Google Scholar] [CrossRef]
Baker, M.; Wurgler, J. Investor sentiment and the cross-section of stock returns. J. Financ. 2006, 61, 1645–1680. [Google Scholar] [CrossRef]
Baker, M.; Wurgler, J. Investor sentiment in the stock market. J. Econ. Perspect. 2007, 21, 129–152. [Google Scholar] [CrossRef]
Da, Z.; Engelberg, J.; Gao, P. The Sum of All FEARS Investor Sentiment and Asset Prices. Rev. Financ. Stud. 2015, 28, 1–32. [Google Scholar] [CrossRef]
García, D. Sentiment during recessions. J. Financ. 2013, 68, 1267–1300. [Google Scholar] [CrossRef]
Mei, D.; Liu, J.; Ma, F.; Chen, W. Forecasting stock market volatility: Do realized skewness and kurtosi? Help. Phys. A 2017, 481, 153–159. [Google Scholar] [CrossRef]
Giacomini, R.; Rossi., B. Forecast comparisons in unstable environments. J. Appl. Econom. 2010, 25, 595–620. [Google Scholar] [CrossRef]
Barndorff-Nielsen, O.E.; Kinnebrouk, S.; Shephard, N. Measuring downside risk: Realised semivariance. In Volatility and Time Series Econometrics: Essays in Honor of Robert F. Engle; Bollerslev, T., Russell, J., Watson, M., Eds.; Oxford University Press: Oxford, UK, 2010; pp. 117–136. [Google Scholar]
Deeney, P.; Cummins, M.; Dowling, M.; Bermingham, A. Sentiment in oil markets. Int. Rev. Financ. Anal. 2015, 39, 179–185. [Google Scholar] [CrossRef]

Table 1. Summary Statistics.

Statistic	MRV	HA
Min	0.001	5.840
Mean	0.424	6.026
Median	0.222	6.033
Max	4.997	6.357

Note: MRV was multiplied by the factor

10^{3}

. Number of observations

= 2466

.

Table 2. In-Sample Results.

Results.Table	Intercept	MRV	MRV $_{w}$	MRV $_{m}$	HA	RKU	RSK	Adj. R2
	$h = 1$
HAR-RV	2.8153	4.0303	8.8586	1.7208	–	–	–	0.6354
p-value	0.0049	0.0001	0.0000	0.0853	–	–	–	–
HAR-RV-HA	4.2709	3.7583	8.9359	1.9765	−4.2584	–	–	0.6390
p-value	0.0000	0.0002	0.0000	0.0481	0.0000	–	–	–
HAR-RV-HA-RKU	4.5456	3.9123	8.5785	1.8141	−4.5257	−1.3242	–	0.6390
p-value	0.0000	0.0001	0.0000	0.0697	0.0000	0.1854	–	–
HAR-RV-HA-RSK	4.2172	3.7246	8.9613	1.9992	−4.2049	–	−1.4846	0.6391
p-value	0.0000	0.0002	0.0000	0.0456	0.0000	–	0.1377	–
HAR-RV-HA-RKU-RSK	4.4645	3.8698	8.6267	1.8390	−4.4448	−1.0451	−1.2514	0.6391
p-value	0.0000	0.0001	0.0000	0.0659	0.0000	0.2960	0.2108	–
	$h = 5$
HAR-RV	1.4702	3.9532	5.4185	2.8944	–	–	–	0.8431
p-value	0.1415	0.0001	0.0000	0.0038	–	–	–	–
HAR-RV-HA	−0.2349	3.8698	5.4262	2.7933	0.2532	–	–	0.8431
p-value	0.8143	0.0001	0.0000	0.0052	0.8001	–	–	–
HAR-RV-HA-RKU	−0.2244	4.0214	4.8399	2.6085	0.2416	−0.1102	–	0.8430
p-value	0.8225	0.0001	0.0000	0.0091	0.8091	0.9122	–	–
HAR-RV-HA-RSK	−0.2348	3.8914	5.4489	2.8141	0.253	–	−0.0847	0.8430
p-value	0.8144	0.0001	0.0000	0.0049	0.8003	–	0.9325	–
HAR-RV-HA-RKU-RSK	−0.2235	4.0578	4.8533	2.6302	0.2406	−0.0956	−0.0679	0.8429
p-value	0.8231	0.0000	0.0000	0.0085	0.8099	0.9239	0.9459	–
	$h = 22$
HAR-RV	1.2423	4.9368	2.7946	1.9409	–	–	–	0.8410
p-value	0.2141	0.0000	0.0052	0.0523	–	–	–	–
HAR-RV-HA	−1.0653	4.9981	3.0358	2.0031	1.0739	–	–	0.8416
p-value	0.2868	0.0000	0.0024	0.0452	0.2829	–	–	–
HAR-RV-HA-RKU	−1.1839	4.8468	2.6076	1.8103	1.1898	0.9923	–	0.8415
p-value	0.2365	0.0000	0.0091	0.0702	0.2341	0.3210	–	–
HAR-RV-HA-RSK	−1.0820	4.9983	3.0343	2.0029	1.0908	–	−1.0739	0.8416
p-value	0.2793	0.0000	0.0024	0.0452	0.2753	–	0.2829	–
HAR-RV-HA-RKU-RSK	−1.1341	4.8989	2.6945	1.8347	1.1397	1.2809	−1.2352	0.8416
p-value	0.2567	0.0000	0.0071	0.0666	0.2544	0.2002	0.2167	–

Note: p-values were computed based on Newey–West robust standard errors. Estimated coefficients were scaled by their estimated standard error. Adj. R2 = adjusted coefficient of determination.

Table 3. Out-of-Sample Results.

Rolling Window	$h = 1$	$h = 5$	$h = 22$
		L1 loss
1000	0.0269	0.5714	0.2693
1200	0.0007	0.4707	0.3105
1400	0.0000	0.9985	0.9274
		L2 loss
1000	0.0327	0.7654	0.6027
1200	0.0049	0.8196	0.6977
1400	0.0015	0.9641	0.9762

Note: p-values of the modified Diebold–Mariano test under the assumption of a linear (L1 loss) and a quadratic (L2 loss) function. Null hypothesis: the series of forecasts from the HAR-RV vs. HAR-RV-HA models are equally accurate. Alternative hypothesis: the forecasts from the HAR-RV-HA model are more accurate.

Table 4. Robustness checks.

Specification Window	$h = 1$	$h = 5$	$h = 22$
HAR-RV-RKU vs. HAR-RV-RKU-HA	0.0055	0.8292	0.7168
HAR-RV-RSK vs. HAR-RV-RSK-HA	0.0045	0.8188	0.6888
HAR-RV-JUMP vs. HAR-RV-JUMP-HA	0.0055	0.8171	0.6962

Note: p-values of the modified Diebold–Mariano test under the assumption of a quadratic (L2 loss) function. Null hypothesis: the series of forecasts from the variants of the HAR-RV vs. HAR-RV-HA models are equally accurate. Alternative hypothesis: the forecasts from the HAR-RV model extended to include HA are more accurate. Length of the rolling-estimation window: 1200 observations.

Table 5. Good and bad realized volatility.

Rolling Window	$h = 1$	$h = 5$	$h = 22$
		RVG
1000	0.0711	0.7816	0.4886
1200	0.0015	0.8577	0.6647
1400	0.0005	0.9646	0.9708
		RVB
1000	0.0615	0.7825	0.5795
1200	0.0519	0.8274	0.6431
1400	0.0095	0.9687	0.9663

Note: p-values of the modified Diebold–Mariano test under the assumption of a quadratic (L2 loss) function. Null hypothesis: the series of forecasts from the HAR-RVG/RVB vs. HAR-RVG/RVB-HA models are equally accurate. Alternative hypothesis: the forecasts from the HAR-RV-HA model are more accurate. RVG: Good realized volatility. RVB: Bad realized volatility.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Bonato, M.; Gkillas, K.; Gupta, R.; Pierdzioch, C. Investor Happiness and Predictability of the Realized Volatility of Oil Price. Sustainability 2020, 12, 4309. https://doi.org/10.3390/su12104309

AMA Style

Bonato M, Gkillas K, Gupta R, Pierdzioch C. Investor Happiness and Predictability of the Realized Volatility of Oil Price. Sustainability. 2020; 12(10):4309. https://doi.org/10.3390/su12104309

Chicago/Turabian Style

Bonato, Matteo, Konstantinos Gkillas, Rangan Gupta, and Christian Pierdzioch. 2020. "Investor Happiness and Predictability of the Realized Volatility of Oil Price" Sustainability 12, no. 10: 4309. https://doi.org/10.3390/su12104309

APA Style

Bonato, M., Gkillas, K., Gupta, R., & Pierdzioch, C. (2020). Investor Happiness and Predictability of the Realized Volatility of Oil Price. Sustainability, 12(10), 4309. https://doi.org/10.3390/su12104309

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Menu

Investor Happiness and Predictability of the Realized Volatility of Oil Price

Abstract

1. Introduction

2. Methods

3. Data

4. Empirical Results

5. Concluding Remarks

Author Contributions

Acknowledgments

Conflicts of Interest

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI