1. Introduction
The pandemic has highlighted social and global inequality and spiked interests in environmental, social, and governance (ESG) investing. ESG assets reached
$35.3 trillion in 2020 from around
$30.7 trillion in 2018, reaching a third of current total global assets under management, according to the Global Sustainable Investment Association (
http://www.gsi-alliance.org/, accessed on 18 November 2021).
According to the 2020 Trends Report, investors are considering ESG factors across USD 17 trillion of professionally managed assets, a 42% increase since 2018. Such continued growth is expected over the long term, too. Since 1995, the value of US sustainable investment assets (USD 639 billion) has increased more than 25-fold (USD 16.6 trillion in 2020), at a compound annual growth rate of 14% (
US SIF 2020).
A few terms are used interchangeably to describe environmental, social, and governance investments, e.g., socially responsible investing (SRI), responsible investing, sustainable investing, and impact investing. In this paper, we understand ESG investing in terms of the ESG factors, which enhance traditional financial analysis, making it more complete.
There are some ESG factors that are helpful in the evaluation of investment performance. Investments with high ESG scores can increase rates of return, while those with poor ESG scores may inhibit these rates. One may find energy consumption, pollution, climate change, waste production, natural resource preservation (deforestation, carbon emission reduction), and animal welfare among the environmental factors. The social factors include human rights, child and forced labor, community engagement, health and safety, stakeholder relations and employee relations, customer success, data hygiene, and security. The governance factors contain the following: quality of management, board independence, conflict of interest, executive compensation, hiring and onboarding best practices, transparency, and disclosure. In this paper, we use the term ESG to refer to these indices, which include the companies that disclose these factors.
Morgan Stanley’s research on nearly 11,000 mutual funds between 2004 and 2018 indicates there is no financial trade-off in returns of sustainable funds compared to traditional ones. Moreover, sustainable funds showed lower downside risk. The number of ESG focused funds has been growing, since 2004 by 144% (Morgan Stanley, Institute for Sustainable Investing, Sustainable Reality. Analyzing Risk and Returns of Sustainable Funds, 2019.
www.morganstanley.com, accessed on 18 November 2021).
A growing number of investors not only focus on the profitability of investment strategies but also look for their social value. ESG investing fulfills this goal. ESG looks at the company’s environmental, social, and governance practices, as well as traditional ones. ESG investors believe that investments in companies employing ESG practices may have a material impact on their investments’ profitability and risk. Moreover, they accept a lower return in the short term and even slightly higher risk because of the additional future value of their investment.
Considering ESG benefits in investing for economic value is not a new concept. The report “Who Cares, Wins—Connecting Financial Markets to a Changing World”, which provided guidelines for companies to incorporate ESG into their operations, was published in 2004. The first ESG index, the Domini 400 Social Index (now known as the MSCI KLD 400 Social Index), was launched in May 1990. Since then, the ESG indices have evolved to meet investors’ unique needs in investing.
ESG equity indices are used as benchmarks for ESG investment, as the underlying assets of passive ESG investment tools (such as exchange-traded funds), and as related risk-management tools (ESG index futures). ESG equity indices are usually constructed based on parent equity indices incorporating ESG investment styles. The construction process may comprise screening out the companies with negative ESG impacts and including the companies with positive ESG impacts by adjusting their relative weights. Therefore, the risk-return performances of these indices may be different from those of their parent indices with traditional investment strategies (
HKEX 2020).
Making investment decisions, including portfolio construction, requires knowledge of volatility and dependence between financial time series. Determination of the dependence structure is essential for portfolio management, and the misinterpretation of the strength of this dependence can lead to wrong investment decisions. Pearson’s linear correlation coefficient is the most common and widely used correlation measure. Because of its linearity, it is a tool appropriate only for measuring the dependence between variables of elliptic distributions. In situations where empirical data are characterized by, for example, asymmetric and non-elliptic distributions, with high kurtosis or skewness, the use of linear correlation coefficient is not advisable. In such instances, copula functions are better tools for investigating the dependence structure between the time series (
Messaoud and Aloui 2015).
The objective of the paper is to evaluate the attractiveness of ESG investments as a potential diversifier for conventional investments. In this study, we examine the following hypothesis:
Hypothesis 1 (H1). ESG investments outperform conventional investments in terms of risk.
Hypothesis 2 (H2). Asymptotic dependence increases during the crisis on the market (declines of stock market indices), stabilizing during the non-crisis periods.
There are many ESG indices; in this paper, we select five ESG indices (see
Table A1 for details). Conventional stock indices are represented by the Dow Jones Industry Average (DJI) and the S&P 500 (GSPC).
Because little is known about the dependence structure between ESG and conventional investments, the contribution of this paper is to fill this research gap. The pioneering character of this research—application of tail dependence to ESG investments, is highly contributing both to the knowledge of the field as well as to the practice of assets selection in portfolio construction. The paper also contributes to improving the understanding of volatility and the (tail) dependence structure between ESG and conventional investments. By quantifying the overall and lower tail risk between these two assets, we indicate that these dependencies exist, can be quantified, and are not negligible, especially in times of crisis.
The research results confirm the higher dependence of extreme values in the crisis period (declines of indices); however, this is also due to increased volatility during this period.
The paper is structured as follows: the introduction, literature review, research methodology, data description, empirical results, discussion, conclusions, and references.
Literature Review
Managi et al. (
2012) report no statistical difference in means and volatilities generated from the SRI indices and conventional indices in neither of the studied regions (the US, the UK, and Japan). Furthermore, they found strong comovements between the two indices in both regimes (bear and bull). In contrast,
Ortas et al. (
2014) found that in the period of the global financial crisis of 2008, social and responsible investment strategies were less risky in comparison to conventional investments.
There is no consensus in the literature as to whether ESG investments are characterized by very high returns and very low risks compared to conventional ones.
Weber and Ang (
2016) analyzed the performance of an emerging market SRI index concerning its financial performance compared to conventional indices. Their results indicate that the SRI index outperformed in terms of mean return the majority of the conventional emerging market portfolios. Similarly,
Verheyden et al. (
2016) found that both global and developed-markets portfolios (a 10% best-in-class ESG screening approach) show higher returns, lower (tail) risk, and no significant reduction of diversification potential.
On the other hand,
Giese and Lee (
2019) reported no clear consensus on whether ESG criteria have enhanced risk-adjusted returns. The empirical findings in the report on ESG indices performance (
HKEX 2020), indicate that many ESG indices tended to have similar, if not better, risk-return performances then their parent indices. This implies that investment in ESG indices may provide equally good or even better returns while pursuing an ethics-focused investment strategy.
Ouchen (
2021) empirically verified whether the series of returns of an ESG index was less volatile than that of a conventional stock index. He concluded that the ESG index was relatively less turbulent than the stock index.
Jain et al. (
2019) reported there is no significant difference in the performance between sustainable indices and conventional indices.
Plastun et al. (
2022) investigated returns on ESG and conventional indices. They showed no significant differences between ESG and conventional indices. The types of price effects detected by them were the same for the cases of ESG and conventional indices (but their power was different in some cases).
Charles et al. (
2016) compared the risk-adjusted performance between ESG and conventional indices, as well as within the ESG indices, examining it based on standard and tail risk measures. They showed that the ESG screens for equities lead neither to a significant outperformance nor an underperformance compared to the benchmarks. They also indicated that the weights used to construct these indices (sustainability-score weights vs. market cap-weights) seemed to impact their risk and performance.
Apergis et al. (
2015) employed a standard cointegration methodology and a novel time-varying quantile cointegration approach to investigate whether the US Dow Jones Sustainability Index and its conventional parent index are integrated. The results confirmed the presence of an asymmetric long-run relationship between these indices that is not detected by the standard methodology of cointegration.
Classical Markowitz portfolio theory does not consider the role currently played by the ESG investments on the market. It applies risk and return as single criteria, assuming that investors are rational and seek the highest return at the lowest level of risk, and their utility functions are convex (
Markowitz 1952). Incorporating the ESG-based factor into the portfolio selection problem,
Pedersen et al. (
2021) proposed a hypothesis that explained how the increasingly widespread adoption of ESG affected portfolio choice and equilibrium asset prices.
In the case of dependency modeling for classical portfolio theory, linear correlation coefficients were used (assuming elliptic distributions of returns). The problem appears because the returns are not correlated strongly when they are around zero; however, the correlation increases in the tails. Then an appropriate tool for dependency modeling is the copula function (
Sklar 1959).
Empirical research indicates that stock returns also display an asymmetric dependence in growing and declining markets, i.e., this dependence may be stronger in bearish markets than in bullish markets and tends to increase in the periods of violent fluctuations of prices (
Ang and Bekaert 2002;
Jondeau 2016;
Longin and Solnik 2001). Because of the detected asymptotic dependence of random variables in tails (
Patton 2006), the authors apply copula functions. This approach allows investigating the dependence in variance and in tails, which is not possible using standard dependence measures.
While more than 2000 empirical studies have been conducted analyzing the ESG factors and financial performance, still little is known about the dependence structure and the associated risks (
Friede 2019). This is especially important as ESG scores are often related to investment risk (
Bax et al. 2021).
Due to the imperfections of financial time series (they are not normally distributed) and correlation coefficients (which measure linear relationship and are constant in time), we applied GARCH family models and copula functions accompanied with some heavy-tailed marginal distributions.
2. Data and Methods
2.1. Research Methodology
The ability to forecast volatility of assets is vital for portfolio selection and asset management, as well as for the pricing of primary and derivative assets (
Engle and Ng 1993).
Early studies point to volatility clustering, leptokurtosis, and the leverage effect in stock-returns time series (
Mandelbrot 1963) and (
Fama and Fama 1965). The additional features of financial time series observed across different financial assets (stocks, stock indices, exchange rates) are as follows: stationarity, fat tails, asymmetry, aggregational Gaussianity, quasi-long-range dependence, and seasonality (e.g.,
Rydberg 2000;
Taylor 1986). In the GARCH model, the variance is influenced by the square of the lagged innovation. However in the equity returns the leverage effect (higher impact of negative shocks on volatility) is observed, the simple GARCH model fails to describe it. GARCH (1,1) with a generalized residuals distribution can capture more volatility assessment than other models. On the other hand, the impact of asymmetry on stock market volatility and return analysis is beyond the descriptive power of the asymmetric GARCH models, which could capture more details.
There are several limitations to GARCH models. The most important one is the inability to capture the asymmetric performance. For that reason, EGARCH, GJR-GARCH, and APGARCH models were proposed. Furthermore, the asymmetric GARCH models can measure the effect of positive or negative shocks on stock market returns and volatility incompletely, and the GARCH (1,1) comparatively fails to accomplish this. The GJR-GARCH model performs better in the face of asymmetry, producing a predictable conditional variance during the period of high volatility. In addition, among the asymmetric GARCH models, the performance of the EGARCH model appeared to be superior.
Based on the properties of the studied time series: volatility clustering, leptokurtosis, asymmetry, leverage effects, mean-reversion, and stationarity—we apply the following models from the GARCH family: GARCH, EGARCH, GJR-GARCH, APGARCH, and AVGARCH in the study. We selected the GARCH model using the Akaike (AIC) and Bayesian (BIC) information criteria.
Table A2,
Table A3,
Table A4 and
Table A5 present only the results of estimation for the best-fitted models according to these criteria (with the minimum criteria values).
The autoregressive conditional heteroskedasticity (ARCH) models were introduced by
Engle (
1982) and their generalization, the GARCH models, by
Bollerslev (
1986). The standard GARCH(
q,
p) model (
Bollerslev 1986) may be written as:
where
is the conditional variance,
the intercept, and
the residuals from the ARMA model.
Some researchers pointed out limitations of the GARCH model. The most important one is that GARCH cannot capture asymmetric performance. Later, for improving this problem, EGARCH, GJR-GARCH, and APGARCH were proposed.
The exponential GARCH (EGARCH) is designed to model the logarithm of the variance rather than the level, and this model accounts for an asymmetric response to a shock. The exponential GARCH model of
Nelson (
1991) is defined as:
where the coefficient
captures the sign effect and
the size effect.
is the expected value of the absolute standardized innovation
.
The Glosten–Jagannathan–Runkle GARCH (GJR-GARCH) as GARCH model captures features of financial time series like leptokurtic returns and volatility clustering.However, the GJR-GARCH model of
Glosten et al. (
1993) models positive and negative shocks on the conditional variance asymmetrically by the use of the indicator function
I:
where
now represents the ’leverage’ term. The indicator function
takes on value of
for
and zero otherwise.
The asymmetric power ARCH (APARCH) model of
Ding et al. (
1993) allows for both leverage and the Taylor effect, named after
Taylor (
1986) who observed that the sample autocorrelation of absolute returns was usually larger than that of squared returns:
where
, being a Box–Cox transformation of
, and
the coefficient in the leverage term.
The absolute value GARCH (AVGARCH) model of
Taylor (
1986) and
Schwert (
1990):
where
and
are rotations and shifts parameters respectively.
This paper examines the structure of interdependence between ESG and conventional indices. In order to achieve this goal, we fit different theoretical distributions to the series of returns. Then, we assumed the best-fitted distribution describing the process of returns as a marginal distribution applied for our copula estimation. We used the two-stage maximum likelihood method to estimate the parameters of the considered two-dimensional copulas. In addition to testing the goodness of fit of alternative copulas, we also verified a set of hypotheses relating to the correlation matrix.
The concept of copula, was first introduced by
Sklar (
1959). The theoretical background for copulas is provided by Sklar’s theorem. There exists a copula
C such that:
where
is two-dimensional joint distribution with the marginal distributions
,
of random variables
. If
are continuous, the copula
is unique:
where
,
for
.
One may use a wide range of parametric copula families to capture the different structures of dependence (e.g., Gaussian, Archimedean). In the paper, we applied the following copulas: Student’s t, Joe–Clayton (a combination of the Joe copula and the Clayton copula), Gumbel–Clayton (a combination of the Clayton copula and the Gumbel copula), and survival. The Gaussian copula does not capture tail dependence, Student’s t-copula has symmetric tail dependence in both lower and upper tails, and the Clayton and Gumbel copulas have only lower and upper tail dependence, respectively. Survival copulas correspond to rotation by 180 degrees.
For two dimensions following copulas are defined as (
Patton 2006):
- (1)
Gaussian/normal (
N) copula
where
N is the normal joint distribution and
is the quantile of the univariate normal distribution;
- (2)
Student’s t/t (
t) copula
where
,
is the joint Student’s t distribution and
is the univariate Student’s t distribution with
ν degrees of freedom;
- (3)
Clayton–Gumbel (
BB1) copula
where
,
;
- (4)
Joe–Clayton (
BB7) copula
where
,
,
,
.
Survival copula is the copula of (1 − u1) and (1 − u2) instead of u1 and u2, respectively. Its function can measure the asymmetric dependence on the opposite side of the distribution as compared to the original function.
Our focus was on the extreme downside market risk, so we investigated the lower tail dependence in detail. The tail-dependence coefficients (
Patton 2006) are:
Lower tail-dependence coefficient:
Upper tail-dependence coefficient:
in the case that the limit exists,
and (
), dependence is present.
The bivariate normal distribution is tail independent, the bivariate Student’s t-distribution exhibits the same upper and lower tail dependence, the bivariate Joe–Clayton and Gumbel–Clayton distributions have both lower and upper tail dependence. The concept of tail dependence is embedded within the copula theory.
Instead of Pearson’s correlation coefficient in the copula theory, we use the Kendall’s τ coefficient (
Nelsen 2006;
Patton 2006). For each pair, the Kendall’s τ is estimated and the
p-values of the independence test based on the Kendall’s τ were determined in the study.
In this paper, we used functions from the VineCopula library in R (
Stoeber et al. 2018). We made the following assumptions during the estimation process: we took 39 copulas into consideration, we applied the Maximum Likelihood Estimation (MLE) method, and we used the AIC selection criterion to select the best-fitted copula. To measure the discrepancy between a hypothesized model and the empirical model, we used the goodness-of-fit (GoF) statistics based on the Kendall’s process as proposed by
Wang and Wells (
2000). For computation of
p-values, the parametric bootstrap described by
Genest et al. (
2006) was used.
2.2. Data Description
In the study, we used five selected ESG indices (presented in
Table A1) and two stock indices—Dow Jones Industry and the S&P 500. Daily logarithmic rates of return for them were calculated, as a difference between logarithms of two consecutive closing prices multiplied by 100 (they can be interpreted as percentage changes). The data set was retrieved from the Thomson Reuters database. In
Table 1, Table 4, Table 7 and Table 10 descriptive statistics for rates of return of selected indices were given. The period of analysis covered 3 July 2007 until 31 December 2019. We considered, based on important market events (e.g., global financial crisis, debt crisis in the EU, fall in oil prices) and data requirements from the models’ perspectives, the following four subperiods:
3 July 2007–30 January 2009 (388 observations)—the global financial crisis period;
2 December 2009–30 July 2014 (1140 observations)—the period of debt crisis in the EU;
3 June 2014–31 January 2017 (633 observations)—the Russian financial crises, fall in oil prices;
3 January 2017–31 December 2019 (746 observations)—stabilization period.
4. Discussion
Our study does not confirm the outperformance of the ESG indices compared to conventional ones in terms of risk in the considered subperiods. However, generalization of the results is limited by the selection of a market index as a proxy of an investment portfolio. Other empirical studies demonstrate a strong correlation between the lower risk related to sustainability and better financial performance (
Whelan et al. 2021). During the 2008 global financial crisis
Fernández et al. (
2019) found that German green mutual funds had risk-adjusted returns slightly better than their peers (in the noncrisis period, they were equal to conventional funds but better than the SRI funds). Similarly, ESG stock indices performed better and recovered faster after the 2008 global financial crisis (
Wu et al. 2017). Other results confirm these findings, as in economic downturns, the high-rated mutual funds outperformed the low-rated funds, based on the Sharpe ratio (
Das et al. 2018a,
2018b;
Khajenouri and Schmidt 2020). In line are research by
Abate et al. (
2021) that mutual funds investing in high ESG stocks perform better than investing in low ESG score stocks.
Gil-Bazo et al. (
2010) confirm that SRI funds perform better than their conventional counterparts.
In the study we evaluate whether conventional stock portfolios, including ESG companies, can effectively decrease portfolio risk, especially in times of financial distress on the market. Generally, we observe in all subperiods almost-weak to high lower-tail dependence between the ESG and conventional indices. Our findings indicate that there is low or symmetric tail dependence between ESG and conventional indices, meaning that if the ESG index decreases, the conventional one will also decrease accordingly. In the two first subperiods (economic downturn periods) lower tail dependence coefficients are higher comparing to the next two periods (‘stabilization’ periods). We conclude that dependencies exist, can be quantified, and are not negligible, especially in times of crisis.
In risk management of an asset portfolio, we are interested whether the decline of one (or more) assets may influence the behavior of the other assets in a portfolio. Especially, the occurrence of simultaneous extreme events on the market implies that risk diversification breaks down just when it is crucial. In case of extreme events, classical dependence analysis (e.g., linear correlation) fails, a copula approach is used. Some researchers argue that considering ESG practices when creating an equity portfolio (selecting companies with high ESG scores) can act as protection against left-tail risk; therefore, reducing ex-ante expectations of a left-tail event (
Shafer and Szado 2020;
De and Clayman 2015;
Djoutsa Wamba et al. 2020).
The measurement of the tail dependence between ESG and conventional investment based on the copula approach, allows to monitor extreme risks between them. Understanding dependencies and risks is important for setting up adequate risk management as well as construction portfolios—ESG-diversified and resilient to crises.
Bax et al. (
2021) use the R-vine copula ESG risk model. They estimate all the conditional dependencies among assets as well as specify their interactions as modeled by different copulas families, but they also introduce three ESG risk measures that capture ESG risk, market risk conditionally on the ESG class, as well as an idiosyncratic risk component.
5. Conclusions
As interest in ESG investments grows, it is crucial to better understand the various risks and return tradeoffs between ESG and conventional stocks and the dependence structures between them. We used GARCH family models to estimate conditional volatilities and the copula approach, in particular the (tail) dependence structure between ESG and conventional investments to quantify the overall and lower tail risk between these two investments.
Hypothesis 1, that ESG investments outperform conventional ones in terms of risk was negatively verified—there are indices that underperform the conventional indices in selected subperiods. Hypothesis 2, that asymptotic dependence increases during the crisis on the market (declines of stock market indices), and it stabilizes during non-crisis periods was positively verified.
The findings indicate that there is no significant difference in daily returns of ESG indices and conventional ones.
The parameters are significant among all the estimations and comparisons, showing that GARCH family models may appropriately model the ESG and conventional index data. In most subperiods, the data for the ESG and conventional indices were negatively skewed. The EGARCH models this type of behavior, but in this study, also AVGARCH model was applied. Surprisingly, GJR-GARCH was not often used.
We found relations between the selected ESG indices. A1GSI is related in construction to S&P 500, while three other ESG indices—SXWESGP, SEESGSEP, and SGESGSEP are related to each other. These relations were visible in similar behavior of the rates of return. Volatility clustering observed at S&P 500, DJI, and A1SGI was different from that in other ESG indices (but the scale of these differences depends on the subperiod). In the first subperiod, one cluster was observed for all the indices (there is no difference visible). In the second and third subperiod, two clusters were found for all indices, but the difference between these two groups of indices was present. The most visible differences were observed in the last subperiod. Only in the second period, results of model estimation are consistent for all the indices (ARMA-EGARCH models with innovations sged and sstd).
We cannot confirm that the volatility of conventional indices should be modeled using different models than the volatility of the ESG—this depends on the time period and the market events. There are periods when volatility of all the indices may be modeled by using the same models, when there is a difference between the modeling of conventional and ESG indices, and when there is a difference between the modeling of ESG indices.
The characteristics of the time series indicated the need to apply to the dependence analysis copula approach. To capture tail dependence, Student’s t-copula and the combination of the Clayton and Gumbel copulas were fitted best in this study. The choice of an appropriate copula function is crucial. Two features are important regarding the copula selection. The general structure of the chosen copula should coincide with the dependence structure of the real data. If the data show tail dependence than we must apply a copula which comprises tail dependence.
In the periods of economic downturn (declines of stock market indices), the dependencies measured by Kendell’s tau coefficients were higher than in the less turbulent periods.
The lower tail dependence and symmetric dependence between ESG and conventional investments were detected. High values of low tail-dependence coefficients were observed in the economic downturn periods; low in stabilization periods. This signifies higher dependence of extreme values in the economic downturn periods and low dependence of extreme values in stabilization periods.
We conclude that when selecting the right model, the preliminary analysis of data is necessary, and the selection of the volatility model should be carried out for subperiods regarding different market-event characteristics. Results show, as in cases of systemic risk, that analysis of volatility and dependence structure should be carried out separately—in the periods of economic downturn and in less turbulent times.
To extend this research for the future, a more detailed analysis, including ESG and non-ESG companies from different markets (developed and developing) would be beneficial. In addition, application of a time-varying model (time-varying copula) would give insights into dependence structure.