Robust Inference in the Capital Asset Pricing Model Using the Multivariate t-distribution

Galea, Manuel; Cademartori, David; Curci, Roberto; Molina, Alonso

doi:10.3390/jrfm13060123

Open AccessArticle

Robust Inference in the Capital Asset Pricing Model Using the Multivariate t-distribution

¹

Departamento de Estadística, Pontificia Universidad Católica de Chile, Avenida Vicuña Mackenna 4860, Santiago 7820436, Chile

²

Escuela de Comercio, Pontificia Universidad Católica de Valparaíso, Avenida Brasil 2830, Valparaíso 2340031, Chile

³

Brennan School of Business, Dominican University, River Forest, IL 60305, USA

^*

Author to whom correspondence should be addressed.

J. Risk Financial Manag. 2020, 13(6), 123; https://doi.org/10.3390/jrfm13060123

Submission received: 1 May 2020 / Revised: 9 June 2020 / Accepted: 9 June 2020 / Published: 13 June 2020

(This article belongs to the Special Issue Financial Statistics and Data Analytics)

Download

Browse Figures

Versions Notes

Abstract

In this paper, we consider asset pricing models under the multivariate t-distribution with finite second moment. Such a distribution, which contains the normal distribution, offers a more flexible framework for modeling asset returns. The main objective of this work is to develop statistical inference tools, such as parameter estimation and linear hypothesis tests in asset pricing models, with an emphasis on the Capital Asset Pricing Model (CAPM). An extension of the CAPM, the Multifactor Asset Pricing Model (MAPM), is also discussed. A simple algorithm to estimate the model parameters, including the kurtosis parameter, is implemented. Analytical expressions for the Score function and Fisher information matrix are provided. For linear hypothesis tests, the four most widely used tests (likelihood-ratio, Wald, score, and gradient statistics) are considered. In order to test the mean-variance efficiency, explicit expressions for these four statistical tests are also presented. The results are illustrated using two real data sets: the Chilean Stock Market data set and another from the New York Stock Exchange. The asset pricing model under the multivariate t-distribution presents a good fit, clearly better than the asset pricing model under the assumption of normality, in both data sets.

Keywords:

capital asset pricing model; estimation of systematic risk; tests of mean-variance efficiency; t-distribution; generalized method of moments; multifactor asset pricing model

1. Introduction

The Capital Asset Pricing Model (CAPM) is one of the most important asset pricing models in financial economics. It is widely used in estimating the cost of capital for companies and measuring portfolio (or investment fund) performance, among others applications; see, for instance, Campbell et al. (1997), Amenc and Le Sourd (2003), Broquet et al. (2004), Levy (2012) and Ejara et al. (2019).

The CAPM framework provides financial practitioners with a measure of

b e t a

(or systematic risk) for entire stock markets, industry sub-sectors, and individual equities (Pereiro 2010).

The literature on the CAPM based on the multivariate normal distribution is vast, as seen, for instance, in the works published by Elton and Gruber (1995), Campbell et al. (1997), Broquet et al. (2004), Francis and Kim (2013), Johnson (2014), Brandimarte (2018) and Mazzoni (2018). However, multivariate normality is not required to ensure the validity of the CAPM. In fact, it is well known that the CAPM is still valid within the class of elliptical distributions, of which multivariate normal and multivariate t-distributions are special cases (see Chamberlain 1983; Hamada and Valdez 2008; Ingersoll 1987; Owen and Rabinovitch 1983). It is also well known that in practice, excess returns are not normally distributed. Most financial assets exhibit excess kurtosis, that is to say, returns having distributions whose tails are heavier than those of the normal distribution and present some degree of skewness; see Fama (1965), Blattberg and Gonedes (1974), Zhou (1993), Campbell et al. (1997), Bekaert and Wu (2000), Chen et al. (2001), Hodgson et al. (2002) and Vorknik (2003). Recently Bao et al. (2018) discuss estimation in the univariate CAPM with asymmetric power distributed errors. In this paper, the multivariate version of the CAPM is considered, primarily focusing on modeling non-normal returns due to excess kurtosis.

Within the class of elliptical distributions, the multivariate t-distribution has been widely used to model data with heavy tails. For instance, Lange et al. (1989) discuss the use of the t-distribution in regression and in problems related to multivariate analysis. Sutradhar (1993) has considered a score test aimed at testing if the covariance matrix is equal to some specified covariance matrix using the t-distribution; Bolfarine and Galea (1996) used the t-distribution in structural comparative calibration models, while Pinheiro et al. (2001) used the multivariate t-distribution for robust estimation in linear mixed-effects models. Cademartori et al. (2003), Fiorentini et al. (2003), Galea et al. (2008), Galea et al. (2010) and Kan and Zhou (2017) provide empirical evidence of the usefulness of t-distribution to model stock returns. In addition, statistical inference based on the t-distribution is simple to implement, and the computational cost is considerably low.

Following Kan and Zhou (2017), there are three main reasons for using the t-distribution in modeling returns of financial instruments. (i) empirical evidence shows that this distribution is appropriate for modeling non-normal returns in many situations, (ii) with the algorithms implemented in this paper, the t-distribution has become almost as tractable as the normal one, and (iii) the CAPM is still valid under t-distribution. It is clear that the t-distribution does not describe all the features of the return data. For instance, the volatility variation over time is one of them, for which the GARCH models are very useful. However, according to our experience (see Cademartori et al. 2003; Galea et al. 2008, 2010; Galea and Giménez 2019), and as mentioned by Kan and Zhou (2017), there is little evidence of GARCH effects on the monthly data that are typically used for asset pricing and corporate studies. In addition, when we have a moderate number of assets, for example more than 10, the fit of the GARCH models requires an important computational effort, which limits its application to real data sets. For more details see Harvey and Zhou (1993) and Kan and Zhou (2017).

Thus, the main goal of this paper is to develop statistical inference tools, such as parameter estimation and hypothesis tests, in asset pricing models, with an emphasis on the CAPM, using the multivariate t-distribution. An extension of the CAPM, the multifactor asset pricing model (MAPM), is also discussed. The t-distribution incorporates an additional parameter, which allows modeling returns with high kurtosis. We consider a reparameterization of the multivariate t-distribution with a finite second moment. This enables a more direct comparison with the normal distribution (see Bolfarine and Galea 1996; Sutradhar 1993). Based on Fiorentini et al. (2003), who use the reparameterization of degrees of freedom suggested by Lange et al. (1989) to model financial data, this version of the multivariate t-distribution is adopted to test hypotheses of interest, such as the hypothesis of mean-variance efficiency. The three most widely used tests based on the likelihood function are considered; Wald tests, likelihood-ratio tests, and score tests (also known as Lagrange multiplier tests). Under the assumption of normality, these tests have been discussed in the literature, see for instance Campbell et al. (1997) and Chou and Lin (2002). Recently Kan and Zhou (2017) discuss the likelihood-ratio tests in the CAPM assuming that the excess returns follow a multivariate t-distribution. In this paper, the modeling of the asset returns conditional on market portfolios and the three most widely used tests are considered. Additionally, a fourth test statistic is considered, based on the likelihood proposed by Terrell (2002), the gradient test. To our knowledge this test has not been applied to test hypothesis in asset pricing models.

The article is developed as follows. In Section 2, the CAPM under the multivariate t-distribution, estimation of parameter and tests of mean-variance efficiency are briefly reviewed, and the Generalized Method of Moments is summarized for comparative purposes. In Section 3, the methodology developed in this paper is applied to two real data sets: the Chilean Stock Market data set and another from the New York Stock Exchange, USA. In Section 4, multifactor asset pricing models under the t-distribution are discussed. In Section 5, a conclusion and final comments are included. The appendices contain technical details.

2. Methodology

2.1. The CAPM under the t-Distribution

First, a set of

p \geq 1

assets of interest is considered, and let

R_{i}

denote the return for asset i, with

i = 1, \dots, p

. CAPM specifies that the stock’s expected return is equal to the risk-free rate return plus a risk premium; i.e.,

E (R_{i}) = R_{f} + β_{i} {E (R_{m}) - R_{f}}, i = 1, \dots, p,

(1)

where

R_{f}

is the risk-free interest rate,

β_{i}

is the systematic risk of the asset i, and

R_{m}

is the market return. This model was independently derived by Sharpe (1964), Lintner (1965) and Mossin (1966). For these p assets, the excess returns can be described using the following multivariate linear regression model; Gibbons et al. (1989), MacKinlay and Richardson (1991) and Campbell et al. (1997),

y_{t} = α + β x_{t} + ϵ_{t}, t = 1, \dots, n,

(2)

where

y_{t} = {(y_{1 t}, \dots, y_{p t})}^{T}

is a

p \times 1

vector representing excess returns of the set of p assets of interest in period t such that,

y_{i t} = R_{i t} - R_{f t}

denotes the excess return of asset i during period t,

α = {(α_{1}, \dots, α_{p})}^{T}

is the intercept vector,

β = {(β_{1}, \dots, β_{p})}^{T}

is the slope vector that corresponds to the sensitivity of the portfolio return to changes in this benchmark return;

x_{t} = R_{m t} - R_{f t}

represents the excess return of the market portfolio during period t and finally

ϵ_{t}

is the errors vector during period t, with mean zero and variance-covariance matrix

Σ

, independent of t, for

t = 1, \dots, n

. If the CAPM holds for this set of assets and the benchmark portfolio is mean-variance efficient, the following restriction on the parameters of model (2) should hold

E (y_{t}) = β x_{t}

, for

t = 1, \dots, n

. Hence, this restriction implies a testable hypothesis:

H_{α} : α = 0 .

(3)

Much of the theory of the CAPM is based on the assumption that excess returns follow a multivariate normal distribution; see for instance Campbell et al. (1997), Broquet et al. (2004), Johnson (2014), Brandimarte (2018), Mazzoni (2018) and Galea and Giménez (2019). However, it has been shown that although the assumption of normality is sufficient to generate the model (1), it is not necessary. Chamberlain (1983), Owen and Rabinovitch (1983), Ingersoll (1987), Berk (1997) and most recently Hamada and Valdez (2008) show that (1) can be obtained under the assumption of elliptically symmetric return distributions. In particular, Berk (1997) showed that when agents maximize the expected utility, elliptical symmetry is both necessary and sufficient for the CAPM.

In this paper, we are interested in develop statistical inference tools, estimation and hypothesis tests in asset pricing models supposing that

ϵ_{t}

, the random errors vector following a multivariate t-distribution, has a mean zero and a covariance matrix

Σ

. In effect, we supposed that the density function of

ϵ_{t}

is given by

f (ϵ) = {| Σ |}^{- 1 / 2} g (δ), δ \geq 0,

(4)

where

g (δ) = k_{p} (η) {(1 + c (η) δ)}^{- \frac{1}{2 η} (1 + η p)},

with

δ = ϵ^{T} Σ^{- 1} ϵ

,

k_{p} (η) = {(c (η) / π)}^{p / 2} {Γ ((1 + η p) / 2 η) / Γ (1 / 2 η)}

and

c (η) = η / (1 - 2 η)

,

0 < η < 1 / 2

. In this case we wrote

ϵ_{t} \sim T_{p} (0, Σ, η)

. From properties of the t-distribution (see Appendix A), we have, given

x_{t}

, that

y_{t} \sim T_{p} (α + β x_{t}, Σ, η)

independently,

t = 1, \dots, n

. The t-distribution offers a more flexible framework for modeling asset returns. In this distribution

η

is a shape parameter that can be used for adjusting the kurtosis distribution and for providing more robust procedures than the ones that use the normal distribution, with moderate additional computational effort.

Following Campbell et al. (1997), we consider the joint distribution of the excess returns given the excess return market. Specifically, we assume that the excess returns

y_{1}, \dots, y_{n}

, given the excess return market, are independent random vectors with a multivariate t-distribution and common covariance matrix. Then, the probability density function of

y_{t}

takes the form of

f (y_{t} | θ) = {| Σ |}^{- 1 / 2} g (δ_{t}),

(5)

where,

δ_{t} = {(y_{t} - α - β x_{t})}^{T} Σ^{- 1} (y_{t} - α - β x_{t})

, for

t = 1, \dots, n

. Therefore, the density for a sample of n periods is given by

f (Y | θ) = \prod_{t = 1}^{n} f (y_{t} | θ) = \prod_{t = 1}^{n} {| Σ |}^{- 1 / 2} g (δ_{t}),

(6)

with

Y = (y_{1}, \dots, y_{n})

and

θ = {(α^{T}, β^{T}, σ^{T}, η)}^{T}

, where

σ = vech (Σ)

is the

p (p + 1) / 2

vector obtained from

vec (Σ)

by deleting from it all of the elements that are above the diagonal of

Σ

.

2.2. Maximum Likelihood Estimation

The logarithm of the likelihood function for the model (6) is given by

L (θ) = \sum_{t = 1}^{n} L_{t} (θ),

(7)

where

L_{t} (θ) = - \frac{1}{2} log | Σ | + log {g (δ_{t})} = log k_{p} (η) - \frac{1}{2} log | Σ | - \frac{1}{2 η} (1 + η p) log (1 + c (η) δ_{t})

is the contribution from the tth return to the likelihood,

t = 1, 2, \dots, n

.

From (7), the score function is given by

U (θ) = \sum_{t = 1}^{n} U_{t} (θ),

(8)

where

U_{t} (θ) = {(U_{t α}^{T}, U_{t β}^{T}, U_{t σ}^{T}, U_{t η})}^{T}

with

\begin{matrix} U_{t α} & = & ω_{t} Σ^{- 1} ϵ_{t}, \\ U_{t β} & = & x_{t} U_{t α}, \\ U_{t σ} & = & - \frac{1}{2} D_{p}^{T} vec (Σ^{- 1} - ω_{t} Σ^{- 1} ϵ_{t} ϵ_{t}^{T} Σ^{- 1}), and \\ U_{t η} & = & \frac{1}{2 η^{2}} {c (η) p - β (η) - ω_{t} c (η) δ_{t} + log (1 + c (η) δ_{t})}, \end{matrix}

where

ω_{t} = (\frac{1 + η p}{η}) (\frac{c (η)}{1 + c (η) δ_{t}})

, for

t = 1, \dots, n

;

β (η) = ψ (\frac{1}{2 η} (1 + η p)) - ψ (\frac{1}{2 η})

;

ψ (x)

is the digamma function and

D_{p}

is the duplication matrix; see Magnus and Neudecker (2007). It is difficult to obtain the maximum likelihood (ML) estimators from

U (θ) = 0

. The EM algorithm has been suggested frequently to obtain ML estimators in statistical models under the t-distribution, mainly because it leads to a simple implementation of an iteratively weighted estimation procedure. As is well known, the t-distribution is a scale mixture of a normal distribution (see Lange et al. 1989), which facilitates the implementation of the EM algorithm considerably. Then, based on the complete-data log-likelihood function we obtained the expressions of ML estimates (see Liu and Rubin 1995; Shoham 2002; Xie et al. 2007). Thus, in our case, as shown in Appendix B, the ML estimates of

α

,

β

,

Σ

and

η

are obtained as solution of the following equations:

\hat{α} = {\bar{y}}_{ω} - \hat{β} {\bar{x}}_{ω}, \hat{β} = \frac{\sum_{t = 1}^{n} ω_{t} (x_{t} - {\bar{x}}_{ω}) (y_{t} - {\bar{y}}_{ω})}{\sum_{t = 1}^{n} ω_{t} {(x_{t} - {\bar{x}}_{ω})}^{2}}, \hat{Σ} = \frac{1}{n} \sum_{t = 1}^{n} ω_{t} {\hat{ϵ}}_{t} {\hat{ϵ}}_{t}^{T},

(9)

and

{\hat{η}}^{- 1} = \frac{2}{a + log a - 1} + 0.0416 \{1 + e r f (0.6594 log (\frac{2.1971}{a + log a - 1}))\},

where

{\hat{ϵ}}_{t} = y_{t} - \hat{α} - \hat{β} x_{t}

;

{\bar{y}}_{ω} = \sum_{t = 1}^{n} ω_{t} y_{t} / \sum_{t = 1}^{n} ω_{t}

;

{\bar{x}}_{ω} = \sum_{t = 1}^{n} ω_{t} x_{t} / \sum_{t = 1}^{n} ω_{t}

;

e r f (x) = \frac{2}{\sqrt{π}} \int_{0}^{x} \exp (- u^{2}) d u

,

a = - (1 / n) \sum_{t = 1}^{n} (v_{t 2} - v_{t 1})

, with

v_{t 1} = (1 + p η) / (1 + c (η) δ_{t})

and

v_{t 2} = ψ (\frac{1 + p η}{2 η}) - log (\frac{1 + c (η) δ_{t}}{2 η})

, for

t = 1, \dots, n

. The iterative process given by Equation (9) was implemented in

R

language. Note that the normal model

ω_{t} = 1, t = 1, \dots, n

and the ML estimators of

α

,

β

and

Σ

correspond to the normal case. Under the

t

-model, the exact marginal distribution of

\hat{α}

,

\hat{β}

and

\hat{Σ}

are particularly difficult to obtain, but under normal distribution, the estimators of

α

,

β

and

Σ

have exact marginal distributions (see Campbell et al. 1997).

2.3. Asymptotic Standard Errors

The standard errors of the ML estimators can be estimated using the expected information matrix. For a multivariate elliptically symmetric distribution, Lange et al. (1989) indicated how to compute the expected information matrix. See also Mitchell (1989). In our case, by using score function (8), the Fisher information matrix for

θ

in the log-likelihood function defined in (7) assumes the form

J = E {U (θ) U^{T} (θ)} = (\begin{matrix} J_{11} & 0 & 0 \\ 0 & J_{22} & J_{23} \\ 0 & J_{23}^{T} & J_{33} \end{matrix}),

(10)

where

J_{11}

,

J_{22}

,

J_{23}

and

J_{33}

denote information concerning

(α, β)

,

σ

,

(σ, η)

and

η

, respectively, and are given by

\begin{matrix} J_{11} & = & c_{α} (η) (X^{T} X) \otimes Σ^{- 1}, \\ J_{22} & = & \frac{n}{4} D_{p}^{T} {2 c_{σ} (η) (Σ^{- 1} \otimes Σ^{- 1}) N_{p} + (c_{σ} (η) - 1) (vec Σ^{- 1}) {(vec Σ^{- 1})}^{T}} D_{p}, \\ J_{23} & = & - \frac{n c (η) c_{σ} (η) (p + 2)}{{(1 + p η)}^{2}} D_{p}^{T} vec Σ^{- 1}, \\ J_{33} & = & - \frac{n}{2 η^{2}} \{(\frac{p}{{(1 - 2 η)}^{2}}) (\frac{1 + η p (1 - 4 η) - 8 η^{2}}{(1 + η p) (1 + η (p + 2))}) - β^{'} (η)\}, \end{matrix}

with

N_{p} = \frac{1}{2} (I_{p^{2}} + K_{p})

where

K_{p}

is the commutation matrix of order

p^{2} \times p^{2}

(Magnus and Neudecker 2007);

c_{α} (η) = c_{σ} (η) / (1 - 2 η)

,

c_{σ} (η) = (1 + p η) / (1 + (p + 2) η)

; and

β^{'} (η) = - \frac{1}{2 η^{2}} \{ψ^{'} (\frac{1 + p η}{2 η}) - ψ^{'} (\frac{1}{2 η})\},

where

ψ^{'} (z)

denotes the trigamma function. Note that

c_{α} (η) = c_{σ} (η) = 1

when

η = 0

and

N_{p} D_{p} = D_{p}

(see for instance Magnus and Neudecker 2007), we have to recover the expressions corresponding to the normal case. Here, X is an

n \times 2

matrix such

X^{T} = (\begin{matrix} 1 & \cdot & \cdot & \cdot & 1 \\ x_{1} & \cdot & \cdot & \cdot & x_{n} \end{matrix})

and ⊗ denotes the Kronecker product. The asymptotic sampling distribution of the ML estimator

\hat{θ}

is given by

\begin{matrix} \sqrt{n} (\hat{θ} - θ) \overset{D}{\mapsto} N_{r} (0, V^{- 1}), \end{matrix}

where

V = {lim}_{n \to \infty} (1 / n) J

and

r = {p (p + 5) + 2} / 2

is the dimension of

θ

. To estimate V, we use

\hat{V} = J (\hat{θ}) / n

.

2.4. Test of Mean-Variance Efficiency

To test

H_{α} : α = 0

the three classic tests based on the likelihood function are considered, including the Wald test, likelihood-ratio test, and score test; see for instance Boos and Stefanski (2013). Let

θ = {(α^{T}, θ_{2}^{T})}^{T}

, with

θ_{2} = {(β^{T}, σ^{T}, η)}^{T}

and

U (θ) = {(U_{α}^{T} (θ), U_{2}^{T} (θ))}^{T}

the score function (8) partitioned following the partition of

θ

. In this case, after some algebraic manipulations, the test statistics are given by

\begin{matrix} L r & = n log {| \tilde{Σ} | / | \hat{Σ} |} + 2 \sum_{t = 1}^{n} log {g ({\hat{δ}}_{t}) / g ({\tilde{δ}}_{t})}, \\ W a & = n c_{α} (\hat{η}) {(1 + {\bar{x}}^{2} / s^{2})}^{- 1} {\hat{α}}^{T} {\hat{Σ}}^{- 1} \hat{α}, \\ S c & = \frac{1}{n c_{α}^{- 1} (\tilde{η})} (1 + {\bar{x}}^{2} / s^{2}) d^{T} {\tilde{Σ}}^{- 1} d, \end{matrix}

where

\bar{x} = (1 / n) \sum_{t = 1}^{n} x_{t}

,

s^{2} = (1 / n) \sum_{t = 1}^{n} {(x_{t} - \bar{x})}^{2}

,

{\hat{δ}}_{t} = {\hat{ϵ}}_{t}^{T} {\hat{Σ}}^{- 1} {\hat{ϵ}}_{t}

,

{\tilde{δ}}_{t} = {(y_{t} - \tilde{β} x_{t})}^{T} {\tilde{Σ}}^{- 1} (y_{t} - \tilde{β} x_{t})

;

\hat{α}

,

\hat{β}

,

\hat{Σ}

and

\hat{η}

are the ML estimators in the model (6);

\tilde{β}

,

\tilde{Σ}

and

\tilde{η}

are the ML estimators of

β

,

Σ

and

η

under

H_{α}

,

d = \sum_{t = 1}^{n} {\tilde{ω}}_{t} (y_{t} - \tilde{β} x_{t})

,

\tilde{θ}

and

\hat{θ}

are the restricted and unrestricted ML estimators of

θ

, respectively. Under

H_{α}

, the asymptotic distribution of each of these test statistics is

χ^{2} (p)

. Note that

c_{α} (η) = 1

when

η = 0

and

W a = n {(1 + {\bar{x}}^{2} / s^{2})}^{- 1} {\hat{α}}^{T} {\hat{Σ}}^{- 1} \hat{α}

, which corresponds to the Wald test under normality; see, for instance, Campbell et al. (1997). In addition, under the assumption of multivariate normality, the likelihood-ratio test is given by

L r = n log (1 + W a / n)

, and the score test takes the form

S c = W a / (1 + W a / n)

. The gradient test Terrell (2002) is also discussed, defined as

G a = U^{T} (\tilde{θ}) (\hat{θ} - \tilde{θ}) .

(11)

Since

U_{2} (\tilde{θ}) = 0

, the gradient statistic in (11) can be written as

G a = U_{α}^{T} (\tilde{θ}) \hat{α} = d^{T} {\tilde{Σ}}^{- 1} \hat{α}

. Note that

G a

is attractive, as it is simple to compute and does not involve knowledge of the Fisher information matrix (10), unlike

W a

and

S c

. Asymptotically,

G a

has a chi-square distribution with p degrees of freedom under

H_{α}

. For more details and applications of this test, see Terrell (2002) and Lemonte (2016). However, in this case, under the normality assumption, the gradient statistic does not offer an alternative to test the hypothesis of mean-variance efficiency since

S c = G a

, see Appendix D.

To calculate the values of the statistics

L r, S c

and

G a

, it is necessary to estimate

θ

under

H_{α}

. The EM algorithm leads to the following equations to obtain the ML estimates of

β

,

Σ

and

η

under

H_{α}

:

\tilde{β} = \frac{\sum_{t = 1}^{n} ω_{t} x_{t} y_{t}}{\sum_{t = 1}^{n} ω_{t} x_{t}^{2}}, \tilde{Σ} = \frac{1}{n} \sum_{t = 1}^{n} ω_{t} (y_{t} - \tilde{β} x_{t}) {(y_{t} - \tilde{β} x_{t})}^{T}

and

{\tilde{η}}^{- 1} = \frac{2}{a + log a - 1} + 0.0416 \{1 + e r f (0.6594 log (\frac{2.1971}{a + log a - 1}))\},

where

a = - (1 / n) \sum_{t = 1}^{n} (v_{t 2} - v_{t 1})

, with

v_{t 1} = (1 + p η) / (1 + c (η) {\tilde{δ}}_{t})

and

v_{t 2} = ψ (\frac{1 + p η}{2 η}) - log (\frac{1 + c (η) {\tilde{δ}}_{t}}{2 η})

, for

t = 1, \dots, n

.

2.5. Model Assessment and Outlier Detection

Any statistical analysis should include a critical analysis of the model assumptions. Following Lange et al. (1989), in this work the Mahalanobis distance is used to assess the fit of the CAPM. In effect, the random variables

F_{t} = (\frac{1}{1 - 2 η}) \frac{δ_{t}}{p} \sim F (p, 1 / η)

for

t = 1, \dots, n

. Substituting the ML estimators yields

{\hat{F}}_{t} = F_{t} (\hat{θ})

, which has asymptotically the same F distribution as

F_{t}

,

t = 1, \dots, n

. Using the Wilson-Hilferty approximation,

z_{t} = \frac{(1 - \frac{2 η}{9}) {\hat{F}}_{t}^{1 / 3} - (1 - \frac{2}{9 p})}{\sqrt{\frac{2 η}{9} {\hat{F}}_{t}^{2 / 3} + \frac{2}{9 p}}}

(12)

for

t = 1, \dots, n

, with an approximately standard normal distribution. Thus, a

Q Q

-plot of the transformed distances

{z_{1}, \dots, z_{n}}

can be used to evaluate the fit of the CAPM under the multivariate t-distribution. For

η = 0

, the transformed distances are simplified to

z_{t} = {{\hat{F}}_{t}^{1 / 3} - (1 - 2 / 9 p)} / \sqrt{2 / 9 p}

and can be used to assess of fit of the CAPM under the assumption of normality. Additionally, the Mahalanobis distance can be used for multivariate outlier detection. In effect, larger than expected values of the Mahalanobis distance,

{\hat{F}}_{t}

,

t = 1, \dots, n

, identify outlying cases (see Lange et al. 1989).

2.6. Generalized Method of Moments Tests

If the iid multivariate t assumption of random errors is violated, hypothesis (3) can be tested using the Generalized Method of Moments (GMM) (see Hansen 1982). No distributional assumptions are needed other than the data being stationary and ergodic. With the GMM framework, the random errors can be both serially dependent and conditionally heteroskedastic. From (2), we have

ϵ_{t} = y_{t} - α - β x_{t}

, for

t = 1, \dots, n

. The idea of the GMM approach (see Hansen 1982), is to use sample moments conditions to replace the population moment conditions of the model restrictions. The relevant population moment conditions are

E (ϵ_{t}) = 0

and

E (x_{t} ϵ_{t}) = 0

, for

t = 1, \dots, n

.

Define

2 p \times 1

vectors

f_{t} (ψ)

and

g_{n} (ψ)

as follows,

f_{t} (ψ) = {(ϵ_{t 1}, x_{t} ϵ_{t 1}, \dots, ϵ_{t j}, x_{t} ϵ_{t j}, \dots, ϵ_{t p}, x_{t} ϵ_{t p})}^{T},

and

g_{n} (ψ) = \frac{1}{n} \sum_{t = 1}^{n} f_{t} (ψ) = \frac{1}{n} \sum_{t = 1}^{n} (ϵ_{t} \otimes x_{t}),

where,

ψ = {(α_{1}, β_{1}, \dots, α_{p}, β_{p})}^{T}

, whose dimension is

2 p \times 1

, where p is the number of assets and

x_{t} = {(1, x_{t})}^{T}

for

t = 1, \dots, n

. The GMM estimator is obtained by minimizing the quadratic form

Q (ψ) = g_{n}^{T} (ψ) W_{n} g_{n} (ψ),

where

W_{n}

,

2 p \times 2 p

, is the weighting matrix. As noted by MacKinlay and Richardson (1991), in this model the GMM estimator is independent of the weighting matrix and always coincides with the OLS estimators, which are given by

\begin{matrix} \hat{α} = \bar{y} - \bar{x} \hat{β}, and \hat{β} = \frac{\sum_{t = 1}^{n} (x_{t} - \bar{x}) (y_{t} - \bar{y})}{\sum_{t = 1}^{n} {(x_{t} - \bar{x})}^{2}}, \end{matrix}

(13)

where

\bar{y} = \frac{1}{n} \sum_{t = 1}^{n} y_{t}

and

\bar{x} = \frac{1}{n} \sum_{t = 1}^{n} x_{t}

.

As we know, there are several versions of the GMM test, for simplicity, in this work the Wald-type GMM test is considered (see MacKinlay and Richardson 1991). It is well known that the GMM estimators (13) are normally distributed asymptotically. In effect, from MacKinlay and Richardson (1991) it follows that the asymptotic sampling distribution of the estimator

\hat{ψ}

is given by

\begin{matrix} \sqrt{n} (\hat{ψ} - ψ) \overset{D}{\mapsto} N_{2 p} (0, Ψ), \end{matrix}

and a consistent estimator of

Ψ

is

\hat{Ψ} = {(D_{n}^{T} S_{n}^{- 1} D_{n})}^{- 1}

with

D_{n} = (1 / n) \sum_{t = 1}^{n} (I_{p} \otimes x_{t} x_{t}^{T}) = (1 / n) (I_{p} \otimes X^{T} X)

and

S_{n} = (1 / n) \sum_{t = 1}^{n} ({\hat{ϵ}}_{t} {\hat{ϵ}}_{t}^{T} \otimes x_{t} x_{t}^{T})

. The Wald-type GMM test is given by

W a = n {\hat{α}}^{T} {(C \hat{Ψ} C^{T})}^{- 1} \hat{α},

where

C = I_{p} \otimes (1, 0)

such that

C \hat{ψ} = \hat{α}

. This test has an asymptotic

χ^{2} (p)

, a

χ^{2}

distribution with degrees of freedom p. See MacKinlay and Richardson (1991) for more details.

For the development of the methodology proposed in this paper, the classical approach is used. Optionally, the Bayesian approach can be used. Two recent references are Barillas and Shanken (2018) and Borup (2019), who propose a Bayesian framework for asset pricing models. For applications of Bayesian inference using the Markov Chain Monte Carlo (MCMC) approach in Capital Asset Pricing Models, see Glabadanidis (2014).

3. Applications

3.1. The Chilean Stock Market Data Set

As an application of the methodology presented in this paper, monthly returns of shares from the Chilean Stock Market were analyzed. The data corresponded to the period from September, 2002 to March, 2020 and five companies: BSantander (one of the biggest banks in the country), ENEL (an electrical distribution company), Falabella (a retail company), LTM (an airline), and SQMB (a company from the chemical industry). The Selective Index of Share Prices (IPSA) was used as the return for the market and the 10-Years Bonds in UFs (BCU, BTU) of the Central Bank of Chile, was used as the risk-free rate, both monthly. This risk free rate is a long-rate, the trend of which is a smoother curve than that of a short-rate.

The means, standard deviations (SDs), Sharpe ratios (Sharpe), coefficients of skewness and kurtosis, and the Jarque-Bera test (JB Test) for normality of the monthly returns are presented in Table 1. SQMB had the highest mean return (1.19% per month), while ENEL had the lowest average return (0.13%). In addition, LTM had the highest volatility (11.21%), while BSantander had the lowest volatility (5.56%) among the five assets. Furthermore, the IPSA index had a standard deviation of 4.60%, which was less than the volatility of the five assets considered. With the exception of SQMB, a lithium-producing company, Sharpe ratios were close to zero.

Except for BSantander, which has a moderate positive asymmetry, the returns of the remaining assets had a moderately negative skew. Except for BSantander, the returns of the remaining assets had a highest kurtosis. LTM showed the highest kurtosis with an estimated coefficient of 22.7946, and BSantander showed the lowest kurtosis. These coefficients provide us initial evidence of the absence of normality in the monthly returns. In fact, with the exception of BSantander, the normality hypothesis was rejected in the other assets using the Jarque-Bera test (JB Test). In brief, the descriptive statistics summary reported in Table 1, confirmed the presence of low levels of skewness and high levels of kurtosis.

Figure 1 shows scatter plots and estimated lines for the five assets, using the normal and t distributions. From this figure, it is possible to observe linear relationships between asset returns and IPSA returns, and potential outliers. The IPSA index returns explained between 28% and 51% of the variability of the five assets returns. The IPSA index explained 50.94% of the variability in the returns of the Falabella and 27.82% of the returns variability of SQMB.

We also performed an analysis of the heteroscedasticity and autocorrelation of the returns of the five assets. Using the White test (see Lee 1991; Waldman 1983; White 1980) Falabella and LTM showed evidence of heteroscedasticity, while the Durbin-Watson test indicated evidence of first order autocorrelation only in the returns of LTM. Then, considering the significant departure from normality, high kurtosis (evidence that the returns had fat-tailed distributions), the moderate skewness, and the results of the tests for heteroscedasticity and autocorrelation of errors, it is assumed in this study that the random vectors

{ϵ_{t}, t = 1, \dots, n}

were iid as a multivariate t-distribution, with zero mean and covariance matrix

Σ

and density given by (4). For illustrative purposes, the normal distribution and tests based on weaker distributional assumptions were used, such as the GMM tests, for testing hypothesis (3).

Table 2 presents the ML estimate for parameters of the CAPM using the normal and t distributions. The standards errors were estimated using the expected information matrix. The results in Table 2 show that the estimates of the coefficients

α

and

β

were very similar using both models (NCAPM and TCAPM), especially the systematic risk estimators (

\hat{β}

) of the assets considered.

The following hypothesis test

H_{0} : η = 0 against H_{1} : η > 0

was considered. In this case, it was found that

\hat{η} = 0.134926

. The asymptotic distribution of the LR test for the previous hypothesis corresponded to a 50:50 mixture of chi-squares with zero and one degree of freedom, whose critical value was 2.7055 at a significance level of 5% (see, for instance Song et al. 2007). In this case, the maximum log-likelihood for the NCAPM was 1481.39 and for the TCAPM, the maximum log-likelihood was 1524.06, which corresponded to a likelihood-ratio statistic of 85.34. This indicates that the TCAPM fit the data significantly better than the NCAPM. As suggested by a referee, for address the impact of finite samples on the p-values and on the standard errors of the parameter estimates, we also used a nonparametric bootstrap procedure (Chou and Zhou 2006; Efron and Tibshirani 1993). Nevertheless, the results were very similar to those obtained using the normal and t-distribution; therefore, they are not shown here. Figure 2 displays the transformed distance plots for the normal and t distributions, see Equation (12). These graphics confirm that the TCAPM presented the best fit.

Table 3 presents tests results for hypothesis (3) based on the Wald, likelihood-ratio, score, and gradient tests. The results in Table 3 show that the mean-variance efficiency of the IPSA index is not rejected

(p - values > 0.59)

, with any of the tests used for the three scenarios (normal, t-distributions and the GMM).

Figure 3 shows the Mahalanobis distances for the normal and t-distribution, for both data sets; the Chilean data set and the New York Stock Exchange data set. For the Chilean data set, under normality Figure 3a, we observe that the returns for 2008/Jun, 2009/Jul and 2020/Mar are possible outliers. For instance, in 2009/Jul, economic activity fell by 3.5%, when Chile entered a recession due to the global financial crisis, while in 2020/Mar, the fall in economic activity (initial) was due to the pandemic caused by Covid-19. Already in Figure 3b, we see as expected, that the TCAPM reduced the possible effect of these returns.

3.2. The New York Stock Exchange Data Set

We considered monthly returns of shares from five companies whose common stock shares trade on the New York Stock exchange, NYSE: Bank of America, Boeing, Ford Motor Company, General Electric Company and Microsoft. The S&P500 was taken as the market price. The 10-year bond yield was used as the risk-free returns; it is a long risk-free rate, similar to the one used for the Chilean data set. The excess returns were the returns minus the risk-free rate. The data corresponded to the period from January, 2000 to March, 2020.

The means, standard deviations (SD), Sharpe ratios (Sharpe), coefficients of skewness and kurtosis, and the Jarque-Bera test (JB Test) for normality of the monthly returns are presented in Table 4. Boeing had the highest mean return (0.53% per month), while General Electric Company had the lowest average return (−0.77%). In addition, Ford had the highest volatility (13.10%), while Microsoft had the lowest volatility (8.34%) among the five assets. Furthermore, the S&P500 index had a standard deviation of 4.34%, which was less than the volatility of the five assets considered. With the exception of General Electric Company, with a negative Sharpe ratio, the other Sharpe ratios were close to zero.

The assets had a moderate (positive and negative) asymmetry. Again, the returns of the assets had a highest kurtosis. The normality hypothesis was rejected in all the assets using the Jarque-Bera test (JB Test). In brief, the descriptive statistics summary reported in Table 4, confirmed the presence of low levels of skewness and high levels of kurtosis.

Figure 4 shows scatter plots and estimated lines of the five assets, using the normal and t distributions. From this figure, it is possible to observe linear relationships between asset returns and S&P500 returns, and potentials outliers. In this case, the S&P500 index returns explained between 27% and 45% of the variability of the five assets returns, similar to the Chilean Data Set.

The Whites test show evidence of heteroscedasticity in Bank of America, Boeing and Ford Motor Company. The Durbin-Watson test indicates no evidence of first order autocorrelation.

Table 5 presents the ML estimate for parameters of the CAPM using the normal and t distributions. The standards errors were estimated using the expected information matrix. Similar results to the Chilean data set were observed. For this data set, it was found that

\hat{η} = 0.276870,

indicating a greater departure from normality than the Chilean data set. The likelihood-ratio statistic value, for the hypothesis

H_{0} : η = 0 against H_{1} : η > 0

was 285.8, which was highly significant. In this case, the maximum log-likelihood for the NCAPM was 1361.2 and for the TCAPM was 1504.1. Again, this indicates that the TCAPM fit the data significantly better than the NCAPM. This was also confirmed by the transformed distance plots displayed in Figure 5.

Table 6 presents tests results for hypothesis (3). The results in Table 6 show that the mean-variance efficiency of the S&P500 index could not be rejected

(p - values > 0.1277)

, with any of the tests, if we used the NCAPM (or GMM test). However, if we used TCAPM, the hypothesis (3) was rejected

(0.0014 < p - values < 0.0030)

with any of the four tests. That is to say, there was a change in statistical inference.

As suggested by a referee, it may also be of interest to test the hypotheses

H_{β} : β = 1

and

H_{α β} : α = 0, β = 1

. The four tests were implemented to verify these hypotheses using the t-distribution, see Appendix C. However, with the four tests statistics, we strongly rejected the null hypotheses, and therefore they are not shown in the present study. More details about this interesting topic can be found at Glabadanidis (2009, 2014, 2019).

Figure 3 show the Mahalanobis distances for the NCAPM and TCAPM models, showing results similar to the Chilean data set, except that the return corresponding to 2009/Apr for TCAPM (see Figure 3d) was a possible outlier. In April 2009, the five assets had high returns, highlighting Ford Motor Company, with a 127.4%. However, when deleting these returns, there were no changes in statistical inference. Once again, this suggests that the TCAPM provided an appropriate way for achieving robust inference.

3.3. Robustness

Aspects of the robustness of the TCAPM with respect to the NCAPM can be illustrated perturbing some observations in the original data. Changes in the ML estimates of

β

can be evaluated using the following procedure. First, an observation can be perturbed to create an outlier by

y_{n} \leftarrow y_{n} + Δ 1_{p}

, for

Δ = - 0.20, - 0.10, 0, 0.10, 0.20

. Then, we re-calculate the ML estimates of

β

under the TCAPM and under the NCAPM. Note that, for the NCAPM,

{\hat{β}}_{Δ j} = {\hat{β}}_{j} + (x_{n} - \bar{x}) Δ / \sum_{t = 1}^{n} {(x_{t} - \bar{x})}^{2},

(14)

for

j = 1, \dots, p

. Finally, a graph of

{\hat{β}}_{Δ j}

,

j = 1, \dots, p

versus

Δ

for each of the p assets, is useful to visualize changes in the estimators. Figure 6 and Figure 7 show the curves of the estimates of

{\hat{β}}_{Δ j}

,

j = 1, \dots, 5

versus

Δ

for each of the 5 assets included in the two data sets considered in this paper. For this perturbation scheme, it can be observed that the influence on parameter estimation was unbounded in the NCAPM, see Equation (14), whereas it was obviously bounded in the TCAPM. This suggests that TCAPM provided an appropriate way for achieving robust statistical inference.

4. Multifactor Asset Pricing Models under the $t$ -Distribution

As suggested by a referee, in some cases it is necessary to use more than one factor to estimate the expected returns of the assets of interest. In this Section we briefly discuss an extension of the TCAPM that includes more than one factor. More details on estimation, hypothesis testing, and applications will be discussed in a separate paper. The multifactor model (MAPM) is a multivariate linear regression model with excess returns on p assets, as follows:

y_{t} = α + β_{1} x_{t 1} + \dots + β_{q} x_{t q} + ϵ_{t}, t = 1, \dots, n,

(15)

where

(x_{1}, \dots, x_{q})

denotes the excess returns of q factors (benchmark assets),

β_{j}

is a

p \times 1

parameter vector,

j = 1, \dots, q

. The above regression model can be expressed as

\begin{matrix} y_{t} & = & α + B_{1} x_{t}^{f} + ϵ_{t}, \\ y_{t} & = & B x_{t} + ϵ_{t}, \end{matrix}

(16)

where

B = (α, B_{1})

, denotes the matrix

(p \times q + 1)

of regression coefficients,

B_{1} = (β_{1}, \dots, β_{q})

,

x_{t} = {(1, x_{t}^{f})}^{T}

and

x_{t}^{f} = {(x_{t 1}, \dots, x_{t q})}^{T}

,

t = 1, \dots, n .

As in Section 2.1, we assume that the excess returns

y_{t}

follows an multivariate t-distribution with mean vector

μ_{t}

and variance-covariance matrix

Σ

, named

y_{t} \sim T_{p} (μ_{t}, Σ, η)

, independent,

t = 1, \dots, n

, whose density function takes the form,

f (y_{t} | θ) = {| Σ |}^{- 1 / 2} g (δ_{t}),

(17)

where,

δ_{t} = {(y_{t} - μ_{t})}^{T} Σ^{- 1} (y_{t} - μ_{t})

is the square of the Mahalanobis distance, with

μ_{t} = B x_{t}

,

x_{t} = {(1, x_{t 1}, \dots, x_{t q})}^{T}

denoting the

t

-th row of the matrix

(n \times q + 1)

X = {(x_{1}, \dots, x_{n})}^{T}

, for

t = 1, \dots, n

.

Thus, in this case, the ML estimates of B,

Σ

and

η

are obtained as solution of the following equations (see Equation (9)):

{\hat{B}}^{T} = {(X^{T} W X)}^{- 1} X^{T} W Y, \hat{Σ} = \frac{1}{n} \sum_{t = 1}^{n} ω_{t} {\hat{ϵ}}_{t} {\hat{ϵ}}_{t}^{T},

(18)

and

{\hat{η}}^{- 1} = \frac{2}{a + log a - 1} + 0.416 \{1 + e r f (0.6594 log (\frac{2.1971}{a + log a - 1}))\},

where

{\hat{ϵ}}_{t} = y_{t} - {\hat{B}}^{T} x_{t}

and

W = diag (ω_{1}, \dots, ω_{n})

an

n \times n

diagonal matrix with elements

ω_{t} = (\frac{1 + η p}{η}) (\frac{c (η)}{1 + c (η) δ_{t}})

, for

t = 1, \dots, n

, and the matrix

(n \times p)

Y = {(y_{1}, \dots, y_{n})}^{T}

.

As in Section 2.3, the standard errors of the ML estimators

\hat{B}

,

\hat{Σ}

and

\hat{η}

can be estimated using the expected information matrix. In this case, the Fisher information matrix for

θ = (B, Σ, η)

assumes the same form of the matrix J given in the Equation (10), but the information concerning to B is now

J_{11} = c_{α} (η) (X^{T} X) \otimes Σ^{- 1}

, with

c_{α}

as defined in Equation (10). To test linear hypotheses of interest, such as

H_{α}

, we can use the same four tests discussed in Section 2.4.

As a illustration we consider the NYSE data set. We fit the following three-factor model,

y_{t} = α + β_{1} x_{t} + β_{2} {SMB}_{t} + β_{3} {HML}_{t} + ϵ_{t}, k = 1, \dots, n,

(19)

where, x is the excess return of the S&P500 index, used in the NYSE data set, while SMB and HML used in the Fama-French model were from the website of Prof. Kenneth French. For details on risk factors, SMB and HML see Fama and French (1995).

In this case, the three risk factors explained between 32% and 50% of the variability of the five assets’ returns, which corresponded to an increase of approximately 5% with respect to the CAPM. From Table 7, wecan see that the values in

\hat{α}

were very similar to those obtained using CAPM as presented in Table 5, while estimates in

{\hat{β}}_{1}

tended to be lower than estimates in

\hat{β}

, in the case of CAPM (see Table 5).

Figure 8 displays the transformed distance plots for the normal and t distributions. Here NMAPM denotes the three risk factors under normality and TMAPM denotes the three risk factors under multivariate t-distribution. These graphics show clear evidence that the TMAPM had a better fit than the NMAPM. Using the likelihood-ratio test, the hypothesis

H_{α}

could not be rejected

(p - value = 0.1277)

, if we used the NMAPM. However, if we used TMAPM, the hypothesis (3) was rejected

(p - value = 0.0011)

. Once again, there was a change in statistical inference.

5. Conclusions

Since the pioneering work by Lange et al. (1989), the t-distribution has proved to be a versatile and robust modeling approach in many regression models. In fact, the t-distribution has a parameter (

η

) modeling kurtosis, which brings more flexibility than the normal distribution.

In this paper, robust methods for statistical inference in asset pricing models with emphasis on CAPM were developed. In effect, assuming a multivariate t-distribution for the stock returns, ML equations for parameters were derived and statistics were proposed to test linear hypotheses of interest, in particular the hypothesis of mean-variance efficiency. Simple expressions were provided in this study for the likelihood-ratio, Wald, score and gradient statistics and for the score function and Fisher information matrix. The proposed statistics generalized results from the literature, which considered tests for mean-variance efficiency under the assumption of multivariate normality (Brandimarte 2018; Campbell et al. 1997; Chou and Lin 2002; Gibbons et al. 1989; Mazzoni 2018). In addition, statistical inference based on the t-distribution is simple to implement, and the computational cost is considerably low.

A simple graphical device for checking the model was implemented, and the methodology developed in this paper was illustrated with two real data sets: the Chilean Stock Market data set (a developing country), and another from the New York Stock Exchange, USA (a developed country). In both data sets, the CAPM under the t-distribution clearly presents a better fit than under the normal distribution. Additionally, in the application of the multifactor asset pricing model to the NYSE data set, the multivariate t-distribution presents a better fit than the normal distribution.

This empirical study provides new evidence for the useful application of the t-distribution in modeling stock returns (Kan and Zhou 2017). As we have pointed out, the log-returns frequently present some degree of skewness. We are currently working on statistical inference in the asset pricing models under the multivariate skew-elliptical distributions. We understand that a skewed t-distribution of Branco and Dey (2001) may be useful for returns with high levels of skewness. For previous applications of the skew-elliptical distributions in finance and actuarial science, see Harvey et al. (2010) and Adcock et al. (2015). See also Paula et al. (2011).

Author Contributions

Conceptualization and investigation, M.G., D.C. and R.C.; methodology, M.G. and A.M.; software and validation, M.G., D.C. and A.M.; formal analysis, resources, data curation, M.G., D.C., R.C. and A.M.; writing—original draft preparation, M.G., D.C., and A.M.; writing—review and editing, M.G., D.C., R.C. and A.M.; visualization, M.G., D.C., R.C. and A.M.; supervision, M.G., D.C., R.C. and A.M.; project administration, M.G. and A.M. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Acknowledgments

The first author acknowledges the partial financial support from Project Puente 001/2019, Dirección de Investigación de la Vicerrectoría de Investigación de la Pontificia Universidad Catlica de Chile, Chile. The authors are grateful to the editor and two reviewers for their helpful comments and suggestions.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. The Multivariate t-Distribution

For completeness, we present some properties of the multivariate t-distribution, with finite second moment, based on the parameterization given in (5).

Property A1.

Let

y \sim T_{p} (μ, Σ, η)

, with

η < 1 / 2

.

(i): Suppose that $y | u \sim N_{p} (μ, u^{- 1} Σ), and that u \sim G (1 / 2 η, 1 / 2 c (η))$ , then $y \sim T_{p} (μ, Σ, η)$
(ii): $E (y) = μ$ and $Cov (y) = Σ$
(iii): $u | y \sim G ((1 / η + p) / 2, (1 / c (η) + δ) / 2)$
(iv): The random variable

$F = (\frac{1}{1 - 2 η}) \frac{δ}{p} \sim F (p, 1 / η)$
(v): Let $A (q \times p)$ , $c \in R^{q}$ and $q \leq p$ , then $A y + c \sim T_{q} (A μ + c, A Σ A^{T}, η)$
(vi): Let $y = {(y_{1}, y_{2})}^{T}$ a partition of y with $y_{1}$ of dimension $p_{1} \leq p$ , and let

$μ = (\begin{matrix} μ_{1} \\ μ_{2} \end{matrix}) and Σ = (\begin{matrix} Σ_{11} & Σ_{12} \\ Σ_{21} & Σ_{22} \end{matrix}),$

then $y_{1} \sim T_{p_{1}} (μ_{1}, Σ_{11}, η)$
(vii): Using the same previous partition, the conditional distribution of $y_{1}$ given $y_{2}$ , $(y_{1} | y_{2}) \sim T_{p_{1}} (μ_{1} - Σ_{11}^{- 1} Σ_{12} (μ_{2} - y_{2}), q (y_{2}, η) (Σ_{11} - Σ_{12} Σ_{22}^{- 1} Σ_{21}), η)$ ,

where

δ = {(y - μ)}^{T} Σ^{- 1} (y - μ)

,

q (y_{2}, η) = {c^{- 1} (η) + {(y_{2} - μ_{2})}^{T} Σ_{22}^{- 1} (y_{2} - μ_{2})} / (η^{- 1} + p_{2})

, and

G (a, b)

denotes the gamma distribution with probability density function

f (x) = b^{a} x^{a - 1} \exp {- b x} / Γ (a)

, for

x, a, b > 0

.

Appendix B. ML Estimation Using the EM Algorithm

To obtain the ML estimate using the EM algorithm, we augmented the observed data, Y, by incorporating latent variables to obtain

Y_{com} = {(y_{1}^{T}, u_{1}), \dots, (y_{n}^{T}, u_{n})}

. Thus, based on property

i)

we can consider the following hierarchical model

y_{t} | u_{t} \overset{ind}{\sim} N_{p} (α + β x_{t}, Σ / u_{t}), u_{t} \overset{ind}{\sim} G (1 / 2 η, 1 / 2 c (η))

, for

t = 1, \dots, n

. The log-likelihood function of complete data, is denoted by

L_{c} (θ) = log f (Y_{com} | θ)

. By using Property A1, the conditional expectation of the complete-data log-likelihood function can be expressed as

E {L_{c} (θ) | Y, θ^{(*)}} = Q (θ | θ^{(*)}) = Q_{1} (τ | θ^{(*)}) + Q_{2} (η | θ^{(*)}),

(A1)

where

τ = {(α^{T}, β^{T}, σ^{T})}^{T}

, and

\begin{matrix} Q_{1} (τ | θ^{(*)}) & = - \frac{n}{2} log | Σ | - \frac{1}{2} \sum_{t = 1}^{n} ω_{t}^{(*)} {(y_{t} - α - β x_{t})}^{T} Σ^{- 1} (y_{t} - α - β x_{t}), \\ Q_{2} (η | θ^{(*)}) & = n {\frac{1}{2 η} log (\frac{1}{2 c (η)}) - log Γ (\frac{1}{2 η}) + \frac{1}{2 c (η)} [ψ (\frac{1 / η^{(*)} + p}{2}) \\ - log (\frac{1 / η^{(*)} + p}{2}) + \frac{1}{n} \sum_{t = 1}^{n} (log ω_{t}^{(*)} - ω_{t}^{(*)})]}, \end{matrix}

where

ω_{t}^{(*)} = E {u_{t} | y_{t}, θ^{(*)}}

are the weights

ω_{t}

defined in (8) and evaluated at

θ = θ^{(*)}

, for

t = 1, \dots, n

. Maximizing the Q-function (A1), we obtain the iterative process defined in Equation (9).

Appendix C. ML Estimation under H_β and H_αβ

In this case

H_{β} : β = 1

. To calculate the values of the statistics

L r, S c

and

G a

, we need to estimate

θ

under

H_{β}

. The EM algorithm leads to the following equations to obtain the ML estimates of

α

,

Σ

and

η

under

H_{β}

:

\tilde{α} = \frac{\sum_{t = 1}^{n} {\tilde{ω}}_{t} (y_{t} - 1_{p} x_{t})}{\sum_{t = 1}^{n} {\tilde{ω}}_{t}}, \tilde{Σ} = \frac{1}{n} \sum_{t = 1}^{n} {\tilde{ω}}_{t} (y_{t} - \tilde{α} - 1_{p} x_{t}) {(y_{t} - \tilde{α} - 1_{p} x_{t})}^{T}

(A2)

and

{\tilde{η}}^{- 1} = \frac{2}{a + log a - 1} + 0.0416 \{1 + e r f (0.6594 log (\frac{2.1971}{a + log a - 1}))\},

where

a = - (1 / n) \sum_{t = 1}^{n} (v_{t 2} - v_{t 1})

, with

v_{t 1} = (1 + p η) / (1 + c (η) {\tilde{δ}}_{t})

and

v_{t 2} = ψ (\frac{1 + p η}{2 η}) - log (\frac{1 + c (η) {\tilde{δ}}_{t}}{2 η})

,

{\tilde{δ}}_{t} = {(y_{t} - \tilde{α} - 1_{p} x_{t})}^{T} {\tilde{Σ}}^{- 1} (y_{t} - \tilde{α} - 1_{p} x_{t})

,

{\tilde{ω}}_{t} = (\frac{1 + η p}{η}) (\frac{c (η)}{1 + c (η) {\tilde{δ}}_{t}})

, for

t = 1, \dots, n

.

It may also be of interest to test the joint hypothesis

H_{α β} : α = 0, β = 1

. Similarly, to calculate the values of the statistics

L r, S c

and

G a

, we need to estimate

Σ

and

η

under

H_{α β}

. The EM algorithm leads to the following equations to obtain the ML estimates of

Σ

and

η

under

H_{α β}

:

\tilde{Σ} = \frac{1}{n} \sum_{t = 1}^{n} {\tilde{ω}}_{t} (y_{t} - 1_{p} x_{t}) {(y_{t} - 1_{p} x_{t})}^{T}

(A3)

and

{\tilde{η}}^{- 1} = \frac{2}{a + log a - 1} + 0.0416 \{1 + e r f (0.6594 log (\frac{2.1971}{a + log a - 1}))\},

where

a = - (1 / n) \sum_{t = 1}^{n} (v_{t 2} - v_{t 1})

, with

v_{t 1} = (1 + p η) / (1 + c (η) {\tilde{δ}}_{t})

and

v_{t 2} = ψ (\frac{1 + p η}{2 η}) - log (\frac{1 + c (η) {\tilde{δ}}_{t}}{2 η})

,

{\tilde{δ}}_{t} = {(y_{t} - 1_{p} x_{t})}^{T} {\tilde{Σ}}^{- 1} (y_{t} - 1_{p} x_{t})

,

{\tilde{ω}}_{t} = (\frac{1 + η p}{η}) (\frac{c (η)}{1 + c (η) {\tilde{δ}}_{t}})

, for

t = 1, \dots, n

.

Appendix D. Equality of the Score and Gradient Tests under Normality

First, let us remember that the score tests, under normality, test for mean-variance efficiency can be written as (Campbell et al. 1997)

S c = \frac{J_{0}}{1 + J_{0} / n}

, where

J_{0} = n b {\hat{α}}^{T} {\hat{Σ}}^{- 1} \hat{α}

, with

b = s^{2} / ({\bar{x}}^{2} + s^{2})

, which is the corresponding Wald tests. In this case, the Gradient test takes the form of

G a = U_{α}^{T} (\tilde{θ}) \hat{α}

, where

U_{α} (\tilde{θ}) = {\tilde{Σ}}^{- 1} \sum_{t = 1}^{n} (y_{t} - \tilde{β} x_{t}),

(A4)

\tilde{β} = \hat{β} + c \hat{α}

,

\tilde{Σ} = \hat{Σ} + b \hat{α} {\hat{α}}^{T}

and

c = \bar{x} / ({\bar{x}}^{2} + s^{2})

; see Campbell et al. (1997) for details on these results. Then, by replacing

\sum_{t = 1}^{n} (y_{t} - \tilde{β} x_{t}) = n b \hat{α}

in (A4) and using the Sherman-Morrison formula to invert the matrix

\tilde{Σ}

, we obtain

\begin{matrix} G a & = n b {\hat{α}}^{T} {\tilde{Σ}}^{- 1} \hat{α} \\ = n b {\hat{α}}^{T} {(\hat{Σ} + b \hat{α} {\hat{α}}^{T})}^{- 1} \hat{α} \\ = \frac{J_{0}}{1 + J_{0} / n} \\ = S c . \end{matrix}

References

Adcock, Christopher, Martin Eling, and Nicola Loperfido. 2015. Skewed distributions in finance and actuarial science: A review. The European Journal of Finance 21: 1253–81. [Google Scholar] [CrossRef]
Amenc, Noel, and Veronique Le Sourd. 2003. Portfolio Theory and Performance Analysis. New York: John Wiley. [Google Scholar]
Barillas, Francisco, and Jay Shanken. 2018. Comparing Asset Pricing Models. The Journal of Finance 73: 715–54. [Google Scholar] [CrossRef]
Bao, Te, Ceer Diks, and Hao Li. 2018. A generalized CAPM model with asymmetric power distributed errors with an application to portfolio construction. Economic Modelling 68: 611–21. [Google Scholar] [CrossRef]
Bekaert, Geer, and Guojun Wu. 2000. Asymmetric Volatility and Risk in Equity Markets. The Review of Financial Studies 13: 1–42. [Google Scholar] [CrossRef]
Berk, Jonathan. 1997. Necessary conditions for the CAPM. Journal of Economic Theory 73: 245–57. [Google Scholar] [CrossRef][Green Version]
Blattberg, Robert C., and Nicholas J. Gonedes. 1974. A comparison of the stable and student distribution as statistical models for stock prices. Journal of Business 47: 244–80. [Google Scholar] [CrossRef]
Bolfarine, Heleno, and Manuel Galea. 1996. On structural Comparative Calibration under a t-model. Computational Statistics 11: 63–85. [Google Scholar]
Boos, Dannis D., and Leonard A. Stefanski. 2013. Essential Statistical Inference, Theory and Methods. New York: Springer. [Google Scholar]
Borup, Daniel. 2019. Asset pricing model uncertainty. Journal of Empirical Finance 54: 166–89. [Google Scholar] [CrossRef]
Branco, Marcia D., and Dipak K. Dey. 2001. A general class of multivariate skew-elliptical distributions. Journal of Multivariate Analysis 79: 99–113. [Google Scholar] [CrossRef]
Brandimarte, Paolo. 2018. An Introduction to Financial Markets: A Quantitative Approach. Hoboken: John Wiley. [Google Scholar]
Broquet, Claude, Robert Cobbaut, Roland Gillet, and Andre van den Berg. 2004. Gestion de Portefeuille, 4th ed. Louvain-la-Neuve: De Boeck Université. Bruxelles. [Google Scholar]
Cademartori, David, Cecilia Romo, Richardo Campos, and Manuel Galea. 2003. Robust estimation of systematic risk using the t-distribution in the Chilean Stock Markets. Applied Economics Letters 10: 447–53. [Google Scholar] [CrossRef]
Campbell, John, Andrew Lo, and A. Craig MacKinlay. 1997. Econometrics of Financial Markets. Princeton: Princeton University Press. [Google Scholar]
Chamberlain, Gary. 1983. A characterization of the distributions that imply mean-variance utility functions. Journal of Economic Theory 29: 185–201. [Google Scholar] [CrossRef]
Chen, Joseph, Harrison Hong, and Jeremy C. Stein. 2001. Forecasting crashes: Trading volume, past returns, and conditional skewness in stock prices. Journal of Financial Economics 61: 345–81. [Google Scholar] [CrossRef]
Chou, Pin-Huang, and Mei-Chen Lin. 2002. Tests of international asset pricing model with and without a riskless asset. Applied Financial Economics 12: 873–83. [Google Scholar] [CrossRef]
Chou, Ping-Huang, and Guofu Zhou. 2006. Using Bootstrap to Test Portfolio Efficiency. Annals of Economics and Finance 2: 217–49. [Google Scholar]
Efron, Bradley, and Robert J. Tibshirani. 1993. An Introduction to the Bootstrap. New York: Chapman and Hall. [Google Scholar]
Ejara, Demissew, Alain Krapl, Thomas J. O’Brien, and Santiago Ruiz de Vargas. 2019. Local, Global, and International CAPM: For Which Countries Does Model Choice Matter? Available online: https://ssrn.com/abstract=3023501 (accessed on 24 January 2020).
Elton, Edwin, and Martin Gruber. 1995. Modern Portfolio Theory and Investment Analysis. New York: John Wiley. [Google Scholar]
Fama, Eugene. 1965. The behavior of stock market prices. Journal of Business 38: 34–105. [Google Scholar] [CrossRef]
Fama, Eegene, and Kenneth R. French. 1995. Size and book-to-market factors in earnings and returns. Journal of Finance 50: 131–55. [Google Scholar] [CrossRef]
Fiorentini, Gabriele, Enriques Sentana, and Giorgio Calzolari. 2003. Maximum likelihood estimation and inference in multivariate conditionally heteroscedastic dynamic regression models with Student t innovations. Journal of Business & Economic Statistics 21: 532–46. [Google Scholar]
Francis, Jack Clark, and Dongcheol Kim. 2013. Modern Portfolio Theory: Foundation, Analysis, and New Dvelopments. Hoboken: John Wiley. [Google Scholar]
Galea, Manuel, Jose A. Díaz-Garcá, and Filidor Vilca. 2008. Influence diagnostics in the capital asset pricing model under elliptical distributions. Journal of Applied Statistics 35: 179–92. [Google Scholar] [CrossRef]
Galea, Manuel, David Cademartori, and Filidor Vilca. 2010. The structural sharpe model under t-distributions. Journal of Applied Statistics 37: 1979–90. [Google Scholar] [CrossRef]
Galea, Manuel, and Patricia Giménez. 2019. Local influence diagnostics for the test of mean-variance efficiency and systematic risks in the Capital Asset Pricing Model. Statistical Papers 60: 293–312. [Google Scholar] [CrossRef]
Gibbons, Michael, Stephen Ross, and Jay Shanken. 1989. A test of the efficiency of a given portfolio. Econometrica 57: 1121–53. [Google Scholar] [CrossRef]
Glabadanidis, Paskalis. 2009. Measuring the economic significance of mean-variance spanning. The Quarterly Review of Economics and Finance 49: 596–616. [Google Scholar] [CrossRef]
Glabadanidis, Paskalis. 2014. What Difference Fat Tails Make: A Bayesian MCMC Estimation of Empirical Asset Pricing Models. In Bayesian Inference in the Social Sciences. Edited by Ivan Jeliazkov and Xin-She Yang. Hoboken: John Wiley. [Google Scholar]
Glabadanidis, Paskalis. 2019. An exact test of the improvement of the minimum variance portfolio. International Review of Finance 19: 45–82. [Google Scholar] [CrossRef]
Hamada, Mahmoud, and Emiliano A. Valdez. 2008. CAPM and option pricing with elliptically contoured distributions. The Journal of Risk and Insurance 75: 387–409. [Google Scholar] [CrossRef]
Hansen, Lars Peter. 1982. Large Sample Properties of Generalized Method of Moments Estimators. Econometrica 50: 1029–54. [Google Scholar] [CrossRef]
Harvey, Campbell, and Guofu Zhou. 1993. International asset pricing with alternative distributional specifications. Journal of Empirical Finance 1: 107–31. [Google Scholar] [CrossRef]
Harvey, Campbell R., John Liechty, M. Liechty, and P. M. Müller. 2010. Portfolio selection with higher moments. Quantitative Finance 10: 469–85. [Google Scholar] [CrossRef]
Hodgson, Douglas, Oliver Linton, and Keith Vorkink. 2002. Testing the capital asset pricing model efficiently under elliptical symmetry: A semiparametric approach. Journal of Applied Econometrics 17: 617–39. [Google Scholar] [CrossRef]
Ingersoll, Jonathan. 1987. Theory of Financial Decision Making. Lanham: Rowman and Littlefield. [Google Scholar]
Johnson, R. Stafford. 2014. Equity Markets and Portfolio Analysis. Hoboken: John Wiley. [Google Scholar]
Kan, Raymond, and Guofu Zhou. 2017. Modeling Non-normality Using Multivariate t: Implications for Asset Pricing. China Finance Review International 7: 2–32. [Google Scholar] [CrossRef]
Lange, Kenneth L., Roderick J. Little, and Jeremy Taylor. 1989. Robust statistical modelling using the t-distribution. Journal of the American Statistical Association 84: 881–96. [Google Scholar]
Lee, John. 1991. A Lagrange multiplier test for GARCH models. Economics Letters 37: 265–71. [Google Scholar] [CrossRef]
Lemonte, Artur. 2016. The Gradient Test: Another Likelihood-Based Test. London: Academic Press. [Google Scholar]
Levy, Haim. 2012. The Capital Asset Pricing Model in the 21st Century: Analytical, Empirical, and Behavioral Perspectives. New York: Cambridge University Press. [Google Scholar]
Lintner, John. 1965. The valuation of risk assets and the selection of risky investments in stock portfolios and capital budgets. Review of Economics and Statistics 41: 13–37. [Google Scholar] [CrossRef]
Liu, Chuanhai, and Donald B. Rubin. 1995. ML estimation of the t distribution using EM and its extensions, ECM and ECME. Statistica Sinica 5: 19–39. [Google Scholar]
MacKinlay, A. Craig, and Mathew P. Richardson. 1991. Using Generalized Method of Moments to Test Mean-Variance Efficiency. The Journal of Finance 46: 511–27. [Google Scholar] [CrossRef]
Magnus, Jan R., and Heinz Neudecker. 2007. Matrix Differential Calculus with Applications in Statistics and Econometrics, 3rd ed. New York: John Wiley. [Google Scholar]
Mazzoni, Thomas. 2018. A First Course in Quantitative Finance. New York: Cambridge University Press. [Google Scholar]
Mitchell, Ann. 1989. The information matrix, skewness tensor and α-connections for the general multivariate elliptic distributions. Annals of the Institute of Statistical Mathematics 41: 289–304. [Google Scholar] [CrossRef]
Mossin, Jan. 1966. Equilibrium in capital asset market. Econometrica 35: 768–83. [Google Scholar] [CrossRef]
Owen, Joel, and Ramon Rabinovitch. 1983. On the class of elliptical distributions and their applications to the theory of portfolio. The Journal of Finance 38: 745–52. [Google Scholar] [CrossRef]
Paula, Gilberto, Victor Leiva, Michelli Barros, and Shuangzhe Liu. 2011. Robust statistical modeling using the Birnbaum-Saunders-t distribution applied to insurance. Applied Stochastic Models in Business and Industry 28: 16–34. [Google Scholar] [CrossRef]
Pereiro, Luis E. 2010. The beta dilemma in emerging markets. Journal of Applied Corporate Finance 22: 110–22. [Google Scholar] [CrossRef]
Pinheiro, Jose C., Chuanhai Liu, and Ying Nian Wu. 2001. Efficient algorithms for robust estimation in linear mixed-effects models using the multivariate t distribution. Journal of Computational and Graphical Statistics 10: 249–76. [Google Scholar] [CrossRef]
Sharpe, William. 1964. Capital asset prices: A theory of markets equilibrium under conditions of risk. Journal of Finance 19: 425–42. [Google Scholar]
Shoham, Shy. 2002. Robust clustering by deterministic agglomeration EM of mixtures of multivariate t distribution. Pattern Recognition 35: 1127–42. [Google Scholar] [CrossRef]
Song, Peter. X. -K., Peng Zhang, and Annie Qu. 2007. Maximum likelihood inference in robust linear mixed-effects models using the multivariate t distributions. Statistica Sinica 17: 929–43. [Google Scholar]
Sutradhar, Brajendra C. 1993. Score test for the covariance matrix of elliptical t-distribution. Journal of Multivariate Analysis 46: 1–12. [Google Scholar] [CrossRef][Green Version]
Terrell, George. 2002. The Gradient Statistic. Computing Science and Statistics 34: 206–15. [Google Scholar]
Vorkink, Keith. 2003. Return distributions and improved tests of asset pricing models. The Review of Financial Studies 16: 845–74. [Google Scholar] [CrossRef]
Waldman, Donald M. 1983. A note on algebraic equivalence of White’s test and a variation of the Godfrey/Breusch-Pagan test for heteroscedasticity. Economics Letters 13: 197–200. [Google Scholar] [CrossRef]
White, Halbert. 1980. A Heteroskedasticity-Consistent Covariance Matrix Estimator and a Direct Test for Heteroskedasticity. Econometrica 48: 817–38. [Google Scholar] [CrossRef]
Xie, Feng-Chang, Bo-Cheng Wei, and Jin-Guan Lin. 2007. Case-deletion Influence Measures for the Data from Multivariate t Distributions. Journal of Applied Statistics 34: 907–21. [Google Scholar] [CrossRef]
Zhou, Guofu. 1993. Asset-pricing test under alternative distributions. The Journal of Finance 48: 1927–42. [Google Scholar] [CrossRef]

Figure 1. Scatter plots and estimated lines for five assets, using the normal and t distributions, for the Chilean Stock Market data set.

Figure 2. QQ-plot of transformed distances for the NCAPM (a) and TCAPM (b), for the Chilean Stock Market data set.

Figure 3. Mahalanobis distances (

{\hat{F}}_{t}

) for the NCAPM (a) and TCAPM (b), for the Chilean Stock Market data set, and NCAPM (c) and TCAPM (d) for the NYSE data set.

Figure 3. Mahalanobis distances (

{\hat{F}}_{t}

) for the NCAPM (a) and TCAPM (b), for the Chilean Stock Market data set, and NCAPM (c) and TCAPM (d) for the NYSE data set.

Figure 4. Scatter plots and estimated lines for five assets, using the normal and t distributions, for the NYSE data set.

Figure 5. Plots of transformed distances for the NCAPM (a) and TCAPM (b), for the NYSE data set.

Figure 6. Perturbed ML estimates of

β

under the NCAPM (red line) and TCAPM (blue line), for the Chilean Stock Market data set.

Figure 6. Perturbed ML estimates of

β

under the NCAPM (red line) and TCAPM (blue line), for the Chilean Stock Market data set.

Figure 7. Perturbed maximum likelihood (ML) estimates of

β

under the NCAPM (red line) and TCAPM (blue line), for the NYSE data set.

Figure 7. Perturbed maximum likelihood (ML) estimates of

β

under the NCAPM (red line) and TCAPM (blue line), for the NYSE data set.

Figure 8. Plots of transformed distances for the NMAPM (a) and TMAPM (b), for the NYSE data set.

Table 1. Summary statistics for monthly log-returns of five assets and IPSA index, from the Chilean Stock Market data set. p-values are in parentheses.

Asset	Mean (%)	SD (%)	Sharpe	Skewness	Kurtosis	JB Test
BSantander	0.4590	5.5594	0.0480	0.1441	2.9212	0.7846
						(0.6755)
ENEL	0.1311	7.4571	−0.0082	−1.3978	9.2396	410.9952
						(0.0000)
Falabella	0.6296	7.3003	0.0601	−0.3574	5.9289	79.9115
						(0.0000)
LTM	0.5687	11.2062	0.0336	−2.4017	22.7946	3647.6426
						(0.0000)
SQMB	1.1889	9.4381	0.1057	−0.0603	5.5979	59.4642
						(0.0000)
IPSA	0.5800	4.6000	0.0850	−0.1324	3.8769	7.3770
						(0.0250)

Table 2. Adjustment results of Capital Asset Pricing Model (CAPM) using the multivariate normal and t distributions, standard errors are in parentheses, for the Chilean Stock Market data set.

Model	Asset	$\hat{α}$	$\hat{β}$			$\hat{Σ}$
Normal	BSantander	0.0007 (0.0032)	0.7105 (0.0685)	0.0021	−0.0001	−0.0003	−0.0001	−0.0003
	ENEL	−0.0028 (0.0038)	0.9901 (0.0830)		0.0030	−0.0003	0.0004	−0.0005
	Falabella	0.0014 (0.0035)	1.1298 (0.0767)			0.0026	−0.0001	−0.0007
	LTM	0.0019 (0.0052)	1.5608 (0.1124)				0.0056	−0.0004
	SQMB	0.0091 (0.0057)	1.1047 (0.1231)					0.0067
$t (\hat{η} = 0.135)$	BSantander	−0.0011 (0.0031)	0.7090 (0.0675)	0.0024	0.0000	−0.0003	−0.0001	−0.0001
	ENEL	−0.0010 (0.0032)	1.0039 (0.0700)		0.0026	−0.0004	0.0002	−0.0006
	Falabella	0.0026 (0.0034)	1.0623 (0.0727)			0.0028	−0.0004	−0.0003
	LTM	0.0014 (0.0047)	1.3873 (0.1011)				0.0054	−0.0002
	SQMB	0.0070 (0.0050)	1.1039 (0.1072)					0.0060

Table 3. Test of the mean-variance efficiency; p-values in parentheses, for the Chilean Stock Market data set.

Test	Normal Fit	Multivariate t Fit	GMM Fit
Wald	3.7243	3.0368	3.6917
	(0.5898)	(0.6943)	(0.5946)
Likelihood-ratio	3.6918	2.9712	-
	(0.5946)	(0.7044)	-
Score	3.6597	2.9508	-
	(0.5994)	(0.7076)	-
Gradient	-	2.9698	-
	-	(0.7046)	-

Table 4. Summary statistics for monthly log-returns of five assets and S&P500 index from the NYSE data set. p-values are in parentheses.

Asset	Mean (%)	SD (%)	Sharpe	Skewness	Kurtosis	JB Test
Bank of America	−0.0691	11.9324	−0.0281	−1.3041	12.6330	1008.4271
						(0.0000)
Boeing	0.5269	9.2462	0.0282	−1.8571	12.0454	968.1001
						(0.0000)
Ford	−0.7582	13.0995	−0.0782	0.0119	16.1538	1751.8490
						(0.0000)
Gelectric	−0.7728	8.5741	−0.1212	−0.6836	5.9354	106.1649
						(0.0000)
Microsoft	0.4093	8.3394	0.0171	−0.3767	6.7068	144.8702
						(0.0000)
S&P500	0.2300	4.3400	−0.0077	−0.8143	4.4698	48.7262
						(0.0000)

Table 5. Adjustment results of CAPM using the multivariate normal and t distributions, standard errors are in parentheses, for the NYSE data set.

Model	Assets	$\hat{α}$	$\hat{β}$			$\hat{Σ}$
Normal	Bank of Am	0.0025 (0.0060)	1.5633 (0.1407)	0.0087	0.0002	0.0019	0.0013	−0.0004
	Boeing	0.0060 (0.0046)	1.1405 (0.1080)		0.0051	0.0007	0.0005	−0.0012
	Ford	−0.0026 (0.0077)	1.7125 (0.1802)			0.0143	0.0006	−0.0005
	Gelectric	−0.0076 (0.0040)	1.3070 (0.0931)				0.0038	−0.0003
	Microsoft	0.0042 (0.0043)	1.1518 (0.1013)					0.0045
$t (\hat{η} = 0.277)$	Bank of Am	−0.0017 (0.0044)	1.2387 (0.1036)	0.0087	0.0002	0.0008	0.0007	−0.0004
	Boeing	0.0065 (0.0039)	1.0935 (0.0908)		0.0067	0.0010	0.0004	−0.0011
	Ford	−0.0147 (0.0050)	1.4773 (0.1174)			0.0111	−0.0002	−0.0005
	Gelectric	−0.0074 (0.0032)	1.2406 (0.0747)				0.0045	0.0002
	Microsoft	0.0007 (0.0034)	1.1487 (0.0797)					0.0051

Table 6. Test of the mean-variance efficiency; p-values in parentheses, for the NYSE data set.

Test	Normal Fit	Multivariate t Fit	GMM Fit
Wald	8.5507	19.7321	8.5664
	(0.1284)	(0.0014)	(0.1277)
Likelihood-ratio	8.4037	18.7816	-
	(0.1353)	(0.0021)	-
Score	8.2601	18.1012	-
	(0.1425)	(0.0028)	-
Gradient	-	17.9527	-
	-	(0.0030)	-

Table 7. Adjustment results of Multifactor Asset Pricing Model (MAPM) using the multivariate normal and t distributions from the NYSE data set and Fama-French data set. Standard errors are in parentheses.

Model	Asset	$\hat{α}$	${\hat{β}}_{1}$	${\hat{β}}_{2}$	${\hat{β}}_{3}$
Normal	Bank of Am	0.0006 (0.0052)	1.4698 (0.1237)	0.2059 (0.1656)	1.4310 (0.1624)
	Boeing	0.0052 (0.0044)	1.1015 (0.1042)	0.0557 (0.1395)	0.6612 (0.1367)
	Ford	−0.0041 (0.0075)	1.6341 (0.1765)	0.3050 (0.2363)	0.9135 (0.2317)
	Gelectric	−0.0081 (0.0038)	1.2836 (0.0910)	−0.0050 (0.1218)	0.4815 (0.1194)
	Microsoft	0.0051 (0.0041)	1.1967 (0.0970)	−0.1058 (0.1298)	−0.6713 (0.1273)
$t (\hat{η} = 0.27)$	Bank of Am	−0.0017 (0.0027)	1.1916 (0.0634)	0.1535 (0.0849)	1.2474 (0.0832)
	Boeing	0.0056 (0.0026)	1.1063 (0.0609)	−0.0279 (0.0815)	0.5455 (0.0799)
	Ford	−0.0149 (0.0033)	1.4170 (0.0780)	0.2127 (0.1045)	0.6901 (0.1024)
	Gelectric	−0.0081 (0.0021)	1.2611 (0.0497)	−0.1479 (0.0665)	0.3387 (0.0652)
	Microsoft	0.0017 (0.0022)	1.1636 (0.0524)	−0.3077 (0.0701)	−0.4498 (0.0688)

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Galea, M.; Cademartori, D.; Curci, R.; Molina, A. Robust Inference in the Capital Asset Pricing Model Using the Multivariate t-distribution. J. Risk Financial Manag. 2020, 13, 123. https://doi.org/10.3390/jrfm13060123

AMA Style

Galea M, Cademartori D, Curci R, Molina A. Robust Inference in the Capital Asset Pricing Model Using the Multivariate t-distribution. Journal of Risk and Financial Management. 2020; 13(6):123. https://doi.org/10.3390/jrfm13060123

Chicago/Turabian Style

Galea, Manuel, David Cademartori, Roberto Curci, and Alonso Molina. 2020. "Robust Inference in the Capital Asset Pricing Model Using the Multivariate t-distribution" Journal of Risk and Financial Management 13, no. 6: 123. https://doi.org/10.3390/jrfm13060123

APA Style

Galea, M., Cademartori, D., Curci, R., & Molina, A. (2020). Robust Inference in the Capital Asset Pricing Model Using the Multivariate t-distribution. Journal of Risk and Financial Management, 13(6), 123. https://doi.org/10.3390/jrfm13060123

Article Menu

Robust Inference in the Capital Asset Pricing Model Using the Multivariate t-distribution

Abstract

1. Introduction

2. Methodology

2.1. The CAPM under the t-Distribution

2.2. Maximum Likelihood Estimation

2.3. Asymptotic Standard Errors

2.4. Test of Mean-Variance Efficiency

2.5. Model Assessment and Outlier Detection

2.6. Generalized Method of Moments Tests

3. Applications

3.1. The Chilean Stock Market Data Set

3.2. The New York Stock Exchange Data Set

3.3. Robustness

4. Multifactor Asset Pricing Models under the $t$ -Distribution

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Appendix A. The Multivariate t-Distribution

Appendix B. ML Estimation Using the EM Algorithm

Appendix C. ML Estimation under H_β and H_αβ

Appendix D. Equality of the Score and Gradient Tests under Normality

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

Robust Inference in the Capital Asset Pricing Model Using the Multivariate t-distribution

Abstract

1. Introduction

2. Methodology

2.1. The CAPM under the t-Distribution

2.2. Maximum Likelihood Estimation

2.3. Asymptotic Standard Errors

2.4. Test of Mean-Variance Efficiency

2.5. Model Assessment and Outlier Detection

2.6. Generalized Method of Moments Tests

3. Applications

3.1. The Chilean Stock Market Data Set

3.2. The New York Stock Exchange Data Set

3.3. Robustness

4. Multifactor Asset Pricing Models under the t -Distribution

5. Conclusions

Author Contributions

Funding

Acknowledgments

Conflicts of Interest

Appendix A. The Multivariate t-Distribution

Appendix B. ML Estimation Using the EM Algorithm

Appendix C. ML Estimation under Hβ and Hαβ

Appendix D. Equality of the Score and Gradient Tests under Normality

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

4. Multifactor Asset Pricing Models under the $t$ -Distribution

Appendix C. ML Estimation under H_β and H_αβ