The Distribution of Cross Sectional Momentum Returns When Underlying Asset Returns Are Student’s t Distributed

Kwon, Oh Kang; Satchell, Stephen

doi:10.3390/jrfm13020027

Open AccessArticle

The Distribution of Cross Sectional Momentum Returns When Underlying Asset Returns Are Student’s t Distributed

by

Oh Kang Kwon

¹ and

Stephen Satchell

^2,*

¹

Discipline of Finance, Codrington Building (H69), The University of Sydney, Sydney NSW 2006, Australia

²

Trinity College, University of Cambridge, Cambridge CB2 1TQ, UK

^*

Author to whom correspondence should be addressed.

J. Risk Financial Manag. 2020, 13(2), 27; https://doi.org/10.3390/jrfm13020027

Submission received: 13 January 2020 / Revised: 27 January 2020 / Accepted: 31 January 2020 / Published: 5 February 2020

(This article belongs to the Special Issue Modern Portfolio Theory)

Download

Browse Figures

Versions Notes

Abstract

:

In Kwon and Satchell (2018), a theoretical framework was introduced to investigate the distributional properties of the cross-sectional momentum returns under the assumption that the vector of asset returns over the ranking and holding periods were multivariate normal. In this paper, the framework is extended to derive the corresponding results when the asset returns are multivariate Student’s t. In particular, we derive the probability density function and the moments of the cross-sectional momentum returns and examine in detail the special case of two underlying assets to demonstrate that many of the salient features reported in the empirical literature are consistent with the theoretical implications.

Keywords:

cross sectional momentum; student’s t distribution; investment strategy

1. Introduction

Empirical investigation of the observed patterns in asset returns has been an active area of research in finance, with momentum, or persistence, in asset returns being one of the more popular examples of this line of research. Of these, perhaps the most prominent is cross-sectional momentum (CSM), which refers to the observation that the set of assets that outperform relative to another set over a prior period tend to continue to outperform over a subsequent period. The existence of CSM is usually tested empirically by sorting the assets according to their returns over a prior “ranking” period, and constructing a portfolio over a subsequent “holding” period by taking a long position in the “winners” and a short position in the “losers”. Statistically significant excess returns from following such a strategy would then support the existence of cross-sectional momentum.

Cross sectional momentum strategies are popular with practitioners since they tend to generate positive returns, while they are popular with academics due to the fact their existence would run contrary to an implication of the efficient market hypothesis that there does not exist any discernible patterns in asset returns. There is an extensive academic literature investigating the properties of CSM returns covering various asset classes, markets, and jurisdictions. The most notable findings are that CSM returns are generally slightly positive, but become highly negative during times of market uncertainty, and that losses during such periods tend to cancel out, or at least significantly reduce, the prior gains.

Various authors, including Fama and French (1992), Jegadeesh and Titman (1993, 2001), Asness (1994), and Israel and Moskowitz (2013), found that momentum strategies are profitable in US equities markets over different time periods dating back to 1927. Analogous results were found for country equity indices by Richards (1997), Asness et al. (1997), Chan et al. (2000), and Hameed and Yuanto (2002), for emerging markets by Rouwenhorst (1998), for exchange rate markets in Okunev and White (2003) and Menkhoff et al. (2012), for commodities by Erb and Harvey (2006), for futures contracts in Moskowitz et al. (2012), and in industries by Sefton and Scowcroft (2004). Similar results were also found by Asness et al. (2013) and Daniel and Moskowitz (2016) for markets in the European Union, Japan, the United Kingdom, and the United States, and across asset classes including fixed income, commodities, foreign exchange, and equity from 1972 through 2013.

Despite the extensive literature on the empirical properties of momentum based returns, there are relatively few that consider the distributional properties of these returns from a theoretical viewpoint, with Kwon and Satchell (2018) being a notable exception that addresses the CSM returns as defined in this paper. Most of the known theoretical results, obtained for example by Lo and MacKinlay (1990), Jegadeesh and Titman (1993), Lewellen (2002), and Moskowitz et al. (2012), are concerned only with the expected values and first order autocorrelations of returns from the so-called weighted relative strength strategy in which the portfolio over the holding period is constructed from all underlying assets weighted, essentially, in proportion to their absolute or relative returns over the ranking period. The reason why we wish to calculate the distribution of CSM returns is that we can then calculate percentiles, quantiles, and related quantities. We can deduce the degree to which moments of returns exist and their precise form. Such information can be used, for example, to assess the fatness of the tails of the distribution and this is valuable for risk management calculations as well as understanding the benefits and limitations of portfolio construction.

By assuming that underlying asset returns are Gaussian, the distribution and the moments of the CSM returns were derived in Kwon and Satchell (2018). In this paper, we extend their results to the case where the underlying asset returns are Student’s t to derive the probability density function and the moments of the CSM returns. The t distribution arises naturally, for example, in a framework where asset volatility is stochastic, and conventional mean-variance analysis will create returns which are very similar to t-distributed returns. The important distinction between Student’s t returns and normal returns is that the distribution of Student’s t has an additional parameter which governs the fatness of the tails of the distribution and can be used to assess tail risk. There is a trade-off between realism and complexity; we would like to use a more complex distribution such as the skewed Student’s t considered in Theodossiou (1998) and Hansen et al. (2010), but the analytical complexity that results becomes prohibitive.

Although the individual asset returns do not exhibit skewness under the generalization to Student’s t, they can be leptokurtic which is a well-established feature in the empirical literature. Moreover, the CSM returns can, and do, exhibit skewness that depends on the statistical properties of the underlying assets. A detailed analysis of the special case of two underlying assets reveals that many of the salient features of the CSM returns reported in the empirical literature are consistent with the theoretical implications from this framework. This analysis is of interest because Kwon and Satchell (2018) were able to show that non-normality was a consequence of the momentum structure, even when the underlying returns were normal. We therefore wish to assess what the impact of assuming non-normality in the underlying returns will have on CSM returns. For example, will it exacerbate non-normality or make very little difference? Answers to this question will shed light on applying CSM to universes of assets which are fundamentally non-normal, such as emerging markets.

It should be pointed out that since we work under the assumption that asset returns over the ranking and holding periods are jointly t-distributed, there are limitations in the properties of momentum returns that can be addressed in the theoretical framework of this paper. For example, it is not possible to adequately address properties that depend on certain firm specific, economic, or financial factors such as liquidity, credit spread, market sentiment, business cycle, and information asymmetry since these factors cannot easily be captured in the distributional assumption on asset returns. Theoretical investigation of such properties would require an extension with the ability to incorporate such factors.

Finally, it may be asked what the connection is between our analysis and the extensive linear factor modelling that dominates the asset pricing literature. This literature essentially says that the time t mean of an asset, say the first, is a linear function of factor returns. In the framework of this paper, we can accommodate such modelling by interpreting the asset mean to be conditional on factor returns.

The remainder of this paper is organized as follows: Section 2 introduces the notation and the key results on multivariate normal distributions, and Section 3 provides a mathematically precise definition of CSM returns. Although the expressions for the CSM return density and the associated moments are quite complex in general, they simplify considerably in the case of two assets with one winner and one loser, and this special case is examined in detail in Section 4, along with implications to the empirically observed features reported in the literature, and the paper concludes with Section 5.

2. Notation and Preliminaries

For the convenience of the reader, we introduce in this section the notation that will be used throughout the paper, and present some known results that will be relied upon in subsequent sections.

2.1. Notation

For any

x \in R^{n}

, we will write

x_{i}

for the i-th coordinate of

x

, and given

y \in R^{n}

write

x ≺ y

if and only if

x_{i} < y_{i}

for all

1 \leq i \leq n

. Similarly, given a matrix

M \in R^{m \times k}

, we will write

M_{i, j}

for the

(i, j)

-th entry of M, and the transpose of a vector or a matrix will be denoted by the superscript

^{'}

. The vector in

R^{n}

with all entries equal to 1 will be denoted

1_{n}

, and given a subset

A \in R^{n}

, we will denote by

I_{A}

the indicator function on A.

Given a random vector,

X

, with values in a region

D_{X} \subset R^{n}

, we will write

f_{X} (x)

and

F_{X} (x)

for the probability density and the cumulative density functions of

X

, respectively. Moreover, given another random vector

Y

, with values in

D_{Y} \subset R^{m}

, we will denote by

f_{X | Y} (x | y)

and

F_{X | Y} (x | y)

the conditional probability density and conditional cumulative density functions of

X

given

Y = y

, respectively.

For any

n \in N

, let

[n] = {1, 2, \dots, n}

and let

S_{n}

be the set of permutations of

[n]

. We will denote the permutation that maps

1 \mapsto i_{1}, 2 \mapsto i_{2}, \dots, n \mapsto i_{n}

by a sequence

(i_{1}, i_{2}, \dots, i_{n})

, and given any

τ \in S_{n}

write

τ (i)

for the image of i under

τ

so that if

τ = (1, 3, 2)

, for example, then

τ (1) = 1

,

τ (2) = 3

, and

τ (3) = 2

. Given a permutation

τ \in S_{n}

, we will denote by

P_{τ} \in R^{n \times n}

the permutation matrix corresponding to

τ

and denote by

D_{n} \in R^{(n - 1) \times n}

the matrix

\begin{matrix} D_{n} = [\begin{matrix} - 1 1 0 0 \dots 0 0 0 \\ 0 - 1 1 0 \dots 0 0 0 \\ \dots \dots \dots \dots \dots \dots \\ 0 0 0 0 \dots 0 - 1 1 \end{matrix}] . \end{matrix}

(1)

The elements of the permutation group

S_{n}

act naturally on the set of polynomials,

R [x_{1}, \dots, x_{n}]

by the rule

\begin{matrix} τ p (x_{1}, \dots, x_{n}) = p (x_{τ_{1}}, \dots, x_{τ_{n}}) \end{matrix}

for any polynomial

p \in R [x_{1}, \dots, x_{n}]

and

τ \in S_{n}

. For any

n_{1}, n_{2} \in N

, let

p_{n_{1}, 2 n_{2}} \in R [x_{1}, \dots, x_{n_{1} + 2 n_{2}}]

be the polynomial

\begin{matrix} p_{n_{1}, 2 n_{2}} (x_{1}, \dots, x_{n_{1} + 2 n_{2}}) = (\prod_{i = 1}^{n_{1}} x_{i}) (\prod_{i = 1}^{n_{2}} {(x_{n_{1} + 2 i} - x_{n_{1} + 2 i - 1})}^{2}) . \end{matrix}

(2)

Denote by

Z (n_{1}, 2 n_{2})

the stabilizer of

p_{n_{1}, 2 n_{2}}

under the action of

S_{n_{1} + 2 n_{2}}

so that

\begin{matrix} Z (n_{1}, 2 n_{2}) = {τ \in S_{n_{1} + 2 n_{2}} ∣ τ p_{n_{1}, 2 n_{2}} = p_{n_{1}, 2 n_{2}}}, \end{matrix}

(3)

and let

Q (n_{1}, 2 n_{2}) = S_{n_{1} + 2 n_{2}} / Z (n_{1}, 2 n_{2})

be the quotient group,1 with elements of

Q (n_{1}, 2 n_{2})

identified with their coset representatives

τ \in S_{n_{1} + 2 n_{2}}

. Finally, define

\begin{matrix} {[n]}^{m} = \prod_{i = 1}^{m} {1, 2, \dots, n} = {(i_{1}, i_{2}, \dots, i_{m}) ∣ 1 \leq i_{j} \leq n, 1 \leq j \leq m} \end{matrix}

(4)

as the m-fold Cartesian product of

[n] = {1, 2, \dots, n}

.

2.2. Multivariate Normal Distributions

The density of an n-dimensional normal distribution with mean

μ

and covariance

Σ

at

x \in R^{n}

will be denoted

ϕ_{n} (x; μ, Σ)

, and the corresponding cumulative density function will be denoted

Φ_{n} (x; μ, Σ)

. In general, given random variables

X_{1}, \dots, X_{n}

, their joint probability density function will be denoted

f_{X_{1}, \dots, X_{n}}

, and we will write

F_{X_{1}, \dots, X_{n}}

for the cumulative density function.

Theorem 1.

Let

n_{1}, n_{2} \in N

and suppose

X \sim N_{n_{1} + n_{2}} (μ, Σ)

, where

\begin{matrix} X = [\begin{matrix} X_{1} \\ X_{2} \end{matrix}], μ = [\begin{matrix} μ_{1} \\ μ_{2} \end{matrix}], Σ = [\begin{matrix} Σ_{1, 1} & Σ_{1, 2} \\ Σ_{2, 1} & Σ_{2, 2} \end{matrix}], \end{matrix}

(5)

with

X_{i}, μ_{i} \in R^{n_{i}}

and

Σ_{i, j} \in R^{n_{i} \times n_{j}}

for

1 \leq i, j \leq 2

, and Σ positive definite. Then, the conditional distribution of

X_{1}

given

X_{2}

is normal with mean and covariance

\begin{matrix} μ_{X_{1} ∣ X_{2}} & = μ_{1} + Σ_{1, 2} Σ_{2, 2}^{- 1} (X_{2} - μ_{2}), \end{matrix}

(6)

\begin{matrix} Σ_{X_{1} ∣ X_{2}} & = Σ_{1, 1} - Σ_{1, 2} Σ_{2, 2}^{- 1} Σ_{2, 1}, \end{matrix}

(7)

respectively, and

ϕ_{n_{1} + n_{2}} (x; μ, Σ)

decomposes as

\begin{matrix} ϕ_{n_{1} + n_{2}} (x; μ, Σ) = ϕ_{n_{1}} (x_{1}; μ_{X_{1} ∣ X_{2}}, Σ_{X_{1} ∣ X_{2}}) ϕ_{n_{2}} (x_{2}; μ_{2}, Σ_{2, 2}) . \end{matrix}

(8)

Proof.

Refer to Muirhead (1982) Theorem 1.2.11. □

Given an n-dimensional random vector

X

and

p = (p_{1}, \dots, p_{m}) \in {1, 2, \dots, n}^{m}

, we will denote by

μ_{p} (X)

the

p

-th moment of

X

so that

\begin{matrix} μ_{p} (X) = E [\prod_{i = 1}^{m} X_{p_{i}}] . \end{matrix}

(9)

Note that the subscripts,

p_{i}

, in (9) may be repeated so that the above definition is equivalent to the more familiar definition of moments in which the powers of the components of

X

appear inside the expectation on the right-hand side, viz.

E [\prod_{i = 1}^{n} X_{i}^{k_{i}}]

with

0 \leq k_{i} \in N

for

1 \leq i \leq n

. For example, we have

E [X_{1}^{2} X_{2}^{3}] = E [X_{1} X_{1} X_{2} X_{2} X_{2}]

, and so in the notation of (9) we have

m = 5

,

d = (d_{1}, d_{2}, d_{3}, d_{4}, d_{5}) = (1, 1, 2, 2, 2)

, and

μ_{(1, 1, 2, 2, 2)} (X) = E [X_{1}^{2} X_{2}^{3}]

.

Theorem 2.

Let

X \sim N_{n} (μ, Σ)

, where

n \in N

,

μ \in R^{n}

and

Σ \in R^{n \times n}

is positive definite. Then, for any

p = (p_{1}, \dots, p_{m}) \in {1, 2, \dots, n}^{m}

, we have

\begin{matrix} μ_{p} (X) = \sum_{\begin{matrix} k, l \in N \\ k + 2 l = m \end{matrix}} \sum_{τ \in Q (k, 2 l)} (\prod_{i = 1}^{k} μ_{p_{τ_{i}}}) (\prod_{i = 1}^{l} Σ_{p_{τ_{k + 2 i - 1}}, p_{τ_{k + 2 i}}}) . \end{matrix}

(10)

In particular, if

μ = 0

and m is even, then

\begin{matrix} μ_{p} (X) = \sum_{τ \in Q (0, m)} \prod_{i = 1}^{\frac{1}{2} m} Σ_{p_{τ_{2 i - 1}}, p_{τ_{2 i}}} . \end{matrix}

(11)

Proof.

Refer to Withers (1985) Theorem 1.1. □

An alternative expression for

μ_{p} (X)

, where

X

is multivarite normal, is given in Kan (2008) Proposition 2.

Corollary 1.

Let

X \sim N_{1} (μ, σ^{2})

. Then, for any

m \in N

, the m-th moment of X is given by

\begin{matrix} μ_{m} (X) = \sum_{\begin{matrix} i = 0 \\ i even \end{matrix}}^{m} (\binom{m}{i}) (i - 1)!! σ^{i} μ^{m - i}, \end{matrix}

(12)

where

k!! = \prod_{i = 0}^{⌊ \frac{k}{2} ⌋} (k - 2 i)

is the double factorial of

k \in N

. In the special case where

μ = 0

and m is even, we have

μ_{m} (X) = (m - 1)!! σ^{m}

.

Proof.

Follows from Theorem 2, since the inner sum in (10), for which

2 l = i

, consists of

(\binom{m}{i}) (i - 1)!!

identical terms that are all equal to

σ^{i} μ^{m - i}

. □

2.3. Multivariate Student’s t Distribution

Asset returns are often assumed to be normally distributed in the academic literature for theoretical convenience, in which case they are completely determined by the location and the scale parameters. However, it is widely reported in the empirical literature that the observed asset returns exhibit excess kurtosis. Student’s t-distribution is a distribution from the elliptical family with an additional parameter

ν

, viz. number of degrees of freedom that controls the kurtosis. The Student’s t distribution reduces to the normal in the limit as

ν \to \infty

, and hence provides a convenient framework under which to investigate the impact of excess kurtosis in the underlying asset returns on the distributional properties of the CSM return. This subsection provides a brief summary of the key properties of the Student’s t distribution that will be required in the remainder of this paper.

The t distribution has a long history in mathematical statistics. The univariate probability density function (pdf), t (

μ

,

σ^{2}

,

ν

), of a t-variate with mean

μ

, scale parameter

σ

, and degrees of freedom

ν

is given by

\begin{matrix} t (μ, σ^{2}, ν) = \frac{Γ (\frac{ν + 1}{2})}{σ \sqrt{ν π} Γ (\frac{ν}{2})} {(1 + \frac{1}{ν} {(\frac{x - μ}{σ})}^{2})}^{- \frac{1}{2} (ν + 1)} . \end{matrix}

(13)

From the origins of the t-test in mathematical finance, it is clear that we can write the corresponding random variable as

x = μ + z / Y

, where z and Y are independent,

z \sim N (0, σ^{2})

,

Y = \sqrt{g / ν}

, and

g \sim χ^{2} (ν)

.

In extending the definition of the t distribution to the multivariate case, we are faced with a choice. Although the choice

z \sim N_{n} (0, Σ)

is clear, Y can be defined in various ways. For example:

each component, $z_{i}$ , of $z$ is normalized by independent $Y_{i} = \sqrt{g_{i} / ν_{i}}$ with same or differing $ν_{i}$ ,
$Y_{i}$ could be jointly dependent,
single common Y.

All three of these choices have a stochastic volatility interpretation corresponding to

idiosyncratic shocks,
common factor shocks,
economy-wide or market-wide shock.

Although we have chosen the final characterization, cross sectional momentum could also be analyzed under other characterizations.

Throughout this paper, the probability density function of n-dimensional Student’s t distribution,

{St}_{n_{1} + n_{2}} (μ, Σ, ν)

, with

ν

degrees of freedom, location

μ

, and shape matrix

Σ

at

x \in R^{n}

will be denoted

t_{n} (x; μ, Σ, ν)

, and we will write

T_{n} (x; μ, Σ, ν)

for the corresponding cumulative density function. The next theorem shows that multivariate Student’s t distribution is closed under conditioning, in the sense that the conditional density of a subset given its complement is again Student’s t. This property will be crucial in the investigation of the CSM returns in later sections.

Theorem 3.

Let

n_{1}, n_{2} \in N

and suppose

X \sim {St}_{n_{1} + n_{2}} (μ, Σ, ν)

, where

ν \in R_{+}

,

\begin{matrix} X = [\begin{matrix} X_{1} \\ X_{2} \end{matrix}], μ = [\begin{matrix} μ_{1} \\ μ_{2} \end{matrix}], Σ = [\begin{matrix} Σ_{1, 1} & Σ_{1, 2} \\ Σ_{2, 1} & Σ_{2, 2} \end{matrix}], \end{matrix}

(14)

with

X_{i}, μ_{i} \in R^{n_{i}}

and

Σ_{i, j} \in R^{n_{i} \times n_{j}}

for

1 \leq i, j \leq 2

, and Σ positive definite. Then, the conditional distribution of

X_{1}

given

X_{2}

is Student’s t with degrees of freedom

ν_{X_{1} ∣ X_{2}} = ν + n_{2}

, and location and shape matrix

\begin{matrix} μ_{X_{1} ∣ X_{2}} & = μ_{1} + Σ_{1, 2} Σ_{2, 2}^{- 1} (X_{2} - μ_{2}), \end{matrix}

(15)

\begin{matrix} Σ_{X_{1} ∣ X_{2}} & = \frac{ν + {(X_{2} - μ_{2})}^{'} Σ_{2, 2}^{- 1} (X_{2} - μ_{2})}{ν + n_{2}} (Σ_{1, 1} - Σ_{1, 2} Σ_{2, 2}^{- 1}, Σ_{2, 1}), \end{matrix}

(16)

respectively, and

t_{n_{1} + n_{2}} (x; μ, Σ, ν)

decomposes as

\begin{matrix} t_{n_{1} + n_{2}} (x; μ, Σ, ν) = t_{n_{1}} (x_{1}; μ_{X_{1} ∣ X_{2}}, Σ_{X_{1} ∣ X_{2}}, ν_{X_{1} ∣ X_{2}}) t_{n_{2}} (x_{2}; μ_{2}, Σ_{2, 2}, ν) . \end{matrix}

(17)

Proof.

Refer to Roth (2013) Appendix A.6, or Muirhead (1982) Problems 1.29 and 1.30. □

Although a Student’s t distribution does not have finite moments of all orders, the next theorem provides an explicit expression for those that do exist.

Theorem 4.

Let

X \sim {St}_{n} (μ, Σ, ν)

, where

n \in N

,

ν \in R_{+}

,

μ \in R^{n}

and

Σ \in R^{n \times n}

is positive definite. Moreover, for any

m \in N

, denote by

P_{m}

the set of subsets of the set

{1, 2, \dots, m}

of size k such that

m - k

is even, where

0 \leq k \leq m

. Then, for any

p = (p_{1}, \dots, p_{m}) \in {1, 2, \dots, n}^{m}

such that

m < ν

, we have

\begin{matrix} μ_{p} (X) & = \sum_{P \in P_{m}} \frac{Γ (\frac{ν}{2} - \frac{1}{2} | P^{c} |)}{Γ (\frac{ν}{2})} {(\frac{ν}{2})}^{\frac{1}{2} | P^{c} |} \prod_{k \in P} μ_{p_{i_{k}}} \sum_{τ \in Q (0, | P^{c} |)} (\prod_{j = 1}^{\frac{1}{2} | P^{c} |} Σ_{p_{i_{τ_{2 j - 1}}}, p_{i_{τ_{2 j}}}}), \end{matrix}

(18)

where

P^{c} = {i_{1}, i_{2}, \dots, i_{k + 2 l}}

in the final sum.

Proof.

Refer to Appendix A. □

Setting

n = 1

gives the moments of the one-dimensional Student’s t distribution.

Corollary 2.

Let

X \sim {St}_{1} (μ, σ^{2}, ν)

. Then, for any

m \in N

such that

m < ν

, the m-th moment of X is given by

\begin{matrix} μ_{m} (X) = \sum_{\begin{matrix} k = 0 \\ k even \end{matrix}}^{m} (\binom{m}{k}) (k - 1)!! \frac{Γ (\frac{1}{2} (ν - k))}{Γ (\frac{ν}{2})} {(\frac{ν}{2})}^{\frac{k}{2}} μ^{m - k} σ^{k}, \end{matrix}

(19)

where

k!!

is the double factorial defined in Corollary 1.

Proof.

Follows from similar arguments to Theorem 4 noting that the indices

p_{i_{j}}

are all equal in this case. □

For an alternative derivation of the moments of Student’s t distribution, refer to Kirkby et al. (2019).

2.4. Unified Skew t Family of Distributions

Multivariate skew-normal (SN) distributions were introduced in Azzalini and Valle (1985) to generalize normal distributions to those that allow non-zero skewness, and the seemingly disparate distributions related to the multivariate SN distributions were brought together under the umbrella of the so-called unified skew-normal (SUN) family of distributions in Arellano-Valle and Azzalini (2006), where it was shown that the SUN family contains many of these skew-normal variants as special cases. The extension of the normal family to those with non-zero skewness was then extended to the elliptical family of distributions in Arellano-Valle and Genton (2010). In what follows, we only summarize the results on the extension for the multivariate Student’s t distributions that will be required in this paper, and refer the reader to Arellano-Valle and Genton (2010) and Jamalizadeh and Balakrishnan (2012) for the details.

Given

ν \in R_{+}

,

n_{i} \in N

,

μ_{i} \in R^{n_{i}}

,

Σ_{i, j} \in R^{n_{1} \times n_{2}}

for

1 \leq i \leq j

, where

Σ_{2, 1}^{'} = Σ_{1, 2}

and

Σ_{i, i}

are positive definite, let

\begin{matrix} [\begin{matrix} X_{1} \\ X_{2} \end{matrix}] \sim {St}_{n_{1} + n_{2}} ([\begin{matrix} μ_{1} \\ μ_{2} \end{matrix}], [\begin{matrix} Σ_{1, 1} & Σ_{1, 2} \\ Σ_{1, 2}^{'} & Σ_{2, 2} \end{matrix}], ν) . \end{matrix}

(20)

Then, the probability density function,

f_{U} (u)

, of an

n_{1}

-dimensional unified skew t (SUT) distributed random variable,

U

, associated with

{(X_{1}^{'}, X_{2}^{'})}^{'}

given by

\begin{matrix} f_{U} (u) = \frac{t_{n_{2}} (u; μ_{2}, Σ_{2, 2}, ν)}{T_{n_{1}} (0; - μ_{1}, Σ_{1, 1}, ν)} T_{n_{1}} (0; - μ_{1} - Σ_{1, 2} Σ_{2, 2}^{- 1} (u - μ_{2}), Σ_{1, 1; 2} (u), ν + n_{2}), \end{matrix}

(21)

where

\begin{matrix} Σ_{1, 1; 2} (u) = \frac{ν + {(u - μ_{2})}^{'} Σ_{2, 2}^{- 1} (u - μ_{2})}{ν + n_{2}} (Σ_{1, 1} - Σ_{1, 2} Σ_{2, 2}^{- 1} Σ_{2, 1}) . \end{matrix}

(22)

The key characteristic of

f_{U} (u)

is that it is a product of an

n_{2}

-dimensional Student’s t density and an

n_{1}

-dimensional cumulative Student’s t density with the variable

u

appearing as the main variable in the former and in the mean and variance parameters of the latter. As will be seen, the densities of cross sectional momentum returns will be a weighted sum of these SUT distributions.

3. Cross-Sectional Momentum Returns with Student’s $t$ Distributed Asset Returns

In this section, we derive the distributional properties of the cross sectional momentum (CSM) returns under the assumption that the underlying asset returns are multivariate Student’s t. We begin by recalling the mathematically precise definition of the CSM return from Kwon and Satchell (2018).

Let

0 < m_{+}, m_{-}, n \in N

such that

m_{+} + m_{-} \leq n

, and for each

1 \leq i \leq n

denote by

r_{i, t}

the return on asset i at time t. Moreover, let

\begin{matrix} r_{t} & = {(r_{1, t}, \dots, r_{n, t})}^{'} \in R^{n}, \end{matrix}

(23)

and for any

τ \in S_{n}

, define

\begin{matrix} r_{τ, m_{\pm}, t} & = \frac{1}{m_{+}} \sum_{i = 1}^{m_{+}} r_{τ_{i}, t} - \frac{1}{m_{-}} \sum_{i = 1}^{m_{-}} r_{τ_{n - m_{-} + i}, t} \in R, \end{matrix}

(24)

\begin{matrix} x_{τ_{i}, t} & = r_{τ_{i + 1}, t} - r_{τ_{i}, t} \in R, 1 \leq i \leq n - 1, \end{matrix}

(25)

\begin{matrix} x_{τ, t} & = {(x_{τ_{1}, t}, \dots, x_{τ_{n - 1}, t})}^{'} \in R^{n - 1}, \end{matrix}

(26)

\begin{matrix} z_{τ, t} & = {(r_{τ, m_{\pm}, t + 1}, x_{τ, t}^{'})}^{'} \in R^{n} . \end{matrix}

(27)

Note that any given

τ \in S_{n}

defines an ordering,

r_{t, τ_{1}} > r_{t, τ_{2}} > \dots > r_{t, τ_{n}}

, of the components of

r_{t}

. Thus,

r_{τ, m_{\pm}, t}

represents the return on a portfolio where the top

m_{+}

ranked assets are equally weighted and held long while the bottom

m_{-}

assets are equally weighted and held short. The assumption of equal weighting is for notational simplicity only, and not crucial for the general theoretical results. Note also that

x_{τ, t}

is defined to allow the ranking of the components of

r_{t}

corresponding to

τ \in S_{n}

to be written succinctly as

x_{τ, t} ≺ 0_{n - 1}

.

Definition 1.

The

(m_{+}, m_{-})

-cross sectional momentum return,

r_{m_{\pm}, t + 1}

, is defined by

\begin{matrix} r_{m_{\pm}, t + 1} = \sum_{τ \in S_{n}} I_{{x_{τ, t} ≺ 0_{n - 1}}} r_{τ, m_{\pm}, t + 1}, \end{matrix}

(28)

where

I_{A}

, for any subset

A \subset R^{n}

, denotes the indicator function on the set A.

For intuition behind the definition of

r_{m_{\pm}, t + 1}

, note that the components of

r_{t}

, representing asset returns over the ranking period, can be arranged in any of the

n!

orderings corresponding to the permutations

τ \in S_{n}

. For each such ranking

r_{t, τ_{1}} > r_{t, τ_{2}} > \dots > r_{t, τ_{n}}

, the

m_{+}

winner returns over the holding period are

r_{t + 1, τ_{1}}, \dots, r_{t + 1, τ_{m_{+}}}

while the

m_{-}

loser returns are

r_{t + 1, τ_{n - m_{+} 1}}, \dots, r_{t + 1, τ_{n}}

. Equally weighting the returns in the winner and the loser portfolios gives

r_{τ, m_{\pm}, t + 1}

, and since the ranking of components of

r_{t}

determined by

τ \in S_{n}

is equivalent to the condition

x_{τ, t} ≺ 0_{n - 1}

, summing over all possible

r_{τ, m_{\pm}, t + 1}

and prefixing by the matching indicator function gives the expression for

r_{m_{\pm}, t + 1}

in (28).

For the remainder of this paper, we make the following assumption on the distribution of

{(r_{t}^{'}, r_{t + 1})}^{'}

.

Assumption 1.

The vector of returns,

{(r_{t}^{'}, r_{t + 1}^{'})}^{'}

, is multivariate Student’s t distributed so that

\begin{matrix} [\begin{matrix} r_{t + 1} \\ r_{t} \end{matrix}] \sim {St}_{2 n} ([\begin{matrix} μ_{t + 1} \\ μ_{t} \end{matrix}], [\begin{matrix} Σ_{t + 1, t + 1} & Σ_{t + 1, t} \\ Σ_{t, t + 1} & Σ_{t, t} \end{matrix}], ν), \end{matrix}

(29)

with

ν \in R_{+}

,

μ_{u} \in R^{n}

, and

Σ_{u, v} \in R^{n \times n}

, where

u, v \in {t, t + 1}

.

Since the t-distribution is symmetric, there are limitations on the properties of asset returns that can be captured adequately by the above assumption as already discussed in Section 1. Nevertheless, the assumption is sufficiently general to accommodate linear factor models and econometric models such as vector autoregressive moving average models where the factors and noise terms, respectively, are t-distributed. Moreover, the framework also allows consideration of more general cases where

μ_{t}

and

μ_{t + 1}

are conditional means linear in factors without requiring the factors themselves to be multivariate t. However, since the analysis in later sections will show that momentum returns are nonlinear in the underlying asset returns, the common practise of regressing momentum returns on the various factors must be interpreted as a best linear prediction rather than a conditional expectation in such cases. We now derive the probability density function,

f_{m_{\pm}, t + 1} (r)

, of the CSM return

r_{m_{\pm}, t + 1}

.

Theorem 5.

Suppose that

{(r_{t}^{'}, r_{t + 1}^{'})}^{'}

satisfies Assumption 1. Then, the probability density function,

f_{m_{\pm}, t + 1} (r)

, of the cross sectional momentum return,

r_{m_{\pm}, t + 1}

, is given by

\begin{matrix} f_{m_{\pm}, t + 1} (r) = \sum_{τ \in S_{n}} t_{1} (r; ι^{'} P_{τ} μ_{t + 1}, ι^{'} P_{τ} Σ_{t + 1, t + 1} P_{τ}^{'} ι, ν) T_{n - 1} (0; λ_{τ} (r), Λ_{τ} (r), ν + 1), \end{matrix}

(30)

where

ι = {(1_{m_{+}}^{'}, 0_{n - m_{+} - m_{-}}^{'}, - 1_{n_{-}}^{'})}^{'}

,

\begin{matrix} λ_{τ} (r) & = D_{n} P_{τ} μ_{t} + \frac{(r - ι^{'} P_{τ} μ_{t + 1})}{ι^{'} P_{τ} Σ_{t + 1, t + 1} P_{τ}^{'} ι^{'}} D P_{τ} Σ_{t, t + 1} P_{τ}^{'} ι, \end{matrix}

(31)

\begin{matrix} \begin{matrix} Λ_{τ} (r) & = \frac{ν + {(ι^{'} P_{τ} Σ_{t + 1, t + 1} P_{τ}^{'} ι^{'})}^{- 1} {(r - ι^{'} P_{τ} μ_{t + 1})}^{2}}{ν + 1} \\ (D_{n} P_{τ} Σ_{t, t} P_{τ}^{'} D_{n}^{'} - \frac{D_{n} P_{τ} Σ_{t, t + 1} P_{τ}^{'} ι ι^{'} P_{τ} Σ_{t + 1, t} P_{τ}^{'} D_{n}^{'}}{ι^{'} P_{τ} Σ_{t + 1, t + 1} P_{τ}^{'} ι^{'}}), \end{matrix} \end{matrix}

(32)

and we define

T_{0} \equiv 1

. Alternatively,

\begin{matrix} \begin{matrix} f_{m_{\pm}, t + 1} (r) & = \sum_{τ \in S_{n}} \int_{R_{-}^{n - 1}} t_{1} (r; γ_{τ} (x), Υ_{τ} (x), ν + n - 1) t_{n - 1} (x; D_{n} P_{τ} μ_{t}, D_{n} P_{τ} Σ_{t, t} P_{τ}^{'} D_{n}^{'}, ν) d x, \end{matrix} \end{matrix}

(33)

where

\begin{matrix} γ_{τ} (x) & = ι^{'} P_{τ} μ_{t + 1} + ι^{'} P_{τ} Σ_{t + 1, t} P_{τ}^{'} D_{n}^{'} {(D_{n} P_{τ} Σ_{t, t} P_{τ}^{'} D_{n}^{'})}^{- 1} (x - D_{n} P_{τ} μ_{t}), \end{matrix}

(34)

\begin{matrix} \begin{matrix} Υ_{τ} (x) & = \frac{ν + {(x - D_{n} P_{τ} μ_{t})}^{'} {(D_{n} P_{τ} Σ_{t, t} P_{τ}^{'} D_{n}^{'})}^{- 1} (x - D_{n} P_{τ} μ_{t})}{ν + n - 1} \\ (ι^{'} P_{τ} Σ_{t + 1, t + 1} P_{τ}^{'} ι - ι^{'} P_{τ} Σ_{t + 1, t} P_{τ}^{'} D_{n}^{'} {(D_{n} P_{τ} Σ_{t, t} P_{τ}^{'} D_{n}^{'})}^{- 1} D_{n} P_{τ} Σ_{t, t + 1} P_{τ}^{'} ι) . \end{matrix} \end{matrix}

(35)

Proof.

Refer to Appendix B. □

Note that the summands that appear in the pdf of the CSM return in (30) have the characteristic form of the SUT densities given in (21) other than for the omission of the normalization factor2 that appears in the denominator of (21). It follows that pdf of the CSM return is a weighted sum of the SUT densities. The next result gives the pdf in the special case where

r_{t}

and

r_{t + 1}

are independent, which can be considered as the case where the market is efficient.

Corollary 3.

If

{(r_{t}^{'}, r_{t + 1}^{'})}^{'}

satisfies Assumption 1, and

r_{t}

and

r_{t + 1}

are independent, then

\begin{matrix} f_{m_{\pm}, t + 1} (r) = \sum_{τ \in S_{n}} t_{1} (r; ι^{'} P_{τ} μ_{t + 1}, ι^{'} P_{τ} Σ_{t + 1, t + 1} P_{τ}^{'} ι, ν) T_{n - 1} (0; D_{n} P_{τ} μ_{t}, Λ_{τ}^{\circ} (r), ν + 1), \end{matrix}

(36)

where

\begin{matrix} Λ_{τ}^{\circ} (r) & = \frac{ν + {(ι^{'} P_{τ} Σ_{t + 1, t + 1} P_{τ}^{'} ι^{'})}^{- 1} {(r - ι^{'} P_{τ} μ_{t + 1})}^{2}}{ν + 1} D_{n} P_{τ} Σ_{t, t} P_{τ}^{'} D_{n}^{'}, \end{matrix}

(37)

Proof.

Follows from Theorem 5 since

Σ_{t, t + 1} = O_{n \times n} = Σ_{t + 1, t}

in this case. □

We next derive the expressions for the non-central moments of the CSM returns. Since t distributions do not have moments of all orders as noted in Theorem 4, the moments of CSM returns will also only exist up to a certain order.

Theorem 6.

Suppose

{(r_{t}^{'}, r_{t + 1}^{'})}^{'}

satisfies Assumption 1, and let

m \in N

such that

m < ν

. Then, the m-th non-central moment of

r_{m_{\pm}, t + 1}

is given by

\begin{matrix} \begin{matrix} μ_{m} (r_{m_{\pm}, t + 1}) & = \sum_{τ \in S_{n}} \sum_{\begin{matrix} k = 0 \\ k even \end{matrix}}^{m} (\binom{m}{k}) (k - 1)!! \frac{Γ (\frac{1}{2} (ν + n - 1 - k))}{Γ (\frac{1}{2} (ν + n - 1))} {(\frac{ν + n - 1}{2})}^{\frac{k}{2}} \\ \int_{R_{-}^{n - 1}} γ_{τ}^{m - k} (x) Υ_{τ}^{\frac{k}{2}} (x) t_{n - 1} (x; D_{n} P_{τ} μ_{t}, D_{n} P_{τ} Σ_{t, t} P_{τ}^{'} D_{n}^{'}, ν) d x, \end{matrix} \end{matrix}

(38)

where

γ_{τ} (x)

and

Υ_{τ} (x)

are as defined in (34) and (35), respectively.

Proof.

Refer to Appendix C. □

4. Special Case of Two Assets

In this section, we examine in detail the special case of two assets, and begin by computing the partial moments of one-dimensional Student’s t distributions that will be required. To reduce notational burden, we define for

η \in R

,

ς \in R_{+}

, and

ν \in R_{+}

\begin{matrix} c (η, ς^{2}, ν) = \frac{Γ (\frac{ν + 1}{2})}{ς \sqrt{ν π} Γ (\frac{ν}{2})}, \end{matrix}

(39)

so that from (13) we have

\begin{matrix} t_{1} (x; η, ς^{2}, ν) = c (η, ς^{2}, ν) {(1 + \frac{1}{ν} {(\frac{x - η}{ς})}^{2})}^{- \frac{1}{2} (ν + 1)} . \end{matrix}

(40)

Lemma 1.

Let

η \in R

,

ς \in R_{+}

and

2 < ν \in R_{+}

. Then, for

m \in N_{+}

, we have

\begin{matrix} \begin{matrix} {(\frac{x - η}{ς})}^{m} t_{1} (x; η, ς^{2}, ν) & = - ς \sqrt{\frac{ν}{ν - 2}} {(\frac{x - η}{ς})}^{m - 1} \frac{\partial t_{1} (x; η, ς^{2} ν / (ν - 2), ν - 2)}{\partial x} . \end{matrix} \end{matrix}

(41)

Proof.

Refer to Appendix D. □

The next theorem will play a key role in the derivation of the non-central moments of the CSM returns.

Theorem 7.

For any

η \in R

,

ς \in R_{+}

,

ν \in R_{+}

, and

m \in N

such that

m < ν

, let

\begin{matrix} κ_{m} (η, ς^{2}, ν) = \int_{- \infty}^{0} {(\frac{x - η}{ς})}^{m} t_{1} (x; η, ς^{2}, ν) d x . \end{matrix}

(42)

Then,

κ_{0} (η, ς^{2}, ν) = T_{1} (0; η, ς^{2}, ν)

,

κ_{1} (η, ς^{2}, ν) = - ς \sqrt{ν / (ν - 2)} t_{1} (0; η, ς^{2} ν / (ν - 2), ν - 2)

, and

\begin{matrix} κ_{m} (η, ς^{2}, ν) = (m - 1) {(\frac{ν}{ν - 2})}^{\frac{1}{2} (m - 1)} κ_{m - 2} (η, \frac{ς^{2} ν}{ν - 2}, ν - 2) \end{matrix}

(43)

for

2 \leq m < ν

. More explicitly, if m is even

\begin{matrix} κ_{m} (η, ς^{2}, ν) = \frac{(m - 1)!! ν^{\frac{1}{2} (m - 1)}}{(ν - 2) (ν - 4) \dots (ν - m + 2) \sqrt{ν - m}} T_{1} (0; η, \frac{ς^{2} ν}{ν - m}, ν - m), \end{matrix}

(44)

and if m is odd

\begin{matrix} \begin{matrix} κ_{m} (η, ς^{2}, ν) \\ = \frac{- ς (m - 1)!! ν^{\frac{1}{2} (m - 1)}}{(ν - 2) (ν - 4) \dots (ν - (m - 1)) \sqrt{ν - (m + 1)}} t_{1} (0; η, \frac{ς^{2} ν}{ν - (m + 1)}, ν - (m + 1)) . \end{matrix} \end{matrix}

(45)

Proof.

Firstly, the expression for

κ_{0} (η, ς^{2}, ν)

follows directly from the definition of

κ_{m} (η, ς^{2}, ν)

, and, for

κ_{1} (η, ς^{2}, ν)

, we obtain on setting

m = 1

in Lemma 1 that

\begin{matrix} κ_{1} (η, ς^{2}, ν) & = - ς \sqrt{\frac{ν}{ν - 2}} \int_{- \infty}^{0} \frac{\partial t_{1} (x; η, ς^{2} ν / (ν - 2), ν - 2)}{\partial x} d x \\ = - ς \sqrt{\frac{ν}{ν - 2}} t_{1} (0; η, \frac{ς^{2} ν}{ν - 2}, ν - 2) . \end{matrix}

Next, for (43), using Lemma 1 and applying integration by parts gives

\begin{matrix} κ_{m} (η, ς^{2}, ν) & = - ς \sqrt{\frac{ν}{ν - 2}} \int_{x \in R_{-}} {(\frac{x - η}{ς})}^{m - 1} \frac{\partial t_{1} (x; η, ς^{2} ν / (ν - 2), ν - 2)}{\partial x} d x \\ = (m - 1) \sqrt{\frac{ν}{ν - 2}} \int_{x \in R_{-}} {(\frac{x - η}{ς})}^{m - 2} t_{1} (x; η, \frac{ς^{2} ν}{ν - 2}, ν - 2) d x \\ = (m - 1) {(\frac{ν}{ν - 2})}^{\frac{1}{2} (m - 1)} \int_{x \in R_{-}} {(\frac{x - η}{ς \sqrt{ν / (ν - 2)}})}^{m - 2} t_{1} (x; η, \frac{ς^{2} ν}{ν - 2}, ν - 2) d x \\ = (m - 1) {(\frac{ν}{ν - 2})}^{\frac{1}{2} (m - 1)} κ_{m - 2} (η, \frac{ς^{2} ν}{ν - 2}, ν - 2), \end{matrix}

which is (43). The explicit expressions for the odd and even cases follow by induction. □

As it will be seen, the quantities that play a key role in the two asset case are the spreads,

r_{t, 2} - r_{t, 1}

and

r_{t + 1, 2} - r_{t + 1, 1}

, and so we define

\begin{matrix} η_{u} & = μ_{u, 2} - μ_{u, 1}, σ_{u, i}^{2} = var (r_{u, i}), \\ ρ_{u} & = \frac{cov (r_{u, 1}, r_{u, 2})}{σ_{u, 1} σ_{u, 2}}, ρ_{i, j} = \frac{cov (r_{t, i}, r_{t + 1, j})}{σ_{t, i} σ_{t + 1, j}}, \\ ς_{u}^{2} & = σ_{u, 1}^{2} + σ_{u, 2}^{2} - 2 ρ_{u} σ_{u, 1} σ_{u, 2}, \\ ς_{t, t + 1} & = ρ_{1, 1} σ_{t, 1} σ_{t + 1, 1} + ρ_{2, 2} σ_{t, 2} σ_{t + 1, 2} - ρ_{1, 2} σ_{t, 1} σ_{t + 1, 2} - ρ_{2, 1} σ_{t, 2} σ_{t + 1, 1}, \\ ϱ_{t, t + 1} & = \frac{ς_{t, t + 1}}{ς_{t} ς_{t + 1}}, \end{matrix}

where

1 \leq i, j \leq 2

and

u \in {t, t + 1}

. Note that

ς_{u}^{2}

is the variance of the spread

r_{u, 2} - r_{u, 1}

, and

ϱ_{t, t + 1}

is the correlation between

r_{t, 2} - r_{t, 1}

and

r_{t + 1, 2} - r_{t + 1, 1}

. Next, we compute the terms

γ_{τ} (x)

and

Υ_{τ} (x)

that appear in the expression (33) for the pdf of the CSM return. For

u, v \in {t, t + 1}

and

τ \in S_{2}

, we have

\begin{matrix} ι^{'} P_{(1, 2)} μ_{u} & = - η_{u} = - ι^{'} P_{(2, 1)} μ_{u}, \\ D_{2} P_{(1, 2)} μ_{u} & = η_{u} = - D_{2} P_{(2, 1)} μ_{u}, \\ D_{2} P_{τ} Σ_{u, u} P_{τ}^{'} D_{2}^{'} & = ι^{'} P_{τ} Σ_{u, u} P_{τ}^{'} ι = ς_{u}^{2}, \\ ι^{'} P_{τ} Σ_{u, v} P_{τ}^{'} D_{2}^{'} & = - ς_{u, v}, \end{matrix}

and so

\begin{matrix} γ_{(1, 2)} (x) & = - η_{t + 1} - \frac{ς_{t, t + 1}}{ς_{t}^{2}} (x - η_{t}), \\ γ_{(2, 1)} (x) & = η_{t + 1} - \frac{ς_{t, t + 1}}{ς_{t}^{2}} (x + η_{t}), \\ Υ_{(1, 2)} (x) & = \frac{ν + ς_{t}^{- 2} {(x - η_{t})}^{2}}{ν + 1} (ς_{t + 1}^{2} - \frac{ς_{t, t + 1}^{2}}{ς_{t}^{2}}) = \frac{ς_{t + 1}^{2} (1 - ϱ_{t, t + 1}^{2})}{ν + 1} (ν + {(\frac{x - η_{t}}{ς_{t}})}^{2}), \\ Υ_{(2, 1)} (x) & = \frac{ν + ς_{t}^{- 2} {(x + η_{t})}^{2}}{ν + 1} (ς_{t + 1}^{2} - \frac{ς_{t, t + 1}^{2}}{ς_{t}^{2}}) = \frac{ς_{t + 1}^{2} (1 - ϱ_{t, t + 1}^{2})}{ν + 1} (ν + {(\frac{x + η_{t}}{ς_{t}})}^{2}) . \end{matrix}

If we define the sign of permutations in

S_{2}

by

ε ((1, 2)) = 1

and

ε ((2, 1)) = - 1

, then the expressions for

γ_{τ} (x)

and

Υ_{τ} (x)

can be written succinctly as

\begin{matrix} γ_{τ} (x) & = - ε (τ) η_{t + 1} - ϱ_{t, t + t} ς_{t + 1} (\frac{x - ε (τ) η_{t}}{ς_{t}}), \end{matrix}

(46)

\begin{matrix} Υ_{τ} (x) & = \frac{ς_{t + 1}^{2} (1 - ϱ_{t, t + 1}^{2})}{ν + 1} (ν + {(\frac{x - ε (τ) η_{t}}{ς_{t}})}^{2}) . \end{matrix}

(47)

The next lemma will provide the building blocks for the non-central moments of CSM returns.

Lemma 2.

Let

γ_{τ} (x)

and

Υ_{τ} (x)

be as defined in (46) and (47) respectively, where

τ \in S_{2}

. Then, for

α, β \in N

such that

α + 2 β < ν

, we have

\begin{matrix} \begin{matrix} \int_{- \infty}^{0} γ_{τ}^{α} (x) Υ_{τ}^{β} (x) t_{1} (x; ε (τ) η_{t}, ς_{t}^{2}, ν) d x \\ = \frac{{(- 1)}^{α} ς_{t + 1}^{2 β} {(1 - ρ_{t, t + 1}^{2})}^{β}}{{(ν + 1)}^{β}} \sum_{i = 0}^{α} \sum_{j = 0}^{β} (\binom{α}{i}) (\binom{β}{j}) ε^{α - i} (τ) η_{t + 1}^{α - i} ϱ_{t, t + 1}^{i} ς_{t + 1}^{i} ν^{β - j} κ_{i + 2 j} (ε (τ) η_{t}, ς_{t}^{2}, ν), \end{matrix} \end{matrix}

(48)

where

κ_{m} (η, ς^{2}, ν)

is as defined in (42).

Proof.

Follows from using the binomial formula to expand the powers of

γ_{τ} (x)

and

Υ_{τ} (x)

, and applying the definition of

κ_{m} (η, ς^{2}, ν)

. □

For notational convenience, given any

τ \in S_{2}

,

ν \in R_{+}

and

α, β \in N

such that

α + 2 β < ν

, we define

\begin{matrix} κ_{α, β, τ} (η, ς, ν) = \int_{- \infty}^{0} γ_{τ}^{α} (x) Υ_{τ}^{β} (x) t_{1} (x; ε (τ) η, ς^{2}, ν) d x, \end{matrix}

(49)

and note that

κ_{α, β, τ} (η, ς, ν)

can be computed explicitly using (48). We now present the non-central moments of the CSM return

r_{1_{\pm}, t + 1}

.

Theorem 8.

Let

η \in R

,

ς \in R_{+}

and

ν \in R_{+}

. Then, for

m \in N_{+}

such that

m < ν

, the m-th non-central moment,

μ_{m} (r_{1_{\pm}, t + 1})

, of

r_{1_{\pm}, t + 1}

is given as follows:

\begin{matrix} μ_{m} (r_{1_{\pm}, t + 1}) = \sum_{τ \in S_{2}} \sum_{\begin{matrix} k = 0 \\ k even \end{matrix}}^{m} (\binom{m}{k}) (k - 1)!! \frac{Γ (\frac{1}{2} (ν - k + 1))}{Γ (\frac{1}{2} (ν + 1))} {(\frac{ν + 1}{2})}^{\frac{k}{2}} κ_{m - k, \frac{k}{2}, τ} (η_{t}, ς_{t}, ν), \end{matrix}

(50)

where

κ_{α, β, τ} (η, ς, ν)

is as defined in (49). In particular, the first four non-central moments are

\begin{matrix} μ_{1} (r_{1_{\pm}, t + 1}) & = \sum_{τ \in S_{2}} κ_{1, 0, τ} (η_{t}, ς_{t}, ν), \end{matrix}

(51)

\begin{matrix} μ_{2} (r_{1_{\pm}, t + 1}) & = \sum_{τ \in S_{2}} (κ_{2, 0, τ} (η_{t}, ς_{t}, ν) + \frac{(ν + 1) κ_{0, 1, τ} (η_{t}, ς_{t}, ν)}{ν - 1}), \end{matrix}

(52)

\begin{matrix} μ_{3} (r_{1_{\pm}, t + 1}) & = \sum_{τ \in S_{2}} (κ_{3, 0, τ} (η_{t}, ς_{t}, ν) + \frac{3 (ν + 1) κ_{1, 1, τ} (η_{t}, ς_{t}, ν)}{ν - 1}), \end{matrix}

(53)

\begin{matrix} μ_{4} (r_{1_{\pm}, t + 1}) & = \sum_{τ \in S_{2}} (κ_{4, 0, τ} (η_{t}, ς_{t}, ν) + \frac{6 (ν + 1) κ_{2, 1, τ} (η_{t}, ς_{t}, ν)}{ν - 1} + \frac{3 {(ν + 1)}^{2} κ_{0, 2, τ} (η_{t}, ς_{t}, ν)}{(ν - 1) (ν - 3)}) . \end{matrix}

(54)

Proof.

Follows from the general expression (38) for the moments of

r_{1_{\pm}, t + 1}

and the definition of

κ_{α, β, τ} (η, ς, ν)

. □

We remark that the non-central moments of

r_{1_{\pm}, t + 1}

given in Theorem 8 are sums indexed by

S_{2}

that consists of two elements, and that each term that appears in these sums can be computed recursively using (43), (48), and (49). Since the right-hand side of (48) consists of a finite number of terms and (43) is equivalent to the explicit expressions (44) or (45) depending on the index m, these moments of

r_{1_{\pm}, t + 1}

can be computed without having to make any simplifying approximations. For example, the first moment is given explicitly by

\begin{matrix} μ_{1} (r_{\pm 1, t + 1}) & = η_{t + 1} (2 T_{1} (0; - η_{t}, ς_{t}^{2}, ν) - 1) + 2 ρ_{t, t + 1} ς_{t + 1} \sqrt{\frac{ν}{ν - 2}} t_{1} (0; η_{t}, \frac{ν ς_{t}^{2}}{ν - 2}, ν), \end{matrix}

which is reassuring since it has the same functional form as the following expression3

\begin{matrix} μ_{1}^{🟉} (r_{\pm 1, t + 1}) = η_{t + 1} (2 Φ (\frac{η_{t}}{ς_{t}}) - 1) + 2 ϱ_{t, t + t} ς_{t + 1} ϕ (\frac{η_{t}}{ς_{t}}) \end{matrix}

obtained for the normally distributed asset return case in Kwon and Satchell (2018) except for the distribution functions being Student’s t rather than normal. The numerical calculations in this paper were performed using code written in C++ that relied on the boost library4 to compute the functions

t_{1}

and

T_{1}

.

Returning briefly to the linear factor structure discussed in Section 1, we could consider

μ_{t, 1}

and

μ_{t, 2}

to be a linear combinations of factors, which in a Carhart (1997) model context would consist of size, market, value, and momentum. Thus, if we were to go long asset 1 and short asset 2 in our CSM momentum portfolio, we might expect a larger exposure to the momentum factor for asset 1 and a smaller exposure for asset 2. We could carry out further detailed analysis to accommodate these features but leave this for further research.

If we denote by

μ_{1_{\pm}, t + 1}

the mean,

σ_{1_{\pm}, t + 1}^{2}

the variance,

γ_{1_{\pm}, t + 1}

the skewness, and

κ_{1_{\pm}, t + 1}

the excess kurtosis of the CSM return, then these quantities are easily computed from the non-central moments given in Theorem 8. It should be noted that the quantities corresponding to the odd moments, viz.

μ_{1_{\pm}, t + 1}

and

γ_{1_{\pm}, t + 1}

are approximately odd as functions of

ϱ_{t, t + 1}

, and those associated with the even moments, viz.

σ_{1_{\pm}, t + 1}^{2}

and

κ_{1_{\pm}, t + 1}

, are approximately even as functions of

ϱ_{t, t + 1}

. This is because the return from a portfolio formed by taking a long position in the loser and a short position in the winner when

ϱ_{t, t + 1} < 0

would have the same distributional properties as the return from taking the opposite positions when

ϱ_{t, t + 1} > 0

.

In the analysis that follows, we assume that the asset returns are stationary in order to reduce the number of parameters. Moreover, we have set

μ_{t, 1} = 6 %

,

μ_{t, 2} = 4 %

,

σ_{t, 1}^{2} = 0.18 (ν - 2) / ν

and

σ_{t, 2}^{2} = 0.14 (ν - 2) / ν

, where the asset variances have been scaled by a factor dependent on

ν

to ensure that they are independent of the number of degrees of freedom. It should be noted that the cross-sectional correlation,

ρ_{t}

, then determines the variance,

ς_{t}^{2}

, of the spread,

r_{t, 2} - r_{t, 1}

.

The mean of the CSM return is shown in Figure 1 as a function of the degrees of freedom

ν

, and the spread autocorrelation,

ϱ_{t, t + 1}

, for

ρ_{t} = - 0.9

,

ρ_{t} = 0.0

, and

ρ_{t} = 0.9

. As expected,

μ_{1_{\pm}, t + 1}

is an increasing function of

ϱ_{t, t + 1}

. For

ϱ_{t, t + 1} > 0

, the mean decreases slightly with

ν

, while the opposite is the case when

ϱ_{t, t + 1} < 0

. In the region

ϱ_{t, t + 1} \approx 0

, the mean is slightly positive. Since this is the region corresponding to small autocorrelations in the underlying asset returns, and the situation most commonly observed in practice, it is reassuring that the small positive CSM returns implied by the model is consistent with the findings reported in the empirical literature. Moreover, we see that the degrees of freedom parameter,

ν

, that controls the kurtosis of asset returns has very little impact on

r_{1_{p m}, t + 1}

. Interpreting small

ν

as representative of assets from emerging markets, this is consistent with findings from Rouwenhorst (1999) and Bekaert et al. (1997) that, although there is evidence of momentum in emerging markets, it is not significantly different to those observed in developed markets, despite the assets from the respective markets having different distributional properties. The surface flattens out as the cross-sectional correlation increases from

- 0.9

to

0.9

. Since an increase in

ρ_{t}

corresponds to a decrease in the variance,

ς_{t}^{2}

, of the spread

r_{t, 2} - r_{t, 1}

, this behavior is consistent with the positive relationship usually associated with risk and return in finance.

The standard deviation of the CSM return as a function of

ν

and

ϱ_{t, t + 1}

is shown in Figure 2. Although not clearly evident from the figure,

σ_{1_{\pm}, t + 1}

is a decreasing function of

ν

, and for a fixed value of

ν

the standard deviation is convex in

ϱ_{t, t + 1}

and takes the maximum value at

ϱ_{t, t + 1} = 0

. Finally, since the variance of the spread decreases as

ρ_{t}

increases,

σ_{1_{\pm}, t + 1}

likewise decreases with increasing

ρ_{t}

.

The skewness of the CSM return in Figure 3 shows that

γ_{1_{\pm}, t + 1}

is negative when

ϱ_{t, t + 1} < 0

and positive otherwise. In fact, although it is not clearly evident from the figure,

γ_{1_{\pm}, t + 1}

is negative even for small positive values of

ϱ_{t, t + 1}

. As discussed above, the autocorrelations in the asset returns tend to be small in practice, and hence

ϱ_{t, t + 1}

will also be small. The corresponding model implied skewness in the CSM return will then be slightly negative, which is consistent with the observations in the empirical literature.

In contrast to the surfaces for other quantities that flatten to a large extent as

ρ_{t}

increases from

- 0.9

to

0.9

, the surface for

κ_{1_{\pm}, t + 1}

in Figure 4 remains relatively unchanged. The excess kurtosis of CSM returns is generally positive, in line with the findings reported in the literature, and increases significantly when

ν

is small and

|ϱ_{t, t + 1}|

is high. It should be noted that

κ_{1_{\pm}, t + 1}

is largest when

ν

is small. Since the deviation of the Student’s t distribution from the normal is greatest when

ν

is small, it follows that the extension considered in this paper will be useful in situations where the observed kurtosis in the CSM returns is higher than the value implied under the assumption of normal asset returns. This would be the case, for example, when considering emerging markets.

5. Conclusions

In this paper, the theoretical framework introduced in Kwon and Satchell (2018) was extended to investigate the distributional properties of cross-sectional momentum (CSM) returns under the assumption that the vector of asset returns were multivariate Student’s t. The probability density function and the moments of the CSM returns were derived, and investigated in detail for the special case of two assets.

It was found that, in situations where the assets return, and hence the return spread, autocorrelations are small, and the CSM return has a small positive mean, negative skewness, and excess kurtosis. These are all consistent with the findings reported in the empirical literature. Moreover, the skewness and the kurtosis both become more pronounced as the number of degrees of freedom in the Student’s t distribution decreases and the corresponding asset returns become less normal.

In modeling asset returns that deviate significantly from being normal, such as those for emerging markets, the extension to the Student’s t considered in this paper would address some of the limitations of assuming normality. Since the Student’s t distribution approaches the normal in the limit as the number of degrees of freedom approaches infinity, the extension also provides a framework under which to analyze the implication and the limitations of the assumption of normality in asset returns to CSM returns.

Author Contributions

Conceptualization, O.K.K. and S.S.; methodology, O.K.K. and S.S; software, O.K.; validation, O.K.K. and S.S.; investigation, O.K.K. and S.S.; writing–original draft preparation, O.K.K. and S.S.; writing–review and editing, O.K.K. and S.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Theorem 4

From the discussion in Section 2.3, see also Roth (2013) Appendix A.4, we have that

X \sim {St}_{n} (μ, Σ, ν)

can be decomposed as

\begin{matrix} X \overset{d}{\sim} μ + \frac{1}{\sqrt{U}} V, \end{matrix}

where

U \sim Gam (\frac{ν}{2}, \frac{ν}{2})

,

V \sim N_{n} (0, Σ)

, and U and

V

are independent. Now, from (9), we obtain

\begin{matrix} μ_{p} (X) = \sum_{P \in P_{m}} \prod_{k \in P} μ_{p_{k}} E [U^{- \frac{1}{2} | P^{c} |} \prod_{l \in P^{c}} V_{p_{l}}], \end{matrix}

where

P^{c} = {1, 2, \dots, m} \ P

denotes the complement of P, and since U and

X

are independent,

\begin{matrix} μ_{p} (X) = \sum_{P \in P_{m}} \prod_{k \in P} μ_{p_{k}} E [U^{- \frac{1}{2} | P^{c} |}] E [\prod_{l \in P^{c}} V_{p_{l}}] . \end{matrix}

The second expectation term can be computed using (10), and so it remains to compute the first expectation term. However, from the properties of Gamma distributions, we have

\begin{matrix} E [U^{- \frac{1}{2} | P^{c} |}] = {(\frac{ν}{2})}^{\frac{1}{2} | P^{c} |} \frac{Γ (\frac{ν}{2} - \frac{1}{2} | P^{c} |)}{Γ (\frac{ν}{2})} \end{matrix}

provided

| P^{c} | < ν

, and so substituting into the expression for

μ_{p} (X)

gives (18).

Appendix B. Proof of Theorem 5

For any

k \in N

, let

1_{k} = (1, 1, \dots, 1) \in R^{k}

, and define

ι \in R^{n}

by

ι = {(1_{m_{+}}^{'}, 0_{n - n_{+} - n_{-}}^{'}, - 1_{n_{-}}^{'})}^{'}

, so that, for any

τ \in S_{n}

, we have that

r_{τ, m_{\pm}, t + 1} = ι^{'} P_{τ} r_{t + 1}

. Then, since

\begin{matrix} [\begin{matrix} r_{τ, m_{\pm}, t + 1} \\ x_{τ, t} \end{matrix}] = [\begin{matrix} ι^{'} P_{τ} & O_{1 \times n} \\ O_{(n - 1) \times n} & D_{n} P_{τ} \end{matrix}] [\begin{matrix} r_{t + 1} \\ r_{t} \end{matrix}], \end{matrix}

we have that

{(r_{τ, m_{\pm}, t + 1}, x_{τ, t}^{'})}^{'}

is a linear transformation of

{(r_{t + 1}^{'}, r_{t}^{'})}^{'}

, and so it follows from Roth (2013) Equation (4.1) that

\begin{matrix} [\begin{matrix} r_{τ, m_{\pm}, t + 1} \\ x_{τ, t} \end{matrix}] \sim {St}_{n} ([\begin{matrix} ι^{'} P_{τ} μ_{t + 1} \\ D_{n} P_{τ} r_{t} \end{matrix}], [\begin{matrix} ι^{'} P_{τ} Σ_{t + 1, t + 1} P_{τ}^{'} ι & ι^{'} P_{τ} Σ_{t + 1, t} P_{τ}^{'} D_{n}^{'} \\ D_{n} P_{τ} Σ_{t, t + 1} P_{τ}^{'} ι & D_{n} P_{τ} Σ_{t, t} P_{τ}^{'} D_{n}^{'} \end{matrix}], ν) . \end{matrix}

Now, the joint pdf,

f_{r_{τ, m_{\pm}, t + 1}, x_{τ, t}} (r, x)

, is given by

\begin{matrix} f_{r_{τ, m_{\pm}, t + 1}, x_{τ, t}} (t, x) = f_{x_{τ, t} ∣ r_{τ, m_{\pm}, t + 1}} (x | r) f_{r_{τ, m_{\pm}, t + 1}} (r) . \end{matrix}

However, from Theorem 3, we have

f_{x_{τ, t} ∣ r_{τ, m_{\pm}, t + 1}} (x | r) = t_{n - 1} (x; λ_{τ} (r), Λ_{τ} (r), ν + 1)

, and so

\begin{matrix} f_{r_{τ, m_{\pm}, t + 1}, x_{τ, t}} (t, x) = t_{n - 1} (x; λ_{τ} (r), Λ_{τ} (r), ν + 1) t_{1} (r; ι^{'} P_{τ} μ_{t + 1}, ι^{'} P_{τ} Σ_{t + 1, t + 1} P_{τ}^{'} ι, ν) . \end{matrix}

If we define

D_{τ} = {x_{τ} ≺ 0}

, then

f_{m_{\pm}, t + 1 ∣ D_{τ}} = I_{D_{τ}} f_{r_{τ, m_{\pm}, t + 1}}

, and so

\begin{matrix} f_{m_{\pm}, t + 1 ∣ D_{τ}} (r) & = \int_{x \in R^{n - 1}} I_{D_{τ}} (x) f_{r_{τ, m_{\pm}, t + 1}, x_{τ, t}} (t, x) d x \\ = \int_{x ≺ 0} t_{n - 1} (x; λ_{τ} (r), Λ_{τ} (r), ν + 1) t_{1} (r; ι^{'} P_{τ} μ_{t + 1}, ι^{'} P_{τ} Σ_{t + 1, t + 1} P_{τ}^{'} ι, ν) d x \\ = t_{1} (r; ι^{'} P_{τ} μ_{t + 1}, ι^{'} P_{τ} Σ_{t + 1, t + 1} P_{τ}^{'} ι, ν) T_{n - 1} (0; λ_{τ, r}, Λ_{τ, r}, ν + 1), \end{matrix}

and summing over

τ \in S_{n}

gives (30). The alternative expression (33) for

f_{m_{\pm}, t + 1} (r)

follows from a similar argument using the decomposition

f_{r_{τ, m_{\pm}, t + 1}, x_{τ, t}} (t, x) = f_{r_{τ, m_{\pm}, t + 1} ∣ x_{τ, t}} (r | x) f_{x_{τ, t}} (x)

.

Appendix C. Proof of Theorem 6

In view of the expression for

f_{m_{\pm}, t + 1} (r)

in (33), we need to compute terms of the form

\begin{matrix} I_{m, τ} = \int_{R} \int_{R_{-}^{n - 1}} r^{m} t_{1} (r; γ_{τ} (x), Υ_{τ} (x), ν + n - 1) t_{n - 1} (x; D_{n} P_{τ} μ_{t}, D_{n} P_{τ} Σ_{t, t} P_{τ}^{'} D_{n}^{'}, ν) d x d r \end{matrix}

for

τ \in S_{n}

. Now, since a one-dimensional Student’s t distribution with

ν - n - 1

degrees of freedom has finite moments for orders less than the number of degrees of freedom, and by assumption

m < ν \leq ν + n - 1

, the m-th moment of

t_{1} (r; λ_{τ} (x), Υ_{τ} (x), ν + n - 1)

exists. Next, interchanging the order of integration gives

\begin{matrix} I_{m, τ} = \int_{R_{-}^{n - 1}} t_{n - 1} (x; D_{n} P_{τ} μ_{t}, D_{n} P_{τ} Σ_{t, t} P_{τ}^{'} D_{n}^{'}, ν) \int_{R} r^{m} t_{1} (r; γ_{τ} (x), Υ_{τ} (x), ν + n - 1) d r d x, \end{matrix}

and applying Corollary 2 to the inner integral gives

\begin{matrix} \begin{matrix} I_{m, τ} & = \sum_{\begin{matrix} k = 0 \\ k even \end{matrix}}^{m} (\binom{m}{k}) (k - 1)!! \frac{Γ (\frac{1}{2} (ν + n - 1 - k))}{Γ (\frac{1}{2} (ν + n - 1))} {(\frac{ν + n - 1}{2})}^{\frac{k}{2}} \\ \int_{R_{-}^{n - 1}} γ_{τ}^{m - k} (x) Υ_{τ}^{\frac{k}{2}} (x) t_{n - 1} (x; D_{n} P_{τ} μ_{t}, D_{n} P_{τ} Σ_{t, t} P_{τ}^{'} D_{n}^{'}, ν) d x . \end{matrix} \end{matrix}

Since

γ_{τ} (x)

and

Υ_{τ} (x)

are of orders 1 and 2 respectively in

x

, we have

γ_{τ}^{m - k} (x) Γ_{τ}^{\frac{k}{2}} (x)

is of order m in

x

, and since

t_{n - 1} (x; D_{n} P_{τ} μ_{t}, D_{n} P_{τ} Σ_{t, t} P_{τ}^{'} D_{n}^{'}, ν)

has finite moments of order up to

ν

by Theorem 4, it follows that the integral in

I_{m, τ}

is well-defined. Summing the

I_{m, τ}

over

τ \in S_{n}

gives (38).

Appendix D. Proof of Lemma 1

Setting the scale parameter to

ς \sqrt{ν / (ν - 2)}

and the degrees of freedom to

ν - 2

in (40) gives

\begin{matrix} t_{1} (x; η, ς^{2} ν / (ν - 2), ν - 2) = c (η, ς^{2} ν / (ν - 2), ν - 2) {(1 + \frac{1}{ν} {(\frac{x - η}{ς})}^{2})}^{- \frac{1}{2} (ν - 1)}, \end{matrix}

and differentiating both sides with respect to x gives

\begin{matrix} \frac{\partial t_{1} (x; η, ς^{2} ν / (ν - 2), ν - 2)}{\partial x} & = - c (η, \frac{ς^{2} ν}{ν - 2}, ν - 2) \frac{(ν - 1)}{ν ς} (\frac{x - η}{ς}) {(1 + \frac{1}{ν} {(\frac{x - η}{ς})}^{2})}^{- \frac{1}{2} (ν + 1)} \\ = - \frac{c (η, ς^{2} ν / (ν - 2), ν - 2) (ν - 1)}{c (η, ς^{2}, ν) ν ς} (\frac{x - η}{ς}) t_{1} (x; η, ς^{2}, ν) . \end{matrix}

(A1)

Now, using the definition of

c (η, ς, ν)

, we have

\begin{matrix} \frac{c (η, ς^{2} ν / (ν - 2), ν - 2) (ν - 1)}{c (η, ς^{2}, ν) ν ς} & = \frac{Γ (\frac{ν - 1}{2})}{ς \sqrt{(ν - 2) π} Γ (\frac{ν - 2}{2})} \cdot \frac{ς \sqrt{ν π} Γ (\frac{ν}{2})}{Γ (\frac{ν + 1}{2})} \cdot \frac{(ν - 1)}{ν ς}, \end{matrix}

and using the property

Γ (z + 1) = z Γ (z)

, we obtain

\begin{matrix} \frac{c (η, ς^{2} ν / (ν - 2), ν - 2) (ν - 1)}{c (η, ς^{2}, ν) ν ς} = \sqrt{\frac{ν}{ν - 2}} \cdot \frac{\frac{1}{2} (ν - 2)}{\frac{1}{2} (ν - 1)} \cdot \frac{(ν - 1)}{ν ς} = \frac{1}{ς} \sqrt{\frac{ν - 2}{ν}} . \end{matrix}

Substituting back into (A1) and rearranging gives (41) for

m = 1

, and the general case follows from multiplying both sides of the identity for the

m = 1

case by powers of

(x - η) / ς

.

References

Arellano-Valle, Reinaldo B., and Adelchi Azzalini. 2006. On the Unification of Families of Skew-normal Distributions. Scandinavian Journal of Statistics 33: 561–74. [Google Scholar] [CrossRef]
Arellano-Valle, Reinaldo B., and Marc G. Genton. 2010. Multivariate unified skew-elliptical distributions. Chilean Journal of Statistics 1: 17–33. [Google Scholar]
Asness, Clifford S. 1994. Variables that Explain Stock Returns. Ph.D. dissertation, University of Chicago, Chicago, IL, USA. [Google Scholar]
Asness, Clifford S., John M. Liew, and Ross L. Stevens. 1997. Parallels between the cross-sectional predictability of stock and country returns. Journal of Portfolio Management 23: 79–87. [Google Scholar] [CrossRef] [Green Version]
Asness, Clifford S., Tobias J. Moskowitz, and Lasse H. Pedersen. 2013. Value and momentum everywhere. The Journal of Finance 58: 929–85. [Google Scholar] [CrossRef] [Green Version]
Azzalini, Adelchi, and Alessandra Dalla Valle. 1985. The multivariate skew-normal distribution. Biometrika 83: 715–26. [Google Scholar] [CrossRef]
Bekaert, Geert, Claude B. Erb, Campbell R. Harvey, and Tadas E. Viskanta. 1997. What Matters for Emerging Equity Market Investments. Emerging Markets Quarterly 1: 17–46. [Google Scholar]
Carhart, Mark M. 1997. On Persistence in Mutual Fund Performance. Journal of Finance 52: 57–82. [Google Scholar] [CrossRef]
Chan, Kalok, Allaudeen Hameed, and Wilson Tong. 2000. Profitability of Momentum Strategies in the International Equity Markets. Journal of Financial and Quantitative Analysis 35: 153–72. [Google Scholar] [CrossRef]
Daniel, Kent, and Tobias J. Moskowitz. 2016. Momentum Crashes. Journal of Financial Economics 122: 221–47. [Google Scholar] [CrossRef] [Green Version]
Erb, Claude B., and Campbell R. Harvey. 2006. The strategic and tactical value of commodity futures. Financial Analysts Journal 62: 69–97. [Google Scholar] [CrossRef]
Fama, Eugene, and Kenneth French. 1992. The Cross-Section of Expected Stock Returns. The Journal of Finance 47: 427–65. [Google Scholar] [CrossRef]
Hameed, Allaudeen, and Kusnadi Yuanto. 2002. Momentum Strategies: Evidence from the Pacific Basin Stock Markets. Journal of Financial Research 25: 383–97. [Google Scholar] [CrossRef]
Hansen, Christian, James B. McDonald, and Whitney K. Newey. 2010. Instrumental Variables Estimation with Flexible Distributions. Journal of Business and Economic Statistics 28: 13–25. [Google Scholar] [CrossRef]
Israel, Ronen, and Tobias J. Moskowitz. 2013. The Role of Shorting, Firm Size, and Time on Market Anomalies. Journal of Financial Economics 108: 275–301. [Google Scholar] [CrossRef]
Jamalizadeh, Ahad, and Narayanaswamy Balakrishnan. 2012. Concomitants of order statistics from multivariate elliptical distributions. Journal of Statistical Planning and Inference 142: 397–409. [Google Scholar] [CrossRef]
Jegadeesh, Narasimhan, and Sheridan Titman. 1993. Returns to buying winners and selling losers: Implications for stock market efficiency. The Journal of Finance 48: 65–91. [Google Scholar] [CrossRef]
Jegadeesh, Narasimhan, and Sheridan Titman. 2001. Profitability and Momentum Strategies: An Evaluation of Alternative Explanations. Journal of Finance 56: 699–720. [Google Scholar] [CrossRef] [Green Version]
Kan, Raymond. 2008. From moements of sum to moments of product. Journal of Multivariate Analysis 99: 542–54. [Google Scholar] [CrossRef] [Green Version]
Kirkby, Justin L., Dang Nguyen, and Duy Nguyen. 2019. Moments of Student’s t-distribution: A Unified Approach. arXiv arXiv:1912.01607. [Google Scholar] [CrossRef] [Green Version]
Kwon, Oh Kang, and Stephen Satchell. 2018. The distribution of cross sectional momentum returns. Journal of Economic Dynamics and Control 94: 225–41. [Google Scholar] [CrossRef]
Lewellen, Jonathan. 2002. Momentum and autocorrelation in Stock Returns. Review of Financial Studies 15: 533–63. [Google Scholar] [CrossRef]
Lo, Andrew, and Craig MacKinlay. 1990. When are Contrarian Profits due to Stock Marjket Overreaction? Review of Financial Studies 3: 175–205. [Google Scholar] [CrossRef]
Menkhoff, Lucas, Lucio Sarno, Maik Schmeling, and Andreas Schrimpf. 2012. Currency momentum strategies. Journal of Financial Economics 106: 660–84. [Google Scholar] [CrossRef] [Green Version]
Moskowitz, Tobias J., Yao Hua Ooi, and Lasse H. Pedersen. 2012. Time Series Momentum. Journal of Financial Economics 104: 228–50. [Google Scholar] [CrossRef] [Green Version]
Muirhead, Robb J. 1982. Aspects of Multivariate Statistical Theory. New Jersey: John Wiley & Sons. [Google Scholar]
Okunev, John U., and Derek White. 2003. Do momentum-based strategies still work in foreign currency markets? Journal of Financial and Quantitative Analysis 38: 425–47. [Google Scholar] [CrossRef]
Richards, Anthony J. 1997. Winner-Loser Reversals in National Stock Market Indices: Can They be Explained? Journal of Finance 52: 2129–44. [Google Scholar] [CrossRef]
Roth, Michael. 2013. On the Multivariate t-Distribution. Technical Report. Linköping: Department of Electrial Engineering, Linköpings Universitet. [Google Scholar]
Rotman, Joseph J. 1995. An Introduction to the Theory of Groups. New York: Springer. [Google Scholar]
Rouwenhorst, Geert K. 1998. International Momentum Strategies. Journal of Finance 53: 267–84. [Google Scholar] [CrossRef] [Green Version]
Rouwenhorst, Geert K. 1999. Local Return Factors and Turnover in Emerging Stock Markets. Journal of Finance 54: 1439–64. [Google Scholar] [CrossRef]
Sefton, James, and Alan Scowcroft. 2004. A Decomposition of Portfolio Momentum Returns. Discussion Paper TBS/DP04/9. London: Tanaka Business School, Imperial College London. [Google Scholar]
Theodossiou, Panayiotis. 1998. Financial Data and the Skewed Generalized T Distribution. Management Science 44: 1650–61. [Google Scholar] [CrossRef]
Withers, Christopher S. 1985. The Moments of the Multivariate Normal. Bulletin of Australian Mathematics Society 32: 103–7. [Google Scholar] [CrossRef] [Green Version]

1.	Refer to Rotman (1995) Chapter 2 for the details on quotient groups.
2.	The normalization factor is, in fact, the probability of the event $0 ≺ X_{1}$ , where $X_{1}$ is as defined in (20).
3.	Rewritten in the notation of this paper.
4.	Refer to www.boost.org/doc/libs/1_72_0/boost/math/distributions/students_t.hpp.

Figure 1. Expected CSM return,

μ_{1_{\pm}, t + 1}

, as a function of

ν

and

ϱ_{t, t + 1}

.

Figure 1. Expected CSM return,

μ_{1_{\pm}, t + 1}

, as a function of

ν

and

ϱ_{t, t + 1}

.

Figure 2. Standard deviation,

σ_{1_{\pm}, t + 1}

, of CSM return as a function of

ν

and

ϱ_{t, t + 1}

.

Figure 2. Standard deviation,

σ_{1_{\pm}, t + 1}

, of CSM return as a function of

ν

and

ϱ_{t, t + 1}

.

Figure 3. Skewness,

γ_{1_{\pm}, t + 1}

, of CSM return as a function of

ν

and

ϱ_{t, t + 1}

.

Figure 3. Skewness,

γ_{1_{\pm}, t + 1}

, of CSM return as a function of

ν

and

ϱ_{t, t + 1}

.

Figure 4. Excess kurtosis,

κ_{1_{\pm}, t + 1}

, of CSM return as a function of

ν

and

ϱ_{t, t + 1}

.

Figure 4. Excess kurtosis,

κ_{1_{\pm}, t + 1}

, of CSM return as a function of

ν

and

ϱ_{t, t + 1}

.

© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).

Share and Cite

MDPI and ACS Style

Kwon, O.K.; Satchell, S. The Distribution of Cross Sectional Momentum Returns When Underlying Asset Returns Are Student’s t Distributed. J. Risk Financial Manag. 2020, 13, 27. https://doi.org/10.3390/jrfm13020027

AMA Style

Kwon OK, Satchell S. The Distribution of Cross Sectional Momentum Returns When Underlying Asset Returns Are Student’s t Distributed. Journal of Risk and Financial Management. 2020; 13(2):27. https://doi.org/10.3390/jrfm13020027

Chicago/Turabian Style

Kwon, Oh Kang, and Stephen Satchell. 2020. "The Distribution of Cross Sectional Momentum Returns When Underlying Asset Returns Are Student’s t Distributed" Journal of Risk and Financial Management 13, no. 2: 27. https://doi.org/10.3390/jrfm13020027

Article Menu

The Distribution of Cross Sectional Momentum Returns When Underlying Asset Returns Are Student’s t Distributed

Abstract

1. Introduction

2. Notation and Preliminaries

2.1. Notation

2.2. Multivariate Normal Distributions

2.3. Multivariate Student’s t Distribution

2.4. Unified Skew t Family of Distributions

3. Cross-Sectional Momentum Returns with Student’s $t$ Distributed Asset Returns

4. Special Case of Two Assets

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A. Proof of Theorem 4

Appendix B. Proof of Theorem 5

Appendix C. Proof of Theorem 6

Appendix D. Proof of Lemma 1

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

Article Menu

The Distribution of Cross Sectional Momentum Returns When Underlying Asset Returns Are Student’s t Distributed

Abstract

1. Introduction

2. Notation and Preliminaries

2.1. Notation

2.2. Multivariate Normal Distributions

2.3. Multivariate Student’s t Distribution

2.4. Unified Skew t Family of Distributions

3. Cross-Sectional Momentum Returns with Student’s t Distributed Asset Returns

4. Special Case of Two Assets

5. Conclusions

Author Contributions

Funding

Conflicts of Interest

Appendix A. Proof of Theorem 4

Appendix B. Proof of Theorem 5

Appendix C. Proof of Theorem 6

Appendix D. Proof of Lemma 1

References

Share and Cite

Article Metrics

Article Access Statistics

Further Information

Guidelines

MDPI Initiatives

Follow MDPI

3. Cross-Sectional Momentum Returns with Student’s $t$ Distributed Asset Returns