Next Article in Journal
Exploring Compliance of AAOIFI Shariah Standard on Ijarah Financing: Analysis on the Practices of Islamic Banks in Malaysia
Next Article in Special Issue
A Wavelet-Based Analysis of the Co-Movement between Sukuk Bonds and Shariah Stock Indices in the GCC Region: Implications for Risk Diversification
Previous Article in Journal
Banking Finance Experts Consensus on Compliance in US Bank Holding Companies: An e-Delphi Study
Previous Article in Special Issue
The Equity Curve and Its Relation to Future Stock Returns
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

The Distribution of Cross Sectional Momentum Returns When Underlying Asset Returns Are Student’s t Distributed

1
Discipline of Finance, Codrington Building (H69), The University of Sydney, Sydney NSW 2006, Australia
2
Trinity College, University of Cambridge, Cambridge CB2 1TQ, UK
*
Author to whom correspondence should be addressed.
J. Risk Financial Manag. 2020, 13(2), 27; https://doi.org/10.3390/jrfm13020027
Submission received: 13 January 2020 / Revised: 27 January 2020 / Accepted: 31 January 2020 / Published: 5 February 2020
(This article belongs to the Special Issue Modern Portfolio Theory)

Abstract

:
In Kwon and Satchell (2018), a theoretical framework was introduced to investigate the distributional properties of the cross-sectional momentum returns under the assumption that the vector of asset returns over the ranking and holding periods were multivariate normal. In this paper, the framework is extended to derive the corresponding results when the asset returns are multivariate Student’s t. In particular, we derive the probability density function and the moments of the cross-sectional momentum returns and examine in detail the special case of two underlying assets to demonstrate that many of the salient features reported in the empirical literature are consistent with the theoretical implications.

1. Introduction

Empirical investigation of the observed patterns in asset returns has been an active area of research in finance, with momentum, or persistence, in asset returns being one of the more popular examples of this line of research. Of these, perhaps the most prominent is cross-sectional momentum (CSM), which refers to the observation that the set of assets that outperform relative to another set over a prior period tend to continue to outperform over a subsequent period. The existence of CSM is usually tested empirically by sorting the assets according to their returns over a prior “ranking” period, and constructing a portfolio over a subsequent “holding” period by taking a long position in the “winners” and a short position in the “losers”. Statistically significant excess returns from following such a strategy would then support the existence of cross-sectional momentum.
Cross sectional momentum strategies are popular with practitioners since they tend to generate positive returns, while they are popular with academics due to the fact their existence would run contrary to an implication of the efficient market hypothesis that there does not exist any discernible patterns in asset returns. There is an extensive academic literature investigating the properties of CSM returns covering various asset classes, markets, and jurisdictions. The most notable findings are that CSM returns are generally slightly positive, but become highly negative during times of market uncertainty, and that losses during such periods tend to cancel out, or at least significantly reduce, the prior gains.
Various authors, including Fama and French (1992), Jegadeesh and Titman (1993, 2001), Asness (1994), and Israel and Moskowitz (2013), found that momentum strategies are profitable in US equities markets over different time periods dating back to 1927. Analogous results were found for country equity indices by Richards (1997), Asness et al. (1997), Chan et al. (2000), and Hameed and Yuanto (2002), for emerging markets by Rouwenhorst (1998), for exchange rate markets in Okunev and White (2003) and Menkhoff et al. (2012), for commodities by Erb and Harvey (2006), for futures contracts in Moskowitz et al. (2012), and in industries by Sefton and Scowcroft (2004). Similar results were also found by Asness et al. (2013) and Daniel and Moskowitz (2016) for markets in the European Union, Japan, the United Kingdom, and the United States, and across asset classes including fixed income, commodities, foreign exchange, and equity from 1972 through 2013.
Despite the extensive literature on the empirical properties of momentum based returns, there are relatively few that consider the distributional properties of these returns from a theoretical viewpoint, with Kwon and Satchell (2018) being a notable exception that addresses the CSM returns as defined in this paper. Most of the known theoretical results, obtained for example by Lo and MacKinlay (1990), Jegadeesh and Titman (1993), Lewellen (2002), and Moskowitz et al. (2012), are concerned only with the expected values and first order autocorrelations of returns from the so-called weighted relative strength strategy in which the portfolio over the holding period is constructed from all underlying assets weighted, essentially, in proportion to their absolute or relative returns over the ranking period. The reason why we wish to calculate the distribution of CSM returns is that we can then calculate percentiles, quantiles, and related quantities. We can deduce the degree to which moments of returns exist and their precise form. Such information can be used, for example, to assess the fatness of the tails of the distribution and this is valuable for risk management calculations as well as understanding the benefits and limitations of portfolio construction.
By assuming that underlying asset returns are Gaussian, the distribution and the moments of the CSM returns were derived in Kwon and Satchell (2018). In this paper, we extend their results to the case where the underlying asset returns are Student’s t to derive the probability density function and the moments of the CSM returns. The t distribution arises naturally, for example, in a framework where asset volatility is stochastic, and conventional mean-variance analysis will create returns which are very similar to t-distributed returns. The important distinction between Student’s t returns and normal returns is that the distribution of Student’s t has an additional parameter which governs the fatness of the tails of the distribution and can be used to assess tail risk. There is a trade-off between realism and complexity; we would like to use a more complex distribution such as the skewed Student’s t considered in Theodossiou (1998) and Hansen et al. (2010), but the analytical complexity that results becomes prohibitive.
Although the individual asset returns do not exhibit skewness under the generalization to Student’s t, they can be leptokurtic which is a well-established feature in the empirical literature. Moreover, the CSM returns can, and do, exhibit skewness that depends on the statistical properties of the underlying assets. A detailed analysis of the special case of two underlying assets reveals that many of the salient features of the CSM returns reported in the empirical literature are consistent with the theoretical implications from this framework. This analysis is of interest because Kwon and Satchell (2018) were able to show that non-normality was a consequence of the momentum structure, even when the underlying returns were normal. We therefore wish to assess what the impact of assuming non-normality in the underlying returns will have on CSM returns. For example, will it exacerbate non-normality or make very little difference? Answers to this question will shed light on applying CSM to universes of assets which are fundamentally non-normal, such as emerging markets.
It should be pointed out that since we work under the assumption that asset returns over the ranking and holding periods are jointly t-distributed, there are limitations in the properties of momentum returns that can be addressed in the theoretical framework of this paper. For example, it is not possible to adequately address properties that depend on certain firm specific, economic, or financial factors such as liquidity, credit spread, market sentiment, business cycle, and information asymmetry since these factors cannot easily be captured in the distributional assumption on asset returns. Theoretical investigation of such properties would require an extension with the ability to incorporate such factors.
Finally, it may be asked what the connection is between our analysis and the extensive linear factor modelling that dominates the asset pricing literature. This literature essentially says that the time t mean of an asset, say the first, is a linear function of factor returns. In the framework of this paper, we can accommodate such modelling by interpreting the asset mean to be conditional on factor returns.
The remainder of this paper is organized as follows: Section 2 introduces the notation and the key results on multivariate normal distributions, and Section 3 provides a mathematically precise definition of CSM returns. Although the expressions for the CSM return density and the associated moments are quite complex in general, they simplify considerably in the case of two assets with one winner and one loser, and this special case is examined in detail in Section 4, along with implications to the empirically observed features reported in the literature, and the paper concludes with Section 5.

2. Notation and Preliminaries

For the convenience of the reader, we introduce in this section the notation that will be used throughout the paper, and present some known results that will be relied upon in subsequent sections.

2.1. Notation

For any x R n , we will write x i for the i-th coordinate of x , and given y R n write x y if and only if x i < y i for all 1 i n . Similarly, given a matrix M R m × k , we will write M i , j for the ( i , j ) -th entry of M, and the transpose of a vector or a matrix will be denoted by the superscript . The vector in R n with all entries equal to 1 will be denoted 1 n , and given a subset A R n , we will denote by I A the indicator function on A.
Given a random vector, X , with values in a region D X R n , we will write f X ( x ) and F X ( x ) for the probability density and the cumulative density functions of X , respectively. Moreover, given another random vector Y , with values in D Y R m , we will denote by f X | Y ( x | y ) and F X | Y ( x | y ) the conditional probability density and conditional cumulative density functions of X given Y = y , respectively.
For any n N , let [ n ] = { 1 , 2 , , n } and let S n be the set of permutations of [ n ] . We will denote the permutation that maps 1 i 1 , 2 i 2 , , n i n by a sequence ( i 1 , i 2 , , i n ) , and given any τ S n write τ ( i ) for the image of i under τ so that if τ = ( 1 , 3 , 2 ) , for example, then τ ( 1 ) = 1 , τ ( 2 ) = 3 , and τ ( 3 ) = 2 . Given a permutation τ S n , we will denote by P τ R n × n the permutation matrix corresponding to τ and denote by D n R ( n 1 ) × n the matrix
D n = 1 1 0 0 0 0 0 0 1 1 0 0 0 0 0 0 0 0 0 1 1 .
The elements of the permutation group S n act naturally on the set of polynomials, R [ x 1 , , x n ] by the rule
τ p ( x 1 , , x n ) = p ( x τ 1 , , x τ n )
for any polynomial p R [ x 1 , , x n ] and τ S n . For any n 1 , n 2 N , let p n 1 , 2 n 2 R [ x 1 , , x n 1 + 2 n 2 ] be the polynomial
p n 1 , 2 n 2 ( x 1 , , x n 1 + 2 n 2 ) = i = 1 n 1 x i i = 1 n 2 ( x n 1 + 2 i x n 1 + 2 i 1 ) 2 .
Denote by Z ( n 1 , 2 n 2 ) the stabilizer of p n 1 , 2 n 2 under the action of S n 1 + 2 n 2 so that
Z ( n 1 , 2 n 2 ) = { τ S n 1 + 2 n 2 τ p n 1 , 2 n 2 = p n 1 , 2 n 2 } ,
and let Q ( n 1 , 2 n 2 ) = S n 1 + 2 n 2 / Z ( n 1 , 2 n 2 ) be the quotient group,1 with elements of Q ( n 1 , 2 n 2 ) identified with their coset representatives τ S n 1 + 2 n 2 . Finally, define
[ n ] m = i = 1 m { 1 , 2 , , n } = { ( i 1 , i 2 , , i m ) 1 i j n , 1 j m }
as the m-fold Cartesian product of [ n ] = { 1 , 2 , , n } .

2.2. Multivariate Normal Distributions

The density of an n-dimensional normal distribution with mean μ and covariance Σ at x R n will be denoted ϕ n ( x ; μ , Σ ) , and the corresponding cumulative density function will be denoted Φ n ( x ; μ , Σ ) . In general, given random variables X 1 , , X n , their joint probability density function will be denoted f X 1 , , X n , and we will write F X 1 , , X n for the cumulative density function.
Theorem 1.
Let n 1 , n 2 N and suppose X N n 1 + n 2 μ , Σ , where
X = X 1 X 2 , μ = μ 1 μ 2 , Σ = Σ 1 , 1 Σ 1 , 2 Σ 2 , 1 Σ 2 , 2 ,
with X i , μ i R n i and Σ i , j R n i × n j for 1 i , j 2 , and Σ positive definite. Then, the conditional distribution of X 1 given X 2 is normal with mean and covariance
μ X 1 X 2 = μ 1 + Σ 1 , 2 Σ 2 , 2 1 ( X 2 μ 2 ) ,
Σ X 1 X 2 = Σ 1 , 1 Σ 1 , 2 Σ 2 , 2 1 Σ 2 , 1 ,
respectively, and ϕ n 1 + n 2 ( x ; μ , Σ ) decomposes as
ϕ n 1 + n 2 ( x ; μ , Σ ) = ϕ n 1 x 1 ; μ X 1 X 2 , Σ X 1 X 2 ϕ n 2 ( x 2 ; μ 2 , Σ 2 , 2 ) .
Proof. 
Refer to Muirhead (1982) Theorem 1.2.11.  □
Given an n-dimensional random vector X and p = ( p 1 , , p m ) { 1 , 2 , , n } m , we will denote by μ p X the p -th moment of X so that
μ p X = E i = 1 m X p i .
Note that the subscripts, p i , in (9) may be repeated so that the above definition is equivalent to the more familiar definition of moments in which the powers of the components of X appear inside the expectation on the right-hand side, viz. E i = 1 n X i k i with 0 k i N for 1 i n . For example, we have E X 1 2 X 2 3 = E X 1 X 1 X 2 X 2 X 2 , and so in the notation of (9) we have m = 5 , d = ( d 1 , d 2 , d 3 , d 4 , d 5 ) = ( 1 , 1 , 2 , 2 , 2 ) , and μ ( 1 , 1 , 2 , 2 , 2 ) X = E X 1 2 X 2 3 .
Theorem 2.
Let X N n ( μ , Σ ) , where n N , μ R n and Σ R n × n is positive definite. Then, for any p = ( p 1 , , p m ) { 1 , 2 , , n } m , we have
μ p X = k , l N k + 2 l = m τ Q ( k , 2 l ) i = 1 k μ p τ i i = 1 l Σ p τ k + 2 i 1 , p τ k + 2 i .
In particular, if μ = 0 and m is even, then
μ p X = τ Q ( 0 , m ) i = 1 1 2 m Σ p τ 2 i 1 , p τ 2 i .
Proof. 
Refer to Withers (1985) Theorem 1.1.  □
An alternative expression for μ p ( X ) , where X is multivarite normal, is given in Kan (2008) Proposition 2.
Corollary 1.
Let X N 1 ( μ , σ 2 ) . Then, for any m N , the m-th moment of X is given by
μ m ( X ) = i = 0 i even m m i ( i 1 ) ! ! σ i μ m i ,
where k ! ! = i = 0 k 2 ( k 2 i ) is the double factorial of k N . In the special case where μ = 0 and m is even, we have μ m ( X ) = ( m 1 ) ! ! σ m .
Proof. 
Follows from Theorem 2, since the inner sum in (10), for which 2 l = i , consists of m i ( i 1 ) ! ! identical terms that are all equal to σ i μ m i .  □

2.3. Multivariate Student’s t Distribution

Asset returns are often assumed to be normally distributed in the academic literature for theoretical convenience, in which case they are completely determined by the location and the scale parameters. However, it is widely reported in the empirical literature that the observed asset returns exhibit excess kurtosis. Student’s t-distribution is a distribution from the elliptical family with an additional parameter ν , viz. number of degrees of freedom that controls the kurtosis. The Student’s t distribution reduces to the normal in the limit as ν , and hence provides a convenient framework under which to investigate the impact of excess kurtosis in the underlying asset returns on the distributional properties of the CSM return. This subsection provides a brief summary of the key properties of the Student’s t distribution that will be required in the remainder of this paper.
The t distribution has a long history in mathematical statistics. The univariate probability density function (pdf), t ( μ , σ 2 , ν ), of a t-variate with mean μ , scale parameter σ , and degrees of freedom ν is given by
t ( μ , σ 2 , ν ) = Γ ν + 1 2 σ ν π Γ ν 2 1 + 1 ν x μ σ 2 1 2 ( ν + 1 ) .
From the origins of the t-test in mathematical finance, it is clear that we can write the corresponding random variable as x = μ + z / Y , where z and Y are independent, z N ( 0 , σ 2 ) , Y = g / ν , and g χ 2 ( ν ) .
In extending the definition of the t distribution to the multivariate case, we are faced with a choice. Although the choice z N n 0 , Σ is clear, Y can be defined in various ways. For example:
  • each component, z i , of z is normalized by independent Y i = g i / ν i with same or differing ν i ,
  • Y i could be jointly dependent,
  • single common Y.
All three of these choices have a stochastic volatility interpretation corresponding to
  • idiosyncratic shocks,
  • common factor shocks,
  • economy-wide or market-wide shock.
Although we have chosen the final characterization, cross sectional momentum could also be analyzed under other characterizations.
Throughout this paper, the probability density function of n-dimensional Student’s t distribution, St n 1 + n 2 μ , Σ , ν , with ν degrees of freedom, location μ , and shape matrix Σ at x R n will be denoted t n ( x ; μ , Σ , ν ) , and we will write T n ( x ; μ , Σ , ν ) for the corresponding cumulative density function. The next theorem shows that multivariate Student’s t distribution is closed under conditioning, in the sense that the conditional density of a subset given its complement is again Student’s t. This property will be crucial in the investigation of the CSM returns in later sections.
Theorem 3.
Let n 1 , n 2 N and suppose X St n 1 + n 2 μ , Σ , ν , where ν R + ,
X = X 1 X 2 , μ = μ 1 μ 2 , Σ = Σ 1 , 1 Σ 1 , 2 Σ 2 , 1 Σ 2 , 2 ,
with X i , μ i R n i and Σ i , j R n i × n j for 1 i , j 2 , and Σ positive definite. Then, the conditional distribution of X 1 given X 2 is Student’s t with degrees of freedom ν X 1 X 2 = ν + n 2 , and location and shape matrix
μ X 1 X 2 = μ 1 + Σ 1 , 2 Σ 2 , 2 1 ( X 2 μ 2 ) ,
Σ X 1 X 2 = ν + X 2 μ 2 Σ 2 , 2 1 X 2 μ 2 ν + n 2 Σ 1 , 1 Σ 1 , 2 Σ 2 , 2 1 , Σ 2 , 1 ,
respectively, and t n 1 + n 2 ( x ; μ , Σ , ν ) decomposes as
t n 1 + n 2 ( x ; μ , Σ , ν ) = t n 1 x 1 ; μ X 1 X 2 , Σ X 1 X 2 , ν X 1 X 2 t n 2 ( x 2 ; μ 2 , Σ 2 , 2 , ν ) .
Proof. 
Refer to Roth (2013) Appendix A.6, or Muirhead (1982) Problems 1.29 and 1.30.  □
Although a Student’s t distribution does not have finite moments of all orders, the next theorem provides an explicit expression for those that do exist.
Theorem 4.
Let X St n ( μ , Σ , ν ) , where n N , ν R + , μ R n and Σ R n × n is positive definite. Moreover, for any m N , denote by P m the set of subsets of the set { 1 , 2 , , m } of size k such that m k is even, where 0 k m . Then, for any p = ( p 1 , , p m ) { 1 , 2 , , n } m such that m < ν , we have
μ p X = P P m Γ ν 2 1 2 | P c | Γ ν 2 ν 2 1 2 | P c | k P μ p i k τ Q ( 0 , | P c | ) j = 1 1 2 | P c | Σ p i τ 2 j 1 , p i τ 2 j ,
where P c = { i 1 , i 2 , , i k + 2 l } in the final sum.
Proof. 
Refer to Appendix A.  □
Setting n = 1 gives the moments of the one-dimensional Student’s t distribution.
Corollary 2.
Let X St 1 ( μ , σ 2 , ν ) . Then, for any m N such that m < ν , the m-th moment of X is given by
μ m ( X ) = k = 0 k even m m k ( k 1 ) ! ! Γ 1 2 ( ν k ) Γ ν 2 ν 2 k 2 μ m k σ k ,
where k ! ! is the double factorial defined in Corollary 1.
Proof. 
Follows from similar arguments to Theorem 4 noting that the indices p i j are all equal in this case.  □
For an alternative derivation of the moments of Student’s t distribution, refer to Kirkby et al. (2019).

2.4. Unified Skew t Family of Distributions

Multivariate skew-normal (SN) distributions were introduced in Azzalini and Valle (1985) to generalize normal distributions to those that allow non-zero skewness, and the seemingly disparate distributions related to the multivariate SN distributions were brought together under the umbrella of the so-called unified skew-normal (SUN) family of distributions in Arellano-Valle and Azzalini (2006), where it was shown that the SUN family contains many of these skew-normal variants as special cases. The extension of the normal family to those with non-zero skewness was then extended to the elliptical family of distributions in Arellano-Valle and Genton (2010). In what follows, we only summarize the results on the extension for the multivariate Student’s t distributions that will be required in this paper, and refer the reader to Arellano-Valle and Genton (2010) and Jamalizadeh and Balakrishnan (2012) for the details.
Given ν R + , n i N , μ i R n i , Σ i , j R n 1 × n 2 for 1 i j , where Σ 2 , 1 = Σ 1 , 2 and Σ i , i are positive definite, let
X 1 X 2 St n 1 + n 2 μ 1 μ 2 , Σ 1 , 1 Σ 1 , 2 Σ 1 , 2 Σ 2 , 2 , ν .
Then, the probability density function, f U ( u ) , of an n 1 -dimensional unified skew t (SUT) distributed random variable, U , associated with ( X 1 , X 2 ) given by
f U ( u ) = t n 2 ( u ; μ 2 , Σ 2 , 2 , ν ) T n 1 0 ; μ 1 , Σ 1 , 1 , ν T n 1 0 ; μ 1 Σ 1 , 2 Σ 2 , 2 1 ( u μ 2 ) , Σ 1 , 1 ; 2 ( u ) , ν + n 2 ,
where
Σ 1 , 1 ; 2 ( u ) = ν + u μ 2 Σ 2 , 2 1 u μ 2 ν + n 2 Σ 1 , 1 Σ 1 , 2 Σ 2 , 2 1 Σ 2 , 1 .
The key characteristic of f U ( u ) is that it is a product of an n 2 -dimensional Student’s t density and an n 1 -dimensional cumulative Student’s t density with the variable u appearing as the main variable in the former and in the mean and variance parameters of the latter. As will be seen, the densities of cross sectional momentum returns will be a weighted sum of these SUT distributions.

3. Cross-Sectional Momentum Returns with Student’s t Distributed Asset Returns

In this section, we derive the distributional properties of the cross sectional momentum (CSM) returns under the assumption that the underlying asset returns are multivariate Student’s t. We begin by recalling the mathematically precise definition of the CSM return from Kwon and Satchell (2018).
Let 0 < m + , m , n N such that m + + m n , and for each 1 i n denote by r i , t the return on asset i at time t. Moreover, let
r t = ( r 1 , t , , r n , t ) R n ,
and for any τ S n , define
r τ , m ± , t = 1 m + i = 1 m + r τ i , t 1 m i = 1 m r τ n m + i , t R ,
x τ i , t = r τ i + 1 , t r τ i , t R , 1 i n 1 ,
x τ , t = ( x τ 1 , t , , x τ n 1 , t ) R n 1 ,
z τ , t = r τ , m ± , t + 1 , x τ , t R n .
Note that any given τ S n defines an ordering, r t , τ 1 > r t , τ 2 > > r t , τ n , of the components of r t . Thus, r τ , m ± , t represents the return on a portfolio where the top m + ranked assets are equally weighted and held long while the bottom m assets are equally weighted and held short. The assumption of equal weighting is for notational simplicity only, and not crucial for the general theoretical results. Note also that x τ , t is defined to allow the ranking of the components of r t corresponding to τ S n to be written succinctly as x τ , t 0 n 1 .
Definition 1.
The ( m + , m ) -cross sectional momentum return, r m ± , t + 1 , is defined by
r m ± , t + 1 = τ S n I { x τ , t 0 n 1 } r τ , m ± , t + 1 ,
where I A , for any subset A R n , denotes the indicator function on the set A.
For intuition behind the definition of r m ± , t + 1 , note that the components of r t , representing asset returns over the ranking period, can be arranged in any of the n ! orderings corresponding to the permutations τ S n . For each such ranking r t , τ 1 > r t , τ 2 > > r t , τ n , the m + winner returns over the holding period are r t + 1 , τ 1 , , r t + 1 , τ m + while the m loser returns are r t + 1 , τ n m + 1 , , r t + 1 , τ n . Equally weighting the returns in the winner and the loser portfolios gives r τ , m ± , t + 1 , and since the ranking of components of r t determined by τ S n is equivalent to the condition x τ , t 0 n 1 , summing over all possible r τ , m ± , t + 1 and prefixing by the matching indicator function gives the expression for r m ± , t + 1 in (28).
For the remainder of this paper, we make the following assumption on the distribution of r t , r t + 1 .
Assumption 1.
The vector of returns, ( r t , r t + 1 ) , is multivariate Student’s t distributed so that
r t + 1 r t St 2 n μ t + 1 μ t , Σ t + 1 , t + 1 Σ t + 1 , t Σ t , t + 1 Σ t , t , ν ,
with ν R + , μ u R n , and Σ u , v R n × n , where u , v { t , t + 1 } .
Since the t-distribution is symmetric, there are limitations on the properties of asset returns that can be captured adequately by the above assumption as already discussed in Section 1. Nevertheless, the assumption is sufficiently general to accommodate linear factor models and econometric models such as vector autoregressive moving average models where the factors and noise terms, respectively, are t-distributed. Moreover, the framework also allows consideration of more general cases where μ t and μ t + 1 are conditional means linear in factors without requiring the factors themselves to be multivariate t. However, since the analysis in later sections will show that momentum returns are nonlinear in the underlying asset returns, the common practise of regressing momentum returns on the various factors must be interpreted as a best linear prediction rather than a conditional expectation in such cases. We now derive the probability density function, f m ± , t + 1 ( r ) , of the CSM return r m ± , t + 1 .
Theorem 5.
Suppose that ( r t , r t + 1 ) satisfies Assumption 1. Then, the probability density function, f m ± , t + 1 ( r ) , of the cross sectional momentum return, r m ± , t + 1 , is given by
f m ± , t + 1 ( r ) = τ S n t 1 r ; ι P τ μ t + 1 , ι P τ Σ t + 1 , t + 1 P τ ι , ν T n 1 0 ; λ τ ( r ) , Λ τ ( r ) , ν + 1 ,
where ι = 1 m + , 0 n m + m , 1 n ,
λ τ ( r ) = D n P τ μ t + r ι P τ μ t + 1 ι P τ Σ t + 1 , t + 1 P τ ι D P τ Σ t , t + 1 P τ ι ,
Λ τ ( r ) = ν + ι P τ Σ t + 1 , t + 1 P τ ι 1 r ι P τ μ t + 1 2 ν + 1 D n P τ Σ t , t P τ D n D n P τ Σ t , t + 1 P τ ι ι P τ Σ t + 1 , t P τ D n ι P τ Σ t + 1 , t + 1 P τ ι ,
and we define T 0 1 . Alternatively,
f m ± , t + 1 ( r ) = τ S n R n 1 t 1 r ; γ τ ( x ) , Υ τ ( x ) , ν + n 1 t n 1 x ; D n P τ μ t , D n P τ Σ t , t P τ D n , ν d x ,
where
γ τ ( x ) = ι P τ μ t + 1 + ι P τ Σ t + 1 , t P τ D n D n P τ Σ t , t P τ D n 1 x D n P τ μ t ,
Υ τ ( x ) = ν + x D n P τ μ t D n P τ Σ t , t P τ D n 1 x D n P τ μ t ν + n 1 ι P τ Σ t + 1 , t + 1 P τ ι ι P τ Σ t + 1 , t P τ D n D n P τ Σ t , t P τ D n 1 D n P τ Σ t , t + 1 P τ ι .
Proof. 
Refer to Appendix B.  □
Note that the summands that appear in the pdf of the CSM return in (30) have the characteristic form of the SUT densities given in (21) other than for the omission of the normalization factor2 that appears in the denominator of (21). It follows that pdf of the CSM return is a weighted sum of the SUT densities. The next result gives the pdf in the special case where r t and r t + 1 are independent, which can be considered as the case where the market is efficient.
Corollary 3.
If r t , r t + 1 satisfies Assumption 1, and r t and r t + 1 are independent, then
f m ± , t + 1 ( r ) = τ S n t 1 r ; ι P τ μ t + 1 , ι P τ Σ t + 1 , t + 1 P τ ι , ν T n 1 0 ; D n P τ μ t , Λ τ ( r ) , ν + 1 ,
where
Λ τ ( r ) = ν + ι P τ Σ t + 1 , t + 1 P τ ι 1 r ι P τ μ t + 1 2 ν + 1 D n P τ Σ t , t P τ D n ,
Proof. 
Follows from Theorem 5 since Σ t , t + 1 = O n × n = Σ t + 1 , t in this case.  □
We next derive the expressions for the non-central moments of the CSM returns. Since t distributions do not have moments of all orders as noted in Theorem 4, the moments of CSM returns will also only exist up to a certain order.
Theorem 6.
Suppose ( r t , r t + 1 ) satisfies Assumption 1, and let m N such that m < ν . Then, the m-th non-central moment of r m ± , t + 1 is given by
μ m r m ± , t + 1 = τ S n k = 0 k even m m k ( k 1 ) ! ! Γ 1 2 ν + n 1 k Γ 1 2 ( ν + n 1 ) ν + n 1 2 k 2 R n 1 γ τ m k ( x ) Υ τ k 2 ( x ) t n 1 x ; D n P τ μ t , D n P τ Σ t , t P τ D n , ν d x ,
where γ τ ( x ) and Υ τ ( x ) are as defined in (34) and (35), respectively.
Proof. 
Refer to Appendix C.  □

4. Special Case of Two Assets

In this section, we examine in detail the special case of two assets, and begin by computing the partial moments of one-dimensional Student’s t distributions that will be required. To reduce notational burden, we define for η R , ς R + , and ν R +
c η , ς 2 , ν = Γ ν + 1 2 ς ν π Γ ν 2 ,
so that from (13) we have
t 1 x ; η , ς 2 , ν = c η , ς 2 , ν 1 + 1 ν x η ς 2 1 2 ( ν + 1 ) .
Lemma 1.
Let η R , ς R + and 2 < ν R + . Then, for m N + , we have
x η ς m t 1 x ; η , ς 2 , ν = ς ν ν 2 x η ς m 1 t 1 x ; η , ς 2 ν / ( ν 2 ) , ν 2 x .
Proof. 
Refer to Appendix D.  □
The next theorem will play a key role in the derivation of the non-central moments of the CSM returns.
Theorem 7.
For any η R , ς R + , ν R + , and m N such that m < ν , let
κ m ( η , ς 2 , ν ) = 0 x η ς m t 1 ( x ; η , ς 2 , ν ) d x .
Then, κ 0 ( η , ς 2 , ν ) = T 1 ( 0 ; η , ς 2 , ν ) , κ 1 ( η , ς 2 , ν ) = ς ν / ( ν 2 ) t 1 0 ; η , ς 2 ν / ( ν 2 ) , ν 2 , and
κ m η , ς 2 , ν = ( m 1 ) ν ν 2 1 2 ( m 1 ) κ m 2 η , ς 2 ν ν 2 , ν 2
for 2 m < ν . More explicitly, if m is even
κ m η , ς 2 , ν = ( m 1 ) ! ! ν 1 2 ( m 1 ) ( ν 2 ) ( ν 4 ) ν m + 2 ν m T 1 0 ; η , ς 2 ν ν m , ν m ,
and if m is odd
κ m η , ς 2 , ν = ς ( m 1 ) ! ! ν 1 2 ( m 1 ) ( ν 2 ) ( ν 4 ) ( ν ( m 1 ) ) ν ( m + 1 ) t 1 0 ; η , ς 2 ν ν ( m + 1 ) , ν ( m + 1 ) .
Proof. 
Firstly, the expression for κ 0 ( η , ς 2 , ν ) follows directly from the definition of κ m ( η , ς 2 , ν ) , and, for κ 1 ( η , ς 2 , ν ) , we obtain on setting m = 1 in Lemma 1 that
κ 1 ( η , ς 2 , ν ) = ς ν ν 2 0 t 1 x ; η , ς 2 ν / ( ν 2 ) , ν 2 x d x = ς ν ν 2 t 1 0 ; η , ς 2 ν ν 2 , ν 2 .
Next, for (43), using Lemma 1 and applying integration by parts gives
κ m ( η , ς 2 , ν ) = ς ν ν 2 x R x η ς m 1 t 1 x ; η , ς 2 ν / ( ν 2 ) , ν 2 x d x = ( m 1 ) ν ν 2 x R x η ς m 2 t 1 x ; η , ς 2 ν ν 2 , ν 2 d x = ( m 1 ) ν ν 2 1 2 ( m 1 ) x R x η ς ν / ( ν 2 ) m 2 t 1 x ; η , ς 2 ν ν 2 , ν 2 d x = ( m 1 ) ν ν 2 1 2 ( m 1 ) κ m 2 η , ς 2 ν ν 2 , ν 2 ,
which is (43). The explicit expressions for the odd and even cases follow by induction.  □
As it will be seen, the quantities that play a key role in the two asset case are the spreads, r t , 2 r t , 1 and r t + 1 , 2 r t + 1 , 1 , and so we define
η u = μ u , 2 μ u , 1 , σ u , i 2 = var r u , i , ρ u = cov r u , 1 , r u , 2 σ u , 1 σ u , 2 , ρ i , j = cov r t , i , r t + 1 , j σ t , i σ t + 1 , j , ς u 2 = σ u , 1 2 + σ u , 2 2 2 ρ u σ u , 1 σ u , 2 , ς t , t + 1 = ρ 1 , 1 σ t , 1 σ t + 1 , 1 + ρ 2 , 2 σ t , 2 σ t + 1 , 2 ρ 1 , 2 σ t , 1 σ t + 1 , 2 ρ 2 , 1 σ t , 2 σ t + 1 , 1 , ϱ t , t + 1 = ς t , t + 1 ς t ς t + 1 ,
where 1 i , j 2 and u { t , t + 1 } . Note that ς u 2 is the variance of the spread r u , 2 r u , 1 , and ϱ t , t + 1 is the correlation between r t , 2 r t , 1 and r t + 1 , 2 r t + 1 , 1 . Next, we compute the terms γ τ ( x ) and Υ τ ( x ) that appear in the expression (33) for the pdf of the CSM return. For u , v { t , t + 1 } and τ S 2 , we have
ι P ( 1 , 2 ) μ u = η u = ι P ( 2 , 1 ) μ u , D 2 P ( 1 , 2 ) μ u = η u = D 2 P ( 2 , 1 ) μ u , D 2 P τ Σ u , u P τ D 2 = ι P τ Σ u , u P τ ι = ς u 2 , ι P τ Σ u , v P τ D 2 = ς u , v ,
and so
γ ( 1 , 2 ) ( x ) = η t + 1 ς t , t + 1 ς t 2 x η t , γ ( 2 , 1 ) ( x ) = η t + 1 ς t , t + 1 ς t 2 x + η t , Υ ( 1 , 2 ) ( x ) = ν + ς t 2 x η t 2 ν + 1 ς t + 1 2 ς t , t + 1 2 ς t 2 = ς t + 1 2 1 ϱ t , t + 1 2 ν + 1 ν + x η t ς t 2 , Υ ( 2 , 1 ) ( x ) = ν + ς t 2 x + η t 2 ν + 1 ς t + 1 2 ς t , t + 1 2 ς t 2 = ς t + 1 2 1 ϱ t , t + 1 2 ν + 1 ν + x + η t ς t 2 .
If we define the sign of permutations in S 2 by ε ( ( 1 , 2 ) ) = 1 and ε ( ( 2 , 1 ) ) = 1 , then the expressions for γ τ ( x ) and Υ τ ( x ) can be written succinctly as
γ τ ( x ) = ε ( τ ) η t + 1 ϱ t , t + t ς t + 1 x ε ( τ ) η t ς t ,
Υ τ ( x ) = ς t + 1 2 1 ϱ t , t + 1 2 ν + 1 ν + x ε ( τ ) η t ς t 2 .
The next lemma will provide the building blocks for the non-central moments of CSM returns.
Lemma 2.
Let γ τ ( x ) and Υ τ ( x ) be as defined in (46) and (47) respectively, where τ S 2 . Then, for α , β N such that α + 2 β < ν , we have
0 γ τ α ( x ) Υ τ β ( x ) t 1 x ; ε ( τ ) η t , ς t 2 , ν d x = ( 1 ) α ς t + 1 2 β 1 ρ t , t + 1 2 β ( ν + 1 ) β i = 0 α j = 0 β α i β j ε α i ( τ ) η t + 1 α i ϱ t , t + 1 i ς t + 1 i ν β j κ i + 2 j ε ( τ ) η t , ς t 2 , ν ,
where κ m η , ς 2 , ν is as defined in (42).
Proof. 
Follows from using the binomial formula to expand the powers of γ τ ( x ) and Υ τ ( x ) , and applying the definition of κ m η , ς 2 , ν .  □
For notational convenience, given any τ S 2 , ν R + and α , β N such that α + 2 β < ν , we define
κ α , β , τ ( η , ς , ν ) = 0 γ τ α ( x ) Υ τ β ( x ) t 1 x ; ε ( τ ) η , ς 2 , ν d x ,
and note that κ α , β , τ ( η , ς , ν ) can be computed explicitly using (48). We now present the non-central moments of the CSM return r 1 ± , t + 1 .
Theorem 8.
Let η R , ς R + and ν R + . Then, for m N + such that m < ν , the m-th non-central moment, μ m r 1 ± , t + 1 , of r 1 ± , t + 1 is given as follows:
μ m r 1 ± , t + 1 = τ S 2 k = 0 k even m m k ( k 1 ) ! ! Γ 1 2 ( ν k + 1 ) Γ 1 2 ( ν + 1 ) ν + 1 2 k 2 κ m k , k 2 , τ ( η t , ς t , ν ) ,
where κ α , β , τ ( η , ς , ν ) is as defined in (49). In particular, the first four non-central moments are
μ 1 r 1 ± , t + 1 = τ S 2 κ 1 , 0 , τ ( η t , ς t , ν ) ,
μ 2 r 1 ± , t + 1 = τ S 2 κ 2 , 0 , τ ( η t , ς t , ν ) + ( ν + 1 ) κ 0 , 1 , τ ( η t , ς t , ν ) ν 1 ,
μ 3 r 1 ± , t + 1 = τ S 2 κ 3 , 0 , τ ( η t , ς t , ν ) + 3 ( ν + 1 ) κ 1 , 1 , τ ( η t , ς t , ν ) ν 1 ,
μ 4 r 1 ± , t + 1 = τ S 2 κ 4 , 0 , τ ( η t , ς t , ν ) + 6 ( ν + 1 ) κ 2 , 1 , τ ( η t , ς t , ν ) ν 1 + 3 ( ν + 1 ) 2 κ 0 , 2 , τ ( η t , ς t , ν ) ( ν 1 ) ( ν 3 ) .
Proof. 
Follows from the general expression (38) for the moments of r 1 ± , t + 1 and the definition of κ α , β , τ ( η , ς , ν ) .  □
We remark that the non-central moments of r 1 ± , t + 1 given in Theorem 8 are sums indexed by S 2 that consists of two elements, and that each term that appears in these sums can be computed recursively using (43), (48), and (49). Since the right-hand side of (48) consists of a finite number of terms and (43) is equivalent to the explicit expressions (44) or (45) depending on the index m, these moments of r 1 ± , t + 1 can be computed without having to make any simplifying approximations. For example, the first moment is given explicitly by
μ 1 r ± 1 , t + 1 = η t + 1 2 T 1 ( 0 ; η t , ς t 2 , ν ) 1 + 2 ρ t , t + 1 ς t + 1 ν ν 2 t 1 0 ; η t , ν ς t 2 ν 2 , ν ,
which is reassuring since it has the same functional form as the following expression3
μ 1 🟉 r ± 1 , t + 1 = η t + 1 2 Φ η t ς t 1 + 2 ϱ t , t + t ς t + 1 ϕ η t ς t
obtained for the normally distributed asset return case in Kwon and Satchell (2018) except for the distribution functions being Student’s t rather than normal. The numerical calculations in this paper were performed using code written in C++ that relied on the boost library4 to compute the functions t 1 and T 1 .
Returning briefly to the linear factor structure discussed in Section 1, we could consider μ t , 1 and μ t , 2 to be a linear combinations of factors, which in a Carhart (1997) model context would consist of size, market, value, and momentum. Thus, if we were to go long asset 1 and short asset 2 in our CSM momentum portfolio, we might expect a larger exposure to the momentum factor for asset 1 and a smaller exposure for asset 2. We could carry out further detailed analysis to accommodate these features but leave this for further research.
If we denote by μ 1 ± , t + 1 the mean, σ 1 ± , t + 1 2 the variance, γ 1 ± , t + 1 the skewness, and κ 1 ± , t + 1 the excess kurtosis of the CSM return, then these quantities are easily computed from the non-central moments given in Theorem 8. It should be noted that the quantities corresponding to the odd moments, viz. μ 1 ± , t + 1 and γ 1 ± , t + 1 are approximately odd as functions of ϱ t , t + 1 , and those associated with the even moments, viz. σ 1 ± , t + 1 2 and κ 1 ± , t + 1 , are approximately even as functions of ϱ t , t + 1 . This is because the return from a portfolio formed by taking a long position in the loser and a short position in the winner when ϱ t , t + 1 < 0 would have the same distributional properties as the return from taking the opposite positions when ϱ t , t + 1 > 0 .
In the analysis that follows, we assume that the asset returns are stationary in order to reduce the number of parameters. Moreover, we have set μ t , 1 = 6 % , μ t , 2 = 4 % , σ t , 1 2 = 0.18 ( ν 2 ) / ν and σ t , 2 2 = 0.14 ( ν 2 ) / ν , where the asset variances have been scaled by a factor dependent on ν to ensure that they are independent of the number of degrees of freedom. It should be noted that the cross-sectional correlation, ρ t , then determines the variance, ς t 2 , of the spread, r t , 2 r t , 1 .
The mean of the CSM return is shown in Figure 1 as a function of the degrees of freedom ν , and the spread autocorrelation, ϱ t , t + 1 , for ρ t = 0.9 , ρ t = 0.0 , and ρ t = 0.9 . As expected, μ 1 ± , t + 1 is an increasing function of ϱ t , t + 1 . For ϱ t , t + 1 > 0 , the mean decreases slightly with ν , while the opposite is the case when ϱ t , t + 1 < 0 . In the region ϱ t , t + 1 0 , the mean is slightly positive. Since this is the region corresponding to small autocorrelations in the underlying asset returns, and the situation most commonly observed in practice, it is reassuring that the small positive CSM returns implied by the model is consistent with the findings reported in the empirical literature. Moreover, we see that the degrees of freedom parameter, ν , that controls the kurtosis of asset returns has very little impact on r 1 p m , t + 1 . Interpreting small ν as representative of assets from emerging markets, this is consistent with findings from Rouwenhorst (1999) and Bekaert et al. (1997) that, although there is evidence of momentum in emerging markets, it is not significantly different to those observed in developed markets, despite the assets from the respective markets having different distributional properties. The surface flattens out as the cross-sectional correlation increases from 0.9 to 0.9 . Since an increase in ρ t corresponds to a decrease in the variance, ς t 2 , of the spread r t , 2 r t , 1 , this behavior is consistent with the positive relationship usually associated with risk and return in finance.
The standard deviation of the CSM return as a function of ν and ϱ t , t + 1 is shown in Figure 2. Although not clearly evident from the figure, σ 1 ± , t + 1 is a decreasing function of ν , and for a fixed value of ν the standard deviation is convex in ϱ t , t + 1 and takes the maximum value at ϱ t , t + 1 = 0 . Finally, since the variance of the spread decreases as ρ t increases, σ 1 ± , t + 1 likewise decreases with increasing ρ t .
The skewness of the CSM return in Figure 3 shows that γ 1 ± , t + 1 is negative when ϱ t , t + 1 < 0 and positive otherwise. In fact, although it is not clearly evident from the figure, γ 1 ± , t + 1 is negative even for small positive values of ϱ t , t + 1 . As discussed above, the autocorrelations in the asset returns tend to be small in practice, and hence ϱ t , t + 1 will also be small. The corresponding model implied skewness in the CSM return will then be slightly negative, which is consistent with the observations in the empirical literature.
In contrast to the surfaces for other quantities that flatten to a large extent as ρ t increases from 0.9 to 0.9 , the surface for κ 1 ± , t + 1 in Figure 4 remains relatively unchanged. The excess kurtosis of CSM returns is generally positive, in line with the findings reported in the literature, and increases significantly when ν is small and ϱ t , t + 1 is high. It should be noted that κ 1 ± , t + 1 is largest when ν is small. Since the deviation of the Student’s t distribution from the normal is greatest when ν is small, it follows that the extension considered in this paper will be useful in situations where the observed kurtosis in the CSM returns is higher than the value implied under the assumption of normal asset returns. This would be the case, for example, when considering emerging markets.

5. Conclusions

In this paper, the theoretical framework introduced in Kwon and Satchell (2018) was extended to investigate the distributional properties of cross-sectional momentum (CSM) returns under the assumption that the vector of asset returns were multivariate Student’s t. The probability density function and the moments of the CSM returns were derived, and investigated in detail for the special case of two assets.
It was found that, in situations where the assets return, and hence the return spread, autocorrelations are small, and the CSM return has a small positive mean, negative skewness, and excess kurtosis. These are all consistent with the findings reported in the empirical literature. Moreover, the skewness and the kurtosis both become more pronounced as the number of degrees of freedom in the Student’s t distribution decreases and the corresponding asset returns become less normal.
In modeling asset returns that deviate significantly from being normal, such as those for emerging markets, the extension to the Student’s t considered in this paper would address some of the limitations of assuming normality. Since the Student’s t distribution approaches the normal in the limit as the number of degrees of freedom approaches infinity, the extension also provides a framework under which to analyze the implication and the limitations of the assumption of normality in asset returns to CSM returns.

Author Contributions

Conceptualization, O.K.K. and S.S.; methodology, O.K.K. and S.S; software, O.K.; validation, O.K.K. and S.S.; investigation, O.K.K. and S.S.; writing–original draft preparation, O.K.K. and S.S.; writing–review and editing, O.K.K. and S.S. All authors have read and agreed to the published version of the manuscript.

Funding

This research received no external funding.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Proof of Theorem 4

From the discussion in Section 2.3, see also Roth (2013) Appendix A.4, we have that X St n ( μ , Σ , ν ) can be decomposed as
X d μ + 1 U V ,
where U Gam ν 2 , ν 2 , V N n 0 , Σ , and U and V are independent. Now, from (9), we obtain
μ p ( X ) = P P m k P μ p k E U 1 2 | P c | l P c V p l ,
where P c = { 1 , 2 , , m } \ P denotes the complement of P, and since U and X are independent,
μ p ( X ) = P P m k P μ p k E U 1 2 | P c | E l P c V p l .
The second expectation term can be computed using (10), and so it remains to compute the first expectation term. However, from the properties of Gamma distributions, we have
E U 1 2 | P c | = ν 2 1 2 | P c | Γ ν 2 1 2 | P c | Γ ν 2
provided | P c | < ν , and so substituting into the expression for μ p ( X ) gives (18).

Appendix B. Proof of Theorem 5

For any k N , let 1 k = ( 1 , 1 , , 1 ) R k , and define ι R n by ι = 1 m + , 0 n n + n , 1 n , so that, for any τ S n , we have that r τ , m ± , t + 1 = ι P τ r t + 1 . Then, since
r τ , m ± , t + 1 x τ , t = ι P τ O 1 × n O ( n 1 ) × n D n P τ r t + 1 r t ,
we have that r τ , m ± , t + 1 , x τ , t is a linear transformation of r t + 1 , r t , and so it follows from Roth (2013) Equation (4.1) that
r τ , m ± , t + 1 x τ , t St n ι P τ μ t + 1 D n P τ r t , ι P τ Σ t + 1 , t + 1 P τ ι ι P τ Σ t + 1 , t P τ D n D n P τ Σ t , t + 1 P τ ι D n P τ Σ t , t P τ D n , ν .
Now, the joint pdf, f r τ , m ± , t + 1 , x τ , t ( r , x ) , is given by
f r τ , m ± , t + 1 , x τ , t ( t , x ) = f x τ , t r τ , m ± , t + 1 ( x | r ) f r τ , m ± , t + 1 ( r ) .
However, from Theorem 3, we have f x τ , t r τ , m ± , t + 1 ( x | r ) = t n 1 x ; λ τ ( r ) , Λ τ ( r ) , ν + 1 , and so
f r τ , m ± , t + 1 , x τ , t ( t , x ) = t n 1 x ; λ τ ( r ) , Λ τ ( r ) , ν + 1 t 1 r ; ι P τ μ t + 1 , ι P τ Σ t + 1 , t + 1 P τ ι , ν .
If we define D τ = { x τ 0 } , then f m ± , t + 1 D τ = I D τ f r τ , m ± , t + 1 , and so
f m ± , t + 1 D τ ( r ) = x R n 1 I D τ ( x ) f r τ , m ± , t + 1 , x τ , t ( t , x ) d x = x 0 t n 1 x ; λ τ ( r ) , Λ τ ( r ) , ν + 1 t 1 r ; ι P τ μ t + 1 , ι P τ Σ t + 1 , t + 1 P τ ι , ν d x = t 1 r ; ι P τ μ t + 1 , ι P τ Σ t + 1 , t + 1 P τ ι , ν T n 1 0 ; λ τ , r , Λ τ , r , ν + 1 ,
and summing over τ S n gives (30). The alternative expression (33) for f m ± , t + 1 ( r ) follows from a similar argument using the decomposition f r τ , m ± , t + 1 , x τ , t ( t , x ) = f r τ , m ± , t + 1 x τ , t ( r | x ) f x τ , t ( x ) .

Appendix C. Proof of Theorem 6

In view of the expression for f m ± , t + 1 ( r ) in (33), we need to compute terms of the form
I m , τ = R R n 1 r m t 1 r ; γ τ ( x ) , Υ τ ( x ) , ν + n 1 t n 1 x ; D n P τ μ t , D n P τ Σ t , t P τ D n , ν d x d r
for τ S n . Now, since a one-dimensional Student’s t distribution with ν n 1 degrees of freedom has finite moments for orders less than the number of degrees of freedom, and by assumption m < ν ν + n 1 , the m-th moment of t 1 r ; λ τ ( x ) , Υ τ ( x ) , ν + n 1 exists. Next, interchanging the order of integration gives
I m , τ = R n 1 t n 1 x ; D n P τ μ t , D n P τ Σ t , t P τ D n , ν R r m t 1 r ; γ τ ( x ) , Υ τ ( x ) , ν + n 1 d r d x ,
and applying Corollary 2 to the inner integral gives
I m , τ = k = 0 k even m m k ( k 1 ) ! ! Γ 1 2 ν + n 1 k Γ 1 2 ( ν + n 1 ) ν + n 1 2 k 2 R n 1 γ τ m k ( x ) Υ τ k 2 ( x ) t n 1 x ; D n P τ μ t , D n P τ Σ t , t P τ D n , ν d x .
Since γ τ ( x ) and Υ τ ( x ) are of orders 1 and 2 respectively in x , we have γ τ m k ( x ) Γ τ k 2 ( x ) is of order m in x , and since t n 1 x ; D n P τ μ t , D n P τ Σ t , t P τ D n , ν has finite moments of order up to ν by Theorem 4, it follows that the integral in I m , τ is well-defined. Summing the I m , τ over τ S n gives (38).

Appendix D. Proof of Lemma 1

Setting the scale parameter to ς ν / ( ν 2 ) and the degrees of freedom to ν 2 in (40) gives
t 1 x ; η , ς 2 ν / ( ν 2 ) , ν 2 = c η , ς 2 ν / ( ν 2 ) , ν 2 1 + 1 ν x η ς 2 1 2 ( ν 1 ) ,
and differentiating both sides with respect to x gives
t 1 x ; η , ς 2 ν / ( ν 2 ) , ν 2 x = c η , ς 2 ν ν 2 , ν 2 ( ν 1 ) ν ς x η ς 1 + 1 ν x η ς 2 1 2 ( ν + 1 ) = c η , ς 2 ν / ( ν 2 ) , ν 2 ( ν 1 ) c η , ς 2 , ν ν ς x η ς t 1 x ; η , ς 2 , ν .
Now, using the definition of c ( η , ς , ν ) , we have
c η , ς 2 ν / ( ν 2 ) , ν 2 ( ν 1 ) c η , ς 2 , ν ν ς = Γ ν 1 2 ς ( ν 2 ) π Γ ν 2 2 · ς ν π Γ ν 2 Γ ν + 1 2 · ( ν 1 ) ν ς ,
and using the property Γ ( z + 1 ) = z Γ ( z ) , we obtain
c η , ς 2 ν / ( ν 2 ) , ν 2 ( ν 1 ) c η , ς 2 , ν ν ς = ν ν 2 · 1 2 ( ν 2 ) 1 2 ( ν 1 ) · ( ν 1 ) ν ς = 1 ς ν 2 ν .
Substituting back into (A1) and rearranging gives (41) for m = 1 , and the general case follows from multiplying both sides of the identity for the m = 1 case by powers of ( x η ) / ς .

References

  1. Arellano-Valle, Reinaldo B., and Adelchi Azzalini. 2006. On the Unification of Families of Skew-normal Distributions. Scandinavian Journal of Statistics 33: 561–74. [Google Scholar] [CrossRef]
  2. Arellano-Valle, Reinaldo B., and Marc G. Genton. 2010. Multivariate unified skew-elliptical distributions. Chilean Journal of Statistics 1: 17–33. [Google Scholar]
  3. Asness, Clifford S. 1994. Variables that Explain Stock Returns. Ph.D. dissertation, University of Chicago, Chicago, IL, USA. [Google Scholar]
  4. Asness, Clifford S., John M. Liew, and Ross L. Stevens. 1997. Parallels between the cross-sectional predictability of stock and country returns. Journal of Portfolio Management 23: 79–87. [Google Scholar] [CrossRef] [Green Version]
  5. Asness, Clifford S., Tobias J. Moskowitz, and Lasse H. Pedersen. 2013. Value and momentum everywhere. The Journal of Finance 58: 929–85. [Google Scholar] [CrossRef] [Green Version]
  6. Azzalini, Adelchi, and Alessandra Dalla Valle. 1985. The multivariate skew-normal distribution. Biometrika 83: 715–26. [Google Scholar] [CrossRef]
  7. Bekaert, Geert, Claude B. Erb, Campbell R. Harvey, and Tadas E. Viskanta. 1997. What Matters for Emerging Equity Market Investments. Emerging Markets Quarterly 1: 17–46. [Google Scholar]
  8. Carhart, Mark M. 1997. On Persistence in Mutual Fund Performance. Journal of Finance 52: 57–82. [Google Scholar] [CrossRef]
  9. Chan, Kalok, Allaudeen Hameed, and Wilson Tong. 2000. Profitability of Momentum Strategies in the International Equity Markets. Journal of Financial and Quantitative Analysis 35: 153–72. [Google Scholar] [CrossRef]
  10. Daniel, Kent, and Tobias J. Moskowitz. 2016. Momentum Crashes. Journal of Financial Economics 122: 221–47. [Google Scholar] [CrossRef] [Green Version]
  11. Erb, Claude B., and Campbell R. Harvey. 2006. The strategic and tactical value of commodity futures. Financial Analysts Journal 62: 69–97. [Google Scholar] [CrossRef]
  12. Fama, Eugene, and Kenneth French. 1992. The Cross-Section of Expected Stock Returns. The Journal of Finance 47: 427–65. [Google Scholar] [CrossRef]
  13. Hameed, Allaudeen, and Kusnadi Yuanto. 2002. Momentum Strategies: Evidence from the Pacific Basin Stock Markets. Journal of Financial Research 25: 383–97. [Google Scholar] [CrossRef]
  14. Hansen, Christian, James B. McDonald, and Whitney K. Newey. 2010. Instrumental Variables Estimation with Flexible Distributions. Journal of Business and Economic Statistics 28: 13–25. [Google Scholar] [CrossRef]
  15. Israel, Ronen, and Tobias J. Moskowitz. 2013. The Role of Shorting, Firm Size, and Time on Market Anomalies. Journal of Financial Economics 108: 275–301. [Google Scholar] [CrossRef]
  16. Jamalizadeh, Ahad, and Narayanaswamy Balakrishnan. 2012. Concomitants of order statistics from multivariate elliptical distributions. Journal of Statistical Planning and Inference 142: 397–409. [Google Scholar] [CrossRef]
  17. Jegadeesh, Narasimhan, and Sheridan Titman. 1993. Returns to buying winners and selling losers: Implications for stock market efficiency. The Journal of Finance 48: 65–91. [Google Scholar] [CrossRef]
  18. Jegadeesh, Narasimhan, and Sheridan Titman. 2001. Profitability and Momentum Strategies: An Evaluation of Alternative Explanations. Journal of Finance 56: 699–720. [Google Scholar] [CrossRef] [Green Version]
  19. Kan, Raymond. 2008. From moements of sum to moments of product. Journal of Multivariate Analysis 99: 542–54. [Google Scholar] [CrossRef] [Green Version]
  20. Kirkby, Justin L., Dang Nguyen, and Duy Nguyen. 2019. Moments of Student’s t-distribution: A Unified Approach. arXiv arXiv:1912.01607. [Google Scholar] [CrossRef] [Green Version]
  21. Kwon, Oh Kang, and Stephen Satchell. 2018. The distribution of cross sectional momentum returns. Journal of Economic Dynamics and Control 94: 225–41. [Google Scholar] [CrossRef]
  22. Lewellen, Jonathan. 2002. Momentum and autocorrelation in Stock Returns. Review of Financial Studies 15: 533–63. [Google Scholar] [CrossRef]
  23. Lo, Andrew, and Craig MacKinlay. 1990. When are Contrarian Profits due to Stock Marjket Overreaction? Review of Financial Studies 3: 175–205. [Google Scholar] [CrossRef]
  24. Menkhoff, Lucas, Lucio Sarno, Maik Schmeling, and Andreas Schrimpf. 2012. Currency momentum strategies. Journal of Financial Economics 106: 660–84. [Google Scholar] [CrossRef] [Green Version]
  25. Moskowitz, Tobias J., Yao Hua Ooi, and Lasse H. Pedersen. 2012. Time Series Momentum. Journal of Financial Economics 104: 228–50. [Google Scholar] [CrossRef] [Green Version]
  26. Muirhead, Robb J. 1982. Aspects of Multivariate Statistical Theory. New Jersey: John Wiley & Sons. [Google Scholar]
  27. Okunev, John U., and Derek White. 2003. Do momentum-based strategies still work in foreign currency markets? Journal of Financial and Quantitative Analysis 38: 425–47. [Google Scholar] [CrossRef]
  28. Richards, Anthony J. 1997. Winner-Loser Reversals in National Stock Market Indices: Can They be Explained? Journal of Finance 52: 2129–44. [Google Scholar] [CrossRef]
  29. Roth, Michael. 2013. On the Multivariate t-Distribution. Technical Report. Linköping: Department of Electrial Engineering, Linköpings Universitet. [Google Scholar]
  30. Rotman, Joseph J. 1995. An Introduction to the Theory of Groups. New York: Springer. [Google Scholar]
  31. Rouwenhorst, Geert K. 1998. International Momentum Strategies. Journal of Finance 53: 267–84. [Google Scholar] [CrossRef] [Green Version]
  32. Rouwenhorst, Geert K. 1999. Local Return Factors and Turnover in Emerging Stock Markets. Journal of Finance 54: 1439–64. [Google Scholar] [CrossRef]
  33. Sefton, James, and Alan Scowcroft. 2004. A Decomposition of Portfolio Momentum Returns. Discussion Paper TBS/DP04/9. London: Tanaka Business School, Imperial College London. [Google Scholar]
  34. Theodossiou, Panayiotis. 1998. Financial Data and the Skewed Generalized T Distribution. Management Science 44: 1650–61. [Google Scholar] [CrossRef]
  35. Withers, Christopher S. 1985. The Moments of the Multivariate Normal. Bulletin of Australian Mathematics Society 32: 103–7. [Google Scholar] [CrossRef] [Green Version]
1.
Refer to Rotman (1995) Chapter 2 for the details on quotient groups.
2.
The normalization factor is, in fact, the probability of the event 0 X 1 , where X 1 is as defined in (20).
3.
Rewritten in the notation of this paper.
4.
Figure 1. Expected CSM return, μ 1 ± , t + 1 , as a function of ν and ϱ t , t + 1 .
Figure 1. Expected CSM return, μ 1 ± , t + 1 , as a function of ν and ϱ t , t + 1 .
Jrfm 13 00027 g001
Figure 2. Standard deviation, σ 1 ± , t + 1 , of CSM return as a function of ν and ϱ t , t + 1 .
Figure 2. Standard deviation, σ 1 ± , t + 1 , of CSM return as a function of ν and ϱ t , t + 1 .
Jrfm 13 00027 g002
Figure 3. Skewness, γ 1 ± , t + 1 , of CSM return as a function of ν and ϱ t , t + 1 .
Figure 3. Skewness, γ 1 ± , t + 1 , of CSM return as a function of ν and ϱ t , t + 1 .
Jrfm 13 00027 g003
Figure 4. Excess kurtosis, κ 1 ± , t + 1 , of CSM return as a function of ν and ϱ t , t + 1 .
Figure 4. Excess kurtosis, κ 1 ± , t + 1 , of CSM return as a function of ν and ϱ t , t + 1 .
Jrfm 13 00027 g004

Share and Cite

MDPI and ACS Style

Kwon, O.K.; Satchell, S. The Distribution of Cross Sectional Momentum Returns When Underlying Asset Returns Are Student’s t Distributed. J. Risk Financial Manag. 2020, 13, 27. https://doi.org/10.3390/jrfm13020027

AMA Style

Kwon OK, Satchell S. The Distribution of Cross Sectional Momentum Returns When Underlying Asset Returns Are Student’s t Distributed. Journal of Risk and Financial Management. 2020; 13(2):27. https://doi.org/10.3390/jrfm13020027

Chicago/Turabian Style

Kwon, Oh Kang, and Stephen Satchell. 2020. "The Distribution of Cross Sectional Momentum Returns When Underlying Asset Returns Are Student’s t Distributed" Journal of Risk and Financial Management 13, no. 2: 27. https://doi.org/10.3390/jrfm13020027

APA Style

Kwon, O. K., & Satchell, S. (2020). The Distribution of Cross Sectional Momentum Returns When Underlying Asset Returns Are Student’s t Distributed. Journal of Risk and Financial Management, 13(2), 27. https://doi.org/10.3390/jrfm13020027

Article Metrics

Back to TopTop