1. Introduction
Efficient markets has been and still is a cornerstone of asset pricing theory. Empirical work in this regard is largely concerned with the adjustment of security prices to relevant information.
Fama (
1970,
1991) refine relevant information into three hierarchical subsets of weak form, semi-strong form, and strong form
Fama (
1970), or equivalently, return predictability, event studies, and private information
Fama (
1991). Event studies investigate the effect of unexpected economic events on asset prices. Therefore, event studies can give the most direct evidence on market efficiency (c.f.
Fama 1991, p. 1577). For this purpose, asset price data available from financial markets can be used with appropriate statistical testing methodology, reliability of which is central in inferences. In order to foster this, the current paper aims to fill the gap in existing (non-parametric) statistical testing by proposing non-parametric rank tests that are robust to cross-sectional dependency of asset returns in more general circumstances than the existing ones. Otherwise, refer to (
Campbell et al. 1997, chp. 4) as an excellent introduction to event studies and related statistical methods.
Regarding methodology, standardizing returns by their respective standard deviations homogenizes data and has proven to improve testing performance. Because of this improvement, standardized return based tests by
Patell (
1976) and
Boehmer et al. (
1991) (BMP) have gained popularity over conventional non-standardized tests in testing event effects on mean security price performance.
Harrington and Shrider (
2007) found that in a short-horizon testing of abnormal returns (i.e., systematic deviation from expected behavior), one should always use methods that are robust to cross-sectional variation in the true abnormal returns.
1 They found that BMP is a good candidate for robust, parametric tests in conventional event studies.
2A major problem in statistical tests of returns is that the returns are not normally distributed (
Fama 1976). Not surprisingly, non-parametric rank tests introduced by
Corrado (
1989,
2011);
Corrado and Zivney (
1992);
Campbell and Wasley (
1993) and
Kolari and Pynnonen (
2011), among others, dominate parametric tests both in terms of better size and power (e.g., see
Campbell and Wasley 1993;
Corrado 1989;
Corrado and Zivney 1992;
Kolari and Pynnonen 2011;
Kolari and Pynnönen 2010;
Luoma 2011). Furthermore, rank tests by
Corrado and Zivney (
1992) and
Kolari and Pynnonen (
2011) that utilize event period re-standardized returns have proven to be robust to event-induced volatility (
Kolari and Pynnönen (
2010);
Kolari and Pynnonen (
2011)), cross-correlation due to event day clusterings (
Kolari and Pynnönen 2010), and autocorrelation (
Kolari and Pynnonen 2011). These studies are consistent with the view stated in the epilogue of
Lehmann (
2006): “Rank tests apply often to relatively simple solutions, such as one-, two-, and
s-sample problems, and testing for independence and randomness, but for these situations they are often the method of choice”. (
Lehmann 2006, p. v). In addition, the results of rank tests are invariant to monotone transformations of the underlying returns; that is, whether the returns are defined as simple, continuously compounded log returns.
Existing rank based tests, however, are not robust to cross-sectional correlation if the event days are partially overlapping in calendar time. This
partial clustering occurs when events are in calendar time scattered within an event window more or less randomly rather than clustered on the same calendar day (i.e.,
complete clustering as in
Kolari and Pynnönen 2010).
Figure 1 illustrates the various degrees of clustering in terms of three stocks. Panel A depict the non-clustered case, Panel B the partial clustering, and Panel C the complete clustering. In the complete clustering the event days are the same in calendar time, while in the partial clustering the event days may or may not be the same in calendar time but the event windows are more or less overlapping. In the non-clustered case the event windows are completely separate in calendar time. In this case all event effects can be investigated utilizing cross-sectional independence assumption of returns. In complete clustering cross-sectional correlation of returns must be fully accounted for. In the partial case the correlation can bias the results depending on the degree of overlapping. For example in the case of Panel C if the interest is only on the event day effect, as all the event days are different, there is no biasing effect by the correlation. On the other hand, if cumulative return effect over the whole event window is of interest, correlation of returns on the overlapping affects the joint behavior of the cumulative returns.
Jaffe (
1974) is probably the first paper in event study testing to address the potential biasing effect of cross-sectional correlation due to clustered events. Table 2 of
Kolari and Pynnönen (
2010) explicitly addresses the issue by showing that already a virtually trivial cross-sectional correlation, such as 0.05, can severely bias testing for event effects towards material over-rejection. The present paper seeks to fill this gap of accounting for cross-sectional correlation in non-parametric even study testing also with partially clustered event days.
The paper is organized as follows.
Section 2 reviews some related key literature.
Section 3 defines the main concepts and derives some distributional properties of rank statistics.
Section 4 introduces the new transformed rank test.
Section 5 reports simulation results, and
Section 6 concludes.
2. Review of Related Literature
Patell and BMP parametric tests are straightforward tests of cumulative abnormal returns (CARs) over multiple day windows. With the correction suggested by
Kolari and Pynnönen (
2010), these tests are useful in the case of completely clustered event days, and with the correction suggested by
Kolari et al. (
2018) when event days are either completely or partially clustered. By construction, the
Corrado (
1989) non-parametric rank test applies for testing single day event returns. Testing for CARs with the same logic implies the need of defining multiple-day returns that match the number of days in the CARs, (see (
Corrado 1989, p. 395); (
Campbell and Wasley 1993, footnote 4)). In practice this approach is carried out by dividing the estimation period and event period into intervals matching the number of days in the CAR. Unfortunately, this procedure is not useful for a number of reasons. Foremost among these is that it does not necessarily lead to a unique testing procedure. In addition, the abnormal return model should be re-estimated for each multiple-day CAR definition. Furthermore, for a fixed estimation period, as the number of days accumulated in a CAR increases, the number of multiple-day estimation period observations reduces quickly impractically low and thus would weaken the abnormal return model estimation (c.f.,
Kolari and Pynnönen 2010).
Kolari and Pynnonen (
2011) solve these issues in their generalized rank test approach.
On the other hand,
Campbell and Wasley (
1993) recommend using the
Corrado (
1989) rank test to test cumulative abnormal returns by simply accumulating the respective ranks to constitute cumulative ranks (see also
Hagnäs and Pynnonen 2014). This practice is adopted in the Eventus
software
Cowan (
2007) and is probably, for the time being, the most popular procedure for multiple day applications of rank tests. An advantage is that this proceure implicitly accounts the cross-sectional correlation in the case of the complete clustering.
In spite of these attractive properties, the cumulative ranks test does not account for cross-sectional correlation due to calendar time partially overlapping event windows, i.e., the case of partial clustering. As referred above, even a small (positive) correlation biases the standard errors downwards leading to over-rejection of the null hypothesis of no event effect. Contributing to the event study literature, this paper proposes an adjustment for the standard errors that corrects the bias in non-parametric testing.
3. Distributional Properties of Ranks
We begin by fixing some notations and an underlying assumption to facilitate our theoretical discussion.
Assumption A1. Stock returns for firm i are weak white noise continuous random variables and are cross-sectionally independent over non-overlapping calendar days, or, It is a stylized fact that the variances of the returns are time varying and that there is mild autocorrelation. The time varying volatility problem can be partially captured in terms of GARCH-modeling. However, typical GARCH-processes satisfy Assumption A1.
Let
denote the abnormal return of security
i on day
t, and following commonly used notations (e.g.,
Brown and Warner 1985, p. 6), let day
indicate the event day. Days from
to
represent the estimation period relative to the event day, and days from
to
represent the event window. The cumulative abnormal return (CAR) from
to
with
, is defined as
The time period from
to
is called in the following as a CAR window or CAR period.
Standardized abnormal returns are defined as
where
Furthermore, for the purpose of accounting the possible event induced volatility, the re-standardized abnormal returns are defined as in
Boehmer et al. (
1991) (see also,
Corrado and Zivney 1992), or
where
is the time
t cross-sectional standard deviation of
,
, and
n is the number of stocks in the portfolio. In addition, let
denote the rank numbers of abnormal returns, where
,
,
, and
.
If the available observations in the estimation period vary from one series to another, it is convenient to use
standardized ranks with zero mean and unit variance. To do this, we compile the known results of rank statistics (e.g.,
Lehmann 2006, Appendix, Section 1) as described below.
Result 1. Let denote the rank numbers as defined above, then Definition 1. Standardized ranks are defined as By Result 1, we obtain:
Next, we define cumulative standardized ranks for individual stocks.
Definition 2. The cumulative standardized ranks of a stock i over the event days window form to are defined aswhere . From Result 2 and utilizing the variance-of-the-sum formula, , we obtain:
Result 3. where , , and . Rather than investigating individual (cumulative) returns, the practice in event studies is to aggregate individual returns into equally-weighted portfolios such that:
Definition 3. The average cumulative standardized ranks are defined as the equally weighted portfolio of individual cumulative standardized ranks defined in (
14)
, i.e.,where andis the time t average of standardized ranks. The expected value of
is the same as that of the cumulative ranks of individual securities, or
If the event days are not clustered the cross-sectional correlations of the return series are zero (or at least negligible). Under the cross-sectional independence and by Equation (
16), the variance of
is
Then by the central limit theorem
The situation is more complicated if the event days are partially overlapping in calendar time which implies cross-sectional correlation. Recalling that the variances of
given in Equation (
16) are constants (independent of
i), we can write the cross-sectional covariance of
and
as
where
is the cross-sectional correlation of
, and
,
. Utilizing this result and the variance-of-the-sum formula, the variance of
in (
17) becomes:
Result 4. whereis the average cross-sectional correlation of cumulated ranks. Cross-sectional dependence affects the asymptotic distribution of the statistic in Equation (
20). However, as discussed in (
Lehmann 1999, Scttion 2.8), it is frequently true that the asymptotic normality holds provided that the average correlation,
, tends to zero rapidly enough such that
where
is some finite constant. Under this condition the limiting distribution of
Z-statistic in (
20) becomes
.
Otherwise, from practical point of view, the crucial result of Formula (
22) is that the only unknown parameter to be estimated is the average cross-sectional correlation
.
Hagnäs and Pynnonen (
2014) discuss approaches to account implicitly for this average correlation in cumulated ranks tests when all events share the same calendar day, i.e., the case of complete clustering. These implicit approaches, however, do not work in the case of partial clustering. Therefore, by utilizing the procedure developed in
Kolari et al. (
2018), this paper proposes a method to estimate explicitly the cross-sectional correlation,
, and thereby solve the cross-sectional correlation problem in the case of the partial clustering.
4. Correlation Robust Test for Cumulated Ranks
Following
Kolari et al. (
2018), let
,
denote the number of calendar days stocks
i and
j share in common within the event windows. By independence in Assumption A1, the correlation,
, of the standardized ranks
and
is zero whenever the underlying calendar days of the relative event days,
u and
v, differ and can be non-zero when the calendar days are the same. Denoting these non-zero correlations (which are also covariances) by
, we get
Combining this with (
21), we obtain
We can assume that the overlapping window lengths,
, and the cross-sectional correlations,
, are not dependent on each other so that
, where
is the average number of overlapping calendar days, and
is the average cross-sectional correlation of
and
.
3 Consequently, we can rewrite (
22) as
where
adjusts the average correlation by the fraction of overlapping calendar days within the event window.
It is notable that, even though the average cross-sectional correlation,
, in Equation (
26) is based on
correlations, it can be computed without estimating individual correlations by utilizing the method introduced by
Edgerton and Toops (
1928). Instead of
individual correlations, it turns out that one needs to compute only
variances, which is a computational problem of order
n viz-a-viz of order
with averaging the correlations. To illustrate the idea, consider
n random variables
,
and define the standardized variables
. Next let
denote the average of the variables. Then because
and
, variance of
becomes
, we obtain
Hence, to estimate the average cross-sectional correlation, all we need are estimates of
n standard deviations of the
x-variables and the variance of
. Finally, for large
n, Equation (
27) shows that
.
Because in our case the calendar days of different stocks are only partially overlapping, we estimate the variance of the average utilizing the clustering robust estimation technique (e.g., see
Cameron et al. 2011) suggested in
Kolari et al. (
2018).
Following
Kolari et al. (
2018), denote the calendar days of the returns in the combined estimation and event window as
, which implies that
L becomes the number of clusters equaling the number of separate calendar days on which returns are available in the combined estimation and event windows. Let
denote the number of stocks having returns on calendar day
t and define
Then
so that
Summing these up over the calendar days in the combined estimation and event window, we have
Taking the average, we get an estimator for the average correlation
where
is the number of the cross-product terms. It is notable that days for which there is available only one return drop automatically out (if
for all
t, then
). The potentially tedious computation over all cross-products can be materially simplified by utilizing the right-hand- side of Equation (
31). By Result 2 the variances of standardized ranks are all equal to one and means equal zero. Therefore, arranging the terms of the rightmost sum of Equation (
31) to correspond to variance representations, the (double) sum becomes equal to
, i.e., the total number of observations.
4 Thus, the only component we need to compute is the first sum of squares on the right-hand-side of (
31). Therefore, similar to the illustration of computing the average correlation above, the computational effort of computing the average correlation is again of order
n (rather than
). Finally, we get:
Result 5. A computationally efficient form of the average correlation in (
32)
iswhere is the total number of returns, M is given by (
33)
, andwith given in Equation (
28)
. Variance, , is a clustering robust variance estimator of standardized ranks in the presence of intra-cluster correlation (cf. e.g., Cameron et al. 2011). As noted earlier, if all .
Given the estimator of the average cross-sectional correlation,
, we can define an appropriate cross-sectional correlation robust test for the null hypothesis of zero cumulative abnormal returns
The test can be defined in terms of the cumulated ranks using the
z-ratio
where
is the square root of Equation (
19), i.e., the variance
of
for completely non-overlapping event windows in calendar time [i.e., when
in Equation (
26)], and
is the length of the window of cumulated abnormal returns.
In event studies, the combined length, T, of the estimation and event period remains fixed, while the number of event firms, n, defines the sample size, thereby being the dimension increased when dealing with the asymptotic distribution of associated test statistics.
Given that the condition in Equation (
24) holds for
, the null distribution of
is asymptotically normal with zero mean and unit variance.
Kolari and Pynnonen (
2011) propose replacing the cumulative ranks in Definition 2 by a single rank number which is based on standardized cumulative abnormal returns (SCARs)
in which
is the standard deviation of
(for details, see
Kolari and Pynnonen 2011). Their approach again accounts implicitly for cross-sectional correlation due to completely overlapping event days. Here, we extend the approach to cover the partial overlapping case. Rather than using the scaled ranks defined in
Kolari and Pynnonen (
2011), we use the standardized ranks of Definition 1. Subsequently, denoting the standardized rank of
by
, we can base the rank test for testing the null hypothesis of zero cumulative abnormal returns in Equation (
36) on the average ranks
If the event periods are completely non-overlapping,
s are independent with zero mean and unit variance (see Definition 1), in which case the null distribution of
has zero mean and variance
. However, if the event days are partially overlapping, the components of
absorb the cross-sectional correlation over the CAR-window. The correlation that inflates the variance is inherited from the cross-sectional correlations of
s.
Kolari et al. (
2018) show that the variance inflation factor is of the form
as in Equation (
26) with the exception that
is replaced by
, the ratio of the average number of overlapping calendar days within the CAR-window to the window length. With this correction the variance of
becomes
. We can estimate the average cross-sectional correlation,
, as in Equation (
32) utilizing only the estimation period in computing
. For this approach, the standardized ranks in Definition 1 are redefined for the estimation period abnormal returns. Alternatively one can estimate the cross-sectional correlation exactly as in Result 5. Both approaches will produce essentially the same result in most cases. With the estimated average correlation, we get a cross-sectional correlation robust generalized rank test statistic
where
. Again, given that the condition in Equation (
24) holds for
, the null distribution of
is asymptotically normal with zero mean and unit variance.
5. Simulation Results
We generate artificial returns utilizing the
Fama and French (
2015) five-factor model (FF5),
where
is the market excess return over the risk-free rate
, SMB, HML, RMW, and CMW are common market factors proposed by Fama and French. We utilize daily data from 2 January 1990 through 30 October 2020 (7770 daily returns) to generate 20,000 initial daily return series for this sample period. The regression coefficients for each stock are generated from multivariate normal distribution with mean vector
and covariance matrix
, in which
is the variance of the error term
. The stock specific
values are generated by drawing
s, the standard deviations, independently from a uniform distribution
. This corresponds to a range of annual volatilities roughly from 10 percent to 48 percent. The
matrix is the cross-product matrix of the Fama-French 5-factor regression model.
5 The (7770) error terms
for stock
i is generated independently from the normal distribution
.
In the simulations we define the abnormal returns with respect to the market model as
where
and
are ordinary least squares (OLS) estimates. Therefore, missing common factors introduce cross-sectional correlation between the abnormal returns. The event period is
trading days around the event day
, and the estimation period consists of 250 days prior the event periods, i.e., relative days
.
In forthcoming experiments we focus on the effect of cross-sectional correlation on the size of the test. Other issues, such as event induced volatility are well documented for example by
Kolari and Pynnonen (
2011);
Kolari and Pynnönen (
2010). Utilizing the base design initiated by
Brown and Warner (
1985), we generate 1000 samples of randomly selected 50 stocks (the returns of which are generated by the FF5 model in Equation (
41)) with four over-lapping event days scenarios. In the first case of non-overlapping event days, the returns are cross-sectionally independent. In the second case of completely overlapping events, all firms share the same event day (calendar time), and in the third and fourth scenarios the event days are randomly scattered across 5 and 10 concecutive calendar days, i.e., one and two weeks of trading days, respectively.
We report two-tailed rejection rates for the null hypothesis of no event-effect across different event windows of
,
,
, and
around the event day, i.e., window lengths
, and 21 days. In addition to statistic
in Equation (
37) we report results for the more traditional rank based test proposed by (
Campbell and Wasley 1993, p. 85):
where
with
and
Furthermore, we report results for traditional parametric (cross-sectional correlation non-robust)
t-statistics popular in event studies (e.g., see (
Campbell et al. 1997, chp. 4)),
where
is the sample average of
defined in (
2), and
is the related standard error. Under independence, the null distribution of
is asymptotically standard normal.
Table 1 summarizes the test statistics and their major features.
Table 2 reports the simulation results of the two-tailed rejection rates of the null hypothesis of no abnormal return at the 5% nominal rejection rate. The results are clear-cut. Panel A of the table reports the non-overlapping case with zero cross-sectional correlation. As expected, all statistics reject close to the nominal rate. Panel B reports results of complete overlapping. That is, all events share the same calendar day; hence, returns are prone to cross-sectional correlation. The new
,
, and the more traditional cumulative ranks statistic,
, that account for cross-sectional correlation, reject reasonably close to the nominal rate up to event windows
and exhibit some over-rejection on the longest event window
, i.e., 21 days. Not surprisingly, the parametric, non-cross-correlation robust statistic,
, incrementally over-rejects as event windows increase in length. Panel C reports partial overlapping with events clustered randomly within 5 trading days (about a week). For event day testing also the a priori non-robust statistics perform well by rejecting at the nominal rate. However, they start to incrementally over-reject as the event window grows longer. The a priori partial overlapping robust statistics,
and
, reject close to the nominal rate up to the event window lengths of 5 days and over-reject to some extent for the longest event windows of 11 and 21 days, albeit far less than the non-robust statistics of
and
. The results are pretty much similar with the decreased overlapping in Panel D. Thus, we conclude that accounting for cross-sectional correlation is crucial to avoid biased inferences in statistical testing, not only due to complete overlapping of event windows, but also for partially overlapping cases. Regarding the latter, this paper has introduced two new test statistics that account for these cases.