1. Introduction
One key assumption behind the well known Black–Scholes (B–S) formula is a constant volatility function, which has been frequently challenged after the stock market crash in October 1987. The extent to which the market deviates from this assumption can be tested by examining the constancy of the option implied volatility, which is the empirically determined parameter that makes the B–S formula fit market prices of the options.
Option prices are characterized and empirically quoted based on “moneyness" values, defined as the ratio of strike price and the forward price for delivery at expiration, and time-to-expiration. As shown by
Aıt-Sahalia et al. (
2001) (ABS, henceforth), inverting the B–S formula makes the option implied volatility a nonparametric function of these two components. Moreover, ABS further shows that a semiparametric partially linear model permitting a flexible function of moneyness—widely known as the “volatility smile”—and a quadratic time effect is a statistically adequate depiction of the empirical option data.
The point of departure in this paper is the following. We note the partially linear specification above precludes the interactive effect between moneyness
and time-to-expiration
T. Consequently, it restricts the term structure of volatility to be constant across moneyness
1. Such a restriction seems controversial and has been rejected by several empirical studies. For instance,
Fengler (
2006) notes that as the time-to-expiration
T increases, the implied volatility for at-the-money (ATM) and out-of-the money (OTM) call options (i.e.,
) displays a increasing term structure, while the implied volatility for in-the-money (ITM) call options (
) displays a decreasing term structure
2. Extant studies have aimed at theoretically explaining these stylized facts using stochastic volatility (
Hull and White 1987;
Renault and Touzi 1996) and jump diffusion models (
Bates 1996;
Jorion 1988), there is an absence of prior work that formally tests whether the term structure of option implied volatility is constant across different levels of moneyness. This paper attempts to fill this void by developing a specification test for the partially linear structure in ABS against a semiparametric alternative that explicitly permits interaction effects.
This study relates to the econometric literature of non-/semiparametric specification test. Several studies test for a partially linear specification against a nonparametric alternative based on the unifying framework proposed by
Newey (
1985a,
1985b).
Fan and Li (
1996) proposed a conditional moment test based on the observation that the squared conditional moment of residual would equal to zero only under the null. They estimate the test statistic using kernels and derive its asymptotic distribution based on a
statistic argument. Recently,
Wang and Wang (
2019) developed a similar test with the k-nearest-neighbor method, which performs better than kernel methods when data are unevenly distributed.
Li and Wang (
1998) tests a parametric partially linear model against a semiparametric alternative based on bootstrap method.
Whang and Andrews (
1993) extends the framework of
Newey (
1985a,
1985b) by allowing for infinite dimensional parameters in the criteria function (e.g., test statistic).
A nonparametric alternative, while seems most natural in many contexts, cannot account for stylized facts and empirically-grounded restrictions in the underlying volatility process. For instance, a quadratic term structure has been widely recognized as a suitable statistical depiction of volatility data (
Aıt-Sahalia et al. 2003,
2001;
Ahn et al. 2002). To this end, we consider a semiparametric alternative in which the quadratic term structure can be imposed explicitly. This alternative specification not only permits a flexible interaction between the moneyness
and time-to-expiration
T but also effectively takes the quadratic term structure into account. As explained below, coefficients of the linear and quadratic terms can be exploited jointly to derive a Hausman-type test.
This study also adds to the existing vibrant literature on the term structure of implied volatility. See, for example,
Mixon (
2002) and
Egloff et al. (
2010). One conventional wisdom—known as the “expectation hypothesis" (
Campa and Chang 1995)—argues that the movement in long-term volatility should be consistent with the expected future short-volatility.
Byoun et al. (
2003), based on the data of foreign currency and S&P 500 stock index options, shows the empirical results on this hypothesis are mixed at best. More recently, volatility derivatives listed in major exchanges have also been levered to analyze the volatility term structure. See
Fassas and Siriopoulos (
2011);
Zhang et al. (
2010) and
Lu and Zhu (
2010) for studies based on VIX futures. Based on options data of S&P 500 index, this paper finds that option implied volatility displays a quadratic term structure for most years between 2012 and 2018.
The rest of this paper is organized as follow. In
Section 2, we describe the testing strategy and construct the Hausman statistic in a general econometric setting. We carry out some Monte Carlo experiments to study the finite sample property of the proposed test statistic in
Section 3. In
Section 4, we report the empirical results using traded option data of S&P 500 index after the 2007–2009 recent crisis. While the focus of this paper is on option implied volatility, the testing framework can be applied to other contexts where a semiparametric alternative is justified.
2. A Hausman-Type Specification Test
Consider a semi-parametric model where a scalar independent variable
Z enters the model nonparametrically and other regressors
X enter the model via an index form:
. The econometric contribution of this paper is to test for a partially linear specification, which precludes the interaction of
X and
Z, against a more general semi-nonparametric alternative. Specifically, let
be a vector of regressors and
, the null hypothesis is
and the alternative is
Enlightened by
Hausman (
1978), the idea is to search for an estimator
for
that is consistent and efficient only under the null and another estimator
that is consistent regardless of whether the partially linear structure in (
1) holds or not. If the data are indeed generated from the null DGP, then the two estimators should be close to each other in probability. See
Amini et al. (
2012) for a review on recent development on the Hausman test.
Assume that the underlying DGP contains a error term
U satisfies
. Under the null, e.g.,
, one candidate for
comes from the following “Robinson differencing” procedure
3,
Take conditional expectation of Z on the model above: ;
Subtract the above equation from the original model on both sides: ;
With and , estimate by ordinary least square: .
Robinson (
1988) shows that the final OLS estimator is root-N-consistent and asymptotically efficient. One candidate for
can be obtained by implementing the semiparametric least square (SLS) estimation on the alternative specification
.
Ichimura and Lee (
1991) develop a consistent estimator for
up to location and scale. Since the parameter
is nested in a bivariate unknown function
which needs to be estimated nonparametrically, the corresponding estimator has a bias and the variance converges to zero at a slower (than parametric) rate. To reduce bias, I employ a two-stage bias-corrected estimator proposed by
Klein and Shen (
2010) in which the conditional expectation
is estimated with a Gaussian kernel.
To ensure proper identification in the semiparametric bi-index model, we make the following assumptions on the vector X and the unknown functions H.
Assumption 1. The unknown function is differentiable and not constant on the joint support of and Z.
The smoothness condition in Assumption 1 is fairly standard. Clearly one cannot identify
up to any level if the unknown function has no variability. To broaden the applicability of the proposed test, we would like to allow some regressors in
X to be deterministically related to others (e.g,
). This extra degree of flexibility comes at a cost because usually the index parameters cannot be identified in this case
4. To regain identification, we make the following assumption,
Assumption 2. - 1.
The vector X has at least one continuous component whose regression coefficient is not zero. We rearrange regressors so that where components in can be either continuous or discrete.
- 2.
When some regressors within X are deterministically related to others, we require the vector X satisfies the regularity conditions formally stated in Assumption 4.2 of Ichimura and Lee (1991).
Ichimura and Lee (
1991) rigorously established the identification assumption based on the following notation. Let each
in
X, for
to be a function of the more fundamental regressors. That is,
is a known function from
into
R for each
where
is a vector of “underlying regressors”. In Assumption 4.2,
Ichimura and Lee (
1991) requires certain differentiability conditions on the functions
and excludes exact multicollinearity problem.
Considering the methods of
Robinson (
1988) and
Ichimura and Lee (
1991) are extremely well known, we will not report the detailed estimation procedure and direct interested readers to the original papers. The conditional expectations in both models are estimated via Gaussian kernels, with the bandwidth parameter chosen optimally based on the rule-of-thumb by
Silverman (
1986)
5. For the purpose of motivating the proposed test statistic, below we formally state the asymptotic distribution of these two estimators using the notation of this paper
6.
Proposition 1. Assume () are i.i.d and that g(·) satisfy certain differentiability and moment condition as in Robinson (1988). Under Assumption 1-2, it can be shown that for the aforementioned estimator ,where and is the variance of the regression error term U. Proposition 2. Assume that and . Let be the maximizer of the following quasi-likelihood function,in which is a trimming function on the basis of the (estimated) index with being the 1% and 99% sample quantile of 7. is a kernel-type conditional expectation estimator defined in D3 in Klein and Shen (2010). Let and denote the Gradient vector and Hessian matrix, respectively. Under Assumption 1-2,where . However,
and
are not directly comparable since
is an estimate of
up to location and scale. To conduct a feasible Hausman-type test, with
, we compute the “normalized parameter” in the partially linear model as
. From the multivariate delta method, the variance-covariance matrix of
is
where
is the
variance-covariance matrix of
derived in Proposition 1.
is the
gradient matrix. Taken together,
is
.
With the two estimators formulated above, a Hausman-type test statistic comes naturally:
where the appropriate scale matrix M is
Note also that the matrix
M is positive semi-definite because
is efficient under the null. The first component in
M,
can be estimated by
. The second component
can be estimated consistently using the aforementioned Delta-method. From Theorem 2.1 of
Hausman (
1978), under the null,
follows a
distribution. The power of this test will be evaluated below based on Monte Carlo experiment under a sequence of “local alternatives”.
3. Monte Carlo Experiments
In this section, we carry out some Monte Carlo experiments to study the finite sample properties of the test statistic. The DGP is given as:
Parameters are set to be unity: . The data are generated from , and . Some truncation is applied to ensure the X’s and Z are bounded. When , the model is partial linear. As increases, the model smoothly transforms from a series of local alternatives to a two-index model in which Z has interactive effect with .
In
Table 1 we report the Monte Carlo results from 500 replications with
. The parameter of interest here is the ratio of regression coefficients of
and
:
. In the first design with
, the Robinson differencing estimator
performs better than the two-index estimator
in terms of accuracy (MEAN of estimates with N = 2000: 1.003 vs. 1.045) and efficiency (Root-Mean-Square-Error (RMSE) with N = 2000: 0.006 vs. 0.008). As
deviates from zero, while the Robinson differencing estimator became inconsistent, the 2-index model still provides robust estimation of
. For example, when
and N = 1000, the mean of
is 0.863 while the mean of
is 0.997.
The testing results are reported in the lower panel of
Table 1. Under
(e.g.,
) with N = 2000, the rejection rate in 500 replication is 3.2% and 8.2%, given the theoretical size is 5% and 10%, respectively. When the true model deviates significantly from the partial linear model (
), the test has powers close to one in both samples. As the interaction term becomes smaller, that is, when the true model approaches a series of local alternatives, the power of the test decreases. For each alternative DGP with a different
, the power increases as the sample size increases.
4. Empirical Results
As motivated in the introduction, we study whether an option’s moneyness and time-to-maturity affect its implied volatility in an interactive fashion, e.g., do options across different moneyness value have different term structure? ABS addresses this question by testing whether the in-sample-fit of a partially linear model is statistically the same as an unrestricted, nonparametric alternative. We revisit this problem with a more recent dataset using the testing strategy developed in the previous section.
To be specific, we denote the price of a call option as
C, its strike price as
K, its time to expiration as
T, and the forward price for delivery at expiration as
F8. The B–S formula for a call option is
where
is the normal cumulative distribution function, and
for a value of the volatility parameter
constant across different “moneyness” values (defined as
) and time-to-expiration
T. For each option price
C, with characteristics (
), Equation (
8) can be inverted to produce the option’s implied volatility. This is the unique value of
that would make C(
) on the right-handside of (
8) equal to the observed market price of the option.
As shown by
Aıt-Sahalia et al. (
2001) (ABS, henceforth), inverting the B–S formula with respect to the volatility parameter would give to the following model for
,
where the unknown transformation
captures the dependency of
on
and
T and the error term
summarizes potential sources of noise, e.g., bid-ask spread. In practice, option prices (
C or
P) are quoted solely based on moneyness and time-to-maturity. As such, it suffices to only include
and
T at the right hand-side of (
10).
The empirical agenda of this paper is centered on testing the functional form of
in Equation (
10). Based on the testing framework in
Section 2, let
denotes the option implied volatility,
denotes the option’s moneyness, and
denotes the linear and quadractic term of time-to-expiration. We test
against a two-index model which permits the interaction between moneyness and time-to-maturity:
for a well-defined function
satisfying Assumption 1. Recall that the aforementioned identification by
Ichimura and Lee (
1991) allows some regressors to be deterministically related to others in the index
9. As such,
is identified.
The empirical sample consists of
N = 45,905 observations on S&P 500 index call options traded at the Chicago Board Options Exchange (CBOE) from September 2012 to December 2018. Data on traded options is retrieved from OptionMetrics at the Wharton Research Data Services (WRDS). The sample is restricted to European options with maturities from 1 to 9 months, which are the most actively traded options. We compute the option premium using the midpoint of bid and ask price, and solve for option implied volatility based on the B–S formula. Following the “Robinson differencing” and Semiparametric Least Square procedures, respectively, we obtain the following estimates
To illustrate the nonparametric components in these models, we plot
and
in
Figure 1 and
Figure 2, respectively. As can be seen from
Figure 2, the term structure of volatility is generally downward-sloping for options across different moneyness values, which provides evidence against the hypothesis that time-to-maturity and moneyness affects implied volatility in an interactive fashion. To reassure, we compute the test statistic
based on (
5), which turns out to be 0.433, leading to the acceptance of the partially linear model with a
p-value equals 0.51.
Our test result, in the bold part, is in accord with ABS. That is, options across moneyess have identical term structures. Specifically, volatility first decreases and then increases as a function of time-to-expiration. From (
13), holding moneyness constant, volatility reaches the lowest when time-to-expiration is about 17 month
10. Recall that the term structure of volatility captures the market’s expectation on the future volatility. If the term structure of CBOE index option is downward-sloping, it implies that investors expect to see the volatility (risk) of the market going down in the future.
So, why did options holders perceive the market risk to be decreasing as the time-to-expiration increases? One plausible (but speculative) explanation emerges after a closer look at the equity market condition in the period from September 2012 to December 2018. The S&P 500 index steadily grew since the second half of 2012 while the level of CBOE Volatility Index (VIX) was consistently low around 12–20. Given the expectation of a persistent rise in equity values, even the speculative OTM call options () can have intrinsic values when its time-to-maturities is “long enough”. As such, investors may be more concern about the short-run risk, making the implied volatility higher in shorter terms.
As a robustness check, we repeat the analysis for each year in the sampling period independently. Parameter estimates and test statistics are reported in
Table 2. With the
critical value being 3.84, we reject the partially linear model for the years of 2013 and 2018. In terms of the magnitude, the estimated test statistic for the year of 2018 is much greater than those from previous years. In March 2018, the US started a prolonged trade war with China, which overturned the steady growth of the S&P 500 index. As a consequence, the VIX index exceeds 30 in December 2018. Because of the tension between the two largest economies in the world, options holder become more concern about the long-run risk. This concern, in turns, is likely to make OTM call options to have an increasing term structure.