Next Article in Journal
On Applicability of Quantum Formalism to Model Decision Making: Can Cognitive Signaling Be Compatible with Quantum Theory?
Next Article in Special Issue
Feature Selection in High-Dimensional Models via EBIC with Energy Distance Correlation
Previous Article in Journal
Demonstration of the Holonomically Controlled Non-Abelian Geometric Phase in a Three-Qubit System of a Nitrogen Vacancy Center
Previous Article in Special Issue
Analysis of Longitudinal Binomial Data with Positive Association between the Number of Successes and the Number of Failures: An Application to Stock Instability Study
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Article

Testing the Intercept of a Balanced Predictive Regression Model

1
School of Statistics, Jiangxi University of Finance and Economics, Nanchang 330013, China
2
Key Laboratory of Data Science in Finance and Economics, Jiangxi University of Finance and Economics, Nanchang 330013, China
*
Author to whom correspondence should be addressed.
Entropy 2022, 24(11), 1594; https://doi.org/10.3390/e24111594
Submission received: 13 September 2022 / Revised: 29 October 2022 / Accepted: 31 October 2022 / Published: 2 November 2022
(This article belongs to the Special Issue Statistical Methods for Modeling High-Dimensional and Complex Data)

Abstract

:
Testing predictability is known to be an important issue for the balanced predictive regression model. Some unified testing statistics of desirable properties have been proposed, though their validity depends on a predefined assumption regarding whether or not an intercept term nevertheless exists. In fact, most financial data have endogenous or heteroscedasticity structure, and the existing intercept term test does not perform well in these cases. In this paper, we consider the testing for the intercept of the balanced predictive regression model. An empirical likelihood based testing statistic is developed, and its limit distribution is also derived under some mild conditions. We also provide some simulations and a real application to illustrate its merits in terms of both size and power properties.

1. Introduction

Prediction of some important variables based on the other available predictors is a practice of great interest in many applications in finance and economics. Various predictive regression models have been suggested in the last few decades. Among them, the simplest one is the following linear predictive regression model:
Y t = α + β X t 1 + U t X t = μ + ϕ X t 1 + e t , B ( L ) e t = V t ,
where α and β are unknown parameters, Y t denotes the response, and X t the predictor. In practice, Y t may denote the stock return, and X t may be some financial indicators. L i e t = e t i , B ( L ) = 1 ( i = 1 p b i L i ) , B ( L ) 0 with all of its roots are fixed and less than one in absolute value. { U t , V t } t = 1 n are independent and identically distributed (hereafter i.i.d.) random vectors with zero mean. For the predictor X t , μ denotes the intercept term, and ϕ is the autoregressive coefficient. It is known that the limit distribution of the least squares estimators of α and β differs from each other when { X t } follows: (i) | ϕ | < 1 (stationary), (ii) μ = 0 and ϕ = 1 + c / n for some constant c, (iii) μ 0 and ϕ = 1 + c / n for some constant c. When c = 0 , { X t } is a unit root AR(1) process.
Predictive regression models are widely used in financial research (see [1,2,3,4,5,6,7,8], and references therein). It is noted that, when the predictor { X t } is non-stationary, the response { Y t } can not be stationary. This is well known in the literature as the unbalanced problem for traditional predictive regression models. Due to the non-stationarity of most financial variables (see [9]), the problem of unbalance often appears in practical applications. To cover this problem, Ref. [10] proposed a balanced predictive regression (BPR) model by adding a difference term of X t to the conditional function part. In detail, it takes the form as follows:
Y t = α + β X t 1 + i = 1 p ψ i Δ X t i + U t X t = μ + ϕ X t 1 + e t , B ( L ) e t = V t ,
where Δ X t i = X t i X t i 1 , and ψ i ’s are unknown parameters needing to be estimated. Compared to the original predictive regression model (1), the existence of Δ X t i ’s in (2) can balance the unmatched problem between Y t and X t with β = 0 when { X t } is non-stationary.
Similar practices are also considered by [11], which proposes a balanced predictive regression model by adding an additional lag of the predictors to the mean function. Through simple transformation, this model is the same as that considered in this paper when p = 1 by noting that β X t 1 + ψ 1 Δ X t 1 = ( β + ψ 1 ) X t 1 + ( ψ 1 ) X t 2 . Both for predictive regression and BPR models, an importation issue of great interest is to test the predictability, i.e., testing the null hypothesis H 0 : β = 0 or H 0 : ψ 1 = = ψ p = 0 or H 0 : β = ψ 1 = = ψ p = 0 . This issue has received extensive attention in the literature. The conventional method for testing predictability is the so-called t-test. Unfortunately, the standard t-test suffers from the issue of obvious over-rejection or ineffective when predictors have strong persistence. See [12,13] for more details. Note that [14] investigated the estimation of predictive regressions when the explanatory variable is nearly integrated. However, the limiting distributions of these tests depend on which Case of (i)–(iii) the predictor X t follows. Some unified predictability tests have been developed in the last decade to avoid this problem. Among others, Ref. [15] considered the predictability test for model (1) when macroeconomic data are non-stationary or heavy-tailed. For the BPR model, Ref. [10] extended the method of [15] and proposed a unified predictability test regardless of X t being stationary, nearly integrated, or unit root.
However, two statistics were developed in [10] to test predictability, which depends on a predefined assumption on whether or not there exists an intercept term α . Although both testing statistics are shown to be chi-squared distributed asymptotically under mild conditions, their finite sample performances are quite different from each other because, when there is an intercept, their second testing statistic depends on the idea of data splitting in order to get rid of the effect of the existence of the intercept term α . Note that both tests are applicable when the underlying true model has no intercept. We found that the p-values of these two tests are different, which may cause confusion when different p-values are obtained based on the same data set.
Hence, it is necessary to do the predefined test to check whether or not there exists a non-zero intercept in model (2) before conducting a predictability test. This motivates us to consider the current research. We propose an empirical likelihood test based on the idea of data splitting and proved that the asymptotic distribution of the test statistic is χ 2 distribution, regardless of X t being stationary or nearly integrated or unit root. The empirical likelihood method was proposed by [16] and proved to have many excellent properties [17], including the distribution of the data that does not need to be assumed. Therefore, this method has been widely studied in the literature [18,19,20]. This also prompted this paper to propose a unified test of interception terms using the empirical likelihood method.
The rest of this article is organized as follows: Section 2 presents the methodologies and the main asymptotic results of our proposed test. Section 3 contains the finite sample simulation studies. An empirical application is discussed in Section 4. The detailed proofs of the main results are presented in Appendix A.

2. Methodology and Main Results

Suppose the random observations { X t , Y t } t = 1 n are generated from the model (2). The main interest of this paper is to develop a unified test for checking the intercept for model (2), i.e., the following hypothesis
H 0 : α = α 0 , versus H 1 : α α 0 .
Hereafter, denote θ 0 : = ( α 0 , β 0 , ψ 1 , 0 , , ψ p , 0 ) as the true value of θ : = ( α , β , ψ 1 , , ψ p ) without confusion.
Since the empirical likelihood method enjoys many desirable properties as discussed in the literature, in the sequel, we are interested in developing a testing method for H 0 based on this method. Note that, when θ = θ 0 , we have
E Y t α β X t 1 i = 1 p ψ i Δ X t i ξ t = 0 ,
where ξ t : = ( 1 , X t 1 , Δ X t 1 , , Δ X t p ) . Then, following [16], one may construct an empirical likelihood function by using the auxiliary vectors W t , where
W t ( θ ) : = Y t α β X t 1 i = 1 p ψ i Δ X t i ξ t .
Unfortunately, this test will suffer from the trouble of the quantity 1 n t = p + 1 n { D n 1 W t ( θ ) W t ( θ ) D n 1 } not converging in probability as n for Case (i), i.e., with μ = 0 and ϕ = 1 + c / n for some constant c, where D n = diag { 1 , n , 1 , , 1 } . Consequently, the quantity
1 n t = p + 1 n D n 1 W t ( θ ) 1 n t = p + 1 n D n 1 W t ( θ ) W t ( θ ) D n 1 1 1 n t = p + 1 n D n 1 W t ( θ )
does not converge in distribution to a chi-squared distributed variable, which in turn results in the related log-empirical likelihood function having a non-standard limit distribution.
As an improvement, although it is possible to construct a new empirical likelihood-based test statistic, which converges in a chi-squared distribution, for θ relying on the idea of data splitting as did in [21], the main interest of this paper is to test the intercept term, i.e., hypothesis H 0 , and we need to handle the redundant parameters β , ψ 1 , , ψ p by using the profile empirical likelihood method as in [17]. Once again, this profile method still does not work; see Theorem 3 in [21] for a detailed discussion.
This motivates us to consider the following testing procedure. Let ψ = ( ψ 1 , , ψ p ) , Z t ( α , β , ψ ) = ( Z t , 1 ( α , β , ψ ) , Z t , 2 ( α , β , ψ ) , , Z t , 2 + p ( α , β , ψ ) ) for t = p + 1 , 2 , , m , where m = [ n / 2 ] with [ · ] being the floor function, and
Z t , 1 ( α , β , ψ ) = Y t α β X t 1 i = 1 p ψ i Δ X t i Z t , 2 ( α , β , ψ ) = ( Y t + m α β X t + m 1 i = 1 p ψ i Δ X t + m i ) w t + m + δ ¯ t Z t , 2 + k ( α , β , ψ ) = ( Y t α β X t 1 i = 1 p ψ i Δ X t i ) Δ X t k , k = 1 , 2 , , p ,
with w t = X t 1 1 + X t 1 2 log ( e + | X t 1 | ) , and δ ¯ t = 1 N i = 1 N δ i , t with N = 2000 and δ i , t N ( 0 , σ ¯ 2 ) , which are independent of the random observations. Through data splitting technology, we use the first half of the data to estimate redundant parameters, and the second half of the data to construct a likelihood function to reduce the challenges posed by noise in the data. To balance the size and the power, in practice, one may take σ ¯ 2 = σ ^ U 2 / 2 with σ ^ U 2 being the estimated error variance. Note that here log ( e + | X t 1 | ) is designed in order to let the quantity 1 m t = p + 1 m { ( Y t + m α β X t + m 1 i = 1 p ψ i Δ X t + m i ) w t + m } to vanish when { X t } follows the non-stationary Cases (ii). A similar weighting technique has also been used in [22]. which uses the profile empirical likelihood to consider the intercept test of the predictive regression model.
Based on { Z t ( α , β , ψ ) } above, a new empirical likelihood function for θ can be defined as follows:
L ( α , β , ψ ) = sup t = p + 1 m ( m p t ) : p 1 0 , , p m 0 , t = p + 1 m p t = 1 , t = p + 1 m p t Z t ( α , β , ψ ) = 0 .
Since we are only interested in α , we further define a profile empirical likelihood function for α as
( α ) = sup ( β , ψ ) L ( α , β , ψ ) .
The following theorem shows that Wilks’ theorem holds for the above proposed profile empirical likelihood method, which depends on some regular conditions as follows:
(C1)
{ ( U t , e t ) } is a sequence of i.i.d. random vectors with mean zero, and E ( | U t | 2 + ϵ 0 ) + E ( | e t | 2 + ϵ 0 ) < for some arbitrarily small positive constant ϵ 0 ;
(C2)
The predictive variable X t , with the initial value of X 0 being constant or random variable of order o p ( m ) , belongs to one of the persistence classes following Cases (i)–(iii);
(C3)
All roots of 1 i = 1 p ψ i , 0 x i = 0 with respect to x are outside the unit circle.
Theorem 1.
Suppose that Conditions C1–C3 hold. Then, under the null hypothesis H 0 : α = α 0 , we have that 2 log ( α 0 ) converges in distribution to a chi-squared distributed variable χ 1 2 with one degree of freedom as n .
Based on the above theorem, we can reject H 0 once
2 log ( α 0 ) χ 1 2 ( 1 τ ) ,
at the significance level τ ( 0 , 1 ) , where χ 1 2 ( 1 τ ) denotes the ( 1 τ )-th quantile of a chi-squared distribution with one degree of freedom. In particular, when one is interested in testing H 0 : α = 0 , one may think that there is no intercept in model (2) if 2 log ( 0 ) χ 1 2 ( 1 τ ) at the significance level τ .
Note that, similar to [23], one may take w t = X t 1 ( 1 + X t 1 2 ) 0.75 to construct the test. Here, we however use an another weight w t = X t 1 1 + X t 1 2 log ( e + | X t 1 | ) in Z t , 2 ( α , β , ψ ) in order to increase the local power of the proposed test for the non-stationary cases. The following theorem states the power property of our proposed empirical likelihood-based test.
Theorem 2.
Suppose the same conditions of Theorem 1 hold. Then, we have as n that:
  • For Case (i), under the local alternative hypothesis H 1 : α = α 0 d 0 m for some constant d 0 R ,
    2 log ( α 0 ) d χ 1 2 ( ν 1 2 ) ,
    where ‘ d ’ denotes the convergence in distribution, and χ 1 2 ( ν 1 2 ) a non-central chi-squared distributed variable with non-central parameter ν 1 2 , where ν 1 is the second component of Σ 1 / 2 γ 1 with γ 1 = d 0 ( 1 , lim t E ( w t + m ) , μ + ( ϕ 1 ) lim t E ( X t 2 ) , , μ + ( ϕ 1 ) lim t E ( X t p 1 ) ) , and
    Σ = lim t E ( Z t ( α 0 , β 0 , ψ 0 ) Z t ( α 0 , β 0 , ψ 0 ) ) ;
  • For Case (ii), under the local alternative hypothesis H 1 : α = α 0 d 0 log ( m ) m for some constant d 0 ,
    2 log ( α 0 ) d ξ + 2 d 0 σ ¯ 1 2 s g n ( J c ( s ) ) d s 2 ,
    where sgn ( · ) denotes the sign function, ξ a standard normally distributed variable, which is independent of 1 2 s g n ( J c ( s ) ) d s , and J c ( s ) : = 0 s e ( s r ) c d W ( r ) with W ( r ) being a Gaussian process with covariance function ( s , t ) = 2 E ( e t 2 ) min ( s , t ) ;
  • ForCase (iii), under the local alternative hypothesis H 1 : α = α 0 d 0 log ( m ) m for some constant d 0 ,
    2 log ( α 0 ) d χ 1 2 ( ν 2 2 ) ,
    where ν 2 = d 2 1 2 s g n ( μ 0 s e ( s r ) c d r ) d s / σ ¯ .
Theorem 2 indicates that the local power of the proposed test for Cases (ii) and (iii) is of order O ( log ( m ) m ) , which does not achieve the normally parametric order O ( 1 / m ) but is better than O ( n a ) for any a ( 0 , 1 / 2 ) .
Remark 1.
Although we only consider in Theorems 1 and 2 the case that { U t } is a sequence of i.i.d. errors, it is possible to obtain similar results when U t follows a dependent process. In fact, when U t follows the AR(q) process, one can obtain similar limits as in Theorems 1 and 2 by taking into account the AR structure of U t .
Remark 2.
When e t follows a strictly stationary GARCH process or a linear process, Theorems 1 and 2 still hold.

3. Simulation Results

In this section, we investigate the finite sample performance of the proposed profile empirical likelihood test using simulated data sets and compare it with the traditional t-test. The random observations { X t , Y t } are generated from the predictive regression model (2), where α = 0 , β = { β 0 , β 1 } = { 0.5 , 0.3 } . μ { 0 , 0.01 } indicates that the model has no intercept item and intercept item, respectively. ϕ { 0.5 , 1 , 1 3 n } , where 0.5 indicates that X t is a stationary process, 1 indicates a unit root process, and 1 3 n indicates a near unit root process. The innovations { U t , V t } t = 1 n were considered in two cases.
Case (i): { U t , V t } t = 1 n are from a bivariate Gaussian Copula C ( F 1 ( U t ) , F 2 ( V t ) , ρ ) , whose marginal distribution F i is a Student’s t-distribution with degrees of freedom d f { 10 , 15 } . The dependence parameter ρ is set to be {−0.2, −0.4, −0.6}, respectively, to explore the performance of two innovations under different degrees of dependence.
Case (ii): { V t } t = 1 n are from GARCH(1,1) structure, i.e.,
V t = η v , t σ v , t , σ v , t 2 = ω v + a v V t 1 2 + b v σ v , t 1 2 .
where { ω v , a v , b v } = { 0.1 , 0.4 , 0.2 } . { U t } t = 1 n and { η v , t } t = 1 n are generated similarly to { U t , V t } t = 1 n in Case (i). All simulating results are repeatedly carried 10,000 times with the sample size n ranging from 200 to 1200.
Table 1 reports the size performance of the proposed method with different settings in Case (i) under the significance level τ = 0.05 . For d f = ( 10 , 10 ) , the results show that, when X t is a stationary or unit root process, regardless of whether μ is 0 or not, with the increase of sample n, size values are closer to τ , and the performance of the method is robust under different ρ . However, when X t is a near unit root process, it can be found that, when n is small, the method has the performance of oversize, especially when ρ = 0.6 , and as n increases, it gradually converges to the significance level. For the t-test when X t is a stable process, the t-test has a good size performance, and its performance is relatively stable even with high dependency. However, when X t is a unit root or near unit root process, it gradually shows over-rejecting with the increase of ρ , no matter whether there is an intercept or not. The d f = ( 15 , 15 ) behaves similarly to d f = ( 10 , 10 ) . Table 2 reports the size performance in Case (ii), in which the innovation of X t has a GARCH structure. The results show that the performances of the profile empirical likelihood and t-test are similar to that in Case (i).
We also generate data with α = d 0 n (stationary) or α = d 0 l o g ( n ) n (unit root or near the unit root), d 0 = { 0 , 1 , 2 , , 7 } , and other parameters are set the same as Case (i) and Case (ii), in order to investigate the power performance of the proposed method. The results are shown in Figure 1 ( μ = 0 ) and Figure 2 ( μ = 0.01 ). Here, we only show the case where n = 1000 . The abscissa represents the constant d 0 value, and the ordinate is the power value. Different colors represent different processes of X t and tests. We can see that, under all settings, Figure 1 and Figure 2 show the same performance. Firstly, as the value of d 0 increases, the power converges quickly to 1. Secondly, when X t is a near unit root, the convergence rate is better than the unit root, and the unit root is better than the stationary. Last but not least, when d 0 = 0 , it can be seen that the value of the t-test is significantly higher than the significance level of 0.05 under all settings. This also indicates the poor size performance of the t-test, even though it has a good convergence rate in power performance.
In summary, the proposed method has good size and power performance when checking whether intercept α exists and also has good robustness for different innovative distributions. The traditional t-test shows over-reject, which makes it easy to misjudge in real data analysis; therefore, the profile empirical likelihood method proposed in this paper is more recommended when real data are endogenous.

4. A Real Data Application

In this section, we illustrate our testing method on U.S. equity data, which has been explored by numerous scholars, such as [10,12,24], etc. We collected the data coming from the Center for Research in Security Prices (CRSP), and the sample period is January 1952 to December 2015 with monthly data ( n = 1068 ). Among them, the value-weighted excess returns of the S&P 500 index are used as the predicted variable Y t , and the other 10 financial variables are dividend–price ratio, dividend yield, earnings–price ratio, dividend payout ratio, book-to-market value ratio, T-bill rate, default yield spread, long-term yield, term spread, and net equity expansion, which are used as predicting variables, respectively.
We know that predictability is one of the critical concerns in various fields, including finance, climatology, tourism, sociology, and so on (see, e.g., [25,26,27]). However, as [10] shows, a small intercept can change the predictability dramatically when the predicting variable is nearly integrated. Hence, the intercept term test is an important pre-test before the statistical inference. On the other hand, Refs. [12,28] showed that the innovation of financial data generally contains endogeneity, which will bring bias to relevant estimates. Therefore, it is undoubtedly very vital to obtain a unified test for the intercept test, no matter whether the predictive variables are stationary, nearly integrated, unit root, or innovation have endogeneity.
Firstly, we use the a d f . t e s t function in the R package to test the stationarity of each predictor variable. Then, the least square estimation was used to fit the model (2), in which V t is estimated based on the V t = e t i = 1 12 b i e t i , in order to make V t uncorrelated. In addition, the correlation coefficient between U t and V t was tested. Before performing our test, the A r c h T e s t function was also used to test whether V t has a GARCH structure. Finally, the method proposed in this paper is used to test H 0 : α = 0 . Table 3 reports all the results for each predictor variable. The first column is the predictive variables. The second column is the estimators of intercept terms, and the third column is the estimators of the autoregressive coefficient of predictive variables. The fourth column is the p-values of the ADF test. The fifth column is the correlation coefficient between the two innovations U t and V t . The sixth column is the p-values of the GARCH test of V t sequence. The last two columns are the p-values of the test proposed in this paper and the traditional t-test.
From Table 3, we can see that the ϕ ^ of all variables are very close to 1, and the a d f . t e s t shows that only T-bill rate and long-term yield do not reject the null hypothesis of unit root, indicating that these variables are non-stationary. The results of the correlation coefficient show that U t and V t have different degrees of correlation under different predictive variables, and we have also carried out corresponding settings in the simulation. In addition, the results of the A r c h T e s t test show that V t under all predictor variables has a significant ARCH effect. The α ^ of ten variables obtained by the least square estimation are close to 0. The t-test showed that the intercept of Book-to-market, Default yield spread, and Term spread are 0. However, the results of the EL test are quite different from t-test, which shows that only the intercept of the Dividend payout ratio is not 0.
Overall, the t-test rejected the original hypothesis of alpha = 0 under most variables, which is consistent with the performance in the simulation. Notice that Ref. [12] mentioned that the t-test does not perform well when the predictor variable is persistent and its innovations are highly correlated. Furthermore, all the variables have a GARCH structure, which also will affect the t-test. In addition, Table 1 in [29] indicates the size distortion of the t-test for plausible parameter values. Hence, we recommend the EL test proposed in this paper over the traditional test.

5. Conclusions

Predictability testing is a common problem in the analysis of real data such as in the economic and financial fields. However, the validity of most existing tests depends on the assumption that the intercept term of the model exists. Therefore, the intercept term test is an important pre-test in practical application. However, it is worth noting that heteroscedasticity and endogeneity are common characteristics of financial data, which makes the existing intercept term test not perform well. In this paper, we develop a unified intercept test based on the empirical likelihood method under the balanced prediction regression model. The asymptotic distribution of the EL test is proved to be chi-squared, and the local power under stationary and non-stationary conditions is also proved. Simulation results verify the unified and effectiveness of the test and are superior to the t-test. Furthermore, in the empirical application, we performed an intercept term test on U.S. equity data. The results showed that the t-test rejected the null hypothesis in most variables, but the EL test shows the opposite. The relevant performance is consistent with the simulation. Therefore, when considering the intercept term test of financial data, the EL test is recommended.

Author Contributions

Conceptualization, X.L.; data curation, Y.F.; methodology, X.L.; investigation, Y.F.; writing—original draft preparation, Y.F., Q.W.; writing—review and editing, X.L., Y.F., Q.W., L.P.; visualization, Q.W.; supervision, L.P. All authors have read and agreed to the published version of the manuscript.

Funding

This research was funded by NNSF of China grant numbers (11971208, 11601197), NSSF of China grant number (21BTJ035), National Major Social Science Project of China grant number (21&ZD152), the 17th student research project of Jiangxi University of Finance and Economics grant number (20220929165358818).

Institutional Review Board Statement

Not applicable.

Data Availability Statement

The data used in applications can be found in the Center for Research in Security Prices.

Conflicts of Interest

The authors declare no conflict of interest.

Appendix A. Detailed Proofs of the Main Results

In this appendix, we provide detailed proofs for the main results presented in Section 2. Before proceeding further, we specify some useful lemmas as follows:
Lemma A1.
Under the same conditions of Theorem 1, we have as n that:
(a)
max p + 1 t m sup β ψ B Z t ( α 0 , β , ψ ) = o p ( m ) ,
(b)
1 m t = p + 1 m Z t ( α 0 , β , ψ ) = 1 m t = p + 1 m Z t ( α 0 , β 0 , ψ 0 ) + O p ( 1 ) holds uniformly for β ψ B ,
(c)
1 m t = p + 1 m { Z t ( α 0 , β , ψ ) Z t ( α 0 , β , ψ ) } = 1 m t = p + 1 m { Z t ( α 0 , β 0 , ψ 0 ) Z t ( α 0 , β 0 , ψ 0 ) } + o p ( 1 ) holds uniformly for β ψ B ,
where B = { β ψ : m ( k 1 ) / 2 | β β 0 | + | ψ ψ 0 | C m } for some constant C > 0 , and k = 1 , 2 , 3 correspond toCases (i)–(iii), respectively.
Proof of Lemma A1.
In the sequel, we only prove Case (ii), as the rest of the cases can be proved similarly. We first prove Part (a). Note that, by [30], we have for any s ( 0 , 2 ] that
1 m X [ m s ] D J c ( s ) : = 0 s e ( s r ) c d W ( r ) ,
for Case (ii) as n , where ‘ D ’ stands for the convergence in space, and D ( 0 , 1 ] is the space of real-valued functions on the interval ( 0 , 1 ] that are right continuous and have finite left limits. Since
max p + 1 t m sup β ψ B Z t ( α 0 , β , ψ ) k = 1 p + 2 max p + 1 t m sup β ψ B | Z t , k ( α 0 , β , ψ ) | ,
we need only to check the upper bound for each component Z t , k ( α 0 , β , ψ ) .
For the first component, rewrite Z t , 1 ( α 0 , β , ψ ) = ε t m ( β β 0 ) 1 m X t 1 i = 1 p ( ψ i ψ i 0 ) Δ X t i . Trivially, based on Conditions C1–C2, we have max p + 1 t 2 m | ε t | = o p ( m ) . Next, using (A1), it is easy to check for any ϵ > 0 that
P max p + 1 t 2 m 1 m X t 1 ϵ m 2 ϵ 2 + δ m δ 1 2 m E t = 1 2 m 1 m X t 1 2 + δ 2 ϵ 2 + δ m δ E 0 2 | J c ( s ) | 2 + δ d s 0 ,
which implies max 1 t 2 m | 1 m X t 1 | = o p ( m ) , and in turn max 1 t 2 m | ( β β 0 ) X t 1 | = o p ( 1 ) uniformly for β ψ B . Using this, by Δ X t = X t X t 1 = c n X t 1 + ε t , we have as n that
max 1 t 2 m | Δ X t | c m n max 1 t 2 m | 1 m X t | + max 1 t 2 m | ε t | = max 1 t 2 m | ε t | · { 1 + o p ( 1 ) } ,
and hence
max p + 1 t 2 m i = 1 p ( ψ i ψ i 0 ) Δ X t i = o p ( 1 ) ,
uniformly for β ψ B . This shows
max p + 1 t m sup B | Z t , 1 ( α 0 , β , ψ ) | max 1 t m | ε t | · { 1 + o p ( 1 ) } .
Similarly, we can show that max p + 1 t m sup B | Z t , l ( α 0 , β , ψ ) | max 1 t m | ε t | · { 1 + o p ( 1 ) } , for l = 2 , , p + 2 . This shows Part (a).
For Part (b), note that
Z t ( α 0 , β , ψ ) = Z t ( α 0 , β 0 , ψ 0 ) ( β β 0 ) X ˜ t 1 i = 1 p ( ψ i ψ i , 0 ) X ˜ t i * ,
where X ˜ t 1 = ( X t 1 , X t + m 1 w t + m , X t 1 , , X t 1 ) , and X ˜ t i * = ( Δ X t i , Δ X t + m i w t + m , Δ X t i Δ X t 1 , , Δ X t i Δ X t p ) . Then, by using a similar proof to that of Part (a), we have
m ( β β 0 ) · 1 m t = 1 m 1 m X ˜ t 1 = O p ( 1 ) , and i = 1 p { m ( ψ i ψ i , 0 ) · 1 m t = 1 m X ˜ t i * } = O p ( 1 ) ,
uniformly for β ψ B as n .
The proof of Part (c) is similar to that of Part (b) based on (A2). We omit the details. This completes the proof of Lemma A1. □
Lemma A2.
Under the same conditions of Theorem 1, we have as n that:
(a)
1 m t = p + 1 m Z t ( α 0 , β 0 , ψ 0 ) d N ( 0 , Σ ) , and
(b)
1 m t = p + 1 m { Z t ( α 0 , β 0 , ψ 0 ) Z t ( α 0 , β 0 , ψ 0 ) } p Σ ,
where ‘ p ’ denotes the convergence in probability. ForCase (i), Σ is specified in Theorem 2, andCases (ii) and(iii),
Σ = E ( U 1 2 ) σ ¯ 2 Γ ˜ w i t h Γ ˜ = E ( U 1 2 ) μ 2 + lim t E ( e t 1 2 ) μ 2 + lim t E ( e t p 2 ) .
Note that μ = 0 for Case (ii).
Proof of Lemma A2.
We only prove Part (a), as Part (b) follows a similar fashion. Put
Z t ( α 0 , β 0 , ψ 0 ) = ( U t , U t w t + δ ¯ t , U t Δ X t 1 , , U t Δ X t p ) .
Denote F t as the sigma field generated by { ( U s , e s ) : 1 s t , m + 1 s t + m } . Then, it is easy to check that { Z t ( α 0 , β 0 , ψ 0 ) } is a Martingale difference array with respect to the filtration F t .
For Case (i), noting that { X t } is stationary, the normality of 1 m t = p + 1 m Z t ( α 0 , β 0 , ψ 0 ) follows immediately under Conditions C1–C3 by the center limit theory for Martingale differences of [31] and the Cramér–Wold device.
For Case (ii), trivially, we have
1 m t = p + 1 m Z t , 1 ( α 0 , β 0 , ψ 0 ) d N ( 0 , E ( U 1 2 ) ) , as n .
Next, note that | X t | p , which results in 1 m t = 1 m w t 2 p 0 as n . Hence,
1 m t = p + 1 m Z t , 2 ( α 0 , β 0 , ψ 0 ) = 1 m t = p + 1 m U t + 1 w t + m 1 + 1 m t = p + 1 m δ ¯ t = 1 m t = p + 1 m δ ¯ t + o p ( 1 ) ,
which implies 1 m t = p + 1 m Z t , 2 ( α 0 , β 0 , ψ 0 ) d N ( 0 , σ ¯ 2 ) .
For 1 m t = p + 1 m Z t , 2 + i ( α 0 , β 0 , ψ 0 ) , k = 1 , 2 , , p , by using a similar proof to [30], we have
1 m t = p + 1 m E ( Z t , 2 + i 2 ( α 0 , β 0 , ψ 0 ) | F t 1 ) = E ( U 1 2 ) 1 m t = p + 1 m ( Δ X t i ) 2 = E ( U 1 2 ) 1 m t = p + 1 m c n X t i 1 + e t i 2 = E ( U 1 2 ) c 2 n 1 m t = p + 1 m 1 n X t i 1 2 + 2 c n 1 m t = p + 1 m X t i 1 e t i n + 1 m t = p + 1 m e t i 2 = E ( U 1 2 ) c 2 n 0 1 J c 2 ( s ) d s + 2 c n 1 m 0 1 J c ( s ) d W ( s ) + lim t E ( ε t i 2 ) = E ( U 1 2 ) · lim t E ( ε t i 2 ) · { 1 + o p ( 1 ) } , as n .
and, for arbitrarily small d 1 , d 2 ( 0 , ϵ 0 ) ,
1 m t = p + 1 m E ( | Z t , 2 + i ( α 0 , β 0 , ψ 0 ) | 2 I ( | Z t , 2 + i ( α 0 , β 0 , ψ 0 ) | d 1 m ) | F t 1 ) 1 d 1 d 2 m d 2 / 2 1 m t = p + 1 m E ( | Z t , 2 + i ( α 0 , β 0 , ψ 0 ) | 2 + d 2 | F t 1 ) = E ( | U 1 | 2 + d 2 ) d 1 d 2 m d 2 / 2 1 m t = p + 1 m c n X t i 1 + e t i 2 + d 2 E ( | U 1 | 2 + d 2 ) 2 1 + d 2 d 1 d 2 m d 2 / 2 1 m t = p + 1 m c n X t i 1 2 + d 2 + 1 m t = p + 1 m | e t i | 2 + d 2 = E ( | U 1 | 2 + d 2 ) 2 1 + d 2 d 1 d 2 m d 2 / 2 1 m t = p + 1 m | e t i | 2 + d 2 · { 1 + o p ( 1 ) } p 0 , as n .
Based on this and (A3), a simple application of the center limit theory for Martingale differences of [31] leads to the fact that, as n ,
1 m t = p + 1 m Z t , 2 + i ( α 0 , β 0 , ψ 0 ) d N ( 0 , E ( U 1 2 ) · lim t E ( ε t i 2 ) ) , k = 1 , 2 , , p .
By noting that { δ ¯ t } is independent of the random observations, we can show that
1 m t = p + 1 m E ( Z t , 2 ( α 0 , β 0 , ψ 0 ) Z t , j ( α 0 , β 0 , ψ 0 ) | F t 1 ) p 0 ,
for j 2 . Similar to (A3), it is easy to check that
1 m t = p + 1 m E ( Z t , i ( α 0 , β 0 , ψ 0 ) Z t , j ( α 0 , β 0 , ψ 0 ) | F t 1 ) p 0 ,
for i , j 2 as n . Hence, Part (a) follows immediately by using the Cramér–Wold device. This completes the proof of this lemma. □
Proof of Theorem 1.
We only prove Case (ii) as Cases (i) and (iii) are similar. For convenience, denote
h ( α 0 , β , ψ , x ) = 1 m t = p + 1 m Z t ( α 0 , β , ψ ) 1 + x Z t ( α 0 , β , ψ ) , and g t ( α 0 , β , ψ ) = λ Z t ( α 0 , β , ψ )
with λ being the solution to h ( α 0 , β , ψ , x ) = 0 with respect to x for fixed β , ψ .
The following proof is similar to that of Theorem 1 in [32]. Let λ = ϱ v with ϱ = λ . Then, similar to [33], we have p t = 1 m 1 1 + λ Z t ( α 0 , β , ψ ) , t = p + 1 , , m , by using the Lagrange multipliers method. Note that
0 = | λ h ( α 0 , β , ψ , λ ) | = 1 m t = p + 1 m g t ( α 0 , β , ψ ) 1 m t = p + 1 m g t 2 ( α 0 , β , ψ ) 1 + g t ( α 0 , β , ψ ) 1 m t = p + 1 m g t 2 ( α 0 , β , ψ ) 1 + g t ( α 0 , β , ψ ) 1 m t = p + 1 m g t ( α 0 , β , ψ ) 1 m t = p + 1 m g t 2 ( α 0 , β , ψ ) 1 + max p + 1 t m g t ( α 0 , β , ψ ) 1 m t = p + 1 m g t ( α 0 , β , ψ ) .
Then, by Lemmas 1 and 2, we have
λ 2 v 1 m t = p + 1 m Z t ( α 0 , β , ψ ) Z t ( α 0 , β , ψ ) v 1 + max p + 1 t m g t ( α 0 , β , ψ ) 1 m t = p + 1 m g t ( α 0 , β , ψ ) λ 1 m t = p + 1 n Z t ( α 0 , β 0 , ψ 0 ) + λ 1 m t = p + 1 m ( Z t ( α 0 , β , ψ ) Z t ( α 0 , β 0 , ψ 0 ) ) = λ × O p ( 1 / n ) ,
uniformly for β ψ B . Hence,
λ = O p ( 1 / m ) ,
uniformly for β ψ B by noting that Σ is positive definite. This, together with Part (a) of Lemma A1, shows
max p + 1 t m | g t ( α 0 , β , ψ ) | λ × max p + 1 t m Z t ( α 0 , β , ψ ) = O p ( 1 / m ) o p ( m ) = o p ( 1 ) ,
uniformly for β ψ B . In turn,
1 m t = p + 1 m Z t ( α 0 , β , ψ ) g t 2 ( α 0 , β , ψ ) 1 + g t ( α 0 , β , ψ ) λ · a max · 2 max p + 1 t m g t ( α 0 , β , ψ ) = o p ( 1 / m ) ,
where a max denotes the maximum eigenvalue of Σ .
Next, by Lemma 1, we have
0 = h ( α 0 , β , ψ , λ ) = 1 m t = p + 1 m Z t ( α 0 , β , ψ ) 1 g t 2 ( α 0 , β , ψ ) + g t 2 ( α 0 , β , ψ ) 1 + g t ( α 0 , β , ψ ) = 1 m t = p + 1 m Z t ( α 0 , β , ψ ) ( 1 g t ( α 0 , β , ψ ) ) + 1 m t = p + 1 m Z t ( α 0 , β , ψ ) g t 2 ( α 0 , β , ψ ) 1 + g t ( α 0 , β , ψ ) = 1 m t = p + 1 m Z t ( α 0 , β , ψ ) ( 1 Z t ( α 0 , β , ψ ) λ ) + o p ( 1 / m ) = 1 m t = p + 1 m Z t ( α 0 , β , ψ ) Σ λ + o p ( 1 / m ) ,
uniformly for β ψ B . Then, we have
λ = Σ 1 1 m t = p + 1 m Z t ( α 0 , β , ψ ) + o p ( 1 / m )
uniformly for β ψ B as n .
As did in the proof of Theorem 1 in [32], we can show by the Taylor expansion that
sup B 2 log L ( α 0 , β , ψ ) ξ n ( β , ψ ) Σ 1 ξ n ( β , ψ ) = o p ( 1 ) , as n ,
where ξ n ( β , ψ ) = 1 m t = p + 1 m Z t ( α 0 , β , ψ ) , based on (A4) and Lemmas 1 and 2. Using this, we have that the minimizer ( β ^ , ψ ^ ) of 2 log L ( α 0 , β , ψ ) + 2 log L ( α 0 , β 0 , ψ 0 ) must lie in B . Next, for any β ψ B , write m ( β β 0 ) ψ ψ 0 = y m with y = y 1 y 2 and y < C for some constant C given in B . Then, by (A2), a similar proof to that of Part (a) in Lemma (A1) leads to the fact that
1 m t = p + 1 m ( Z t ( α 0 , β , ψ ) Z t ( α 0 , β 0 , ψ 0 ) ) = 1 m t = p + 1 m 1 m X ˜ t 1 , X ˜ t 1 * , , X ˜ t p * m ( β β 0 ) m ( ψ ψ 0 ) + o p ( 1 ) = lim t 0 1 J c ( s ) d s 0 0 0 0 0 0 0 0 E ( e t 1 2 ) 0 0 0 0 E ( e t 2 2 ) 0 0 0 0 E ( e t p 2 ) · m ( β β 0 ) m ( ψ ψ 0 ) + o p ( 1 ) : = A y + o p ( 1 ) ,
where X ˜ t 1 , X ˜ t i * , i = 1 , 2 , , p , are defined in (A2). Hence, we have
φ ( y ) : = 2 log L α 0 , β 0 + y 1 m , ψ 0 + y 2 m + 2 log L ( α 0 , β 0 , ψ 0 ) = ξ n β 0 + y 1 m , ψ 0 + y 2 m Σ 1 ξ n β 0 + y 1 m , ψ 0 + y 2 m ξ n ( β 0 , ψ 0 ) Σ 1 ξ n ( β 0 , ψ 0 ) + o p ( 1 ) = y A Σ 1 A y + 2 y A Σ 1 ξ n ( β 0 , ψ 0 ) + o p ( 1 ) ,
uniformly for β ψ B . Then, the minimizer of φ ( y ) satisfies
y ^ = A Σ 1 A 1 A Σ 1 ξ n ( β 0 , ψ 0 ) + o p ( 1 ) .
Using this, we further have
2 log L α 0 , β 0 + y ^ 1 m , ψ 0 + y ^ 2 m = ξ n ( β 0 , ψ 0 ) Σ 1 Σ 1 A A Σ 1 A 1 A Σ 1 ξ n ( β 0 , ψ 0 ) + o p ( 1 ) = ξ n ( β 0 , ψ 0 ) ( diag { 0 , σ ¯ 2 , 0 , , 0 } ) 1 ξ n ( β 0 , ψ 0 } + o p ( 1 ) = σ ¯ 2 1 m t = 1 m Z t , 2 ( α 0 , β 0 , ψ 0 ) 2 + o p ( 1 ) d χ 1 2 ,
based on Lemmas 1 and 2 as n . This completes the proof of this Theorem. □
Proof of Theorem 2.
We only show Case (i) as the other two cases can be proved in a similar fashion. Under the local alternative hypothesis H 1 , we have
Z t ( α 0 , β , ψ ) = Z t ( α 0 C 0 m , β , ψ ) + C 0 m X ˘ t 1 ,
where X ˘ t 1 = ( 1 , w t + m , Δ X t 1 , , Δ X t p ) . Then, it is easy to show as n that
1 m t = 1 m Z t ( α 0 , β , ψ ) = 1 m t = 1 m Z t ( α 0 C 0 m , β , ψ ) + C 0 m 1 m t = 1 m X ˘ t 1 = 1 m t = 1 m Z t ( α 0 C 0 m , β , ψ ) + C 0 γ 1 + o p ( 1 ) ,
and
1 m t = 1 m Z t ( α 0 , β , ψ ) Z t ( α 0 , β , ψ ) = 1 m t = 1 m Z t ( α 0 C 0 m , β , ψ ) + C 0 m X ˘ t 1 Z t ( α 0 C 0 m , β , ψ ) + C 0 m X ˘ t 1 = 1 m t = 1 m Z t ( α 0 C 0 m , β , ψ ) Z t ( α 0 C 0 m , β , ψ ) + o p ( 1 ) .
Based on the quantities 1 m t = 1 m Z t ( α 0 C 0 m , β , ψ ) and 1 m t = 1 m Z t ( α 0 C 0 m , β , ψ ) Z t ( α 0 C 0 m , β , ψ ) given in (A6) and (A7), respectively, the rest of the proof is similar to that of Lemmas 1 and 2 and Theorem 1. We omit the details. □

References

  1. Amihud, Y. Illiquidity and stock returns cross-section and time-series effects. J. Financ. Mark. 2002, 5, 31–56. [Google Scholar] [CrossRef] [Green Version]
  2. Baker, M.; Stein, J.C. Market liquidity as a sentiment indicator. J. Financ. Mark. 2004, 7, 271–299. [Google Scholar] [CrossRef] [Green Version]
  3. Campbell, J.Y.; Shiller, R.J. The dividend-price ratio and expectations of future dividends and discount factors. Rev. Financ. Stud. 1988, 1, 195–228. [Google Scholar] [CrossRef] [Green Version]
  4. Kaul, G. Predictable components in stock returns. Handb. Stat. 1996, 14, 269–296. [Google Scholar]
  5. Keim, D.B.; Stambaugh, R.F. Predicting returns in the stock and bond markets. J. Financ. Econ. 1986, 17, 357–390. [Google Scholar] [CrossRef] [Green Version]
  6. Lewellen, J. Predicting returns with financial ratios. J. Financ. Econ. 2004, 74, 209–235. [Google Scholar] [CrossRef]
  7. Fama, E.F.; French, K.R. Business conditions and expected returns on stocks and bonds. J. Financ. Econ. 1989, 25, 23–49. [Google Scholar] [CrossRef]
  8. Zhu, F.; Liu, M.; Ling, S.; Cai, Z. Testing for structural change of predictive regression model to threshold predictive regression model. J. Bus. Econ. Stat. 2022. [Google Scholar] [CrossRef]
  9. Cai, Z.; Jing, B.; Kong, X.; Liu, Z. Nonparametric regression with nearly integrated regressors under long-run dependence. Econom. J. 2017, 20, 118–138. [Google Scholar] [CrossRef]
  10. Liu, X.; Yang, B.; Cai, Z.; Peng, L. A unified test for predictability of asset returns regardless of properties of predicting variables. J. Econom. 2019, 208, 141–159. [Google Scholar] [CrossRef]
  11. Ren, Y.; Tu, Y.; Yi, Y. Balanced predictive regressions. J. Empir. Financ. 2019, 54, 118–142. [Google Scholar] [CrossRef]
  12. Campbell, J.Y.; Yogo, M. Efficient tests of stock return predictability. J. Financ. Econ. 2006, 81, 27–60. [Google Scholar] [CrossRef] [Green Version]
  13. Lanne, M. Testing the predictability of stock returns. Rev. Econ. Stat. 2002, 84, 407–415. [Google Scholar] [CrossRef]
  14. Torous, W.; Valkanov, R.; Yan, S. On predicting stock returns with nearly integrated explanatory variables. J. Bus. 2004, 77, 937–966. [Google Scholar] [CrossRef]
  15. Zhu, F.; Cai, Z.; Peng, L. Predictive regressions for macroeconomic data. Ann. Appl. Stat. 2014, 8, 577–594. [Google Scholar] [CrossRef] [Green Version]
  16. Owen, A.B. Empirical likelihood ratio confidence intervals for a single functional. Biometrika 1988, 75, 237–249. [Google Scholar] [CrossRef]
  17. Qin, J.; Lawless, J. Empirical likelihood and general estimating equations. Ann. Stat. 1994, 22, 300–325. [Google Scholar] [CrossRef]
  18. Chen, S.X.; Peng, L.; Qin, Y.L. Effects of data dimension on empirical likelihood. Biometrika 2009, 96, 711–722. [Google Scholar] [CrossRef]
  19. Chen, S.X.; Van Keilegom, I. A review on empirical likelihood methods for regression. Test 2009, 18, 415–447. [Google Scholar] [CrossRef] [Green Version]
  20. Kitamura, Y. Empirical likelihood methods with weakly dependent processes. Ann. Stat. 1997, 25, 2084–2102. [Google Scholar] [CrossRef]
  21. Liu, X.; Liu, Y.; Lu, F. Empirical likelihood-based unified confidence region for a predictive regression model. Commun. Stat.-Simul. Comput. 2022, 51, 2122–2139. [Google Scholar] [CrossRef]
  22. Liu, X.; Liu, Y.; Rao, Y.; Lu, F. A Unified test for the Intercept of a Predictive Regression Model. Oxf. Bull. Econ. Stat. 2021, 83, 571–588. [Google Scholar] [CrossRef]
  23. Li, C.; Li, D.; Peng, L. Uniform test for predictive regression with AR errors. J. Bus. Econ. Stat. 2017, 35, 29–39. [Google Scholar] [CrossRef]
  24. Kostakis, A.; Magdalinos, T.; Stamatogiannis, M.P. Robust econometric inference for stock return predictability. Rev. Financ. Stud. 2015, 28, 1506–1553. [Google Scholar] [CrossRef] [Green Version]
  25. Klaussner, C.; Vogel, C. Temporal predictive regression models for linguistic style analysis. J. Lang. Model. 2018, 6, 175–222. [Google Scholar] [CrossRef] [Green Version]
  26. Pesaran, M.H.; Timmermann, A. Predictability of stock returns: Robustness and economic significance. J. Financ. 1995, 50, 1201–1228. [Google Scholar] [CrossRef]
  27. Ashby, S.A.; Taylor, M.A.; Chen, A.A. Statistical models for predicting rainfall in the Caribbean. Theor. Appl. Climatol. 2005, 82, 65–80. [Google Scholar] [CrossRef]
  28. Cai, Z.W.; Wang, Y.F. Testing predictive regression models with non-stationary regressors. J. Econom. 2014, 178, 4–14. [Google Scholar] [CrossRef]
  29. Elliott, G.; Stock, J.H. Inference in time series regression when the order of integration of a regressor is unknown. Econom. Theory 1994, 10, 672–700. [Google Scholar] [CrossRef] [Green Version]
  30. Phillips, P.C.B. Towards a unified asymptotic theory for autoregression. Biometrika 1987, 74, 535–547. [Google Scholar] [CrossRef]
  31. Hall, P.; Heyde, C.C. Martingale Limit Theory and Its Application; Academic Press: Boca Raton, FL, USA, 2014. [Google Scholar]
  32. Ma, Y.; Zhou, M.; Peng, L.; Zhang, R. Test for zero median of errors in an arma–garch model. Econom. Theory 2022, 38, 536–561. [Google Scholar] [CrossRef]
  33. Owen, A.B. Empirical Likelihood; Chapman and Hall/CRC: Boca Raton, FL, USA, 2001. [Google Scholar]
Figure 1. The power performance with μ = 0 .
Figure 1. The power performance with μ = 0 .
Entropy 24 01594 g001
Figure 2. The power performance with μ = 0.01 .
Figure 2. The power performance with μ = 0.01 .
Entropy 24 01594 g002
Table 1. The size performance of { U t , V t } t = 1 n under Case (i) and τ = 0.05 .
Table 1. The size performance of { U t , V t } t = 1 n under Case (i) and τ = 0.05 .
ELt-Test
ϕ μ ρ 2004008001000120020040080010001200
d f = ( 10 , 10 ) 0.50−0.20.04770.04610.05140.04970.05210.04850.05250.05360.05090.0494
−0.40.04300.04530.04860.05140.04530.05050.04930.05100.05160.0518
−0.60.04170.04420.04950.04760.04250.04960.05210.05160.05060.0505
0.01−0.20.04760.04880.05010.05150.05030.04710.04740.04950.05250.0481
−0.40.04480.04580.04770.04620.04660.05230.05360.05260.05140.0498
−0.60.04260.04340.04420.04910.04810.05600.04990.05170.05060.0519
10−0.20.05260.05110.04830.05030.04750.05860.06150.05590.05750.0574
−0.40.05010.05080.05020.04690.04570.08270.08360.08140.08810.0917
−0.60.04780.04400.04450.04810.04840.13210.12460.13330.12890.1329
0.01−0.20.05580.05080.05060.04720.05190.05520.05870.05740.06070.0594
−0.40.04830.05060.04730.04860.04980.08270.08050.08430.08670.0825
−0.60.04880.04930.04930.04320.04430.12770.12080.11840.12920.1229
1− 3 n 0−0.20.04840.04680.04650.05020.05120.05330.05130.05440.05500.0543
−0.40.04750.05070.04530.04670.04400.06830.07070.06790.06340.0703
−0.60.04630.03890.03660.04220.04030.09110.08930.08450.08760.0846
0.01−0.20.05070.04810.04890.04960.04870.05130.05780.05260.05220.0505
−0.40.04670.05010.04580.04540.04700.06950.06820.06770.06900.0658
−0.60.03860.04090.04030.03970.04150.08690.08980.08980.08350.0868
d f = ( 15 , 15 ) 0.50−0.20.04210.04970.04950.04860.04860.04710.04950.04950.05290.0495
−0.40.04270.04550.04630.04550.04990.04630.05280.04900.05160.0505
−0.60.03610.04390.04290.05190.04860.05490.04800.05310.04770.0527
0.01−0.20.04460.04980.04530.04970.04960.04990.05410.05230.05140.0517
−0.40.04350.04620.04600.04550.04710.05120.04570.04990.05280.0504
−0.60.04170.04440.04600.04510.04880.04770.05180.05130.05080.0528
10−0.20.04940.05110.05190.04970.04480.05660.06170.05670.05840.0572
−0.40.05420.04960.05050.05270.04970.08390.08810.09140.08620.0798
−0.60.05060.04920.04540.04640.04320.12260.12660.12820.12440.1248
0.01−0.20.05180.04690.05140.05330.04980.06320.05680.06070.05870.0588
−0.40.05330.04810.04860.04860.04720.08020.08180.08290.08230.0821
−0.60.04830.04630.04680.04820.04350.12840.12750.12240.13170.1304
1− 3 n 0−0.20.05110.04680.05020.05110.05080.05020.05460.05340.05250.0514
−0.40.04760.04760.04760.04570.04190.06570.06350.06700.06850.0677
−0.60.04360.04120.04350.04220.04080.09080.08310.08900.08660.0886
0.01−0.20.05390.05000.04810.04720.04920.05590.05450.05610.05060.0565
−0.40.04670.04740.04270.04760.04280.06780.06810.06590.06410.0702
−0.60.04280.04450.04380.04100.04170.08550.08440.08640.08570.0840
Table 2. The size performance of { U t , V t } t = 1 n under Case (ii) and τ = 0.05 .
Table 2. The size performance of { U t , V t } t = 1 n under Case (ii) and τ = 0.05 .
ELt-Test
ϕ μ ρ 2004008001000120020040080010001200
d f = ( 10 , 10 ) 0.50−0.20.04840.04700.05100.05030.05470.05050.05090.05070.05150.0509
−0.40.04020.04200.05000.05310.05000.04900.05230.04790.04990.0502
−0.60.04060.03820.04650.04320.04700.05320.05200.05320.05070.0500
0.01−0.20.04340.04350.04810.04800.04970.05270.04750.04930.04760.0508
−0.40.04220.04620.04850.04720.04390.05060.04630.05190.04860.0492
−0.60.04180.04190.04520.04540.04640.05240.04810.04830.05070.0500
10−0.20.05140.04840.05340.05110.04990.05650.06010.06060.06180.0566
−0.40.05330.04730.05200.05150.05110.08340.08720.08580.08960.0864
−0.60.04970.04970.04700.04520.05030.12630.12280.12790.12180.1265
0.01−0.20.05360.05060.04930.05130.04610.05810.05970.05560.05580.0577
−0.40.05230.04760.05150.04790.05010.08020.08170.08140.07880.0770
−0.60.04810.04480.04650.04670.04640.11930.12340.11980.11870.1133
1− 3 n 0−0.20.05330.04920.04890.04880.04920.05020.05400.05360.05520.0554
−0.40.05220.04390.04690.04870.04720.06750.07040.06910.06410.0659
−0.60.04450.04420.04450.03480.04060.08810.08690.08470.08860.0885
0.01−0.20.05130.04930.05040.04890.04720.05740.05120.05500.05050.0580
−0.40.04770.04670.04880.04610.04540.06540.06350.06150.06500.0658
−0.60.04640.04290.04040.04010.04220.08700.08680.08530.08450.0873
d f = ( 15 , 15 ) 0.50−0.20.04570.04710.04980.04870.04530.04910.05030.05360.05130.0523
−0.40.04190.04360.04350.04620.04630.04990.05190.04820.05450.0490
−0.60.03930.04300.04440.04390.04640.05420.05270.05160.04880.0559
0.01−0.20.04380.04790.04570.05250.04840.05150.04680.05360.05230.0485
−0.40.03940.04310.04790.04590.04830.04860.04860.05240.05360.0516
−0.60.03610.04260.04430.04810.04620.04990.05140.04870.05200.0478
10−0.20.05070.05460.05180.04820.04920.05630.06000.05540.05720.0583
−0.40.05600.04900.05010.05030.05010.08030.08410.08400.08470.0842
−0.60.05210.05080.04450.04610.04550.13220.13180.12940.12250.1258
0.01−0.20.05290.04890.05060.04900.05140.05700.05840.05650.05220.0587
−0.40.05100.04690.04940.04870.04940.07800.08120.07930.08080.0815
−0.60.04540.04910.05010.04860.04540.12370.12230.11350.11950.1121
1− 3 n 0−0.20.05130.04780.04900.05050.05220.05410.05150.05370.05340.0523
−0.40.04850.04840.04580.04610.04710.06750.06740.06290.06510.0664
−0.60.04470.04250.04100.04210.04130.08600.08510.08960.09030.0860
0.01−0.20.05270.05400.04850.04950.04860.05710.05660.05540.05630.0539
−0.40.05450.05010.04450.04820.04500.06230.06900.06530.06840.0639
−0.60.04910.04650.04070.04290.04780.08740.08200.08330.08710.0849
Table 3. Test results of 10 financial variables.
Table 3. Test results of 10 financial variables.
Predictor α ^ ϕ ^ Adf.TestCor( U t , V t )GARCH-VELt-Test
Dividend–price ratio0.03190.9928 0.0157  **−0.9618 9.6402 × 10 37  ***0.4200 0.0102  **
Dividend yield0.03570.9929 0.0239  **−0.0791 5.4329 × 10 34  ***0.4626 0.0042  ***
Earnings–price ratio0.03080.9870 0.0112  **−0.7966 7.6498 × 10 35  ***0.2730 0.0058 ***
Dividend payout ratio0.00720.9913 0.0100  ***−0.0407 1.7194 × 10 41  *** 0.0828 * 0.0487  **
Book-to-market value ratio−0.00500.9858 0.0100  ***−0.8031 1.1404 × 10 84  ***0.13220.2146
T-bill rate0.00970.99340.5266-0.0762 8.1443 × 10 68  ***0.2945 0.0001  ***
Default yield spread0.00130.9752 0.0100  ***−0.2433 3.4015 × 10 72  ***0.20270.6769
Long-term yield0.01050.99650.9568−0.1098 2.0058 × 10 44  ***0.2722 0.0033  **
Term spread0.00330.9608 0.0100  ***−0.0070 8.4860 × 10 43  ***0.21260.2350
Net equity expansion0.00900.9805 0.0100  ***−0.0619 3.1793 × 10 39  ***0.7741 1.43 × 10 5  ***
Significance levels: * p ≤ 0.1, ** p ≤ 0.05, *** p ≤ 0.01.
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Share and Cite

MDPI and ACS Style

Wang, Q.; Liu, X.; Fan, Y.; Peng, L. Testing the Intercept of a Balanced Predictive Regression Model. Entropy 2022, 24, 1594. https://doi.org/10.3390/e24111594

AMA Style

Wang Q, Liu X, Fan Y, Peng L. Testing the Intercept of a Balanced Predictive Regression Model. Entropy. 2022; 24(11):1594. https://doi.org/10.3390/e24111594

Chicago/Turabian Style

Wang, Qijun, Xiaohui Liu, Yawen Fan, and Ling Peng. 2022. "Testing the Intercept of a Balanced Predictive Regression Model" Entropy 24, no. 11: 1594. https://doi.org/10.3390/e24111594

APA Style

Wang, Q., Liu, X., Fan, Y., & Peng, L. (2022). Testing the Intercept of a Balanced Predictive Regression Model. Entropy, 24(11), 1594. https://doi.org/10.3390/e24111594

Note that from the first issue of 2016, this journal uses article numbers instead of page numbers. See further details here.

Article Metrics

Back to TopTop