Next Article in Journal
Investigating the Economic and Financial Damage around Currency Peg Failures
Next Article in Special Issue
CVaR Regression Based on the Relation between CVaR and Mixed-Quantile Quadrangles
Previous Article in Journal
Book-To-Market Decomposition, Net Share Issuance, and the Cross Section of Global Stock Returns
Previous Article in Special Issue
Determining Distribution for the Quotients of Dependent and Independent Random Variables by Using Copulas
 
 
Font Type:
Arial Georgia Verdana
Font Size:
Aa Aa Aa
Line Spacing:
Column Width:
Background:
Project Report

Examination and Modification of Multi-Factor Model in Explaining Stock Excess Return with Hybrid Approach in Empirical Study of Chinese Stock Market

Division of Business Management, Beijing Normal University-HongKong Baptist University United International College, Zhuhai 519087, China
*
Author to whom correspondence should be addressed.
J. Risk Financial Manag. 2019, 12(2), 91; https://doi.org/10.3390/jrfm12020091
Submission received: 28 April 2019 / Revised: 22 May 2019 / Accepted: 22 May 2019 / Published: 25 May 2019
(This article belongs to the Special Issue Mathematical Finance with Applications)

Abstract

:
To search significant variables which can illustrate the abnormal return of stock price, this research is generally based on the Fama-French five-factor model to develop a multi-factor model. We evaluated the existing factors in the empirical study of Chinese stock market and examined for new factors to extend the model by OLS and ridge regression model. With data from 2007 to 2018, the regression analysis was conducted on 1097 stocks separately in the market with computer simulation based on Python. Moreover, we conducted research on factor cyclical pattern via chi-square test and developed a corresponding trading strategy with trend analysis. For the results, we found that except market risk premium, each industry corresponds differently to the rest of six risk factors. The factor cyclical pattern can be used to predict the direction of seven risk factors and a simple moving average approach based on the relationships between risk factors and each industry was conducted in back-test which suggested that SMB (size premium), CMA (investment growth premium), CRMHL (momentum premium), and AMLH (asset turnover premium) can gain positive return.

1. Introduction

Financial markets are rife with uncertainties which make it difficult to forecast future market trends. Especially in the stock market, while providing investors with remarkable return, at the same time, it entails tremendous risk. As people pursue higher returns, they must bear the corresponding risks.
A pricing model with multiple factors is a promising approach to predict future stock prices. If the relationship between the risk and return can be expressed by multiple factors in a mathematical model, the standards of stock selection and corresponding investment strategies can be established. To be specific, each investor has their own risk preference, for instance, risk averse investors tend to bear lower risk and receive lower return. This specific type of investors has their preference in stock selection. In order to find suitable selection criteria, we need to quantify the relationship of risk premium factor and excess return. Investors can refer to the selection criteria to establish a corresponding trading strategy which can achieve their target excess return.
In the process of model examination, regression was conducted on single stocks which differed from the previous research which used portfolios. Especially, ridge regression was conducted instead of OLS regression. As for the process of model modification, two new factors were added to achieve higher explanatory power. Furthermore, we discussed the endogeneity and exogeneity for the risk premium factor. On the basis of economic objectives, we established a trading strategy for Chinese stock market.
Although previous research has worked well in the American stock market, these findings may be less practically applied to the Chinese market due the investor component. Since individual investors contribute nearly 80% of the trading volume, investment behavior and preference can largely impact the market average return. However, the asymmetric information and investment concepts may lead to irrational behaviors. Therefore, we use risk premium factors to explain the excess return and the coefficients to measure the sensitivity of investor reactions to the risk premium.
To begin, we conducted single stock regression to examine the effectiveness of a five-factor model in Chinese stock market. Then we compare the coefficients of different factors under specific company types to discover the leading factor. We use the t-statistic to evaluate the significance of each factor. Regarding previous articles and research, we will add new factors in the model for better performance. Moreover, we conducted inter-factor cyclical research to study the pattern of factors. We can predict the rise and fall of the coefficient on a quarterly basis.

2. Literature Review

The multi-factor risk model has undergone a series of developments which can be divided into four major steps. At the beginning, in order to figure out the leading factors for security return, several researchers have contributed to the development of an asset pricing model. Initially, Markowitz (1952) proposed a mean-variance model to illustrate the statistical relationship between security risk and return in terms of standard deviation and expected rerun. It has established the foundation of modern finance theory. However, he had not specified the factors that explain expected security returns.
Based on the modern portfolio theory, the capital asset pricing model (CAPM) was developed by Sharpe (1964) and Lintner (1965) to illustrate the linear relationship between expected return and market risk premium.
Ri = Rf + β m a r k e t ( Rm Rf )
With the empirical tests in the stock market, they managed to find out the pattern of stock returns in line with the general stock market index. To be specific, the model estimates the relationships between the return on market index (the explanatory variable) and the return on the stock (the dependent variable). The regression coefficient of the single index model is referred to as beta which is a measure of the sensitivity of a stock to general movement in the market index. This empirical research symbolized the transition from qualitative analysis to quantitative analysis which also laid the foundation for the subsequent asset pricing models.
Since then, the CAPM model has been widely applied in research and empirical testing. However, with increasing abnormal returns which cannot be explained by existing factors, the market beta was no longer sufficient to describe expected return (Fama 1996). Therefore, Fama and French proposed a multifactor model consisting of three factors for market risk (Rm − Rf), market value, and book-to-market ratio (Fama and French 1993). In this model, Ri, Rf, and Rm stands for security expected return, risk free rate, and market return. SMB and HML are the risk premium factors. They illustrated that small stocks can generate higher returns than large stocks while value stocks can generate higher returns than growth stocks.
R i R f = a + β m a r k e t ( R m R f ) + β s i z e SMB + β B M HML
It is also a supplement of arbitrage pricing theory (Ross 1976) which emphasizes that the expected return is not only affected by market risk but also a series of other factors. The generalized model (Bodie et al. 2014) illustrated the linear relationship of expected return and different factors (Bodie et al. 2017). In Equation (3), Fj represents the factors and b represents the coefficients.
r i = a i + j = 1 k b i j F j +   ϵ i ,   i = 1 ,   2 ,   N
Moreover, Banz (1981) illustrated that return of securities are affected by several index including B/M ratio and E/P ratio which represent a series of risk premium. These articles largely contribute to the development of the multi-factor model.
In latter decades, the three-factor model was faced with a series of challenges. As the three-factor model was applied to stock trading and empirical testing, it appeared that some of the phenomenon cannot be explained by the model which can lead to unpredictable abnormal return. Therefore, more specific factors need to be added into the model to improve the accuracy. Novy-Marx (2013) proved that profitability, measured by gross profits-to-assets, has roughly the same power as book-to-market value in explaining the average return. The following equation is based on the dividend discount model, Y t + τ represents the earning for period t + τ , d B t + τ is the change in book equity, r represents the expected return. It implied that higher market value leads to lower expected return, while higher earnings imply higher expected return.
M t =   τ = 1 E ( Y t + τ d B t + τ ) / ( 1 + r ) τ
Aharoni et al. (2013) recorded an insignificant but statistically reliable relationship between investment pattern and average return. Other evidence also illustrated that the profitability and investment factors can explain some of the variation in average return.
Carhart (1997) conducted a study on the common factors in stock return and investment expense by adding momentum factor in three-factor model. It measures the tendency of price changes with a portfolio of long previous-12-month return winners and short previous-12-month loser stocks, which had an 8% accumulated yield. The momentum factor can explain 6.4% of the excess return.
Therefore, Fama and French (2015) added the two new factors, investment (CMA) and profitability (RMW), to build the five-factor model for measuring the effects of company size, valuation, profitability, and investment pattern in average stock returns. In this equation, they divide both sides by book value at time t to create book-to-market ratio and present the relationship between return r and valuation factor.
R i R f   =   a   +   β m a r k e t ( R m R f ) + β s i z e SMB + β B M HML + β p r o f i t a b i l i t y RMW + β i n v e s t m e n t CMA + ε
Moreover, they use the RMW factor to represent the profitability and the CMA factor to represent investment. According to empirical study, the five-factor model has a higher effectiveness than the three-factor model, which can explain 71–94% of the variation of average return. In the test, researchers divided all the stocks with three sets of factors. The result also implied that book-to-market factor is redundant for describing average return using the American stock data from 1963 to 2013. Also, the model fails to capture the low average return for small firms with low profitability and high investment.
Sehgal and Vasishth (2015) tested the model in various emerging markets and discovered that the change of price and trading volume are partly risk based and partly behavioral. The research indicated that behavioral factors are necessary in the study of the multi-factor model. Except for the existing risk premium factors in Fama and French model, Peng et al. (2014) studied the effect of different investment sentiments on the market return from which they found out customer satisfaction is a significant factor for abnormal return. Moreover, a human capital factor was considered in terms of compensation level (Moinak and Balakrishnan 2018). Zahedi and Rounaghi (2015) applied an artificial neural network to assess the components of a multi-factor model and predict future stock prices.
Considering the features of data and the time-varying factors, a range of studies applied various methods to address problems via multi-factor models. Akter and Nobi (2018) examined the distribution and frequency distribution for both daily stock returns and volatility. Furthermore, Chen and Kawaguchi (2018) distinguish two significant regimes (a persistent bear market and a bull market) to examine market time-varying risk factors to achieve Markov regime-switching. These studies examined the model specifically in certain periods which enables the researchers to compare the model performances in different situations which can improve the applicability of the model.
Some researchers conducted modification on the basic model with specific structure. Ronzani et al. (2017) suggested that β (systematic risk) evolves over time and the model with time-varying β provide less conservative VaR measures than the static β. While Cisse et al. (2019) examined the dynamics of the model with Kalman filter and Markov switching (MS) model and proved that the former method fits better in the model. Bhattacharjee and Roy (2019) proposed a social network dependence structure to address such misspecifications. For the investment aspect, Frazzini et al. (2013) suggesting that Buffett’s returns are more due to stock selection than to his effect on management.

3. Methodology

3.1. Hypotheses

Hypothesis 1. 
Market premium has a positive relationship with excess return.
Hypothesis 2. 
Size premium has a positive relationship with excess return.
Hypothesis 3. 
Book-to-market premium has a positive relationship with excess return.
Hypothesis 4. 
Profitability premium has a positive relationship with excess return.
Hypothesis 5. 
Investment growth premium has a positive relationship with excess return.
Hypothesis 6. 
Momentum premium has positive a relationship with excess return.
Hypothesis 7. 
Asset turnover premium has a positive relationship with excess return.
Hypotheses are conducted on the relationship between dependent variable and independent variables. To be specific, the positive relationship stated that risk premium factor can generate higher excess return. For instance, a company with higher profitability can achieve higher returns than a low profitability company. Since previous researchers have not examined the effect of factors on single stocks, we decided to retest the effect of seven factors on excess return for each security and summary by industries.

3.2. Research Design

The data were collected from the Choice, Tongdaxin, and Resset database. Choice and Tongdaxin can provide historical values of stock financial ratios and Resset provides the basic information of companies and interest rates. The samples are 1097 stocks in the Chinese stock market, including Shanghai securities exchange market and Shenzhen securities exchange market from, 2007 to 2018 because large quantity of firms transformed their non-tradable shares into tradable shares in the process of Chinese Reformation of National Owned Stock from 2005 to 2006 and the time range after 2007 provides the research with more companies’ data. The type of data is quarterly data which includes 47 quarterly data points for each stock. In the process of data cleaning, those stocks with missing data will be deleted in the samples.
Even if Fama and French (2015) had delivered the five-factor model, yet they only consider the factor performance in different groups of portfolio but not the factors’ effect on each security in the stock market. Thus, by applying variable-intercept models mentioned in the book of Hsiao (2003), we ran the panel data for each stock to test the effect of these five premiums in the Model 1. The study bridged the relationships between the excess return of each security and different premium factors via OLS (ordinary least squares regression), which means each stock has their individual coefficient and there are more than 1000 regression functions (Johnson and Wichern 2008). During the research process, this paper classified the securities into different groups to calculate risk premium factors based on market value, book-to-market ratio, ROE, and the growth of investment. Then, this research considered the average coefficients’ level, average significant level, and their distribution.
In Model 2, the momentum factor and turnover factor are added into the model. We developed an innovative method to study each single stock in the market, in order to find out the general pattern of the stock market. In contrast to the former ways that examine the model with the diversified portfolio, we conducted ridge regression on single stocks and divided the stocks in industries. The 28-industry classification standards are from ShenWan industry index.
Figure 1 illustrates the research structure and their corresponding functions for the results. The research includes two major parts, which are multi-factor examination with single stock regression and time-series analysis for risk factors.
The first part involved stability test, OLS regression, ridge regression, and robustness test to find the significant correlations between stock excess return and seven risk factors. In addition, a chi-square test was conducted to examine whether the effect of factors various in different industries. Moreover, by measuring the percentage of positive and negative correlations in each industry, the significant relationships between factors and industry was discovered.
The second part of research covered chi-square test to find out the endogeneity and exogeneity (or called pattern of fluctuation) for seven risk factors and a back test of trading strategy with trend analysis. For details, for chi-square testing, we firstly recorded the rise and fall pattern in the neighboring quarters. There were four kinds of pattern (rise after rise, rise after fall, fall after rise, fall after fall). We calculated each patterns’ amount and conducted chi-square testing on the pattern of each factor. If a factor passes the test, it can be inferred that the current pattern is in accordance with the previous period. Therefore, with the endogeneity and exogeneity of the factors, we can predict the future rise and fall based on the current pattern. Moreover, a trading strategy was established based on the trend analysis of factors. As for the investment portfolio, stocks were selected according to the effect of seven factors on industries. A simple back test was also conducted to examine the performance of the trading strategy.
The result of general significant correlations and the factors’ effect in various industry can be used in some economic phenomenon explanation. Furthermore, combing the significant relationships between risk factors and corresponding industries with trend analysis, trading strategies can be built.

Factor Selection

We modified the multi-factor model to adapt for the Chinese stock market. Since some of the existing factors in the model have poor performance in the regression analysis and empirical test, it is significant to add or delete some of the factors to improve the model.
In accordance with previous studies, we selected two new factors (momentum factor and turnover factor). Firstly, the momentum effect is remarkable in the medium term of stock market. Momentum effect was proposed by Jegadeesh and Titman (1993) to illustrate the phenomenon that stock performed well in the past is more likely to achieve higher return. Regarding the composition of investor and investment behavior in stock market, we suggested that the momentum effect can explain the part of the excess return. Secondly, we added a turnover factor into the model. Asset turnover rate is a component of DuPont analysis, which is used to analyze the return of shareholder’s equity (Wild 2016). Asset turnover rate is a vital measure of a company’s operation capacity. A higher asset turnover rate indicates that a company has better operation capacity and higher efficiency. When conducting value investing, this financial ratio is primarily regarded before investment, as it can directly reflect the condition of a company, for example profitability, operation capacity, and so on.

3.3. Research Process

3.3.1. Multi-Factor Model Examination with Single Stock Regression Analysis

  • Stability Test
  • DV: Ri-Rf
  • IV: Rm-Rf, SMB, RMW, HML, CMA, CRMHL, AMLH
  • CV: Time, Time^2, Season
If the three control variables which refers to time, the square of time and season can pass the t-test with high p-level, these three variables should be put into the regression models, otherwise they can be deleted.
  • OLS Regression
We conducted OLS regression on 47 quarters for each single stock, in order to examine the multi-factor model. According to OLS, to calculate the beta (coefficient) of each independent variable, the matrix operation combines X and y. In this formula, X is a 47 × 7 matrix, in which 47 is the 47 time-series of data and 7 represents seven proposed risk factors. The y is a 47 × 1 matrix and 1 indicates the excess return of a single stock. Since there are 1097 stock, X matrix is fixed yet y matrix is the data of different stocks.
β = ( X T X ) 1 X T y
[ β 1 β 2 β 7 ] = ( [ S M B 1 R M W 1 A M L H 1 S M B 47 R M W 47 A M L H 47 ] T [ S M B 1 R M W 1 A M L H 1 S M B 47 R M W 47 A M L H 47 ] ) 1 [ S M B 1 R M W 1 A M L H 1 S M B 47 R M W 47 A M L H 47 ] T [ y 1 y 2 y 7 ]
We used computer simulation in Python, which can help us automatically run regression 1097 times. The python program would print out the result of regression and calculate the number of mean value of coefficients and t-value, which were collected in a 7 × 1097 matrix. We also drew the frequency histogram for each factor to study the distribution of the coefficients. Based on these assumptions, the majority of the coefficients should be positive. If most of them come out negative, we may infer that the five-factor model is not suitable for the Chinese stock market.
  • Ridge Regression
Since we added two more factors in the model, we applied ridge regression instead of OLS regression. On one hand, ridge regression can prevent an over-fitting result from extra factors. On the other, it can prevent multi-collinearity and increase the significance of the factors. Ridge regression is developed based on OLS regression by adding regularization term λ Ι . The regularization term can discover the factor with collinearity and force the coefficient to approach zero to ensure that the effect of multi-collinearity is minimized. To calculate the beta (coefficient) of each independent variable, the matrix operation combines X, y, λ , and I. X and y are the same as the matrix in the OLS model. λ is an optimization hyperparameter and I is a 7 × 7 identity matrix.
β = ( X T X + λ I ) 1 X T y
[ β 1 β 2 β 7 ] = ( [ S M B 1 R M W 1 A M L H 1 S M B 47 R M W 47 A M L H 47 ] T [ S M B 1 R M W 1 A M L H 1 S M B 47 R M W 47 A M L H 47 ] + [ λ 0 0 0 0 λ 0 0 0 0 0 λ ] ) 1 [ S M B 1 R M W 1 A M L H 1 S M B 47 R M W 47 A M L H 47 ] T [ y 1 y 2 y 7 ]
When we applied the ridge regression, we need to find a λ that can provide the largest sum of R-square which are the number of positive factors and percentage of positive coefficient. We set the summary of these three measures as our target function to find out the optimal λ with iteration.
λ =   argmax ( Q )
Q = P average value + number of positive coefficients + R square mean + number of variables whose p value over 0.9 (9).
  • Robustness Test
In this part, a zero-mean test for OLS and ridge regression model is adopted to check the robustness of models. Both OLS and ridge regression models are fixed-coefficient models, which means that they have an assumption that the covariance of errors between different units (stocks) are equal to zero. Thereafter, a residual covariance matrix between 1097 stocks’ regression can be drawn by Python and calculate the mean and standard deviation of all numbers in covariance matrix. Then a t-test is conducted to test whether E( u i , j ) is equal to zero.
Covariance matrix of residual
[ u 1 , 1 u 1 , 2 u 1 , 1097 u 2 , 1 u 1097 , 1 u 1097 , 1097 ]
u 1 , 2   means the covariance between stock 1 and 2.
  T - value = E ( u i , j ) S T D E ( u i , j )

3.3.2. Time-Series Analysis for Risk Factors

  • Chi-Square Test
We use chi-square in two steps respectively. Firstly, it was used to examine the different effect of factors in different industries. In the test we divided the significance of factors based on industries and calculated the total value   χ 2 .
Secondly, it was applied to find out the pattern in the inter-factor direction prediction analysis. We divided the pattern into four types with a combination of increase and decrease. Then we used chi-square to find out whether there is a pattern for factor change direction. With the increase and decrease pattern, we can predict the next quarter movement based on the current situation. As we can see in Figure 2, suppose “0” represents the increase of risk factor in one term (or one season) and “1” represents the decrease of risk factors.
R = Max(F1, F2) + Max(F3, F4)
W = Min(F1, F2) + Min(F3, F4)
In Figure 3, if investors spontaneously make investment decision, the frequencies of right and wrong decisions are equitable. Both are T/2. (T is number of total transaction time)
H0
There is no relationship between right or wrong investment decisions and direction prediction rules.
H1
There is a relationship between right or wrong investment decisions and direction prediction rules.
In Figure 4, the expected frequency of right decisions ( R e ) and wrong decisions ( W e ) made by the rule can be calculated by Equations (15) and (16). Also, the expected frequency of right or wrong decision made by random selection ( ( T / 2 ) e ) can be calculated by Equation (17).
R e = ( R + W ) × ( R + T / 2 ) / ( R + W + T )
W e = ( R + W ) × ( W + T / 2 ) / ( R + W + T )
( T / 2 ) e = T × ( R + T / 2 ) / ( R + W + T )
According to the significant table (Figure 5) of chi-square test, if total chi-square level is >2.706, we have 90% confidence to reject the null hypothesis (H0). If total chi-square level is >3.841, we have 95% confidence to reject the null hypothesis (H0). If total chi-square level is >6.635, we have 99% confidence to reject the null hypothesis (H0).
Total   chi - square   level = ( R R e ) 2 / R e + ( W W e ) 2 / W e + 2 [ T / 2 ( T / 2 ) e 2 ] / ( T / 2 ) e
The degree of freedom = (column − 1)(row − 1) = (2 − 1)(2 − 1) = 1
  • Back-Test for Trading Strategy
After factors’ cyclical research and trend analysis for fluctuation of risk factors, the trading strategy based on moving average approach and correlations between factors and industries can be formed. By deciding the time to do the transactions and stocks which should be bought or sold, the float return from 2007S4 to 2018S3 and annual expected return can then be calculated.

3.4. Assumptions for Multi-Factor Examination

The following assumptions are made in applying the multi-factor model:
  • Perfect market: there are no tax and transaction costs.
  • People are risk averse or rational.
  • People can lend or borrow money at a risk-free rate freely.
  • There is a trade-off between risk and return for all securities.
  • Everyone can obtain market information equally and freely.
  • Investors have the same expectations for this market.
The intuition and assumption behind the hypotheses are presented in the Appendix A with measurements of factors and graphs.

3.5. Models and Variable Definitions

Model 1:
R i R f = a + β m a r k e t ( R m R f ) + β s i z e SMB + β B M HML + β p r o f i t a b i l i t y RMW + β i n v e s t m e n t CMA + ε
Model 2:
R i R f = a + β m a r k e t ( R m R f ) + β s i z e SMB + β B M HML + β p r o f i t a b i l i t y RMW + β i n v e s t m e n t CMA + β m o m e n t u m CRMHL + β t u r n o v e r AMHL + ε
The details of the factors (Rm-Rf, SMB, HML, RMW, CMA, CRMHL, AMHL) including explanation and graphs are presented in Appendix A.
Table 1 displays the dependent variable, independent variables and the corresponding labels and explanations.

4. Results

4.1. Multi-Factor Model Examination with Single Stock Regression Analysis

4.1.1. Stability Test

  • DV: Ri-Rf
  • IV: Rm-Rf, SMB, RMW, HML, CMA, CRMHL, AMLH
  • CV: Time, Time^2, Season
According to the result of stability test (presented in Appendix E), the control variable including Time, Time^2, Season did not pass the t-test, the significance level is far less than 0.9. Thus, they are removed from the model. It means that excess return is not affected by time.

4.1.2. Regression Models

  • OLS Regression
Model 1: Five-factor model
  • DV: Ri-Rf
  • IV: Rm-Rf, SMB, RMW, HML, CMA
Model 2: Seven-factor model
  • DV: Ri-Rf
  • IV: Rm-Rf, SMB, RMW, HML, CMA, CRMHL, AMLH
Model 3: Modified optimal model
  • DV: Ri-Rf
  • IV: Rm-Rf, RMW, HML, CMA, CRMHL, AMLH
  • Ridge Regression
Model 4: Ridge regression model
  • DV: Ri-Rf
  • IV: Rm-Rf, SMB, RMW, HML, CMA, CRMHL, AMLH
We conducted OLS regression on Model 1, 2, and 3, and ridge regression on Model 4. After conducting the regression analysis, we recorded each factors’ coefficients and mean of R square. We created frequency histograms to observe the distribution of coefficients. The graphs provide a general view of the coefficients.
In Table 2, above each histogram, there are two numbers, mean of coefficients and the proportion of coefficients that passed the t-test (in the parenthesis).
To be specific, we divided the coefficients into uncorrelated, positively correlated, and uncorrected, based on 95% confidence interval. Table 3 shows the percentage of coefficient. SMB, CMA, Rm-Rf, CRMHL, and AMHL have more positive coefficients. Whereas RMW and HML have more negative coefficients.
In ridge regression, we obtain the λ by calculating the maximum of the objective function Q. When λ equals to 0.008, the result of ridge regression is maximized. In Figure 6, we can find out the largest Q value at 0.008.
Some factors have negative effects on the excess return, which deviates from our assumption. Moreover, regarding the factors that have an even distribution. We need to classify them in industry groups to discover a further pattern.

4.1.3. Robustness Test: Zero Mean Residual Testing

According to Table 4, since t-values in four models are less than 2.021, we have 95% confidence not to reject the null hypothesis, which means the residuals are randomly independent for all stocks, both the OLS and ridge regression model pass the zero mean residual test.

4.1.4. Industry Analysis

According to the result of chi-square test, each industry corresponds differently to the factors. Therefore, effect of factors needs to be discussed based on different industries respectively. The details of the test results are presented in the Appendix B.
We classify the stocks in 28 industries. Also, we divided the coefficients in positive correlation and negative correlation. As for data cleaning, we only preserved the factors that contained over 70% coefficients are of the same signs. The specific table is presented in the Appendix C.
In the Table 5, the industries are classified in manifold groups based on the significance level and the relationship with each factor.
According to the table, there is no industry that can be explained by seven factors simultaneously. The banking and steel industries are best fitted in the seven-factors model, since they have six significant factors and the highest sum of percentage of significant coefficients.
Each factor has different relationships with different industries. For market premium (Rm-Rf), it can apply to all industries. For SMB, it is positively related with seven industries (Group A), yet SMB is negatively correlated with Steel industry. With respect to RMW, it has negative relationships with most industries (Group D). However, RMW has significant positive relationships with bank industry. HML is negative related to multiple industries (Group F). Also, HML has significant positive relationships with the banking industry. CMA is positively correlated with five industries (Group G). Meanwhile, CMA is negatively correlated to three industries (Group H) including extractive, banking, and nonferrous metal. CRMHL is positive correlated with most of industries (Group I) except media, electrical equipment, non-bank finance, animal husbandry and fishery, commercial, and comprehensive industries. AMLH has a negative correlation with Group K. At the same time, AMLH is positively related with three other industries (Group J) including housing, non-bank finance, and banking.
Moreover, the industries with the same pattern are regarded as coordinated industry. Additionally, the industries with different pattern are regarded as deviated industries. To be specific, the banking industry is considered as a deviated industry since it has a positive RMW and HML factor, while other industries have negative factors. The graph shows four groups of coordinated industry—including telecommunication and electronics, chemistry and automobile, light manufacturing and defense, and nonferrous metal and extractive—while banking is regarded as a deviated industry.
Considering the trend of the factors, we can formulate corresponding strategies. To be specific, based on the condition of each factor and the classification from Group A to L which consist of different industries presented in Table 5, we formulated manifold strategies in Table 6.
Here we can take an example to illustrate the above table, when SMB is positive, it means in this market, small market value (MV) companies outperform big companies, thus for those industries which is significantly positive correlated with SMB, we should buy small MV stocks and sell large MV stocks. However, for those industries which is negatively correlated with SMB, the strategy should purchase large MV stocks and sell small MV stocks. When SMB is negative, vice versa.
According to the significant level and correlation effect between 7 factors and 28 industries, the trading portfolio and strategy can be conducted. Since the strategic making depends on the positive or negative condition of seven factors, if we can forecast the conditions of seven factors, the trading strategies can be easily made. Thus, for the next two parts, we explored the fluctuation of seven factors to answer two questions:
  • From analysis of factors’ changing pattern, can we find the reasons or elements to illustrate the fluctuation of seven factors? (if we find out the driving factor which stimulate the moving of other factors, we can explain the most essential ratio that investor may focus on.)
  • Can we find an approach to forecast the moving of each factor and apply it to do the back test for trading strategies?

4.2. Time-Series Analysis for Risk Factors

4.2.1. Endogeneity and Exogeneity for Factors’ Cyclical Research

  • Endogeneity
According to Pearson Correlation matrix, last term data of SMB, RMW, HML, CMA, CRMHL, and AMLH has significant correlation with this term at the level of 0.01. Nevertheless, for the market factor (Rm-Rf), its last term data are nearly independent with this term, which means except market risk factor, other factors are possible to forecast themselves with last term data.
However, to test the endogeneity of these factors, chi-square test are necessary for examination and application of these relationships. After the test, the results show that only SMB, RMW pass the test with 99% confidence level and CMA also has a significance level of 0.05, which means these three factors exist endogeneity and can be truly used to forecast their direction and investment cycle.
  • Exogeneity
Based on Pearson correlation matrix, the fluctuation of HML, CRHML, and AMLH depend upon the history data of other factors. For details, HML is related to the AMLH of last term. CRHML has a small correlation with HML AMLH bridge its relationships with RMW of last term. Finally, with respect to Rm-Rf, its volatility is partly connected to last term’s RMW and AMLH.
Figure 7 summarizes the endogeneity and exogeneity for predicting direction of seven risk factors. In the chi-square test, RMW, SMB, and CMA factor can be only predicted by the last term direction of themselves. With confidence level of 95%, CRHML’s direction can forecasted by its history data with HML and the increase and decrease of HML can be predicted by last term direction and AMLH at 90% confidence level. For market factor (Rm-Rf)—even if single stock history data cannot predict next term—with RMW and AMLH, market factor can estimate the direction of its next term with 95% confidence. With a confidence level of 90%, AMLH’s fluctuation direction can forecast by RMW. The details of prediction are presented in Appendix D.

4.2.2. Trading Strategy with Trend Analysis

In the last two parts, with the examination of chi-square test, the directions of fluctuation of seven factors are predictable, yet the range of fluctuation is still unknown.
Therefore, in order to forecast the positive and negative levels of each risk factor, trend analysis is conducted.
The above seven graphs in Figure 8 describe the trend of each factor. The orange line is the four-season moving average. SMB, HML, and CMA are mostly below 0. By contrast, RMW and CRMHL are above 0. While AMLH and Rm-Rf fluctuate around 0.4.

4.2.3. Back-Testing

Table 7 illustrates the results of the back-testing from 2007S4 to 2018S3 for the trading system. With the strategy, we can achieve positive annual return in SMB, CMA, CRMHL, and AMLH for 15.43%, 3.23%, 29.13%, and 8.4% respectively. To some extent, the result proved the practicability of the seven-factor model, especially for the factor CRMHL in terms of trading strategy. Consequently, with alternative trading strategies which can predict the factor fluctuation more precisely, we can witness a greater margin of improvement.

5. Discussion

5.1. Analysis of Multi-Factor Model

We compared three multi-factor models in OLS regression and proceed to ridge regression for higher significance and explanatory power. In the seven-factor model, only RMW and HML factor have negative effects on the stock excess return, which is deviated from the assumption. To be specific, negative RMW means that a company with lower ROE can achieve higher excess return than higher ROE company. Also, negative HML means lower book-to-market ratio result in higher excess return.
According to the previous empirical studies in Chinese stock market, the low level of ROE often corresponds to low stock price. On the contrary, relatively high ROE means that the stock price has reached a periodic top. We can infer that stocks with relatively low ROE can attract investment which can boost the stock price. As for the CMA, a company with lower growth of investments has higher excess return. We can infer that investors prefer purchase stock with lower growth of investment.

5.2. Industry Analysis

We merely discussed the industries with more than four significant factors. Since they are consistent with the model to the higher extent. As we can see in Figure A10 in Appendix C, among 12 industries, there are four groups of coordinated industries and one group of deviated industry. We found out that the coordinated industries are usually cyclical industries which are highly related to the economic wave. Cyclical industry primarily consists of two categories, resources and industrial raw material. They are closely related to macro economy cycle and supply–demand relationship such as manufacturing, automobile, metal and chemistry industry. Moreover, the coordinated industry is distributed at upstream and downstream; for example, electronics is the upstream industry for telecommunication. Thereby, the chain effect is transmitted from the upstream industry through a series of companies all the way to the downstream industry.
To be specific, regarding different factors, industries may react differently in terms of positive and negative relationship. For SMB, it has a negative effect on steel industry, while six industries stand in the opposite position including electrical equipment, electronics, computers, construction material, telecommunication, etc. There are two reasons for the adverse effect in steel industry. Firstly, steel industry is widely regarded as traditional industry. Secondly, it is dominated by a few giant companies which are supported by the government. Whereas the six companies with positive effects are in opposite condition. They are relatively small in terms of market value. Most companies in electrical equipment and computer industry are in early stage which have potential to achieve higher excess return.
As for RMW, over half of the industry are in a negative relationship, while banking is positive. Since banking is closely related to the macro-economic policy. Therefore, it responds to market situations like ROE more swiftly. Moreover, we ran the regression on the same period which may reveal the lag of the information dissemination in some of the industry. Since the market is imperfect, the companies may not respond to the information in the exact same period.
For HML, over 70% of the companies have a negative effect, while banking has a positive relationship. A higher book-to-market ratio reflects that a bank is increasing the amount of loan issue. Since the major source of incomes of banking come from loan issue, thus a bank with higher book-to-market ratio can achieve a higher excess return. While other industries with low book-to-market ratio can achieve higher excess return.
As for CMA, five companies—including textiles & garments, utilities, and light manufacturing—consist of manufacturing processes. They have constant cash flow to maintain daily operation and dividend payment. They are also relatively conservative in investment. However, banking and non-bank finance is negative related to CMA factor. It can be inferred that they are more aggressive in investment to achieve higher excess return.
For CRMHL, none of the companies are in negative relationship, while over 70% of the industries are positive to the momentum factor. This means that the momentum effect widely applied to industries, which can result in the similar movement in the next period.
When it comes to AMLH, the housing and finance industries are in a positive relationship. In these industries, assets in terms of land and money reserves are placed in the foremost position, which means that lower asset turnover rates can result in higher excess return. However, for industries with a negative relationship, since asset turnover rate is the reflection of operating capacity, a higher asset turnover rate can lead to higher excess return.

5.3. Factor Cyclical Research

According to the result of chi-square test, the future direction of SMB and CMA can be merely predicted based on the previous period. While RMW has the initial effect to the rest of the factors, this chain effect conveys through AMLH and HML to Rm-Rf and CRMHL respectively. Although the size of this effect cannot be predicted, we can find the pattern of investment in this chain effect which can reflect investor behavior in the stock market. Firstly, investors primarily focus on the ROE of the companies. Together with the asset turnover rate, we can predict the future direction of the market premium (Rm-Rf). Secondly, the future direction of CRMHL can also be predicted by HML and AMLH. It can suggest that investors usually refer to the profitability and operation capacity of companies before making investment. These two factors can affect the company evaluation and eventually reflected on the change of momentum factor. According to the investment pattern, we can establish a corresponding investment strategy.
Since the direction of market premium factor (Rm-Rf) can be predicted, to some extent, we can find out the pattern of the index. Therefore, it can be also be applied in the investment of index futures and options through call and put.

5.4. Trading Strategy and Back Test

According to the trend analysis, a moving average model is applied in the model (Hanke and Wichern 2014). However, the span of moving average is not specifically decided, which can affect the result of back-testing. We only conducted a four-season moving average as an example. In order to find out the optimal moving average, iteration can be applied to figure out the optimal trading strategies.

5.5. Significance and Limitations of Research

With the comparison of previous work and our research, academic significance can be illustrated in two aspects.
For one thing, prior studies mainly focus on the effect of factors on excess return from a portfolio aspect. Stocks are classified into different groups based on their characteristics. For example, the companies are divided into two groups based on size, then each group is divided into two groups based on the book-to-market ratio. By comparing the average return of the four groups, they can find out the relationship between factors and return. Thus, their methods can only be applied to investments of a specific portfolio, which may lack of practicability and explanatory power in application to a single stock or specific industries. For this research, nevertheless, we focus on discovering different effects of risk premium factors to the excess return of single stock via OLS and ridge regression and then summarize the significant correlations between factors and different industries, which means our research is more practical. For example, based on our model, investors can judge the factors which significantly affect a specific stock or industry.
Additionally, previous works have never discussed the risk premium factors from the aspect of time-series analysis. While forecasting the investment cycle, prior research only investigated from a macro-economic level or technical analysis. For one example, in Business Cycles (Lars 2006), the author merely analyzed the cycle of housing, credit, and inventory, which belonged to macro-economic level. For another example, technical analyses, like Elliott wave principle and candlestick charts, aim to describe the market price wave pattern on time series. Thereafter, there is an academic gap of discussing investment cycle of fundamental analysis. In this paper, however, the fluctuations of seven factors are studied by chi-square test of endogeneity and exogeneity of factors. Moreover, it is a new type of cycle analysis because it explains the companies’ value behind the investment decision.
The contribution consists of four points. Firstly, we applied a novel method to find out the correlations between seven risk factors and each single stock. Secondly, we find out and explain the correlations between seven risk factors and each industry and specify the situations (positive or negative) of risk factors to buy or sell the industries’ stocks. Thirdly, the result of cyclical analysis can be applied in forecasting the direction of risk factors, especially the market risk factor (Rm-Rf) which can be used in transaction of options and futures of market index. Lastly, a back-test was conducted in a simple trading system which suggested that SMB (size premium), CMA (investment growth premium), CRMHL (momentum premium), and AMLH (asset turnover premium) can gain positive returns.
As for the limitations, this research conducted a four-seasons-moving average method to forecast the level of seven factors. Even if it proved that SMB, CMA, CRMHL, and AMLH can be applied in trading system. However, more forecasting methods like ARIMA, VAR, and ANN can be constructed to form better trading strategies. More limitations are presented with our direction for further study.

6. Conclusions and Further Study

In the research of relationship between risk premium and excess return, the hybrid approach takes a primary position. In the examination and modification of multi-factor model, OLS and ridge regression are conducted on Models 1 to 4. The seven-factor model has an optimal combination of p level and R square. In chi-square test, the effect of factor was responded differently in each industry. Therefore, p level was calculated based on the industry in order to find out the well-fitted coordinated and deviated industry. Bank and steel industry are well-fitted in the model, while industries within the same stream—e.g., telecommunication and electronics—are found to be coordinated industries. Moreover, cyclical industry is usually coordinated except for banking.
In the factor cyclical research, chi-square and correlation tests are applied to find out the endogeneity and exogeneity. SMB, RMW, and CMA have endogeneity while the remaining four factors have both endogeneity and exogeneity. With the pattern of direction in factors, an investment strategy was established. To be specific, when moving average SMB is positive, small market value (MV) companies outperform big companies, thus for those industries which are significantly positively correlated with SMB, it would be better to buy small MV stocks and sell large MV stocks. With the strategy, we can achieve a positive return in SMB, CMA, CRMHL, and AMLH respectively in the back test.
As for the further study, the proportion of individual investors in the Chinese stock market contribute nearly 80% of the trading volume. Therefore, investor preference and the irrational behavior should be considered. The investor sentiment factor may be added to improve the explanatory power.
Previous studies also illustrated that the reversal effect exists in the long-term stock market. If our period is set to be longer than one year, we can add the reversal factor to explain the reversal effect.
Dynamic analysis can be applied to data processing, since the economic environment changes over time. We can also monitor the stock market and provide suggestion in stock selection which can fit for the target return.
Moreover, we can further study the inter-factor drive relationship, in order to establish more investment strategies. To be specific, by optimizing the back-test ratio and Sharpe ratio, we can construct a better investment portfolio.

Funding

This research received no external funding form government or any institution.

Conflicts of Interest

We declare that there is no conflict of interest in this research.

Appendix A. Intuition and Assumption Behind the Hypotheses

There is a trade-off between return and risk. In order to find the corresponding risk for the excess return, according to CAPM model, there is a market risk. To illustrate the abnormal return, Fama used three-factor model which added size premium and book-to-market premium in 1993. In 2013, in their five-factor model, profitability and investment growth are also considered to be significant coefficients. Next, the intuition behind these premiums will be explain one by one.
1. Market Premium
Figure A1. Positive relationship between expected return and market premium.
Figure A1. Positive relationship between expected return and market premium.
Jrfm 12 00091 g0a1
Market premium is represented by the difference between market return and risk-free rate. Since the fluctuation of stock market’s expected return is higher than the risk-free rate, namely stock market has higher risk, the expected return of stock market should higher than risk-free rate.
2. Size Premium
Figure A2. Positive relationship between expected return and size premium.
Figure A2. Positive relationship between expected return and size premium.
Jrfm 12 00091 g0a2
Size premium is the difference between small companies’ average return and big companies’ average return in a diversified portfolio or in the market. Normally, small companies have higher returns than the bigger ones. Because at the same period of time, small companies’ profit is easy to grow faster than big companies and the growth rate of their dividend also is higher.
According to the DDM model, the price of security depends on the discounted present value of future dividend. In this formula, ‘P’ is the expected present price of one stock. ‘D’ is the dividend of this year and ‘g’ is the constant growth rate of the dividend (it also is the growth rate of profit). For the last, ‘i’ is the dividend interest rate, which may refer to risk-free rate.
P = lim n [ D ( 1 + g ) 1 + i + D ( 1 + g ) 2 ( 1 + i ) 2 + D ( 1 + g ) 3 ( 1 + i ) 3 + + D ( 1 + g ) n ( 1 + i ) n ] = lim n [ D ( 1 + g ) 1 + i ( 1 + g 1 + i ) n 1 1 + g 1 + i 1 ]
Given by i > g ,
( 1 + g 1 + i ) n 0
So, P = D ( 1 + g ) i g
P 1 = D 1 ( 1 + g 1 ) i g 1 ,   P 2 = D 2 ( 1 + g 1 ) i g 1
If we assume D 1 = D 2
Hence, the return rate of the stock can be represented by
R i = P 2 P 1 P 1 = ( g 2 g 1 ) ( i + 1 ) ( 1 + g 1 ) ( i g 2 )
In terms of small companies and big companies, g 1 ( t h e   g r o w t h   r a t e   o f   f i r s t   t e r m ) a n d   i   ( r i s k f r e e   r a t e )   a r e   t h e   s a m e .
However, in the second term, with the control of other effects, growth rate ( g 2 ) of small companies is higher than the big one, so the expected return ( R i ) of small companies is generally higher than the large one.
In other words, high returns of small companies indicate they also carry higher risk. If investors can suffer the risk brought by small companies, they can gain the risk premium which is called ‘size premium’.
3. Book-to-Market Premium
Figure A3. Positive relationship between expected return and book-to-market premium.
Figure A3. Positive relationship between expected return and book-to-market premium.
Jrfm 12 00091 g0a3
Book-to-market premium is the difference between high B/M ratio companies’ average return and low B/M ratio companies’ average return in a diversified portfolio or in the market.
There is an effect named B/M effect which indicates that higher B/M ratio companies has higher excess return. It can be illustrated by prospect theory easily. However, in this case, the x-axis is MV and the y-axis is the value.
Figure A4. Relationship between true and cognitive value of stocks with the growth of MV/B.
Figure A4. Relationship between true and cognitive value of stocks with the growth of MV/B.
Jrfm 12 00091 g0a4
For the higher B/M ratio companies, the MV/B is relative lower and people always overprice the true value of a stock, it leads to the demand for those companies is higher. With the growth of demand and stock price, the return of those stocks also is higher.
Vice versa, higher B/M ratio companies has lower expected return and excess return (expected return minus risk of free rate).
Therefore, if the investor prefers higher B/M ratio companies, they take the risk of B/M effect on one hand, they gain the B/M premium on the other hand.
4. Profitability Premium
Figure A5. Positive relationship between expected return and profitability premium.
Figure A5. Positive relationship between expected return and profitability premium.
Jrfm 12 00091 g0a5
Profitability premium is the difference between robust profitability (higher ROE) companies’ average return and weak profitability (lower ROE) companies’ average return in a diversified portfolio or in the market.
With the control of other effects, companies with robust profitability—measured by the level of ROE (return of equity)—outperform in their expected rate of return and take greater variance. This is because companies with high profits also distribute high dividends.
P   = D ( 1 + g ) i g
According DDM (dividend discount model) formula, higher dividend means higher price and demand which will enhance the level of expected return. If an investor purchases companies with robust profit, they may get higher excess return and fluctuation at the same time. Vice versa, weak profitability companies bring people low excess return and risk.
5. Investment Growth Premium
Figure A6. Positive relationship between expected return and investment growth premium.
Figure A6. Positive relationship between expected return and investment growth premium.
Jrfm 12 00091 g0a6
Investment growth premium is the difference between conservative (lower growth rate of investment or lower growth rate of assets) companies’ average return and aggressive (higher growth rate of investment or higher growth rate of assets) companies average return in a diversified portfolio or in the market.
The reason why aggressive companies may have low excess return and risk is that these kinds of firms allocate more profit into reinvestment rather than dividends, thus it decreases the expected price and return, which leads to low risk. Vice versa, conservative companies bring higher excess return and risk because of larger amount of dividend rather investment.
6. Momentum Premium
Figure A7. Positive relationship between expected return and momentum premium.
Figure A7. Positive relationship between expected return and momentum premium.
Jrfm 12 00091 g0a7
Momentum premium is the difference between higher momentum (higher accumulated return) companies’ average return and lower momentum (lower accumulated return) companies’ average return in a diversified portfolio or in the market.
Momentum is the accumulated return in one quarter. The higher one means the stock is popular with high return and risk. Vice versa, people invest in low momentum companies with low premium and risk.
7. Asset Turnover Premium
Figure A8. Positive relationship between expected return and asset turnover premium.
Figure A8. Positive relationship between expected return and asset turnover premium.
Jrfm 12 00091 g0a8
Asset turnover premium is the difference between higher asset turnover companies’ average return and lower asset turnover companies’ average return in a diversified portfolio in the market.
Asset turnover is the total revenue divided by total asset. The higher one means the stock is popular with high return and risk. Vice versa, people invest in low momentum companies with low premium and risk. The regression analysis was conducted with a opposite direction.

Appendix B. Chi-Square Test of Industry in Different Factors

Table A1 recorded the total chi-square value of 28 industries for seven factors. By conducting the chi-square test, we can find out the different effect of factors to various industries. When the total chi-square value is larger than the critical value (when df = 27), it means that each industry is responding differently to the specific factors. According to the result, six factors passed the test, while Rm-Rf is insignificant. Therefore, we need to discuss the effect of factors based on different industry.
Table A1. Chi-square test of industry in different factors.
Table A1. Chi-square test of industry in different factors.
FactorsChi-Square ( χ 2 )
SMB97.58155
RMW102.9963
HML134.683
CMA85.7257
Rm-Rf38.17769
CRMHL66.58794
AMLH179.6194

Appendix C. Significance Level and Correlation Effect

Table A2. Significance level of factors in different industries.
Table A2. Significance level of factors in different industries.
TitleSMBRMWHMLCMACRMHLAMLHRm-Rf
Extractive0−0.76−0.8−0.720.72−0.721
Media0−0.78571−10001
Electrical equipment0.741935−0.80645−0.935480001
Electronics0.8333330−0.87500.729167−0.708331
Housing00−0.7252700.7142860.7912091
Textiles & garments0−0.70833−0.8750.7083330.91666701
Non-bank finance000000.7142860.964286
Steel−0.73684−0.947370.78947401−0.894741
Utilities0000.7105260.82894701
Defense0−0.8−0.720.80.8−0.841
Chemistry0−0.74699−0.7831300.795181−0.843371
Mechanical equipment0−0.76667−0.7666700.83333301
Computer0.7777780−0.9259300.77777801
Domestic appliance00000.857143−0.76191
Construction material 0.714286−0.7619000.90476201
Construction ornament00000.93333301
Transportation0−0.81633000.85714301
Animal husbandry and fishery00−0.714290001
Automobile0−0.86364−0.7500.818182−0.751
Light manufacturing0−0.73077−0.769230.7307690.730769−0.807691
Commercial 00−0.885250.754098001
Food0.7941180000.794118−0.764711
Telecommunication0.7272730−0.7727300.772727−0.772731
Leisure service0.8125−0.875−0.812500.8125−0.81251
Medical00−0.8041200.835052−0.742270.989691
Bank00.7142861−0.71429111
Nonferrous metal0−0.88372−0.74419−0.837210.790698−0.860471
Comprehensive0−0.92308−0.807690001
In this table, 0 represents that the factor is insignificant to the industry. The absolute value represents the percentage of the significant coefficients. A positive number represents that the factor has positive effect on the industry, while negative number represents that the factor has a negative effect on the industry.

Appendix D. Forecasting the Direction of Factors

Figure A9. Use SMB to forecast the direction of SMB.
Figure A9. Use SMB to forecast the direction of SMB.
Jrfm 12 00091 g0a9
Figure A10. Use RMW to forecast the direction of RMW.
Figure A10. Use RMW to forecast the direction of RMW.
Jrfm 12 00091 g0a10
Figure A11. Use CMA to forecast the direction of CMA.
Figure A11. Use CMA to forecast the direction of CMA.
Jrfm 12 00091 g0a11
Figure A12. Use AMLH and HML to forecast the direction of HML.
Figure A12. Use AMLH and HML to forecast the direction of HML.
Jrfm 12 00091 g0a12
Figure A13. Use HML and CRMHL to forecast the direction of CRMHL.
Figure A13. Use HML and CRMHL to forecast the direction of CRMHL.
Jrfm 12 00091 g0a13
Figure A14. Use RMW and AMLH to forecast the direction of AMLH.
Figure A14. Use RMW and AMLH to forecast the direction of AMLH.
Jrfm 12 00091 g0a14
Figure A15. Use RMW, AMLH, and Rm-Rf to forecast the direction of Rm-Rf.
Figure A15. Use RMW, AMLH, and Rm-Rf to forecast the direction of Rm-Rf.
Jrfm 12 00091 g0a15

Appendix E. Result of Stability Test

Figure A16. Significance level of variables.
Figure A16. Significance level of variables.
Jrfm 12 00091 g0a16
The figures in the red frame are the significance level of time, time^2 and season.

References

  1. Aharoni, Gil, Bruce Grundy, and Qi Zeng. 2013. Stock returns and the miller modigliani valuation formula: Revisiting the fama French analysis. Social Science Electronic Publishing 110: 347–57. [Google Scholar] [CrossRef]
  2. Akter, Nahida, and Ashadun Nobi. 2018. Investigation of the financial stability of s&p 500 using realized volatility and stock returns distribution. Journal of Risk and Financial Management 11: 22. [Google Scholar]
  3. Banz, Rolf W. 1981. The relationship between return and market value of common stocks. Journal of Financial Economics 9: 3–18. [Google Scholar] [CrossRef] [Green Version]
  4. Bhattacharjee, Arnab, and Sudipto Roy. 2019. Abnormal Returns or Mismeasured Risk? Network Effects and Risk Spillover in Stock Returns. Journal of Risk and Financial Management 12: 50. [Google Scholar] [CrossRef]
  5. Bodie, Zvi, Alex Kane, Alan J. Marcus, and Ravi Jain. 2014. Investments. New York: Mc Graw Hill Education. [Google Scholar]
  6. Bodie, Zvi, Alex Kane, and Alan J. Marcus. 2017. Essentials of Investments. New York: McGraw-Hill Education. [Google Scholar]
  7. Carhart, Mark M. 1997. On persistence in mutual fund performance. The Journal of Finance 52: 26. [Google Scholar] [CrossRef]
  8. Chen, Jieting, and Yuichiro Kawaguchi. 2018. Multi-factor asset-pricing models under markov regime switches: Evidence from the chinese stock market. International Journal of Financial Studies 6: 54. [Google Scholar] [CrossRef]
  9. Cisse, Mamadou, Mamadou Konte, Mohamed Toure, and Smael A. Assani. 2019. Contribution to the Valuation of BRVM’s Assets: A Conditional CAPM Approach. Journal of Risk and Financial Management 12: 27. [Google Scholar] [CrossRef]
  10. Fama, Eugene F. 1996. Multifactor portfolio efficiency and multifactor asset pricing. The Journal of Financial and Quantitative Analysis 31: 441–65. [Google Scholar] [CrossRef]
  11. Fama, Eugene F., and Kenneth R. French. 1993. Common risk factors in the returns on stocks and bonds. Journal of Financial Economics 33: 3–56. [Google Scholar] [CrossRef]
  12. Fama, Eugene F., and Kenneth R. French. 2015. A five-factor asset pricing model. Journal of Financial Economics 116: 1–22. [Google Scholar] [CrossRef] [Green Version]
  13. Frazzini, Andrea, David Kabiller, and Lasse H. Pedersen. 2013. Buffett’s alpha. CEPR Discussion Papers 3: 583–90. [Google Scholar]
  14. Hanke, John E., and Dean W. Wichern. 2014. Business Forecasting. Zug: Pearson Schweiz Ag. [Google Scholar]
  15. Hsiao, Cheng. 2003. Analysis of Panel Data. Cambridge: Cambridge University Press. [Google Scholar]
  16. Jegadeesh, Narasimhan, and Sheridan Titman. 1993. Returns to buying winners and selling losers: Implications for stock market efficiency. The Journal of Finance 48: 65–91. [Google Scholar] [CrossRef]
  17. Johnson, Richard Arnold, and Dean W. Wichern. 2008. Applied Multivariate Statistical Analysis. Beijing: Tsinghua University Press. [Google Scholar]
  18. Lars, Tvede. 2006. Business Cycles: History, Theory and Investment Reality. Hoboken: Wiley. [Google Scholar]
  19. Lintner, John. 1965. The valuation of risk assets and the Selection of Risky Investments in Stock Portfolios and Capital Budgets. The Review of Economics and Statistics 47: 13–37. [Google Scholar] [CrossRef]
  20. Markowitz, Harry. 1952. Portfolio selection. Journal of Finance 7: 77–91. [Google Scholar]
  21. Moinak, Maiti, and A. Balakrishnan. 2018. Is human capital the sixth factor? Journal of Economic Studies 45: 710–37. [Google Scholar]
  22. Novy-marx, Robert. 2013. The other side of value: The gross profitability premium. Journal of Financial Economics 108: 1–28. [Google Scholar] [CrossRef] [Green Version]
  23. Peng, Chi-Lu, Kuan-Ling Lai, Maio-Ling Chen, and An-Pin Wei. 2014. Investor sentiment, customer satisfaction and stock returns. Social Science Electronic Publishing 49: 827–50. [Google Scholar] [CrossRef]
  24. Ronzani, André, Osvaldo Candido, and Wilfredo Maldonado. 2017. Goodness-of-fit versus significance: A capm selection with dynamic betas applied to the brazilian stock market. International Journal of Financial Studies 5: 33. [Google Scholar] [CrossRef]
  25. Ross, Stephen A. 1976. The arbitrage theory of capital asset pricing. Journal of Economic Theory 13: 341–60. [Google Scholar] [CrossRef]
  26. Sehgal, Sanjay, and Vibhuti Vasishth. 2015. Past price changes, trading volume and prediction of portfolio returns. Journal of Advances in Management Research 12: 330–56. [Google Scholar] [CrossRef]
  27. Sharpe, William F. 1964. Capital asset prices: A theory of market equilibrium under conditions of risk. The Journal of Finance 19: 18. [Google Scholar]
  28. Wild, John J. 2016. Fundamental Accounting Principles. New York: McGraw-Hill Education. [Google Scholar]
  29. Zahedi, Javad, and Mohammad Mahdi Rounaghi. 2015. Application of artificial neural network models and principal component analysis method in predicting stock prices on tehran stock exchange. Physica A: Statistical Mechanics and its Applications 438: 178–87. [Google Scholar] [CrossRef]
Figure 1. Multi-factor model examination with single stock regression and time-series analysis.
Figure 1. Multi-factor model examination with single stock regression and time-series analysis.
Jrfm 12 00091 g001
Figure 2. Pattern of factor’s change.
Figure 2. Pattern of factor’s change.
Jrfm 12 00091 g002
Figure 3. Observed frequencies: fo.
Figure 3. Observed frequencies: fo.
Jrfm 12 00091 g003
Figure 4. Expected frequencies: fe.
Figure 4. Expected frequencies: fe.
Jrfm 12 00091 g004
Figure 5. Chi-square level: χ 2 = (fofe)2 f.
Figure 5. Chi-square level: χ 2 = (fofe)2 f.
Jrfm 12 00091 g005
Figure 6. Target function optimization.
Figure 6. Target function optimization.
Jrfm 12 00091 g006
Figure 7. Summary of endogeneity and exogeneity for predicting direction of seven risk factors.
Figure 7. Summary of endogeneity and exogeneity for predicting direction of seven risk factors.
Jrfm 12 00091 g007
Figure 8. Four seasons moving average of seven factors.
Figure 8. Four seasons moving average of seven factors.
Jrfm 12 00091 g008
Table 1. Dependent variable and independent variables.
Table 1. Dependent variable and independent variables.
Name LabelNote
DV
Excess return R i R f The return of security mines risk-free rate of return
IV
Abnormal returnaThe constant term of formula
Market premium R m R f The return of market index (in this model, market index is Shanghai stock exchange market index) mines risk-free rate of return
Size premium
(Small minus Big)
SMBThe return on a diversified set of small stocks minus the return on a diversified set of big stocks.
Book-to-market premium (High minus low)HMLThe difference between the returns on diversified portfolios of high and low B/M stocks.
Profitability premium (Robust minus weak)RMWThe difference between the returns on diversified portfolios of stocks with robust and weak profitability.
Investment growth premium
(Conservative minus aggressive)
CMAThe difference between the returns on diversified
portfolios of the stocks of low and high investment firms, which we label as conservative and aggressive.
Momentum premium (High momentum minus low momentum)CRMHLThe difference between higher momentum (higher accumulated return) companies’ average return and lower momentum (lower accumulated return) companies’ average return in a diversified portfolio or in the market.
Asset turnover premium
(Low turnover rate minus high turnover rate)
AMLHThe difference between low asset turnover companies’ average return and higher turnover companies’ average return or in the market.
Table 2. R square mean, coefficients’ mean and p-level for four models.
Table 2. R square mean, coefficients’ mean and p-level for four models.
OLSRidge Regression
Model 1Model 2Model 3Model 4
Mean of R square 0.5692
Jrfm 12 00091 i001
0.6078
Jrfm 12 00091 i002
0.5911
Jrfm 12 00091 i003
0.6048
Jrfm 12 00091 i004
Mean of coefficient (p-level)Rm-Rf−0.29
(0.9535)
Jrfm 12 00091 i005
0.29
(0.9991)
Jrfm 12 00091 i006
0.82
(0.9991)
Jrfm 12 00091 i007
0.21
(0.9991)
Jrfm 12 00091 i008
SMB−0.29
(0.8669)
Jrfm 12 00091 i009
0.34
(0.1641)
Jrfm 12 00091 i010
0.29
(0.8368)
Jrfm 12 00091 i011
RMW−0.74
(0.9189)
Jrfm 12 00091 i012
0.24
(0.9891)
Jrfm 12 00091 i013
0.40
(0.9763)
Jrfm 12 00091 i014
0.22
(0.9298)
Jrfm 12 00091 i015
HML0.13
(0.3874)
Jrfm 12 00091 i016
0.30
(0.9690)
Jrfm 12 00091 i017
−0.27
(0.9444)
Jrfm 12 00091 i018
0.22
(0.9681)
Jrfm 12 00091 i019
CMA−0.11
(0.9335)
Jrfm 12 00091 i020
0.58
(0.9900)
Jrfm 12 00091 i021
−0.50
(0.9763)
Jrfm 12 00091 i022
0.38
(0.9909)
Jrfm 12 00091 i023
CRMHL −0.14
(0.9617)
Jrfm 12 00091 i024
0.33
(0.9243)
Jrfm 12 00091 i025
0.02
(0.9535)
Jrfm 12 00091 i026
AMLH −0.04
(0.9991)
Jrfm 12 00091 i027
−0.72
(0.9991)
Jrfm 12 00091 i028
0.06
(0.9991)
Jrfm 12 00091 i029
Table 3. Correlations between seven factors and excess return of stocks.
Table 3. Correlations between seven factors and excess return of stocks.
SMBRMWHMLCMAAMLHCRMHLRm-Rf
Positive61.0%24.2%23.0%56.2%62.4%77.8%99.8%
Negative22.7%68.7%73.8%42.8%37.5%17.6%0.1%
Uncorrelated16.3%7.0%3.2%0.9%0.1%0.1%0.1%
Table 4. Zero mean residual testing.
Table 4. Zero mean residual testing.
OLS t-ValueRidge Regression t-Value
Model 1Model 2Model 3Model 4
0.58110.53930.54160.5387
Table 5. Relationships of industries and seven factors.
Table 5. Relationships of industries and seven factors.
Positive RelationshipsNegative Relationships
SMB(Group A: 7)
Electrical Equipment, Electronics, Computer, Construction Material, Food, Telecommunication, and Leisure Service
(Group B: 1)
Steel
RMW(Group C: 1)
Banking
(Group D: 16)
Extractive, Media, Electrical Equipment, Textiles & Garments, Steel, Defense, Chemistry, Mechanical Equipment, Computer, Construction Material, Transportation, Automobile, Light Manufacturing, Leisure Service, Nonferrous Metal, and Comprehensive Industries.
HML(Group E: 1)
Banking
(Group F: 19)
Extractive, Media, Electrical Equipment, Electronics, Housing, Textiles & Garments, Steel, Defense, Chemistry, Mechanical equipment, Computer, Animal Husbandry and Fishery, Automobile, Light Manufacturing, Commercial, Telecommunication, Leisure Service, Medical, Nonferrous Metal, and Comprehensive Industries.
CMA(Group G: 5)
Textiles & Garments, Utilities, Defense, Light Manufacturing, and Commercial
(Group H: 3)
Extractive, Banking, and Nonferrous Metal
CRMHL(Group I: 20)
Extractive, Electronics, Housing, Textiles & Garments, Steel, Utilities, Defense, Chemistry, Mechanical Equipment, Computer, Domestic Appliance, Construction material, Transportation, Automobile, Light Manufacturing, Telecommunication, Leisure Service, Medical, Banking, and Nonferrous Metal.
AMLH(Group J: 3)
Housing, Non-Bank Finance, and Banking.
(Group K: 14)
Extractive, Electrical Equipment, Electronics, Steel, Defense, Chemistry, Domestic appliance, Automobile, Light Manufacturing, Food, Telecommunication, Leisure Service, Medical and Nonferrous Metal.
Rm-Rf(Group L: 28)
All industries
Table 6. Trading strategies based on condition of factors.
Table 6. Trading strategies based on condition of factors.
ConditionWhen Factor Is PositiveWhen Factor Is Negative
StrategiesBUYSELLBUYSELL
SMBSmall MV companies in
Group A
Big MV companies in
Group B
Big MV
companies in
Group A
Small MV companies in
Group B
Big MV
companies in
Group A
Small MV companies in
Group B
Small MV companies in
Group A
Big MV companies in
Group B
RMWRobust ROE companies in
Group C
Weak ROE companies in
Group D
Weak ROE companies in
Group C
Robust ROE companies in
Group D
Weak ROE companies in
Group C
Robust ROE companies in
Group D
Robust ROE companies in
Group C
Weak ROE companies in
Group D
HMLHigh B/M companies in
Group E
Low B/M companies in
Group F
Low B/M companies in
Group E
High B/M companies in
Group F
Low B/M companies in
Group E
High B/M companies in
Group F
High B/M companies in
Group E
Low B/M companies in
Group F
CMAConservative (low growth rate of assets) companies in
Group G
Aggressive (High growth rate of assets)
companies in
Group H
Aggressive companies in
Group G
Conservative
companies in
Group H
Aggressive companies in
Group G
Conservative
companies in
Group H
Conservative companies in
Group G
Aggressive
companies in
Group H
CRMHLHigh CR
companies in
Group I
High CR
companies in
Group I
AMLHLow Asset turnover companies in
Group J
High Asset turnover companies in
Group K
High Asset turnover companies in
Group J
Low Asset turnover companies in
Group K
High Asset turnover companies in
Group J
Low Asset turnover companies in
Group K
Low Asset turnover companies in
Group J
High Asset turnover companies in
Group K
Rm-RfAll companies
(Group L)
All companies
(Group L)
Table 7. Back-testing float return from 2007S4 to 2018S3 and expected annual return for factors.
Table 7. Back-testing float return from 2007S4 to 2018S3 and expected annual return for factors.
FactorsFloat Return Expected Annual Return
SMB165.89%15.43%
RMW−6.525%−0.61%
HML−22.29%−2.07%
CMA34.67%3.23%
CRMHL313.09%29.13%
AMLH90.26%8.40%

Share and Cite

MDPI and ACS Style

Huang, J.; Liu, H. Examination and Modification of Multi-Factor Model in Explaining Stock Excess Return with Hybrid Approach in Empirical Study of Chinese Stock Market. J. Risk Financial Manag. 2019, 12, 91. https://doi.org/10.3390/jrfm12020091

AMA Style

Huang J, Liu H. Examination and Modification of Multi-Factor Model in Explaining Stock Excess Return with Hybrid Approach in Empirical Study of Chinese Stock Market. Journal of Risk and Financial Management. 2019; 12(2):91. https://doi.org/10.3390/jrfm12020091

Chicago/Turabian Style

Huang, Jian, and Huazhang Liu. 2019. "Examination and Modification of Multi-Factor Model in Explaining Stock Excess Return with Hybrid Approach in Empirical Study of Chinese Stock Market" Journal of Risk and Financial Management 12, no. 2: 91. https://doi.org/10.3390/jrfm12020091

APA Style

Huang, J., & Liu, H. (2019). Examination and Modification of Multi-Factor Model in Explaining Stock Excess Return with Hybrid Approach in Empirical Study of Chinese Stock Market. Journal of Risk and Financial Management, 12(2), 91. https://doi.org/10.3390/jrfm12020091

Article Metrics

Back to TopTop