A Machine Learning Approach for Investigating the Determinants of Stock Price Crash Risk: Exploiting Firm and CEO Characteristics
Abstract
:1. Introduction
2. Research Methodology
2.1. Data and Sample
2.2. Measuring Stock Crash Risk
2.3. Measuring Determinants
3. Results
3.1. Evaluation Criterion
3.2. Model Evaluation and Comparison
3.3. Model Interpretation
4. Discussion
5. Conclusions
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Barber, B.M.; Odean, T. The behavior of individual investors. Handbook Econ. Financ. 2013, 2, 1533–1570. [Google Scholar] [CrossRef]
- Bleck, A.; Liu, X. Market transparency and the accounting regime. J. Account. 2007, 45, 229–256. [Google Scholar] [CrossRef]
- Jin, L.; Myers, S.C. R2 around the world: New theory and new tests. J. Financ. Econ. 2006, 79, 257–292. [Google Scholar] [CrossRef]
- Hutton, A.P.; Marcus, A.J.; Tehranian, H. Opaque financial reports, R2, and crash risk. J. Financ. Econ. 2009, 94, 67–86. [Google Scholar] [CrossRef]
- Liu, C.; Chen, Y.; Huang, S.; Chen, X.; Liu, F. Assessing the Determinants of Corporate Risk-Taking Using Machine Learning Algorithms. Systems 2023, 11, 263. [Google Scholar] [CrossRef]
- Deng, S.; Zhu, Y.; Duan, S.; Fu, Z.; Liu, Z. Stock Price Crash Warning in the Chinese Security Market Using a Machine Learning-Based Method and Financial Indicators. Systems 2022, 10, 108. [Google Scholar] [CrossRef]
- Chen, J.; Hong, H.; Stein, J.C. Forecasting crashes: Trading volume, past returns, and conditional skewness in stock prices. J. Financ. Econ. 2001, 61, 345–381. [Google Scholar] [CrossRef]
- Baik BO, K.; Farber, D.B.; Lee SA, M. CEO ability and management earnings forecasts. Contemp. Account. Res. 2011, 28, 1645–1668. [Google Scholar] [CrossRef]
- Xie, W.; Ye, C.; Wang, T.; Shen, Q. M&A goodwill, information asymmetry and stock price crash risk. Econ. Res.-Ekon. Istraz. 2020, 33, 3385–3405. [Google Scholar] [CrossRef]
- Andreou, P.C.; Louca, C.; Petrou, A.P. CEO age and stock price crash risk. Int. Rev. Financ. 2017, 21, 1287–1325. [Google Scholar] [CrossRef]
- Jiang, X.; Zhu, J.; Akbar, A.; Hou, Z.; Bao, X. The dark side of executives’ professional background: Evidence from Chinese firm’s stock price crash risk. Manag. Decis. Econ. 2022, 43, 3771–3784. [Google Scholar] [CrossRef]
- Kim, J.B.; Liao, S.; Liu, Y. Married CEOs and stock price crash risk. Eur. Financ. Manag. 2022, 28, 1376–1412. [Google Scholar] [CrossRef]
- Li, Y.; Zeng, Y. The impact of top executive gender on asset prices: Evidence from stock price crash risk. J. Corp. Financ. 2019, 58, 528–550. [Google Scholar] [CrossRef]
- Al Mamun, M.; Balachandran, B.; Duong, H.N. Powerful CEOs and stock price crash risk. J. Corp. Financ. 2020, 62, 101582. [Google Scholar] [CrossRef]
- Kim, J.B.; Wang, Z.; Zhang, L. CEO overconfidence and stock price crash risk. Contemp. Account. Res. 2016, 33, 1720–1749. [Google Scholar] [CrossRef]
- Habib, A.; Hasan, M.M.; Jiang, H. Stock price crash risk: Review of the empirical literature. Account. Financ. 2018, 58, 211–251. [Google Scholar] [CrossRef]
- Zhu, Y.; Xie, C.; Wang, G.J.; Yan, X.G. Comparison of individual, ensemble and integrated ensemble machine learning methods to predict China’s SME credit risk in supply chain finance. Neural Comput. Appl. 2017, 28, 41–50. [Google Scholar] [CrossRef]
- Choudhury, P.; Allen, R.T.; Endres, M.G. Machine learning for pattern discovery in management research. Strateg. Manag. J. 2021, 42, 30–57. [Google Scholar] [CrossRef]
- Liu, F.; Wang, R.; Fang, M. Mapping green innovation with machine learning: Evidence from China. Technol. Forecast. Soc. Chang. 2024, 200, 123107. [Google Scholar] [CrossRef]
- Liu, F.; Long, X.; Dong, L.; Fang, M. What makes you entrepreneurial? Using machine learning to investigate the determinants of entrepreneurship in China. China Econ. Rev. 2023, 81, 102029. [Google Scholar] [CrossRef]
- Liu, F.; Huang, W.; Zhang, J.; Fang, M. Corporate social responsibility in family business: Using machine learning to uncover who is doing good. Technol. Soc. 2024, 76, 102453. [Google Scholar] [CrossRef]
- Chatzis, S.P.; Siakoulis, V.; Petropoulos, A.; Stavroulakis, E.; Vlachogiannakis, N. Forecasting stock market crisis events using deep and statistical machine learning techniques. Expert Syst. Appl. 2018, 112, 353–371. [Google Scholar] [CrossRef]
- Wang, M.; Yu, Y.; Liu, F. Does digital transformation curb the formation of zombie firms? A machine learning approach. Technol. Anal. Strat. Manag. 2023, 1–17. [Google Scholar] [CrossRef]
- Zhang, J.; Zhu, M.; Liu, F. Find who is doing social good: Using machine learning to predict corporate social responsibility performance. Oper. Manag. Res. 2023, 17, 253–266. [Google Scholar] [CrossRef]
- Dimson, E. Risk measurement when shares are subject to infrequent trading. J. Financ. Econ. 1979, 7, 197–226. [Google Scholar] [CrossRef]
- Ben-Nasr, H.; Bouslimi, L.; Zhong, R. Do patented innovations reduce stock price crash risk? Inter. Rev. Financ. 2021, 21, 3–36. [Google Scholar] [CrossRef]
- Chen, Y.; Fan, Q.; Yang, X.; Zolotoy, L. CEO early-life disaster experience and stock price crash risk. J. Corp. Financ. 2021, 68, 101928. [Google Scholar] [CrossRef]
- Hasan, M.M.; Taylor, G.; Richardson, G. Brand capital and stock price crash risk. Manag. Sci. 2022, 68, 7221–7247. [Google Scholar] [CrossRef]
- Baek, Y.; Kim, H.Y. ModAugNet: A new forecasting framework for stock market index value with an overfitting prevention LSTM module and a prediction LSTM module. Expert Syst. Appl. 2018, 113, 457–480. [Google Scholar] [CrossRef]
- Lv, P.; Shu, Y.; Xu, J.; Wu, Q. Modal decomposition-based hybrid model for stock index prediction. Expert Syst. Appl. 2022, 202, 117252. [Google Scholar] [CrossRef]
- Lundberg, S.M.; Lee, S.I. A unified approach to interpreting model predictions. In Proceedings of the 31st International Conference on Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017. [Google Scholar]
- Zhang, M.; Xie, L.; Xu, H. Corporate philanthropy and stock price crash risk: Evidence from China. J. Bus. Ethics. 2016, 139, 595–617. [Google Scholar] [CrossRef]
- Xu, J.; Zou, L. The impact of CEO pay and its disclosure on stock price crash risk: Evidence from China. China Financ. Rev. Int. 2019, 9, 479–497. [Google Scholar] [CrossRef]
Panel A. Distribution of Sample Firms Across Industries | ||
---|---|---|
Industry | Frequency | Percentage |
Computers, Communication Equipment, and Other Electronic Equipment Manufacturing | 201 | 10.06% |
Pharmaceutical Manufacturing | 168 | 8.40% |
Electrical Machinery and Equipment Manufacturing | 138 | 6.90% |
Software and Information Technology Services | 136 | 6.80% |
Raw Chemical Materials and Chemical Products Manufacturing | 134 | 6.70% |
Special-purpose Machinery Manufacturing | 134 | 6.70% |
Real Estate | 77 | 3.85% |
General-purpose Machinery Manufacturing | 68 | 3.40% |
Automobile Manufacturing | 57 | 2.85% |
Retail | 53 | 2.65% |
Nonmetal Mineral Products | 48 | 2.40% |
Rubber and Plastic Products | 45 | 2.25% |
Wholesale | 41 | 2.05% |
Internet and Other Related Services | 41 | 2.05% |
Smelting and Pressing of Nonferrous Metals | 36 | 1.80% |
Metal Products | 34 | 1.70% |
Instrument and Apparatus Manufacturing | 34 | 1.70% |
Wine, Beverage, and Refined Tea Manufacturing | 30 | 1.50% |
Railway, Shipping, Aerospace, and Other Transport Equipment Manufacturing | 29 | 1.45% |
Agricultural and Sideline Product Processing | 28 | 1.40% |
Business Services | 27 | 1.35% |
Food Manufacturing | 26 | 1.30% |
Ecological Protection and Environmental Governance | 24 | 1.20% |
Civil Engineering Construction | 23 | 1.15% |
Textile Garments and Clothing | 22 | 1.10% |
Production and Supply of Electric Power and Heat Power | 20 | 1.00% |
Others | 325 | 16.26% |
Total | 1999 | 100.00% |
Panel B. Distribution of Firm-year Observations Across Industries | ||
Industry | Frequency | Percentage |
Computers, Communication Equipment, and Other Electronic Equipment Manufacturing | 1117 | 9.37% |
Pharmaceutical Manufacturing | 1040 | 8.73% |
Electrical Machinery and Equipment Manufacturing | 807 | 6.77% |
Software and Information Technology Services | 796 | 6.68% |
Raw Chemical Materials and Chemical Products Manufacturing | 761 | 6.39% |
Special-purpose Machinery Manufacturing | 757 | 6.35% |
Real Estate | 560 | 4.70% |
General-purpose Machinery Manufacturing | 400 | 3.36% |
Retail | 367 | 3.08% |
Automobile Manufacturing | 333 | 2.79% |
Nonmetal Mineral Products | 311 | 2.61% |
Wholesale | 281 | 2.36% |
Rubber and Plastic Products | 256 | 2.15% |
Metal Products | 243 | 2.04% |
Wine, Beverage, and Refined Tea Manufacturing | 230 | 1.93% |
Internet and Other Related Services | 222 | 1.86% |
Smelting and Pressing of Nonferrous Metals | 220 | 1.85% |
Agricultural and Sideline Product Processing | 186 | 1.56% |
Instrument and Apparatus Manufacturing | 176 | 1.48% |
Business Services | 170 | 1.43% |
Textile Garments and Clothing | 147 | 1.23% |
Railway, Shipping, Aerospace and Other Transport Equipment Manufacturing | 147 | 1.23% |
Food Manufacturing | 138 | 1.16% |
Civil Engineering Construction | 138 | 1.16% |
Textiles | 120 | 1.01% |
Ecological Protection and Environmental Governance | 119 | 1.00% |
Production and Supply of Electric Power and Heat Power | 105 | 0.88% |
Others | 1768 | 14.84% |
Total | 11,915 | 100.00% |
Variable | Definition | Mean | SD |
---|---|---|---|
Firm characteristics | |||
Firm age | Years since the firm’s founding | 18.535 | 5.531 |
IPO age | Years since the firm’s IPO | 9.795 | 6.872 |
LogSize | ln(Total assets) | 22.055 | 1.145 |
Leverage | Total debt/Total assets | 0.413 | 0.207 |
Goodwill | Goodwill/Total assets; Zero replaces the missing values in goodwill | 0.038 | 0.082 |
Brand capital | Advertising expenses/Total assets | 0.038 | 0.082 |
Cash | (Cash+Short-term Investments)/Total assets | 0.167 | 0.129 |
ROA | Net income/Total assets | 0.055 | 0.070 |
ROE | Net income/Total equity | 0.059 | 0.141 |
Sigma | Standard deviation of weekly stock revenue over a year | 0.052 | 0.020 |
RET | Average weekly returns of a specific firm over a year | 0.000 | 0.007 |
DTURN | Current-year mean monthly share turnover–Last-year mean monthly share turnover | 0.140 | 24.296 |
CEO characteristics | |||
CEO gender | “1” if the CEO is male and “0” if the CEO is female | 0.930 | 0.256 |
CEO age | Age of the CEO | 49.724 | 6.508 |
CEO education | “1” if the CEO holds a postgraduate degree and “0” otherwise | 0.465 | 0.499 |
CEO MBA | “1” if the CEO holds an MBA degree and “0” otherwise. | 0.077 | 0.267 |
CEO duality | “1” if the CEO also serves as chairman and “0” otherwise | 0.314 | 0.464 |
CEO tenure | Number of years in a CEO position with a particular company | 1.315 | 0.798 |
CEO pay | ln(Total Annual Salary + 1) | 13.127 | 1.813 |
CEO shareholdings | ln(Outstanding Shares held by CEO + 1) | 9.055 | 7.903 |
CEO board experience | “1” if the CEO is also a director and “0” otherwise | 0.918 | 0.274 |
CEO academic experience | “1” if the CEO has the experience of (a) teaching at a college, (b) working at a research laboratory, and (c) researching at an institute, and “0” otherwise | 0.234 | 0.423 |
CEO overseas experience | “1” if the CEO has overseas experience and “0” otherwise | 0.094 | 0.292 |
CEO production | “1” if the CEO has career experience in the production area and “0” otherwise | 0.127 | 0.333 |
CEO RD | “1” if the CEO has career experience in the R&D area and “0” otherwise | 0.266 | 0.442 |
CEO design | “1” if the CEO has career experience in the design area and “0” otherwise | 0.031 | 0.172 |
CEO HRM | “1” if the CEO has career experience in the human resource management area and “0” otherwise | 0.022 | 0.146 |
CEO administration | “1” if the CEO has career experience in the administration area and “0” otherwise | 1.000 | 0.000 |
CEO marketing | “1” if the CEO has career experience in the marketing area and “0” otherwise | 0.291 | 0.454 |
CEO finance | “1” if the CEO has career experience in the finance area and “0” otherwise | 0.136 | 0.342 |
CEO accounting | “1” if the CEO has career experience in the accounting area and “0” otherwise | 0.105 | 0.306 |
Model | MSE | MSE kf1 | MSE kf2 | MSE kf3 | MSE kf4 | MSE kf5 | Mean |
---|---|---|---|---|---|---|---|
Panel A. Using NCSKEW as the measure | |||||||
XGBoost | 0.4557 | 0.4883 | 0.4654 | 0.5337 | 0.5222 | 0.5518 | 0.5123 |
GBDT | 0.4578 | 0.4831 | 0.4853 | 0.5522 | 0.5036 | 0.5372 | 0.5123 |
AdaBoost | 0.4522 | 0.5032 | 0.4898 | 0.5417 | 0.5109 | 0.5502 | 0.5191 |
Bagging | 0.4582 | 0.5055 | 0.4939 | 0.5503 | 0.5168 | 0.5553 | 0.5243 |
Random forest | 0.5196 | 0.4971 | 0.5305 | 0.5722 | 0.5430 | 0.6794 | 0.5644 |
Extra-Trees | 0.5385 | 0.5402 | 0.5351 | 0.5810 | 0.5647 | 0.6586 | 0.5759 |
Lasso | 0.5241 | 0.6591 | 0.6528 | 0.5668 | 0.5531 | 0.6562 | 0.6176 |
Ridge | 0.5248 | 0.6666 | 0.6528 | 0.5675 | 0.5550 | 0.6551 | 0.6194 |
MLPRegressor | 0.9078 | 0.7692 | 0.6892 | 0.7641 | 0.7241 | 0.8027 | 0.7499 |
Elastic net | 0.8124 | 0.7649 | 0.8174 | 0.9246 | 0.8383 | 0.8271 | 0.8345 |
Decision tree | 1.0060 | 1.0545 | 1.0502 | 0.9974 | 1.1352 | 1.1662 | 1.0807 |
Panel B. Using DUVOL as the measure | |||||||
XGBoost | 0.2186 | 0.2459 | 0.2657 | 0.2589 | 0.2472 | 0.2293 | 0.2494 |
GBDT | 0.2215 | 0.2366 | 0.2819 | 0.2715 | 0.2295 | 0.2302 | 0.2499 |
AdaBoost | 0.2235 | 0.2559 | 0.2851 | 0.2717 | 0.2377 | 0.2491 | 0.2599 |
Bagging | 0.2243 | 0.2555 | 0.2808 | 0.2717 | 0.2333 | 0.2585 | 0.2600 |
Random forest | 0.2683 | 0.2501 | 0.3257 | 0.3008 | 0.2610 | 0.3519 | 0.2979 |
Extra-Trees | 0.2938 | 0.2915 | 0.3245 | 0.3298 | 0.2878 | 0.3279 | 0.3123 |
Lasso | 0.2847 | 0.4207 | 0.4454 | 0.3260 | 0.2801 | 0.3447 | 0.3634 |
Ridge | 0.2853 | 0.4245 | 0.4458 | 0.3259 | 0.2808 | 0.3437 | 0.3641 |
MLPRegressor | 0.6372 | 0.4650 | 0.5544 | 0.6304 | 0.4168 | 0.4075 | 0.4948 |
Elastic net | 0.4766 | 0.5588 | 0.5457 | 0.5194 | 0.4912 | 0.4720 | 0.5174 |
Decision tree | 0.5534 | 0.5155 | 0.5811 | 0.6509 | 0.5470 | 0.4683 | 0.5526 |
Variable | SHAP1 | Effects1 | Rank1 | SHAP2 | Effects2 | Rank2 | Average Rank |
---|---|---|---|---|---|---|---|
RET | 0.4565 | −0.9212 | 1 | 0.4441 | −0.9330 | 1 | 1 |
Sigma | 0.1206 | −0.9230 | 2 | 0.0738 | −0.8016 | 2 | 2 |
IPO age | 0.0779 | −0.8919 | 3 | 0.0641 | −0.8554 | 3 | 3 |
LogSize | 0.0624 | −0.8781 | 4 | 0.0518 | −0.9182 | 4 | 4 |
DTURN | 0.0296 | −0.1388 | 5 | 0.0215 | −0.5543 | 5 | 5 |
Brand capital | 0.0257 | −0.1839 | 6 | 0.0178 | −0.3067 | 7 | 6.5 |
CEO pay | 0.0193 | −0.4289 | 9 | 0.0203 | −0.5393 | 6 | 7.5 |
Leverage | 0.0169 | 0.3330 | 10 | 0.0155 | 0.5731 | 8 | 9 |
Cash | 0.0217 | −0.1021 | 7 | 0.0081 | −0.1376 | 12 | 9.5 |
Goodwill | 0.0115 | 0.2983 | 11 | 0.0119 | 0.7344 | 9 | 10 |
ROE | 0.0197 | −0.0529 | 8 | 0.0080 | −0.2135 | 13 | 10.5 |
Firm age | 0.0104 | −0.3905 | 13 | 0.0096 | −0.7212 | 10 | 11.5 |
ROA | 0.0110 | 0.5675 | 12 | 0.0085 | 0.6158 | 11 | 11.5 |
CEO accounting | 0.0086 | 0.8266 | 15 | 0.0046 | 0.7113 | 15 | 15 |
CEO shareholdings | 0.0079 | 0.3512 | 17 | 0.0058 | −0.2465 | 14 | 15.5 |
CEO age | 0.0081 | 0.1521 | 16 | 0.0045 | 0.4151 | 16 | 16 |
CEO marketing | 0.0090 | −0.7914 | 14 | 0.0041 | −0.8544 | 18 | 16 |
CEO tenure | 0.0067 | 0.2150 | 18 | 0.0039 | 0.3601 | 19 | 18.5 |
CEO education | 0.0016 | −0.8944 | 21 | 0.0042 | −0.7742 | 17 | 19 |
CEO academic experience | 0.0018 | 0.6689 | 19 | 0.0015 | 0.8396 | 20 | 19.5 |
CEO RD | 0.0006 | −0.0171 | 24 | 0.0006 | −0.4033 | 21 | 22.5 |
CEO duality | 0.0007 | −0.3051 | 23 | 0.0004 | 0.0793 | 26 | 24.5 |
CEO board experience | 0.0004 | 0.4694 | 26 | 0.0005 | 0.0919 | 24 | 25 |
CEO design | 0.0017 | −0.5892 | 20 | 0.0000 | −0.0667 | 30 | 25 |
CEO finance | 0.0004 | 0.4053 | 27 | 0.0005 | 0.5884 | 23 | 25 |
CEO HRM | 0.0010 | 0.7714 | 22 | 0.0001 | 0.4582 | 28 | 25 |
CEO production | 0.0006 | −0.5677 | 25 | 0.0004 | −0.3166 | 25 | 25 |
CEO gender | 0.0002 | 0.3752 | 30 | 0.0005 | −0.7119 | 22 | 26 |
CEO overseas experience | 0.0002 | −0.3707 | 29 | 0.0002 | −0.2451 | 27 | 28 |
CEO MBA | 0.0003 | 0.5626 | 28 | 0.0001 | 0.5475 | 29 | 28.5 |
CEO administration | 0.0000 | 0.0000 | 31 | 0.0000 | 0.0000 | 31 | 31 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Li, Y.; Xue, H.; Wei, S.; Wang, R.; Liu, F. A Machine Learning Approach for Investigating the Determinants of Stock Price Crash Risk: Exploiting Firm and CEO Characteristics. Systems 2024, 12, 143. https://doi.org/10.3390/systems12050143
Li Y, Xue H, Wei S, Wang R, Liu F. A Machine Learning Approach for Investigating the Determinants of Stock Price Crash Risk: Exploiting Firm and CEO Characteristics. Systems. 2024; 12(5):143. https://doi.org/10.3390/systems12050143
Chicago/Turabian StyleLi, Yan, Huiyuan Xue, Shiyu Wei, Rongping Wang, and Feng Liu. 2024. "A Machine Learning Approach for Investigating the Determinants of Stock Price Crash Risk: Exploiting Firm and CEO Characteristics" Systems 12, no. 5: 143. https://doi.org/10.3390/systems12050143
APA StyleLi, Y., Xue, H., Wei, S., Wang, R., & Liu, F. (2024). A Machine Learning Approach for Investigating the Determinants of Stock Price Crash Risk: Exploiting Firm and CEO Characteristics. Systems, 12(5), 143. https://doi.org/10.3390/systems12050143