1. Introduction
Option is an important tool for investors to obtain benefits and avoid risks in financial derivatives, and option pricing has always been a hot research objective of scholars. The effective pricing of options contributes to the stability of the financial market and the sustained development of the national economy. In order to improve the accuracy of option pricing, scholars have performed a variety of research. Currently, option pricing models are mainly divided into parametric models and non-parametric models. In 1973, Black and Scholes [
1] studied and obtained the famous Black–Scholes (B-S) pricing formula and established the classical parametric pricing model. The B-S formula can lead all investors to a risk-neutral world with risk-free interest rate as the rate of return, and it can predict the price of options better, regardless of their preferences. However, the formula has made many assumptions in advance, for example, the volatility and interest rate of options are assumed to be a constant; the underlying asset follow geometric Brownian motion, etc., is not completely consistent with the actual market situation; thus, the option price calculated by the formula is far from the actual situation. Later, many scholars made corresponding improvements to the model, such as adjusting the time course of the evolution of the underlying asset price, and the interest rate and volatility were subject to a random process [
2,
3,
4,
5,
6,
7,
8,
9,
10,
11,
12,
13,
14,
15] so as to establish option pricing models closer to the actual situation, such as the CIR model [
16,
17] and Heston model [
18,
19,
20,
21,
22].
The above parametric pricing model follows strict assumptions. Once there is a discrepancy between the actual situation and assumed conditions, the pricing result will have a large error with the real price, which greatly affects the accuracy and stability of the model. In order to estimate option prices more accurately, in recent years, some scholars have begun to try to apply machine learning methods to option pricing problems, such as the Non-parametric modular Neural Network [
23] (NNN), Support Vector Machine [
24] (SVM), Decision Tree [
25] (DT), Artificial Neural Network [
26] (ANN), etc. In this method, the model only needs to focus on the relationship between the features in the option data, without considering the complex economic principles and rigorous mathematical derivation, which provides a new modeling idea for option pricing. However, machine learning-based methods have a common shortcoming, which is the manual extraction of data features. This process is complicated and tedious, and the nonlinear fitting ability of some methods is insufficient, which results in a lack of implicit information in the extracted features. In addition, financial option data often have high dimensional and nonlinear characteristics, which results in unsatisfactory classification effects, thus affecting the overall performance of its pricing model.
It is worth noting that deep learning has received great attention in the field of financial time series analysis due to its strong nonlinear fitting ability and feature capture ability. As an important branch of neural networks in machine learning, deep learning can conduct in-depth feature screening and learning of data, desalting irrelevant factors and strengthening relevant factors while learning heterogeneity information. In particular, CNN in deep learning has excellent performance in this aspect. It can automatically extract features. To a certain extent, the more layers set, the more advanced features extracted, and the more information contained, and the better the classification effect. CNN is widely used in graphic and image processing because it requires less hyperparameters and less computation [
27,
28]. LSTM is a temporal recursive neural network, which was first proposed by Hochreiter and Schmidhuber [
29] in 1997. Originating from recurrent neural networks (RNNs), LSTM overcomes the problem of gradient disappearance in the training process of RNN and is an effective tool for long-term prediction. In recent years, the application of LSTM to time-series-dependent data prediction has gradually become popular, and LSTM has relatively mature studies in options pricing [
30], volatility prediction in option market [
31], stock price prediction [
32,
33] and so on. Some scholars try to combine CNN and LSTM to establish CNN-LSTM hybrid models for medical [
34] and stock price prediction [
35], etc. A large number of empirical results show that the single neural network prediction model is often difficult for accurately predicting option prices due to the impact of the discontinuity of trading. Due to complex and random evolution paths of interest rate and volatility that affect option prices in the real market, CIR and Heston models effectively break through the confinement of constant interest rate and constant volatility assumed by the traditional B-S pricing model and can simultaneously meet dual requirements of the market for volatility and interest rate and better adapt to the real option market. Therefore, combining the classical option pricing parameter model with the deep learning non-parametric model while establishing a hybrid pricing model has become a new research direction The dual-hybrid model can effectively overcome problems existing in parametric model and non-parametric model, such as poor nonlinear fitting ability and poor interpretation.
With the advent of the era of Big Data, a large amount of data is generated in the process of option trading, as well as characteristic data that may affect the final price of options. In machine learning experiments, especially the pricing process of financial derivatives, feature engineering has always been the focus of scholars’ research. Effective analysis of option pricing feature data can reduce the complexity of the pricing model and can greatly improve the prediction accuracy and stability of the model. Some scholars screen features in order to reduce their dimensionality to study and discuss important features. The main methods adopted are principal component analysis [
36] (PCA), random forest [
37,
38] (RF), factor analysis [
39] (FA), etc. Random forest is a data mining classification algorithm based on statistical sampling theory (Bootstrap), which is subordinate to ensemble learning. There are two types of random forest regression and random forest classification. Another group of scholars focused on the internal relations between features and adopts, for example, the least squares method (LS) performs feature correlation analysis. Feature-processed data input not only greatly reduces the running time of the model but also effectively improves prediction accuracy, which is more suitable for the option pricing in the real market.
In summary, in order to price options more effectively, improve prediction accuracy and model performance and maintain the sustainable and stable development of the financial market, this paper combines the CIR-Heston stochastic mixed parameter model and the CNN-LSTM neural network mixed non-parametric model to propose a new dual-mixed model for option pricing. This is conducted to better explore the performance of the dual-hybrid model in option price prediction; effectively predict the price trend; avoid losses; provide investors with reference; perform random forest feature importance sorting on the original input feature variables of CSI 300ETF option historical data; filter out the main features; use historical data processed by feature engineering as experimental data to train and test the above-mentioned dual-mixed model; to compare with the prediction results of reference models; and to integrate ridge regression, which completes empirical analysis. The dual-hybrid model and the reference model were tested for model robustness to verify the overall performance of the model. The article provides a new method of thinking for pricing financial derivatives by considering the establishment of a dual-mixed model and adopting different economic models based on the characteristics of experimental data to more closely fit the real financial market.
Author Contributions
Conceptualization, K.Z. and J.Z.; methodology, K.Z.; software, K.Z.; validation, K.Z. and J.Z.; formal analysis, K.Z.; investigation, K.Z.; resources, K.Z.; data curation, K.Z.; writing—original draft preparation, K.Z.; writing—review and editing, K.Z., J.Z. and Q.L.; visualization, K.Z.; supervision, K.Z., J.Z. and Q.L; project administration, J.Z.; funding acquisition, J.Z. All authors have read and agreed to the published version of the manuscript.
Funding
This research was funded by National Nature Science Foundation of China, Grant Number 51675161.
Data Availability Statement
Conflicts of Interest
The authors declare no conflict of interest.
References
- Black, F.; Scholes, M. The pricing of options and corporate liabilities. J. Political Econ. 1973, 81, 637–659. [Google Scholar]
- Han, Y.C.; Liu, C.Y.; Song, Q.S. Pricing double volatility barriers option under stochastic volatility. Stochastics 2020, 93, 625–645. [Google Scholar]
- Liu, Z.P. Option Pricing Formulas in a new uncertain mean-reverting stock model with floating interest rate. Discret. Dyn. Nat. Soc. 2020, 3, 1–8. [Google Scholar]
- Zhang, L.D.; Sun, Y.; Meng, X. European spread option pricing with the floating interest rate for uncertain financial market. Math. Probl. Eng. 2020, 8, 1–8. [Google Scholar]
- He, X.J.; Lin, S. A fractional Black-Scholes model with stochastic volatility and European option pricing. Expert Syst. Appl. 2021, 178, 114983. [Google Scholar]
- Kim, D.H.; Choi, S.Y.; Yoon, J.H. Pricing of vulnerable options under hybrid stochastic and local volatility. Chaos Solitons Fractals 2021, 146, 110–118. [Google Scholar]
- Wang, L.; Zhang, S. Pricing the Asian option under vasier interest rate. Acat Math. Appl. Sin. 2003, 26, 15–24. [Google Scholar]
- Li, S.; Li, S.H. Exotic options pricing formulae with stochastic interest rates. Acta Math. Sin. 2008, 51, 299–310. [Google Scholar]
- Huang, W.; Tao, X.; Li, S. Pricing formulae for european options under the fractional vasicek interest rate model. Acta Math. Sin. 2012, 55, 219–230. [Google Scholar]
- Geng, Y.J.; Zhou, S.W.; Mathematics, D.O. Pricing Asian option under mixed jump-fraction process. J. East China Norm. Univ. 2017, 3, 29–38. [Google Scholar]
- Zhou, X.; Pan, J. Pricing of power options based on mixed fractional Hull-White interest rate model. J. Hefei Univ. Technol. 2017, 40, 847–853. [Google Scholar]
- Wang, Y.; Xue, H. Vulnerable Option Pricing under Bi-fractional Jump—Diffusion Process. J. Hangzhou Norm. Univ. 2018, 17, 437–442. [Google Scholar]
- Yang, Z. Pricing European Lookback Option in a Special Kind of Mixed Jump-Diffusion Black-Scholes Model. Acta Math. Sci. 2019, 39, 1514–1531. [Google Scholar]
- Yang, Y.; Liu, G.; Wang, Y. Barrier Option Pricing Based on CIR Stochastic Volatility Model. Nat. Sci. J. Harbin Norm. Univ. 2019, 35, 1–4. [Google Scholar]
- Djeutcha, E.; Fono, L.A. Pricing for options in a Hull-White-Vasicek volatility and interest rate model. Appl. Math. Sci. 2021, 15, 377–384. [Google Scholar]
- Cox, J.C.; Ingersolli, J.E.; Ross, A. A Theory of the Term Structure of Interest Rates. Econometrica 1985, 53, 385–407. [Google Scholar]
- Chang, Y.; Wang, Y.; Zhang, S. Option Pricing under Double Heston Jump-Diffusion Model with Approximative Fractional Stochastic Volatility. Math. Probl. Eng. 2021, 9, 126. [Google Scholar]
- Heston, S.L. A closed-form solution for options with stochastic volatility with applications to bonds and currency options. Rev. Financ. Stud. 1993, 6, 327–343. [Google Scholar]
- Heston, S.; Nandi, S.A. Closed-Form GARCH Option Valuation Model. Rev. Financ. Stud. 2000, 13, 585–625. [Google Scholar]
- Zhang, L.J.; Zhang, W.Y.; Economics, S.O. Option pricing model by applying hybrid neural network based on heston model and genetic algorithm. J. Ind. Eng. Eng. Manag. 2018, 32, 142–149. [Google Scholar]
- Sun, Y.; Tian, J.; Chen, Y. Pricing of Arithmetic Average Asian Option under the Fractional Jump Diffusion Heston Model. J. Hangzhou Norm. Univ. 2019, 18, 629–635. [Google Scholar]
- Huang, C. An Empirical Study on the Volatility of CSI 300 Index based on Arch Model. In Proceedings of the 3rd Guangzhou International Forum on Finance (GZIFF), Guangzhou, China, 19 November 2020; pp. 292–299. [Google Scholar]
- Gradojevic, N.; Ramazan, G.; Dragan, K. Option pricing with modular neural networks. IEEE Trans. Neural Netw. 2009, 20, 626–637. [Google Scholar] [PubMed] [Green Version]
- Nikon, M.; Mansourfar, G.; Bagherzad, E.H.J. Stock price prediction using DEEP learning algorithm and its comparison with machine learning algorithms. Intelligent Systems in Accounting. Financ. Manag. 2019, 26, 164–174. [Google Scholar]
- Ivacu, C.F. Option pricing using Machine Learning. Expert Syst. Appl. 2020, 163, 5–12. [Google Scholar]
- Tan, D.D. S&P 500 Index Option Pricing Based on the BP Neural Networks. Stat. Inf. Forum 2008, 11, 40–43. [Google Scholar]
- Ikram, A.; Liu, Y. Skeleton Based Dynamic Hand Gesture Recognition using LSTM and CNN. In Proceedings of the 2020 2nd International Conference on Image Processing and Machine Vision (IPMV), Bangkok, Thailand, 5 August 2020; pp. 63–68. [Google Scholar]
- Xu, P.C.; Liu, B.Y. Interactive Behavior Recognition based on Image Enhancement and Deep CNN Learning. Commun. Technol. 2019, 52, 701–706. [Google Scholar]
- Li, C.J.; Qu, Z.; Wang, S.Y.; Liu, L. A Method of Cross-layer Fusion Multi-object Detection and Recognition Based on Improved Faster R-CNN Model in Complex Traffic Environment. Pattern Recognit. Lett. 2021, 145, 127–134. [Google Scholar]
- Hochreiter, S.; Schmidhuber, J. Long Short-Term Memory. Neural Comput. 1997, 9, 1735–1780. [Google Scholar]
- Xie, H.L.; You, T. Research on European Stock Index Options Pricing based on Deep Learning Algorithm:Evidence from 50ETF Options Markets. Stat. Inf. Forum 2018, 33, 99–106. [Google Scholar]
- Jun, R.; Jianhua, W.; Chuanmei, W.; Jianxiang, W. Stock Index Forecast based on Regularized LSTM Model. Comput. Appl. Softw. 2018, 35, 44–48. [Google Scholar]
- Feng, Y.X.; Li, Y.M. A Research on The CSI 300 Index Prediction Model Based on Lstm Neural Network. Math. Pract. Theory 2019, 49, 308–315. [Google Scholar]
- Rai, H.M.; Chatterjee, K. Hybrid CNN-LSTM deep learning model and ensemble technique for automatic detection of myocardial infarction using big ECG data. Appl. Intell. 2021, 1–19. [Google Scholar] [CrossRef]
- Anuradha, J. Big data based stock trend prediction using deep CNN with reinforcement-LSTM model. Int. J. Syst. Assur. Eng. Manag. 2021, 1, 1–11. [Google Scholar] [CrossRef]
- Ashok, J.; Rajan, E.G. Principal Component Analysis Based Image Recognition. Int. J. Comput. Sci. Inf. Technol. 2010, 1, 11–24. [Google Scholar]
- Breiman, L. Random forests. Mach. Learn 2001, 45, 5–32. [Google Scholar]
- Abdulkareem, N.M.; Abdulazeez, A.M. Machine Learning Classification Based on Radom Forest Algorithm: A Review. Int. J. Sci. Bus. 2021, 5, 128–142. [Google Scholar]
- Yadav, K.; Sircar, A. Modeling parameters influencing city gas distribution sector based on factor analysis method. Pet. Res. 2021, 1, 1–11. [Google Scholar]
- Alheety, M.I.; Kibria, B.M. A new version of unbiased ridge regression estimator under the stochastic restricted linear regression model. Commun. Stat. Simul. Comput. 2021, 50, 1–11. [Google Scholar]
- Yaman, I.; Dalkili, T.E. A hybrid approach to cardinality constraint portfolio selection problem based on nonlinear neural network and genetic algorithm. Expert Syst. Appl. 2020, 169, 114517. [Google Scholar]
- Spelta, A.; Pecora, N.; Pagnottoni, P. Chaos based portfolio selection: A nonlinear dynamics approach. Expert Syst. Appl. 2022, 188, 16005. [Google Scholar]
- Peter, J.M.; Yuehua, W.; Hong, X. Portfolio Optimization for Binary Options Based on Relative Entropy. Entropy 2020, 22, 752. [Google Scholar]
- Ma, Y.L.; Han, R.Z.; Wang, W.Z. Portfolio optimization with return prediction using deep learning and machine learning. Expert Syst. Appl. 2021, 165, 113973. [Google Scholar]
| Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).