Predicting Success of Outbound Telemarketing in Insurance Policy Loans Using an Explainable Multiple-Filter Convolutional Neural Network
Abstract
:1. Introduction
1.1. Objectives
1.2. Contributions
- First research in the field of insurance policy loans: We propose an explainable deep learning model based on a CNN architecture to predict the success of outbound telemarketing using insurance policy loan data. To the best of our knowledge, the present work is the first in the field.
- New dataset configuration: We utilize newly constructed data to predict the success of outbound telemarketing of insurance policy loans. This dataset comprised 153 variables extracted from 44,412 customers.
- Information loss minimization: We used high-dimensional insurance policy loan data consisting of more than 200 input dimensions without feature selection, which allowed the advantages of a deep learning model to be exploited by extracting features from input data and minimizing information loss due to feature selection.
- Performance superiority and feasibility in practice: We confirmed that the proposed XmCNN model significantly outperformed the machine learning and deep learning models used for comparison. In particular, an ensemble model built with the proposed deep learning model showed the lowest false positive rate and the highest F1-score. Therefore, the experimental results indicate that our proposed model can reduce negative outbound telemarketing outcomes, which are detrimental to customer experience.
- Improvement of model explanatory power: The proposed interpretable model exhibited the ability to identify important variables applicable in practical operations.
2. Background
2.1. Outbound Telemarketing
2.2. Deep Neural Networks
2.3. Convolutional Neural Networks
2.4. Ensemble Classifier
3. Method
3.1. Data Description and Preprocessing
3.2. Proposed Method
3.3. Comparative Machine Learning Models
3.4. Comparative Deep Learning Models
3.5. Ensemble Approaches
3.6. Evaluation Criteria
4. Experimental Results
4.1. Comparison of Machine Learning Model Results
4.2. Performance Analysis of the Proposed Model and Deep Learning Models
4.3. Investigation Results of Ensemble Models
4.4. Feature Importance
5. Discussion
6. Implications
6.1. Practical Implications
6.2. Academic Implications
7. Conclusions
Author Contributions
Funding
Institutional Review Board Statement
Informed Consent Statement
Data Availability Statement
Acknowledgments
Conflicts of Interest
Abbreviations
Abbreviation | Description |
ANN | Artificial Neural Network |
CCP | Customer Churn Prediction |
CNN | Convolutional Neural Network |
DCNN | Deep Convolutional Neural Network |
DL | Deep Learning |
DNN | Deep Neural Network with fully connected layers. |
DT | Decision Tree |
FN | False Negative |
FNR | False Negative Rate |
FP | False Positive |
FPR | False Positive Rate |
GBM | Gradient Boosting Machine |
LightGBM | Light Gradient Boosting Machine |
LR | Logistic Regression |
ML | Machine Learning |
MLP | Multi-layer Perceptron |
RF | Random Forest |
SMOTE | Synthetic Minority Oversampling Technique |
SVM | Support Vector Machine |
TN | True Negative |
TP | True Positive |
XGBoost | eXtreme gradient boosting |
XmCNN | eXplainable Multiple-filter Convolutional Neural Network |
Appendix A
Num | Variable | Mean | Std. Dev | Skewness | Kurtosis |
---|---|---|---|---|---|
1 | Number of months since security card issuance | 0.3006 | 0.2879 | 0.53 | −0.97 |
2 | Number of lapsed contracts | 0.0054 | 0.0361 | 10.65 | 161.20 |
3 | Number of persistent contracts | 0.0675 | 0.0618 | 2.88 | 14.33 |
4 | Numbers of cancelled contracts | 0.0171 | 0.0407 | 5.86 | 61.69 |
5 | Number of total contracts (without cancelled contracts) | 0.0404 | 0.0587 | 3.44 | 21.47 |
6 | Number of total contracts (including cancelled contracts) | 0.0434 | 0.0626 | 2.77 | 12.91 |
7 | Number of persistent contracts applied in the last year | 0.0094 | 0.0348 | 6.69 | 80.34 |
8 | Number of months since initial contract (including cancelled contracts) | 0.3475 | 0.1513 | 0.10 | −0.66 |
9 | Number of months since initial contract (without cancelled contracts) | 0.3257 | 0.1521 | 0.11 | −0.64 |
10 | Number of lapse in the last year | 0.0005 | 0.0148 | 37.34 | 1809.45 |
11 | Number of reinstatement in the last year | 0.0008 | 0.0200 | 31.49 | 1195.51 |
12 | Percentage of insurance contracts (without annuity) | 0.7621 | 0.3573 | −1.27 | 0.12 |
13 | Percentage of annuity contracts | 0.2372 | 0.3568 | 1.27 | 0.13 |
14 | Number of annuity contracts | 0.0229 | 0.0366 | 3.36 | 32.74 |
15 | Number of insurance contracts (without annuity) | 0.0597 | 0.0618 | 2.56 | 11.38 |
16 | Fractional premiums of contracts applied in the last year (monthly payment) | 0.0013 | 0.0168 | 32.62 | 1362.61 |
17 | Fractional premiums of contracts applied in the last year (total payment) | 0.0010 | 0.0146 | 37.06 | 1796.87 |
18 | Lump sum premium of contracts applied in the last year | 0.0002 | 0.0117 | 68.02 | 5020.47 |
19 | Total premiums of contracts applied in the last year | 0.0011 | 0.0147 | 36.28 | 1738.72 |
20 | Fractional premiums of persistent contracts (monthly payment) | 0.0047 | 0.0196 | 19.74 | 589.15 |
21 | Fractional premiums of persistent contracts (total payment) | 0.0097 | 0.0237 | 11.65 | 261.33 |
22 | Lump sum premium of persistent contracts | 0.0009 | 0.0132 | 37.00 | 2034.87 |
23 | Total premiums of persistent contracts | 0.0098 | 0.0239 | 11.49 | 254.66 |
24 | Number of overdue premiums in the last year | 0.0259 | 0.0483 | 4.22 | 34.69 |
25 | Total premiums | 0.0098 | 0.0239 | 11.49 | 254.66 |
26 | Premiums of annuity contracts | 0.0026 | 0.0159 | 28.51 | 1143.59 |
27 | Premiums of insurance contracts (without annuity) | 0.0155 | 0.0367 | 6.35 | 62.98 |
28 | Percentage of insurance contract premiums (without annuity) | 0.7417 | 0.3828 | −1.09 | −0.51 |
29 | Percentage of annuity contract premiums | 0.2577 | 0.3824 | 1.09 | −0.51 |
30 | Amount of withdrawals in the last year | 0.0002 | 0.0060 | 148.51 | 24,342.78 |
31 | Amount of withdrawals in the last three months | 0.0001 | 0.0066 | 123.16 | 17,663.96 |
32 | Number of withdrawals in the last three months | 0.0011 | 0.0193 | 28.26 | 1012.06 |
33 | Number of withdrawals in the last year | 0.0017 | 0.0188 | 22.24 | 719.69 |
34 | Total amount of withdrawals | 0.0002 | 0.0061 | 146.03 | 23,787.52 |
35 | Total number of withdrawals | 0.0040 | 0.0233 | 17.43 | 493.54 |
36 | Minimum amount of withdrawals | 0.0025 | 0.0209 | 26.97 | 972.92 |
37 | Maximum amount of withdrawals | 0.0033 | 0.0248 | 23.23 | 719.68 |
38 | Average amount of withdrawals | 0.0024 | 0.0154 | 23.72 | 999.05 |
39 | Total number of the insured | 0.0914 | 0.1306 | 1.46 | 1.76 |
40 | Total number of contracts for the insured | 0.0676 | 0.0463 | 2.93 | 19.38 |
41 | The difference between the number of contracts and the number of the insured | 0.1230 | 0.0452 | 3.41 | 22.70 |
42 | Number of days since the latest new loan | 0.1844 | 0.1708 | 0.62 | −0.54 |
43 | Average duration of policy loans in the last year | 0.0236 | 0.0884 | 5.31 | 33.03 |
44 | Minimum duration of policy loans in the last year | 0.0176 | 0.0808 | 6.38 | 46.36 |
45 | Maximum duration of policy loans in the last year | 0.0326 | 0.1114 | 4.34 | 20.66 |
46 | Number of new loans in the last year | 0.0038 | 0.0144 | 18.65 | 867.61 |
47 | Number of new additional loans in the last year | 0.0067 | 0.0225 | 11.53 | 273.93 |
48 | Number of new loans in the last three months | 0.0025 | 0.0145 | 20.93 | 979.93 |
49 | Number of new additional loans in the last three months | 0.0051 | 0.0222 | 11.10 | 240.79 |
50 | Average amount of policy loans per day in the last year | 0.0023 | 0.0099 | 45.47 | 3846.50 |
51 | Maximum amount of policy loans per day in the last year | 0.0020 | 0.0097 | 57.96 | 5269.69 |
52 | Minimum amount of policy loans per day in the last year | 0.0016 | 0.0130 | 34.13 | 1995.08 |
53 | Days of application for loan execution in the last year | 0.0110 | 0.0345 | 9.10 | 129.16 |
54 | Number of loan executions or repayments in the last year | 0.0035 | 0.0147 | 30.21 | 1651.34 |
55 | Number of loan executions or repayments in the last three months | 0.0035 | 0.0162 | 26.65 | 1310.40 |
56 | Number of overdues for credit or mortgage loans in the last year | 0.0018 | 0.0258 | 22.76 | 638.32 |
57 | Number of overdues for credit or mortgage loans in the last three months | 0.0019 | 0.0333 | 21.18 | 508.23 |
58 | Number of overdues for policy loans in the last year | 0.0028 | 0.0205 | 17.69 | 506.66 |
59 | Number of overdues for policy loans in the last three months | 0.0018 | 0.0158 | 20.52 | 765.31 |
60 | Average amount of repayments per day in the last year | 0.0023 | 0.0099 | 43.40 | 3790.84 |
61 | Maximum amount of repayments per day in the last year | 0.0041 | 0.0154 | 22.17 | 1025.47 |
62 | Minimum amount of repayments per day in the last year | 0.0025 | 0.0197 | 16.02 | 412.54 |
63 | Days of application for loan execution or repayment in the last year | 0.0052 | 0.0186 | 20.63 | 858.26 |
64 | Number of loan executions in the call center in the last year | 0.0039 | 0.0215 | 15.33 | 395.71 |
65 | Number of loan executions in the customer center in the last year | 0.0003 | 0.0065 | 120.69 | 18,230.18 |
66 | Number of loan executions through ARS in the last year | 0.0010 | 0.0146 | 34.22 | 1688.28 |
67 | Number of loan executions through ATM in the last year | 0.0004 | 0.0094 | 55.44 | 4633.79 |
68 | Number of loan executions through mobile in the last year | 0.0041 | 0.0189 | 16.26 | 532.14 |
69 | Number of loan executions through PC in the last year | 0.0032 | 0.0236 | 17.84 | 452.49 |
70 | Balance of policy loans | 0.0053 | 0.0151 | 19.24 | 929.07 |
71 | Variance of policy loan balance in the last year | 0.0005 | 0.0077 | 88.57 | 10,180.49 |
72 | Mean of policy loan balance in the last year | 0.0113 | 0.0256 | 7.29 | 132.88 |
73 | Skewness of policy loan balance in the last year | 0.2377 | 0.0522 | 3.93 | 24.15 |
74 | Kurtosis of policy loan balance in the last year | 0.0140 | 0.0284 | 11.52 | 205.91 |
75 | Maximum of policy loan balance in the last year | 0.0084 | 0.0184 | 12.99 | 499.93 |
76 | Minimum of policy loan balance in the last year | 0.0037 | 0.0177 | 15.61 | 540.52 |
77 | Variance of policy loan balance in the last three months | 0.0006 | 0.0094 | 80.79 | 8158.41 |
78 | Mean of policy loan balance in the last three months | 0.0108 | 0.0266 | 7.68 | 132.74 |
79 | Skewness of policy loan balance in the last three months | 0.2111 | 0.0447 | 5.08 | 36.07 |
80 | Kurtosis of policy loan balance in the last three months | 0.0273 | 0.0243 | 12.64 | 279.84 |
81 | Maximum of policy loan balance in the last three months | 0.0077 | 0.0183 | 11.03 | 376.43 |
82 | Minimum of policy loan balance in the last three months | 0.0047 | 0.0209 | 12.71 | 330.79 |
83 | Sum of credit or mortgage loan balance | 0.0055 | 0.0305 | 14.42 | 277.35 |
84 | Number of loan executions for 2 times in the current month | 0.0009 | 0.0174 | 29.26 | 1087.64 |
85 | Number of loan executions for 3 times in the current month | 0.0058 | 0.0345 | 11.19 | 181.01 |
86 | Number of loan executions for 1 time in the last year | 0.0057 | 0.0229 | 10.44 | 221.56 |
87 | Number of loan executions for 2 times in the last year | 0.0051 | 0.0209 | 12.43 | 329.46 |
88 | Number of loan executions for 3 times in the last year | 0.0069 | 0.0238 | 10.42 | 210.37 |
89 | Number of call center uses in the last year | 0.0216 | 0.0309 | 4.97 | 67.76 |
90 | Number of call center uses in the last three months | 0.0099 | 0.0263 | 6.71 | 109.37 |
91 | Number of mobile uses in the last three months | 0.0156 | 0.0395 | 7.55 | 94.47 |
92 | Number of website uses in the last three months | 0.0048 | 0.0253 | 14.25 | 319.68 |
93 | Number of channels for insurance contract | 0.2798 | 0.0986 | 1.91 | 5.92 |
94 | Number of contracts through financial planner | 0.0453 | 0.0585 | 3.40 | 21.12 |
95 | Number of contracts through general agent | 0.0052 | 0.0200 | 11.34 | 299.56 |
96 | Number of contracts through bank (bancassurance) | 0.0013 | 0.0108 | 33.33 | 2540.77 |
97 | Number of contracts through direct marketing | 0.0093 | 0.0327 | 6.29 | 71.21 |
98 | Number of contracts through other channels | 0.0043 | 0.0290 | 12.13 | 219.39 |
99 | Percentage of contracts through financial planner | 0.7483 | 0.4058 | −1.15 | −0.51 |
100 | Percentage of contracts through general agent | 0.0977 | 0.2784 | 2.73 | 5.78 |
101 | Percentage of contracts through bank (bancassurance) | 0.0097 | 0.0665 | 7.77 | 66.60 |
102 | Percentage of contracts through direct marketing | 0.1006 | 0.2823 | 2.65 | 5.33 |
103 | Percentage of contracts through other channels | 0.0239 | 0.1370 | 6.21 | 38.79 |
104 | Percentage of contracts through face−to−face | 0.8796 | 0.3066 | −2.33 | 3.67 |
105 | Average duration of policy loans in the last three years | 0.0280 | 0.0903 | 5.02 | 30.08 |
106 | Minimum duration of policy loans in the last three years | 0.0198 | 0.0821 | 6.14 | 43.37 |
107 | Maximum duration of policy loans in the last three years | 0.0400 | 0.1147 | 3.89 | 16.92 |
108 | Average amoun of repayments with other services per day in the last year | 0.0013 | 0.0158 | 30.74 | 1303.93 |
109 | Maximum amount of repayments with other services per day in the last year | 0.0019 | 0.0182 | 23.16 | 797.77 |
110 | Minimum amount of repayments with other services per day in the last year | 0.0007 | 0.0148 | 41.28 | 2146.83 |
111 | Number of applications for repayment with other services per day in the last year | 0.0047 | 0.0259 | 12.00 | 283.97 |
112 | Average rate of policy loans | 0.5867 | 0.1574 | 0.86 | 0.18 |
113 | Maximum rate of policy loans | 0.6526 | 0.1854 | 0.46 | −0.78 |
114 | Minimum rate of policy loans | 0.5295 | 0.1681 | 1.14 | 0.62 |
115 | Loan limit exhaustion rate | 0.2471 | 0.3669 | 1.04 | −0.66 |
116 | Number of loan completion in the last year | 0.0038 | 0.0145 | 18.51 | 855.27 |
117 | Number of policy loan executions | 0.0041 | 0.0125 | 22.04 | 1370.04 |
118 | Average days since new additional loans in the last year | 0.0139 | 0.0625 | 6.85 | 57.88 |
119 | Percentage of consecutive loan executions in the last year | 0.0634 | 0.2084 | 3.42 | 10.77 |
120 | Percentage of consecutive loan executions and repayments in the last year | 0.0618 | 0.2067 | 3.56 | 11.83 |
121 | Percentage of recurring loan repayments in the last year | 0.0727 | 0.2194 | 3.29 | 9.98 |
122 | Percentage of one−time loan execution in the last year | 0.0653 | 0.2363 | 3.57 | 11.07 |
123 | Percentage of one−time loan repayment in the last year | 0.0445 | 0.1990 | 4.47 | 18.31 |
124 | Percentage of one−time loan repayment with other services in the last year | 0.0083 | 0.0852 | 11.02 | 122.82 |
125 | Number of marketing campaigns in the current month | 0.0007 | 0.0159 | 29.07 | 1053.85 |
126 | Number of marketing campaigns in the last three months | 0.0021 | 0.0261 | 16.03 | 326.40 |
127 | Number of marketing campaigns in the last year | 0.0059 | 0.0408 | 10.11 | 138.04 |
128 | Age | 46.1099 | 8.6803 | 0.00 | −0.37 |
References
- Moro, S.; Cortez, P.; Rita, P. A data-driven approach to predict the success of bank telemarketing. Decis. Support Syst. 2014, 62, 22–31. [Google Scholar] [CrossRef] [Green Version]
- Kim, K.H.; Lee, C.S.; Jo, S.M.; Cho, S.B. Predicting the success of bank telemarketing using deep convolutional neural network. In Proceedings of the 2015 7th International Conference of Soft Computing and Pattern Recognition (SoCPaR), Fukuoka, Japan, 13–15 November 2015; pp. 314–317. [Google Scholar]
- Frempong, A.J.; Jayabalan, M. Predicting customer response to bank direct telemarketing campaign. In Proceedings of the 2017 International Conference on Engineering Technology and Technopreneurship (ICE2T), Kuala Lumpur, Malaysia, 18–20 September 2017; pp. 1–4. [Google Scholar]
- Koumétio, C.S.T.; Cherif, W.; Hassan, S. Optimizing the prediction of telemarketing target calls by a classification technique. In Proceedings of the 2018 6th International Conference on Wireless Networks and Mobile Communications (WINCOM), Marrakesh, Morocco, 16–19 October 2018; pp. 1–6. [Google Scholar]
- Ghatasheh, N.; Faris, H.; AlTaharwa, I.; Harb, Y.; Harb, A. Business Analytics in Telemarketing: Cost-Sensitive Analysis of Bank Campaigns Using Artificial Neural Networks. Appl. Sci. 2020, 10, 2581. [Google Scholar] [CrossRef] [Green Version]
- Turkmen, E. Deep Learning Based Methods for Processing Data in Telemarketing-Success Prediction. In Proceedings of the 2021 Third International Conference on Intelligent Communication Technologies and Virtual Mobile Networks (ICICV), Tirunelveli, India, 4–6 February 2021; pp. 1161–1166. [Google Scholar]
- Cortes, C.; Vapnik, V. Support-vector networks. Mach. Learn. 1995, 20, 273–297. [Google Scholar] [CrossRef]
- Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
- Chawla, N.V.; Bowyer, K.W.; Hall, L.O.; Kegelmeyer, W.P. SMOTE: Synthetic Minority Over-Sampling Technique. J. Artif. Intell. Res. 2002, 16, 321–357. [Google Scholar] [CrossRef]
- LEE, H. A Method of Bank Telemarketing Customer Prediction based on Hybrid Sampling and Stacked Deep Networks. J. Korea Soc. Digit. Ind. Inf. Manag. 2019, 15, 197–206. [Google Scholar]
- Hosein, P.; Ramoudith, S.; Rahaman, I. On the Optimal Allocation of Resources for a Marketing Campaign. In Proceedings of the 10th International Conference on Operations Research and Enterprise Systems (ICORES 2021), Vienna, Austria, 4–6 February 2021; pp. 169–176. [Google Scholar]
- Hosseini, S. A decision support system based on machined learned Bayesian network for predicting successful direct sales marketing. J. Manag. Anal. 2021, 8, 295–315. [Google Scholar]
- Krishna, C.L.; Reddy, P.V.S. Deep Neural Networks for the Classification of Bank Marketing Data using Data Reduction Techniques. Int. J. Recent Technol. Eng. (IJRTE) 2019, 8, 3. [Google Scholar]
- Moro, S.; Cortez, P.; Rita, P. A divide-and-conquer strategy using feature relevance and expert knowledge for enhancing a data mining approach to bank telemarketing. Expert Syst. 2018, 35, e12253. [Google Scholar] [CrossRef] [Green Version]
- Srivastava, N.; Hinton, G.; Krizhevsky, A.; Sutskever, I.; Salakhutdinov, R. Dropout: A simple way to prevent neural networks from overfitting. J. Mach. Learn. Res. 2014, 15, 1929–1958. [Google Scholar]
- LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
- Arenas-Márquez, F.J.; Martinez-Torres, R.; Toral, S. Convolutional neural encoding of online reviews for the identification of travel group type topics on TripAdvisor. Inf. Process. Manag. 2021, 58, 102645. [Google Scholar] [CrossRef]
- Behera, R.K.; Jena, M.; Rath, S.K.; Misra, S. Co-LSTM: Convolutional LSTM model for sentiment analysis in social big data. Inf. Process. Manag. 2021, 58, 102435. [Google Scholar] [CrossRef]
- Fu, X.; Ouyang, T.; Chen, J.; Luo, X. Listening to the investors: A novel framework for online lending default prediction using deep learning neural networks. Inf. Process. Manag. 2020, 57, 102236. [Google Scholar] [CrossRef]
- Goldani, M.H.; Safabakhsh, R.; Momtazi, S. Convolutional neural network with margin loss for fake news detection. Inf. Process. Manag. 2021, 58, 102418. [Google Scholar] [CrossRef]
- Song, C.; Ning, N.; Zhang, Y.; Wu, B. A multimodal fake news detection model based on crossmodal attention residual and multichannel convolutional neural networks. Inf. Process. Manag. 2021, 58, 102437. [Google Scholar] [CrossRef]
- Neagoe, V.E.; Ciotec, A.D.; Cucu, G.S. Deep convolutional neural networks versus multilayer perceptron for financial prediction. In Proceedings of the 2018 International Conference on Communications (COMM), Bucharest, Romania, 14–16 June 2018; pp. 201–206. [Google Scholar]
- Zhang, X.; Wu, F.; Li, Z. Application of convolutional neural network to traditional data. Expert Syst. Appl. 2021, 168, 114185. [Google Scholar] [CrossRef]
- Chen, T.; Guestrin, C. Xgboost: A scalable tree boosting system. In Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, New York, NY, USA, 13–17 August 2016; pp. 785–794. [Google Scholar]
- Kvamme, H.; Sellereite, N.; Aas, K.; Sjursen, S. Predicting mortgage default using convolutional neural networks. Expert Syst. Appl. 2018, 102, 207–217. [Google Scholar] [CrossRef] [Green Version]
- De Caigny, A.; Coussement, K.; De Bock, K.W.; Lessmann, S. Incorporating textual information in customer churn prediction models based on a convolutional neural network. Int. J. Forecast. 2020, 36, 1563–1578. [Google Scholar] [CrossRef]
- Weng, C.H.; Huang, T.C.K.; Han, R.P. Disease prediction with different types of neural network classifiers. Telemat. Inform. 2016, 33, 277–292. [Google Scholar] [CrossRef]
- Borisov, V.; Haug, J.; Kasneci, G. Cancelout: A layer for feature selection in deep neural networks. In Proceedings of the International Conference on Artificial Neural Networks, Munich, Germany, 17–19 September 2019; pp. 72–83. [Google Scholar]
- Kim, Y. Convolutional Neural Networks for Sentence Classification. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), Doha, Qatar, 25–29 October 2014; pp. 1746–1751. [Google Scholar]
- Yin, W.; Schütze, H. Multichannel Variable-size Convolution for Sentence Classification. In Proceedings of the Nineteenth Conference on Computational Natural Language Learning, Beijing, China, 30–31 July 2015; pp. 204–214. [Google Scholar]
- Szegedy, C.; Liu, W.; Jia, Y.; Sermanet, P.; Reed, S.; Anguelov, D.; Erhan, D.; Vanhoucke, V.; Rabinovich, A. Going deeper with convolutions. In Proceedings of the IEEE Computer Society Conference on Computer Vision and Pattern Recognition, Boston, MA, USA, 7–12 June 2015; pp. 1–9. [Google Scholar]
- Friedman, J.H. Greedy function approximation: A gradient boosting machine. Ann. Stat. 2001, 29, 1189–1232. [Google Scholar] [CrossRef]
- Ke, G.; Meng, Q.; Finley, T.; Wang, T.; Chen, W.; Ma, W.; Ye, Q.; Liu, T.Y. Lightgbm: A highly efficient gradient boosting decision tree. In Proceedings of the Advances in Neural Information Processing Systems, Long Beach, CA, USA, 4–9 December 2017; pp. 3146–3154. [Google Scholar]
- Kingma, D.P.; Ba, J. Adam: A Method for Stochastic Optimization. In Proceedings of the 3rd International Conference for Learning Representations, San Diego, CA, USA, 7–9 May 2015. [Google Scholar]
- Arik, S.O.; Pfister, T. Tabnet: Attentive interpretable tabular learning. arXiv 2019, arXiv:1908.07442. [Google Scholar]
Authors | Title | Models | Number of Input Features |
---|---|---|---|
Moroet al. [1] | A data-driven approach to predict the success of bank telemarketing | - Logistic regression - Decision tree - Artificial neural networks - Support vector machine | 22 |
Kim et al. [2] | Predicting the success of bank telemarketing using deep convolutional neural network | - Deep convolutional neural networks | 16 |
Asare-Frempong et al. [3] | Predicting customer response to bank direct telemarketing campaign | - Artificial neural networks - Decision tree - Logistic regression | 16 |
Koumétio et al. [4] | Optimizing the prediction of telemarketing target calls by a classification technique | - Naïve Bayes classifiers - Decision tree - Artificial neural networks - Support vector machine | 21 |
Ghatasheh et al. [5] | Business analytics in telemarketing: cost-sensitive analysis of bank campaigns using artificial neural networks | - Artificial neural networks - Support vector machine - Random forest | 16 |
Turkmen [6] | Deep learning-based methods for processing data in telemarketing-success prediction | - Long short-term memory - Gated recurrent unit - Simple recurrent neural networks | 20 |
Authors of the present study | Predicting Success of Outbound Telemarketing in Insurance Policy Loans Using an Explainable Multiple-Filter Convolutional Neural Network | - Random forest - Support vector machine - Gradient boosting machine - eXtreme gradient boosting - light gradient boosting machine - Deep neural networks - Deep convolutional neural networks | 210 |
Stage | Marketing Targets | Attempted Call | Completed Call | Loan Information Completed | Loan Execution 1 |
---|---|---|---|---|---|
Number of customers | 171,424 | 64,379 | 49,727 | 45,155 | 8530 |
Category | Descriptions | Number of Items |
---|---|---|
Customer characteristics | Age, occupation, region of residence, usage channel, number of complaints, etc. | 72 |
Insurance transaction | Number of contracts, insurance type, payment amount, number of overdue, withdrawal amount, contract channel, insured information, etc. | 55 |
Insurance policy loan | Loan experience, execution frequency, limit exhaustion rate, interest rate, usage period, balance, repayment amount, overdue, loan channel, etc. | 66 |
General loan | Loan balance, number of overdue in the last 3 months, number of overdue in the last 1 year, etc. | 3 |
Campaign execution | Recent campaign experience, number of recent campaign executions, number of campaign executions in the current month, etc. | 9 |
Call list | Mobile experience, ARS experience, existing call list groups, etc. | 5 |
Total | 210 |
Loans Not Executed | Loans Executed | Total | |
---|---|---|---|
Before SMOTE | 25,118 (80.8%) | 5971 (19.2%) | 31,089 (100.0%) |
After SMOTE | 29,855 (50.0%) | 29,855 (50.0%) | 59,710 (100.0%) |
Measures | Formulation |
---|---|
False Positive Rate (FPR) | |
False Negative Rate (FNR) | |
Recall | |
Precision | |
Accuracy | |
F1-score |
Predict | |||
---|---|---|---|
Positive | Negative | ||
Actual | Positive | TP (True Positive) | FN (False Negative) |
Negative | FP (False Positive) | TN (True Negative) |
Model | FPR | FNR | Recall | Precision | Accuracy | F1-Score |
---|---|---|---|---|---|---|
RF | 0.0632 | 0.7506 | 0.5931 | 0.6629 | 0.8030 | 0.6073 |
SVM | 0.4658 | 0.0154 | 0.7594 | 0.6656 | 0.6218 | 0.5990 |
GBM | 0.1497 | 0.6369 | 0.6067 | 0.6081 | 0.7554 | 0.6074 |
XGBoost | 0.1352 | 0.4710 | 0.6968 | 0.6847 | 0.7993 | 0.6903 |
LightGBM | 0.0821 | 0.3264 | 0.6635 | 0.7541 | 0.8384 | 0.6903 |
Model | FPR | FNR | Recall | Precision | Accuracy | F1-Score |
---|---|---|---|---|---|---|
DNN | 0.1143 | 0.1870 | 0.8493 | 0.7918 | 0.8715 | 0.8143 |
0.0898 | 0.1941 | 0.8581 | 0.8177 | 0.8899 | 0.8352 | |
0.0887 | 0.1902 | 0.8605 | 0.8201 | 0.8915 | 0.8376 | |
0.0887 | 0.1889 | 0.8612 | 0.8204 | 0.8918 | 0.8387 | |
0.0756 | 0.1916 | 0.8664 | 0.8366 | 0.9018 | 0.8502 |
Ensemble Model | FPR | FNR | Recall | Precision | Accuracy | F1-Score |
---|---|---|---|---|---|---|
RF, SVM, GBM, XGBoost, LightGBM | 0.0831 | 0.3631 | 0.7769 | 0.7810 | 0.8624 | 0.7789 |
SVM, GBM, XGBoost, LightGBM | 0.0739 | 0.3714 | 0.7773 | 0.7921 | 0.8682 | 0.7843 |
SVM, GBM, LightGBM | 0.0584 | 0.3856 | 0.7780 | 0.8138 | 0.8779 | 0.7938 |
SVM, GBM, XGBoost | 0.1073 | 0.2622 | 0.8152 | 0.7790 | 0.8625 | 0.7945 |
RF, SVM, GBM | 0.0840 | 0.3168 | 0.7996 | 0.7928 | 0.8707 | 0.7961 |
0.0537 | 0.1992 | 0.8735 | 0.8671 | 0.9179 | 0.8703 | |
0.0519 | 0.2031 | 0.8725 | 0.8693 | 0.9187 | 0.8709 | |
0.0527 | 0.1992 | 0.8741 | 0.8689 | 0.9188 | 0.8714 | |
0.0525 | 0.1999 | 0.8738 | 0.8690 | 0.9188 | 0.8714 | |
0.0517 | 0.1992 | 0.8745 | 0.8704 | 0.9196 | 0.8724 |
Number | Feature |
---|---|
1 | Percentage of one-time loan execution in the last year |
2 | Percentage of contracts through financial planner |
3 | Number of channels for insurance contract |
4 | Percentage of insurance contract premiums (without annuity) |
5 | Maximum duration of policy loan in the last three years |
6 | Minimum rate of policy loans |
7 | Total premium |
8 | Maximum amount of policy loans per day in the last year |
9 | Number of call center uses in the last year |
Feature Importance Top-N (%) | Model | |||||
---|---|---|---|---|---|---|
XGBoost | LightGBM | XmCNN | ||||
Accuracy | F1-Score | Accuracy | F1-Score | Accuracy | F1-Score | |
20 | 0.6741 | 0.6055 | 0.8059 | 0.5756 | 0.8702 | 0.8044 |
40 | 0.7953 | 0.6746 | 0.8290 | 0.6702 | 0.8890 | 0.8338 |
60 | 0.8074 | 0.6934 | 0.8176 | 0.6624 | 0.8892 | 0.8344 |
80 | 0.7690 | 0.6653 | 0.8306 | 0.6896 | 0.8922 | 0.8395 |
100 | 0.7993 | 0.6903 | 0.8384 | 0.6903 | 0.9018 | 0.8502 |
Publisher’s Note: MDPI stays neutral with regard to jurisdictional claims in published maps and institutional affiliations. |
© 2021 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Gu, J.; Na, J.; Park, J.; Kim, H. Predicting Success of Outbound Telemarketing in Insurance Policy Loans Using an Explainable Multiple-Filter Convolutional Neural Network. Appl. Sci. 2021, 11, 7147. https://doi.org/10.3390/app11157147
Gu J, Na J, Park J, Kim H. Predicting Success of Outbound Telemarketing in Insurance Policy Loans Using an Explainable Multiple-Filter Convolutional Neural Network. Applied Sciences. 2021; 11(15):7147. https://doi.org/10.3390/app11157147
Chicago/Turabian StyleGu, Jinmo, Jinhyuk Na, Jeongeun Park, and Hayoung Kim. 2021. "Predicting Success of Outbound Telemarketing in Insurance Policy Loans Using an Explainable Multiple-Filter Convolutional Neural Network" Applied Sciences 11, no. 15: 7147. https://doi.org/10.3390/app11157147
APA StyleGu, J., Na, J., Park, J., & Kim, H. (2021). Predicting Success of Outbound Telemarketing in Insurance Policy Loans Using an Explainable Multiple-Filter Convolutional Neural Network. Applied Sciences, 11(15), 7147. https://doi.org/10.3390/app11157147