Prediction of Customer Churn Behavior in the Telecommunication Industry Using Machine Learning Models
Abstract
:1. Introduction
2. Literature Review
2.1. Customer Relationship Management and Predictive Marketing
- Strategic CRM, which is the use of customer data through systematic analysis as a means of marketing management [13];
- Operational CRM, which supports company operations whereby information on employees, customers, and leads is stored;
- Analytical CRM, which is a strategy whereby customer and market information is analyzed to aid the decision-making process for business management [14];
- Collaborative CRM, which allows for multi-way communications between companies and their customers, with the goal of facilitating more profitable retention of customers [15].
2.2. Customer Churn Prediction Review
2.3. Customer Churn Prediction Model for Telecom Industry
2.4. Explainable Artificial Intelligence (AI) in Churn Analysis
3. Research Methodology
3.1. The Crisp Model
3.2. Dataset
3.3. Machine Learning Algorithms
3.4. Model Performance Evaluation
3.4.1. Confusion Matrix
- TN stands for true negative. Here, the customers are observed as not being churners, and the model has also classified the customers as non-churners.
- FP stands for false positive. Here, customers are observed as being non-churners, but the model has classified the customers as churners.
- FN stands for false negative. Customers are observed as being churners, but the model has classified the customers as non-churners.
- TP stands for true positive. Here, customers are both observed and classified as churners.
3.4.2. Four Evaluation Metrics
3.4.3. Other Evaluation Metrics
3.5. Explainable AI Techniques
4. Analysis and Results
4.1. Descriptive Statistics
4.2. Results Based on Confusion Matrix
4.3. Results of Analysis and Discussion
4.4. Model Interpretation
4.4.1. Local Interpretable Model-Agnostic Explanations (LIME)
4.4.2. SHapley Additive exPlanation (SHAP)
5. Discussion and Recommendations
5.1. Relevance to This Emerging Area and Its Contributions
5.2. Implications Relevance to This Special Issue
5.3. Limitations of the Study
6. Conclusions and Recommendations
Author Contributions
Funding
Data Availability Statement
Acknowledgments
Conflicts of Interest
References
- Eklof, J.; Podkorytova, O.; Malova, A. Linking customer satisfaction with financial performance: An empirical study of Scandinavian banks. Total Qual. Manag. Bus. Excell. 2020, 31, 1684–1702. [Google Scholar] [CrossRef]
- Madhani, P.M. Enhancing Customer Value Creation with Market Culture: Developing 7Cs Framework. IUP J. Manag. Res. 2018, 17, 46–64. [Google Scholar]
- Chouiekh, A. Deep convolutional neural networks for customer churn prediction analysis. Int. J. Cogn. Inform. Nat. Intell. 2020, 14, 1–16. [Google Scholar] [CrossRef]
- Duval, A. Explainable Artificial Intelligence (XAI); MA4K9 Scholarly Report; Mathematics Institute, The University of Warwick: Coventry, UK, 2019; pp. 1–53. [Google Scholar]
- Reilly, J. The Machine Learning Revolution: Telco Customer Churn Prediction. Technical Paper. 2023. Available online: https://www.akkio.com/post/telecom-customer-churn (accessed on 13 April 2024).
- Yulianti, Y.; Saifudin, A. Sequential feature selection in customer churn prediction based on Naive Bayes. IOP Conf. Ser. Mater. Sci. Eng. 2020, 879, 012090. [Google Scholar] [CrossRef]
- Leung, C.K.; Pazdor, A.G.; Souza, J. Explainable artificial intelligence for data science on customer churn. In Proceedings of the 2021 IEEE 8th International Conference on Data Science and Advanced Analytics (DSAA), Porto, Portugal, 6–9 October 2021; pp. 1–10. [Google Scholar]
- Mishra, A.; Reddy, U.S. A comparative study of customer churn prediction in telecom industry using ensemble based classifiers. In Proceedings of the 2017 International Conference on Inventive Computing and Informatics (ICICI), Coimbatore, India, 23–24 November 2017; pp. 721–725. [Google Scholar]
- De Caigny, A.; Coussement, K.; De Bock, K.W. A new hybrid classification algorithm for customer churn prediction based on logistic regression and decision trees. Eur. J. Oper. Res. 2018, 269, 760–772. [Google Scholar] [CrossRef]
- Lalwani, P.; Mishra, M.K.; Chadha, J.S.; Sethi, P. Customer churn prediction system: A machine learning approach. Computing 2022, 104, 271–294. [Google Scholar] [CrossRef]
- Yu, R.; An, X.; Jin, B.; Shi, J.; Move, O.A.; Liu, Y. Particle classification optimization-based BP network for telecommunication customer churn prediction. Neural Comput. Appl. 2018, 29, 707–720. [Google Scholar] [CrossRef]
- Fujo, S.W.; Subramanian, S.; Khder, M.A. Customer churn prediction in telecommunication industry using deep learning. Inf. Sci. Lett. 2022, 11, 24. [Google Scholar]
- Kampani, N.; Jhamb, D. Analyzing the role of e-crm in managing customer relations: A critical review of the literature. J. Crit. Rev. 2020, 7, 221–226. [Google Scholar]
- Nelson, C.A.; Walsh, M.F.; Cui, A.P. The role of analytical CRM on salesperson use of competitive intelligence. J. Bus. Ind. Mark. 2020, 35, 2127–2137. [Google Scholar] [CrossRef]
- Al-Homery, H.; Asharai, H.; Ahmad, A. The core components and types of CRM. Pak. J. Human. Soc. Sci. 2019, 7, 121–145. [Google Scholar] [CrossRef]
- Calzada-Infante, L.; Óskarsdóttir, M.; Baesens, B. Evaluation of customer behavior with temporal centrality metrics for churn prediction of prepaid contracts. Expert Syst. Appl. 2020, 160, 113553. [Google Scholar] [CrossRef]
- Hendriyani, C.; Auliana, L. Transformation from relationship marketing to electronic customer relationship management: A literature study. Rev. Integr. Bus. Econ. Res. 2018, 7, 116–124. [Google Scholar]
- Chagas, B.N.R.; Viana, J.A.N.; Reinhold, O.; Lobato, F.; Jacob, A.F.; Alt, R. Current applications of machine learning techniques in CRM: A literature review and practical implications. In Proceedings of the 2018 IEEE/WIC/ACM International Conference on Web Intelligence (WI), Santiago, Chile, 3–6 December 2018; pp. 452–458. [Google Scholar]
- Eachempati, P.; Srivastava, P.R.; Kumar, A.; de Prat, J.M.; Delen, D. Can customer sentiment impact firm value? An integrated text mining approach. Technol. Forecast. Soc. Chang. 2022, 174, 121265. [Google Scholar] [CrossRef]
- Lamrhari, S.; El Ghazi, H.; Oubrich, M.; El Faker, A. A social CRM analytic framework for improving customer retention, acquisition, and conversion. Technol. Forecast. Soc. Chang. 2022, 174, 121275. [Google Scholar] [CrossRef]
- Mahmoud, M.A.; Hinson, R.E.; Adika, M.K. The effect of trust, commitment, and conflict handling on customer retention: The mediating role of customer satisfaction. J. Relat. Mark. 2018, 17, 257–276. [Google Scholar] [CrossRef]
- De Caigny, A.; Coussement, K.; De Bock, K.W.; Lessmann, S. Incorporating textual information in customer churn prediction models based on a convolutional neural network. Int. J. Forecast. 2020, 36, 1563–1578. [Google Scholar] [CrossRef]
- De Caigny, A.; Coussement, K.; Verbeke, W.; Idbenjra, K.; Phan, M. Uplift modeling and its implications for B2B customer churn prediction: A segmentation-based modeling approach. Ind. Mark. Manag. 2021, 99, 28–39. [Google Scholar] [CrossRef]
- Seymen, O.F.; Dogan, O.; Hiziroglu, A. Customer Churn Prediction Using Deep Learning. In Proceedings of the 12th International Conference on Soft Computing and Pattern Recognition (SoCPaR 2020). SoCPaR 2020; Advances in Intelligent Systems and Computing; Springer: Cham, Switzerland, 2021; Volume 1383. [Google Scholar] [CrossRef]
- Guo, W.W.; Xue, H. Crop yield forecasting using artificial neural networks: A comparison between spatial and temporal models. Math. Probl. Eng. 2014, 2014, 857865. [Google Scholar] [CrossRef]
- Shahi, T.B.; Shrestha, A.; Neupane, A.; Guo, W. Stock price forecasting with deep learning: A comparative study. Mathematics 2020, 8, 1441. [Google Scholar] [CrossRef]
- Aldunate, Á.; Maldonado, S.; Vairetti, C.; Armelini, G. Understanding customer satisfaction via deep learning and natural language processing. Expert Syst. Appl. 2022, 209, 118309. [Google Scholar] [CrossRef]
- Raza, K. Improving the prediction accuracy of heart disease with ensemble learning and majority voting rule. In U-Healthcare Monitoring Systems; Academic Press: Devon, UK, 2019; pp. 179–196. [Google Scholar]
- Ahmad, I.; Yousaf, M.; Yousaf, S.; Ahmad, M.O. Fake news detection using machine learning ensemble methods. Complexity 2020, 2020, 8885861. [Google Scholar] [CrossRef]
- Sahoo, A.K.; Pradhan, C.; Das, H. Performance evaluation of different machine learning methods and deep-learning based convolutional neural network for health decision making. Nat. Inspired Comput. Data Sci. 2020, 871, 201–212. [Google Scholar]
- Wei, C.P.; Chiu, I.T. Turning telecommunications call details to churn prediction: A data mining approach. Expert Syst. Appl. 2002, 23, 103–112. [Google Scholar] [CrossRef]
- Anees, R.T.; Nordin, N.A.; Anjum, T.; Cavaliere, L.P.L.; Heidler, P. Evaluating the Impact of Customer Relationship Management (CRM) Strategies on Customer Retention. Bus. Manag. Strateg. 2020, 11, 117–133. [Google Scholar] [CrossRef]
- Shrestha, Y.R.; Ben-Menahem, S.M.; Von Krogh, G. Organizational decision-making structures in the age of artificial intelligence. Calif. Manag. Rev. 2019, 61, 66–83. [Google Scholar] [CrossRef]
- Buchanan, B.G.; Wright, D. The impact of machine learning on UK financial services. Oxf. Rev. Econ. Pol. 2021, 37, 537–563. [Google Scholar] [CrossRef] [PubMed]
- Fernández-Rovira, C.; Valdés, J.Á.; Molleví, G.; Nicolas-Sans, R. The digital transformation of business. Towards the datafication of the relationship with customers. Technol. Forecast. Soc. Chang. 2021, 162, 120339. [Google Scholar] [CrossRef]
- Bauer, K.; von Zahn, M.; Hinz, O. Expl(AI)ned: The impact of explainable artificial intelligence on users’ information processing. Inf. Syst. Res. 2023, 34, 1582–1602. [Google Scholar] [CrossRef]
- Teng, Y.; Zhang, J.; Sun, T. Data-driven decision-making model based on artificial intelligence in higher education system of colleges and universities. Expert Syst. 2023, 40, e12820. [Google Scholar] [CrossRef]
- Djeffal, C. The Normative Potential of the European Rule on Automated Decisions: A New Reading for Art. 22 GDPR. Z. Ausländisches Öffentl. Recht Völkerr. 2020, 81, 847–879. [Google Scholar]
- Nkolele, R.; Wang, H. Explainable Machine Learning: A Manuscript on the Customer Churn in the Telecommunications Industry. In Proceedings of the 2021 Ethics and Explainability for Responsible Data Science (EE-RDS), Johannesburg, South Africa, 27–28 October 2021; pp. 1–7. [Google Scholar]
- Shapley, L.S. A value for n-person games. In Contributions to the Theory of Games; Princeton University Press: Princeton, NJ, USA, 1953; Volum 2, pp. 307–317. [Google Scholar]
- Štrumbelj, E.; Kononenko, I. Explaining prediction models and individual predictions with feature contributions. Knowl. Inf. Syst. 2014, 41, 647–665. [Google Scholar] [CrossRef]
- Gramegna, A.; Giudici, P. SHAP and LIME: An evaluation of discriminative power in credit risk. Front. Artif. Intell. 2021, 4, 752558. [Google Scholar] [CrossRef] [PubMed]
- Kwon, O.; Lee, N.; Shin, B. Data quality management, data usage experience and acquisition intention of big data analytics. Int. J. Inf. Manag. 2014, 34, 387–394. [Google Scholar] [CrossRef]
- Provost, F.; Fawcett, T. Data science and its relationship to big data and data-driven decision making. Big Data 2013, 1, 51–59. [Google Scholar] [CrossRef] [PubMed]
- Martínez-Plumed, F.; Contreras-Ochando, L.; Ferri, C.; Hernández-Orallo, J.; Kull, M.; Lachiche, N.; Ramírez-Quintana, M.J.; Flach, P. CRISP-DM twenty years later: From data mining processes to data science trajectories. IEEE Trans. Knowl. Data Eng. 2019, 33, 3048–3061. [Google Scholar] [CrossRef]
- Hakkoum, H.; Idri, A.; Abnane, I. Artificial neural networks interpretation using LIME for breast cancer diagnosis. In Trends and Innovations in Information Systems and Technologies; Springer: Cham, Switzerland, 2020; pp. 15–24. [Google Scholar]
- Hutter, F.; Xu, L.; Hoos, H.H.; Leyton-Brown, K. Algorithm runtime prediction: Methods & evaluation. Artif. Intell. 2014, 206, 79–111. [Google Scholar]
- Antwarg, L.; Miller, R.M.; Shapira, B.; Rokach, L. Explaining anomalies detected by autoencoders using Shapley Additive Explanations. Expert Syst. Appl. 2021, 186, 115736. [Google Scholar] [CrossRef]
- Lessmann, S.; Baesens, B.; Seow, H.V.; Thomas, L.C. Benchmarking state-of-the-art classification algorithms for credit scoring: An update of research. Eur. J. Oper. Res. 2015, 247, 124–136. [Google Scholar] [CrossRef]
Observed 0 | Observed 1 | |
---|---|---|
Estimated 0 | TN | FN |
Estimated 1 | FP | TP |
Count | Mean | Std | Min | 25% | 50% | 75% | Max | |
---|---|---|---|---|---|---|---|---|
Age | 4601 | 47.89307 | 17.362 | 19 | 33 | 47 | 62 | 80 |
Number of Dependents | 4601 | 0.380569 | 0.880541 | 0 | 0 | 0 | 0 | 8 |
Number of Referrals | 4601 | 1.947403 | 2.957352 | 0 | 0 | 0 | 3 | 11 |
Tenure in Months | 4601 | 34.63117 | 24.19849 | 1 | 12 | 32 | 58 | 72 |
Avg Monthly Long-Distance Charges | 4601 | 25.5811 | 14.26574 | 1.01 | 13.02 | 25.84 | 37.97 | 49.99 |
Avg Monthly GB Download | 4601 | 26.12889 | 19.53716 | 2 | 13 | 21 | 30 | 85 |
Monthly Charge | 4601 | 81.20272 | 21.14967 | −10 | 69.9 | 83.75 | 96.2 | 118.75 |
Total Charges | 4601 | 3042.595 | 2391.057 | 42.9 | 847.3 | 2564.3 | 4968 | 8684.8 |
Total Refunds | 4601 | 2.163306 | 8.286778 | 0 | 0 | 0 | 0 | 49.57 |
Total Extra Data Charges | 4601 | 8.943708 | 28.62071 | 0 | 0 | 0 | 0 | 150 |
Total Long-Distance Charges | 4601 | 888.9163 | 866.5069 | 1.13 | 178.89 | 582 | 1417.92 | 3536.64 |
Total Revenue | 4601 | 3938.291 | 3054.189 | 46.92 | 1119.4 | 3378.79 | 6412.05 | 11979.34 |
Model | Accuracy | AUC | Sensitivity | Specificity |
---|---|---|---|---|
Logistic Regression | 0.7553 | 0.84 | 0.7400 | 0.7704 |
KNN | 0.7129 | 0.81 | 0.8467 | 0.5805 |
Naïve Bayes | 0.7984 | 0.88 | 0.8227 | 0.7744 |
Decision Tree | 0.8004 | 0.80 | 0.7907 | 0.8100 |
Random Forest | 0.8694 | 0.95 | 0.8547 | 0.8839 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Chang, V.; Hall, K.; Xu, Q.A.; Amao, F.O.; Ganatra, M.A.; Benson, V. Prediction of Customer Churn Behavior in the Telecommunication Industry Using Machine Learning Models. Algorithms 2024, 17, 231. https://doi.org/10.3390/a17060231
Chang V, Hall K, Xu QA, Amao FO, Ganatra MA, Benson V. Prediction of Customer Churn Behavior in the Telecommunication Industry Using Machine Learning Models. Algorithms. 2024; 17(6):231. https://doi.org/10.3390/a17060231
Chicago/Turabian StyleChang, Victor, Karl Hall, Qianwen Ariel Xu, Folakemi Ololade Amao, Meghana Ashok Ganatra, and Vladlena Benson. 2024. "Prediction of Customer Churn Behavior in the Telecommunication Industry Using Machine Learning Models" Algorithms 17, no. 6: 231. https://doi.org/10.3390/a17060231
APA StyleChang, V., Hall, K., Xu, Q. A., Amao, F. O., Ganatra, M. A., & Benson, V. (2024). Prediction of Customer Churn Behavior in the Telecommunication Industry Using Machine Learning Models. Algorithms, 17(6), 231. https://doi.org/10.3390/a17060231