Deep Learning for Predicting Attrition Rate in Open and Distance Learning (ODL) Institutions
Abstract
:1. Introduction
2. Related Works
3. Methodology
3.1. Data Cleaning
3.2. Feature Selection
3.3. Train–Test Set
3.4. Deep Learning Models
3.4.1. Long Short-Term Memory (LSTM)
3.4.2. One-Dimensional Convolutional Neural Network (1DCNN)
3.5. Proposed Model
4. Results and Discussion
Institutional Challenge Dataset—Course Materials
5. Conclusions
5.1. Limitations of This Study
5.2. Recommendation for Further Studies
Author Contributions
Funding
Data Availability Statement
Conflicts of Interest
References
- Uliyan, D.; Aljaloud, A.S.; Alkhalil, A.; Al Amer, H.S.; Mohamed, M.A.E.A.; Alogali, A.F.M. Deep Learning Model to Predict Students Retention Using BLSTM and CRF. IEEE Access 2021, 9, 135550–135558. [Google Scholar] [CrossRef]
- Akmal, A. Predicting Dropout on E-Learning Using Machine Learning. J. Appl. Data Sci. 2020, 1, 29–34. [Google Scholar] [CrossRef]
- Alamri, A.; Sun, Z.; Cristea, A.I.; Senthilnathan, G.; Shi, L.; Stewart, C. Is mooc learning different for dropouts? a visually driven, multi-granularity explanatory ml approach. In International Conference on Intelligent Tutoring Systems; Springer: Cham, Switzerland, 2020; pp. 353–363. [Google Scholar]
- Lainjo, B. Mitigating Academic Institution Dropout Rates with Predictive Analytics Algorithms. Int. J. Educ. Teach. Soc. Sci. 2023, 3, 29–49. [Google Scholar] [CrossRef]
- López-Pernas, S.; Kleimola, R.; Väisänen, S.; Hirsto, L. Early detection of dropout factors in vocational education: A large-scale case study from Finland. In Proceedings of the Finnish Learning Analytics and Artificial Intelligence in Education Conference (FLAIEC22), Joensuu, Finland, 29–30 September 2022; pp. 44–50. [Google Scholar]
- Akour, M.; Alsghaier, H.; Al Qasem, O. The effectiveness of using deep learning algorithms in predicting students’ achievements. Indones. J. Electr. Eng. Comput. Sci. 2020, 19, 387–393. [Google Scholar] [CrossRef]
- Taye, M.M. Understanding of Machine Learning with Deep Learning: Architectures, Workflow, Applications and Future Directions. Computers 2023, 12, 91. [Google Scholar] [CrossRef]
- Zhang, X.; Cao, Z. A framework of an intelligent education system for higher education based on deep learning. Int. J. Emerg. Technol. Learn. (Online) 2021, 16, 233. [Google Scholar] [CrossRef]
- Aggarwal, K.; Singh, S.K.; Chopra, M.; Kumar, S.; Colace, F. Deep learning in robotics for strengthening industry 4.0.: Opportunities, challenges and future directions. In Robotics and AI for Cybersecurity and Critical Infrastructure in Smart Cities; Springer: Cham, Switzerland, 2022; pp. 1–19. [Google Scholar]
- Suganyadevi, S.; Seethalakshmi, V.; Balasamy, K. A review on deep learning in medical image analysis. Int. J. Multimed. Inf. Retr. 2022, 11, 19–38. [Google Scholar] [CrossRef]
- Dargan, S.; Kumar, M.; Ayyagari, M.R.; Kumar, G. A survey of deep learning and its applications: A new paradigm to machine learning. Arch. Comput. Methods Eng. 2020, 27, 1071–1092. [Google Scholar] [CrossRef]
- Pierrakeas, C.; Koutsonikos, G.; Lipitakis, A.D.; Kotsiantis, S.; Xenos, M.; Gravvanis, G.A. The variability of the reasons for student dropout in distance learning and the prediction of dropout-prone students. In Machine Learning Paradigms: Advances in Learning Analytics; The Variability of the Reasons for Student Dropout in Distance Learning and the Prediction of Dropout-Prone Students; Springer: Cham, Switzerland, 2020; pp. 91–111. [Google Scholar]
- Ashour, S. Analysis of the attrition phenomenon through the lens of university dropouts in the United Arab Emirates. J. Appl. Res. High. Educ. 2020, 12, 357–374. [Google Scholar] [CrossRef]
- Del Bonifro, F.; Gabbrielli, M.; Lisanti, G.; Zingaro, S.P. Student dropout prediction. In Proceedings of the Artificial Intelligence in Education: 21st International Conference, AIED 2020, Ifrane, Morocco, 6–10 July 2020; Proceedings, Part I 21. Springer International Publishing: New York, NY, USA, 2020; pp. 129–140. [Google Scholar] [CrossRef]
- Aldowah, H.; Al-Samarraie, H.; Alzahrani, A.I.; Alalwan, N. Factors affecting student dropout in MOOCs: A cause and effect decision-making model. J. Comput. High. Educ. 2020, 32, 429–454. [Google Scholar] [CrossRef]
- Kim, D.; Kim, S. Sustainable education: Analyzing the determinants of university student dropout by nonlinear panel data models. Sustainability 2018, 10, 954. [Google Scholar] [CrossRef]
- Park, C. Development of prediction model to improve dropout of cyber university. J. Korea Acad.-Ind. Coop. Soc. 2020, 21, 380–390. [Google Scholar]
- Sun, Z.; Harit, A.; Yu, J.; Cristea, A.I.; Shi, L. A Brief Survey of Deep Learning Approaches for Learning Analytics on MOOCs. In Proceedings of the Intelligent Tutoring Systems: 17th International Conference, ITS 2021, Virtual Event, 7–11 June 2021; pp. 1–11. [Google Scholar]
- Dake, D.K.; Buabeng-Andoh, C. Using Machine Learning Techniques to Predict Learner Drop-out Rate in Higher Educational Institutions. Edited by Robin Singh Bhadoria. Mob. Inf. Syst. 2022, 2022, 2670562. [Google Scholar] [CrossRef]
- Yılmaz, B.A.; Karataş, S. Why do open and distance education students drop out? Views from various stakeholders. Int. J. Educ. Technol. High. Educ. 2022, 19, 28. [Google Scholar] [CrossRef]
- Queiroga, E.M.; Lopes, J.L.; Kappel, K.; Aguiar, M.; Araújo, R.M.; Munoz, R.; Villarroel, R.; Cechinel, C. A Learning Analytics Approach to Identify Students at Risk of Dropout: A Case Study with a Technical Distance Education Course. Appl. Sci. 2020, 10, 3998. [Google Scholar] [CrossRef]
- Obaido, G.; Agbo, F.J.; Alvarado, C.; Oyelere, S.S. Analysis of attrition studies within the computer sciences. IEEE Access 2023, 11, 53736–53748. [Google Scholar] [CrossRef]
- Stephenson, C.; Miller, A.D.; Alvarado, C.; Barker, L.; Barr, V.; Camp, T.; Frieze, C.; Lewis, C.; Mindell, E.C.; Limbird, L.; et al. Retention in Computer Science Undergraduate Programs in the Us: Data Challenges and Promising Interventions; ACM: New York, NY, USA, 2018. [Google Scholar]
- Mubarak, A.A.; Cao, H.; Hezam, I.M. Deep Analytic Model for Student Dropout Prediction in Massive Open Online Courses. Comput. Electr. Eng. 2021, 93, 107271. [Google Scholar] [CrossRef]
- Bañeres, D.; Rodríguez, M.E.; Guerrero-Roldán, A.E.; Karadeniz, A. An early warning system to detect at-risk students in online higher education. Appl. Sci. 2020, 10, 4427. [Google Scholar] [CrossRef]
- Isidro, C.; Carro, R.M.; Ortigosa, A. Dropout detection in MOOCs: An exploratory analysis. In Proceedings of the 2018 International Symposium on Computers in Education (SIIE), Cádiz, Spain, 19–21 September 2018; pp. 1–6. [Google Scholar] [CrossRef]
- Raga, R.C.; Raga, J.D. Early prediction of student performance in blended learning courses using deep neural networks. In Proceedings of the 2019 International Symposium on Educational Technology (ISET), Hradec Králové, Czech Republic, 2–4 July 2019; pp. 39–43. [Google Scholar]
- Ndunagu, J.; Ukwandu, E.; Ezema, M.; Olebara, C.; Ugwu, M.; Igwe, J.; Oyewola, D.; Fapohunda, S. Future of remote learning: The virtual laboratory perspective. Univ. Ib. J. Sci. Log. ICT Res. 2022, 8, 55–68. [Google Scholar]
- Xing, W.; Du, D. Dropout prediction in MOOCs: Using deep learning for personalized intervention. J. Educ. Comput. Res. 2019, 57, 547–570. [Google Scholar] [CrossRef]
- Cristea, A.I.; Alamri, A.; Kayama, M.; Stewart, C.; Alsheri, M.; Shi, L. Earliest predictor of dropout in moocs: A longitudinal study of futurelearn courses. In Information Systems Development: Designing Digitalization; Association for Information Systems: Atlanta, GA, USA, 2018. [Google Scholar]
- Kőrösi, G.; Farkas, R. Mooc performance prediction by deep learning from raw clickstream data. In International Conference on Advances in Computing and Data Sciences; Springer: Singapore, 2020; pp. 474–485. [Google Scholar]
- Li, K.C.; Wong, B.T. Factors related to student persistence in open universities: Changes over the years. Int. Rev. Res. Open Distrib. Learn. 2019, 20, 132–151. [Google Scholar]
- Hassan, W.; Thalho, N.P.; Shar, M.M.; Aqeel, C.; Soomro, U.; Fatima, S.S. Causes of Students’ Dropout at Elementary Level in Pakistan. Palarch’s J. Archaeol. Egypt/Egyptol. 2020, 17, 4308–4325. [Google Scholar]
- Schmidhuber, J. Deep Learning in Neural Networks: An Overview. Neural Netw. 2015, 61, 85–117. [Google Scholar]
- Yosinski, J.; Clune, J.; Bengio, Y.; Lipson, H. How transferable are features in deep neural networks? Adv. Neural Inf. Process. Syst. 2014, 4, 3320–3328. [Google Scholar]
- LeCun, Y.; Bengio, Y.; Hinton, G. Deep learning. Nature 2015, 521, 436–444. [Google Scholar] [CrossRef]
- Bengio, Y. Learning Deep Architectures for AI. Found. Trends Mach. Learn. 2009, 2, 1–127. [Google Scholar] [CrossRef]
- Kannan, K.R.; Abarna, K.T.M.; Vairachilai, S. Graph Neural Networks for Predicting Student Performance: A Deep Learning Approach for Academic Success Forecasting. Int. J. Intell. Syst. Appl. Eng. 2024, 12, 228–232. [Google Scholar]
- Fauszt, T.; Erdélyi, K.; Dobák, D.; Bognár, L.; Kovács, E. Design of a Machine Learning Model to Predict Student Attrition. Int. J. Emerg. Technol. Learn. 2023, 18, 184. [Google Scholar] [CrossRef]
- Abunasser, B.S.; AL-Hiealy, M.R.; Barhoom, A.M.; Almasri, A.R.; Abu-Naser, S.S. Prediction of Instructor Performance using Machine and Deep Learning Techniques. (IJACSA) Int. J. Adv. Comput. Sci. Appl. 2022, 13, 78–83. [Google Scholar] [CrossRef]
- Arqawi, S.A.; Rumman, M.A.; Zitawi, E.A.; Rabaya, A.H.; Sadaqa, A.S.; Abunasser, B.S.; Abu-Naser, S.S. Predicting Employee Attrition and Performance Using Deep Learning. J. Theor. Appl. Inf. Technol. 2022, 100, 6526–6536. [Google Scholar]
- Barramuño, M.; Meza-Narváez, C.; Gálvez-García, G. Prediction of student attrition risk using machine learning. J. Appl. Res. High. Educ. 2022, 14, 974–986. [Google Scholar] [CrossRef]
- Lokhande, D.B.; Chavan, T.A.; Patil, D.P. Deep neural network in prediction of student performance. Gradiva Rev. J. 2022, 8, 811–816. [Google Scholar]
- Nabil, A.; Seyam, M.; Abou-Elfetouh, A. Prediction of Students’ Academic Performance Based on Courses’ Grades Using Deep Neural Networks. IEEE Access 2021, 9, 140731–140746. [Google Scholar] [CrossRef]
- Opazo, D.; Moreno, S.; Álvarez-Miranda, E.; Pereira, J. Analysis of first-year university student dropout through machine learning models: A comparison between universities. Mathematics 2021, 9, 2599. [Google Scholar] [CrossRef]
- Shilbayeh, S.; Abonamah, A. Predicting student enrollments and attrition patterns in higher educational institutions using machine learning. Int. Arab J. Inf. Technol. 2021, 18, 562–567. [Google Scholar]
- He, Y.; Chen, R.; Li, X.; Hao, C.; Liu, S.; Zhang, G.; Jiang, B. Online At-Risk Student Identification Using RNN-GRU Joint Neural Networks. Information 2020, 11, 474. [Google Scholar] [CrossRef]
- Kemper, L.; Vorhoff, G.; Wigger, B.U. Predicting student dropout: A machine learning approach. Eur. J. High. Educ. 2020, 10, 28–47. [Google Scholar] [CrossRef]
- Aguilar-Gonzalez, S.; Palafox, L. Prediction of Student Attrition Using Machine Learning. In Advances in Soft Computing, Proceedings of the 18th Mexican International Conference on Artificial Intelligence, MICAI 2019, Xalapa, Mexico, 27 October–2 November 2019; Proceedings 18; Springer International Publishing: Cham, Switzerland, 2019; pp. 212–222. [Google Scholar]
- Oyewola, D.O.; Dada, E.G.; Omotehinwa, T.O.; Emebo, O.; Oluwagbemi, O.O. Application of Deep Learning Techniques and Bayesian Optimization with Tree Parzen Estimator in the Classification of Supply Chain Pricing Datasets of Health Medications. Appl. Sci. 2022, 12, 10166. [Google Scholar] [CrossRef]
- Oyewola, D.O.; Oladimeji, L.A.; Julius, S.O.; Kachalla, L.B.; Dada, E.G. Optimizing Sentiment Analysis of Nigerian 2023 Presidential Election Using Two-Stage Residual Long Short Term Memory. Heliyon 2023, 9, e14836. [Google Scholar] [CrossRef]
- Sabri, M.; El Bouhdidi, J.; Chkouri, M.Y. A Proposal for a Deep Learning Model to Enhance Student Guidance and Reduce Dropout BT—Artificial Intelligence and Industrial Applications; Masrour, T., El Hassani, I., Cherrafi, A., Eds.; Springer International Publishing: Cham, Switzerland, 2021; pp. 158–165. [Google Scholar]
- Moghar, A.; Hamiche, M. Stock Market Prediction Using LSTM Recurrent Neural Network. Procedia Comput. Sci. 2020, 170, 1168–1173. [Google Scholar] [CrossRef]
- Qazi, E.U.; Almorjan, A.; Zia, T. A One-Dimensional Convolutional Neural Network (1D-CNN) Based Deep Learning System for Network Intrusion Detection. Appl. Sci. 2022, 12, 7986. [Google Scholar] [CrossRef]
- Wu, C.; Jiang, P.; Ding, C.; Feng, F.; Chen, T. Intelligent Fault Diagnosis of Rotating Machinery Based on One-Dimensional Convolutional Neural Network. Comput. Ind. 2019, 108, 53–61. [Google Scholar] [CrossRef]
- Harbola, S.; Coors, V. One Dimensional Convolutional Neural Network Architectures for Wind Prediction. Energy Convers. Manag. 2019, 195, 70–75. [Google Scholar] [CrossRef]
- Sharma, Y.; Singh, B.K. One-Dimensional Convolutional Neural Network and Hybrid Deep-Learning Paradigm for Classification of Specific Language Impzhanaired Children Using Their Speech. Comput. Methods Programs Biomed. 2022, 213, 106487. [Google Scholar] [CrossRef]
- Moitra, D.; Mandal, R.K. Classification of Non-Small Cell Lung Cancer Using One-Dimensional Convolutional Neural Network. Expert Syst. Appl. 2020, 159, 113564. [Google Scholar] [CrossRef]
S/N | Author | Objective of Study | Classification Techniques Studied | Recommendation of Classification Techniques by Author(s) |
---|---|---|---|---|
1 | [38] | This research work demonstrates the effectiveness of this approach in predicting student performance, and ablation feature analysis is conducted to gain insights into the underlying factors that contribute to performance prediction. | Graph Neural Network (GNN) and Convolutional Neural Networks (CNNs), | Graph Neural Network (GNN) |
2 | [39] | This study used machine learning model in predicting student dropout rates in the Business Informatics BSc course at the Faculty of Finance and Accounting of Budapest Business School using data extracted from the administration system. | Boosted Decision tree | Boosted Decision tree as best suited for predicting attrition rate. |
3 | [40] | To find how effective the instructor in the higher education systems is, a group of machine and deep learning algorithms were applied to predict instructor performance in higher education systems. | Machine and Deep Learning Techniques | Deep Learning Techniques |
4 | [41] | The aim of this study was to utilize machine and deep learning models to predict employee attrition with a high accuracy, furthermore, to identify the most influential factors affecting employee attrition. | Machine Learning and Deep Learning Techniques | Deep Learning |
5 | [42] | Implemented a method to predict student attrition in the upper years of a physiotherapy program with 23.58% males and 17.39% females’ population in the attrition student group. | KNN and boosted Decision tree | KNN |
6 | [43] | The proposed deep neural network model outperforms existing machine learning methods in terms of accuracy, achieving up to 85.4% accuracy | Machine Learning and Deep Neural Network | Deep Neural Network—DNN |
7 | [44] | The main goal of this paper is to explore the efficiency of deep learning in the field of EDM, especially in predicting students’ academic performance, to identify students at risk of failure | A deep neural network (DNN), decision tree, random forest, gradient boosting, logistic regression, support vector classifier, and K-nearest neighbor | Deep Neural Network—DNN |
8 | [45] | This study predicts student dropout in two Chilean universities using machine learning models. It focused on finding out variables that trigger first-year engineering student probability of dropout. | KNN, SVM, Decision tree, Random Forest, Gradient-boosting decision tree, Naive Bayes, Logistic regression and neural network. | Gradient-boosting decision trees reports the best model. |
9 | [46] | This study was focused on students at Abu Dhabi School of Management (ADSM) in the UAE that are at the risk of dropping out. | Decision tree | The use of decision tree has high significance in predicting students at risk of dropping out. |
10 | [18] | The study mainly analyze the trends of feature processing and the model design in dropout prediction, respectively | Recurrent Neural Networks (RNN), Convolutional Neural Networks (CNN), Graph Neural Networks (GNN) and other deep learning models | Recurrent Neural Networks (RNN) |
11 | [1] | The model has diverse features which can be utilized to assess how new students will perform and thus contributes to early prediction of student retention and dropout. | BLSTM and CRF deep learning techniques | BLSTM and CRF deep learning techniques. |
12 | [47] | This study aims to predict students’ performance in a specific course as it is continuously running, using the statistic personal biographical information and sequential behavior data with VLE. | simple RNN, GRU, and LSTM. | GRU and simple RNN |
13 | [48] | This study aimed at predicting student dropout at the Karlsruhe Institute of Technology (KIT) | logistic regressions and decision trees | Decision trees produced slightly better results than logistic regressions |
14 | [31] | Deep learning algorithms could be applied directly on raw input data, and this could spare the most time-consuming process of feature engineering | SVM, LOGREG or MLP and RNN—Deep learning Technique | RNN |
15 | [49] | This study explored the relationship between attrition and entrance examination, student place of origin and grades up to the point of abandonment of the major. | ||
16 | [29] | this study proposes to use the deep learning algorithm to construct the dropout prediction model and further produce the predicted individual student dropout probability | K-nearest neighbors (KNN), support vector machines (SVM), decision tree and Deep Learning techniques | Deep Learning techniques |
Column | Data Type |
---|---|
attrition | object |
Difficulty-Course-materials | object |
Frustration-information-NOUN | object |
No-Social-Networking | object |
Poor-Academic-Performance | object |
Inadequate-Communication | object |
Column | Data Type |
---|---|
attrition | object |
Family-Challenges | object |
Financial-Reasons | object |
Sickness | object |
Others | object |
MODEL | Training (%) | Test (%) |
---|---|---|
LSTM | 57.29 | 56.75 |
CNN | 49.91 | 50.50 |
MODEL | Training (%) | Test (%) |
---|---|---|
LSTM | 0.6765 | 0.6852 |
CNN | 0.6730 | 0.6782 |
MODEL | Training (%) | Test (%) |
---|---|---|
LSTM | 50.98 | 46.19 |
CNN | 50.98 | 46.19 |
MODEL | Training (%) | Test (%) |
---|---|---|
LSTM | 0.6928 | 0.6953 |
CNN | 0.6929 | 0.6951 |
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to people or property resulting from any ideas, methods, instructions or products referred to in the content. |
© 2024 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/).
Share and Cite
Ndunagu, J.N.; Oyewola, D.O.; Garki, F.S.; Onyeakazi, J.C.; Ezeanya, C.U.; Ukwandu, E. Deep Learning for Predicting Attrition Rate in Open and Distance Learning (ODL) Institutions. Computers 2024, 13, 229. https://doi.org/10.3390/computers13090229
Ndunagu JN, Oyewola DO, Garki FS, Onyeakazi JC, Ezeanya CU, Ukwandu E. Deep Learning for Predicting Attrition Rate in Open and Distance Learning (ODL) Institutions. Computers. 2024; 13(9):229. https://doi.org/10.3390/computers13090229
Chicago/Turabian StyleNdunagu, Juliana Ngozi, David Opeoluwa Oyewola, Farida Shehu Garki, Jude Chukwuma Onyeakazi, Christiana Uchenna Ezeanya, and Elochukwu Ukwandu. 2024. "Deep Learning for Predicting Attrition Rate in Open and Distance Learning (ODL) Institutions" Computers 13, no. 9: 229. https://doi.org/10.3390/computers13090229
APA StyleNdunagu, J. N., Oyewola, D. O., Garki, F. S., Onyeakazi, J. C., Ezeanya, C. U., & Ukwandu, E. (2024). Deep Learning for Predicting Attrition Rate in Open and Distance Learning (ODL) Institutions. Computers, 13(9), 229. https://doi.org/10.3390/computers13090229