Intelligent Machine Learning Approach for Effective Recognition of Diabetes in E-Healthcare Using Clinical Data
Abstract
:1. Introduction
- To propose a Filter based DT-(ID3) algorithm for features selection. The proposed algorithm should select more appropriate features from the dataset. Two ensemble algorithms, Ada Boost and Random Forest, are used for feature selection and compared the performance of DT on the proposed feature selection algorithm with these two FS algorithms and wrapper based feature selection methods.
- The Classification performance of the classifier has been checked according to original feature sets and on selected feature sets with cross validation methods, such as Hold out, K-fold, and LOSO. The LOSO is more suitable then train/test and k-folds validations. The classifier performance with the LOSO validation method is high in terms of accuracy of selected features compared to other validation methods such as Hold out and k-folds. Additional performance evaluation metrics results are very high with LOSO validation.
2. Related Work
3. Materials and Method of Research
3.1. Dataset
3.2. Problem Statement of Feature Selection
3.2.1. Proposed Filter Based Decision Tree Approach for Feature Selection
Filter Based Decision Tree Iterative Dichotomiser 3 (DT-ID3) Feature Selection Algorithm
Algorithm 1: Filter Based DT-ID3 Approach for Feature Selection. |
Ada Boost Feature Selection Algorithm
Random Forest Feature Selection Algorithm
Algorithm 2: Ensemble Decision Tree Ada Boost FS algorithm. |
Algorithm 3: Ensemble Random Forest FS Algorithm. |
1 Randomly select f features from F feature set where |
2 The node d is computed using the best split point in features f |
3 Divide the nodes into sub nodes by using the best splits |
4 Repeat the steps 1 to 3 until I number of nodes is reached |
5 Create forest by repeating steps 1 to 4 for n number times to generate N number of trees. |
3.2.2. Wrapper Based Feature Selection Using Sequential Backward Selection Algorithm
Algorithm 4: Wrapper based Sequential Backward Selection of Feature FS Algorithm. |
1 Algorithm starting with , the d is dimensional of feature full space |
2 Eliminate feature , that maximizes the criterion: |
3, Where |
4 Eliminate feature from feature space: |
5 |
6 |
7 Finish if k reached the required features, if not then repeat step 2 |
3.3. Classification Algorithm
3.4. Cross Validation Methods
3.4.1. Hold Out
3.4.2. K-Folds
3.4.3. Leave One Subject Out
3.5. Performance Evalution Matrix
3.6. Methodology of the Proposed Technique for Diabetes Disease Detection
Algorithm 5: Proposed method for Diabetes detection. |
1 Begin |
2 Preprocessing of the Dataset using Different Statistical Techniques |
3 Feature selection using DT (ID3) algorithm; |
4 Using hold out, k folds and LOSO cross validation techniques for tuning hyper parameters and best model selection |
5 Classification of diabetes and healthy people using DT classifier |
6 Computes different performance evaluation metrics for model evaluation |
7 Finish |
4. Experiments and Results Discussion
4.1. Experimental Setup
4.2. Experimental Results
4.2.1. Results of Pre-Processing Operations on the Dataset
4.2.2. Experimental Results of Feature Selection Algorithm Filter Based DT (ID3)
4.2.3. Experimental Results of Ensemble Ada Boost FS Algorithm
4.2.4. Experimental Results of Ensemble Random Forest FS Algorithm
4.2.5. Experimental Result of Wrapper Based Sequential Backward Selection of Feature FS Algorithm
4.2.6. Classification Performance of Classifier DT with Individual Feature
4.2.7. Classification Performance on Full Features Set and on Selected Features Sets Selected by Filter-Based Dt (Id3), Ada Boost And Random Forest
4.2.8. Performance of Classifier on selected features set selected by Wrapper based Sequential Backward Selection algorithm
4.2.9. Performance Comparison of Our Method with Previous Methods for Diabetess Detection
5. Conclusions
Author Contributions
Funding
Conflicts of Interest
References
- Alberti, K.G.; Zimmet, P.S.J. International Diabetes Federation: A consensus on Type 2 diabetes prevention. Diabetes Med. 2007, 24, 51–63. [Google Scholar] [CrossRef] [PubMed]
- Inzucchi, S.; Bergenstal, R.; Fonseca, V.; Gregg, E.; Mayer-Davis, B.; Spollett, G.; Wender, R. Diagnosis and classification of diabetes mellitus. Diabetes Care 2010, 33, S62–S69. [Google Scholar]
- World Health Organization. World Health Statistics 2016: Monitoring Health for the SDGs Sustainable Development Goals; World Health Organization: Geneva, Switzerland, 2016. [Google Scholar]
- Mathers, C.D.; Loncar, D. Projections of global mortality and burden of disease from 2002 to 2030. PLoS Med. 2006, 3, e442. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Franciosi, M.; De Berardis, G.; Rossi, M.C.; Sacco, M.; Belfiglio, M.; Pellegrini, F.; Tognoni, G.; Valentini, M.; Nicolucci, A. Use of the diabetes risk score for opportunistic screening of undiagnosed diabetes and impaired glucose tolerance: The IGLOO (Impaired Glucose Tolerance and Long-Term Outcomes Observational) study. Diabetes Care 2005, 28, 1187–1194. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Freund, Y.; Schapire, R.; Abe, N. A short introduction to boosting. J. Jpn. Soc. Artif. Intell. 1999, 14, 1612. [Google Scholar]
- Breiman, L. Random forests. Mach. Learn. 2001, 45, 5–32. [Google Scholar] [CrossRef] [Green Version]
- Hospital Frankfurt Germany. Diabetes Data Set. Available online: https://www.kaggle.com/johndasilva/diabetes (accessed on 15 September 2019).
- Ramezani, R.; Maadi, M.; Khatami, S.M. A novel hybrid intelligent system with missing value imputation for diabetes diagnosis. Alex. Eng. J. 2018, 57, 1883–1891. [Google Scholar] [CrossRef]
- Pourpanah, F.; Lim, C.P.; Saleh, J.M. A hybrid model of fuzzy ARTMAP and genetic algorithm for data classification and rule extraction. Expert Syst. Appl. 2016, 49, 74–85. [Google Scholar] [CrossRef]
- Patil, B.M.; Joshi, R.C.; Toshniwal, D. Hybrid prediction model for type-2 diabetic patients. Expert Syst. Appl. 2010, 37, 8102–8108. [Google Scholar] [CrossRef]
- Yilmaz, N.; Inan, O.; Uzer, M.S. A new data preparation method based on clustering algorithms for diagnosis systems of heart and diabetes diseases. J. Med. Syst. 2014, 38, 1–12. [Google Scholar] [CrossRef]
- Alić, B.; Gurbeta, L.; Badnjević, A. Machine learning techniques for classification of diabetes and cardiovascular diseases. In Proceedings of the IEEE 6th Mediterranean Conference on Embedded Computing, Bar, Montenegro, 11–15 June 2017; pp. 1–4. [Google Scholar]
- Kayaer, K.; Yildirim, T. Medical diagnosis on Pima Indian diabetes using general regression neural networks. In Proceedings of the International Conference on Artificial Neural Networks and Neural Information Processing, Istanbul, Turkey, 26–29 June 2003; Volume 181, p. 184. [Google Scholar]
- Temurtas, H.; Yumusak, N.; Temurtas, F. A comparative study on diabetes disease diagnosis using neural networks. Expert Syst. Appl. 2009, 36, 8610–8615. [Google Scholar] [CrossRef]
- Polat, K.; Güneş, S. An expert system approach based on principal component analysis and adaptive neuro-fuzzy inference system to diagnosis of diabetes disease. Digit. Signal Process. 2007, 17, 702–710. [Google Scholar] [CrossRef]
- Sagir, A.M.; Sathasivam, S. Design of a modified adaptive neuro fuzzy inference system classifier for medical diagnosis of Pima Indians Diabetes. In Proceedings of the 24th National Symposium on Mathematical Sciences: Mathematical Sciences Exploration for the Universal Preservation (AIP Conference Proceedings 1870), Kuala Terengganu, Malaysia, 27–29 September 2016; p. 040048. [Google Scholar]
- Kahramanli, H.; Allahverdi, N. Design of a hybrid system for the diabetes and heart diseases. Expert Syst. Appl. 2008, 35, 82–89. [Google Scholar] [CrossRef]
- Polat, K.; Güneş, S.; Arslan, A. A cascade learning system for classification of diabetes disease: Generalized discriminant analysis and least square support vector machine. Expert Syst. Appl. 2008, 34, 482–487. [Google Scholar] [CrossRef]
- Guo, Y.; Bai, G.; Hu, Y. Using bayes network for prediction of type-2 diabetes. In Proceedings of the IEEE International Conference for Internet Technology and Secured Transactions, London, UK, 10–12 December 2012; pp. 471–472. [Google Scholar]
- Aslam, M.W.; Zhu, Z.; Nandi, A.K. Feature generation using genetic programming with comparative partner selection for diabetes classification. Expert Syst. Appl. 2013, 40, 5402–5412. [Google Scholar] [CrossRef]
- Wettayaprasit, W.; Sangket, U. Linguistic knowledge extraction from neural networks using maximum weight and frequency data representation. In Proceedings of the IEEE Conference on Cybernetics and Intelligent Systems, Bangkok, Thailand, 7–9 June 2006; pp. 1–6. [Google Scholar]
- Ganji, M.F.; Abadeh, M.S. Using fuzzy ant colony optimization for diagnosis of diabetes disease. In Proceedings of the IEEE 18th Iranian Conference on Electrical Engineering, Isfahan, Iran, 11–13 May 2010; pp. 501–505. [Google Scholar]
- Beloufa, F.; Chikh, M.A. Design of fuzzy classifier for diabetes disease using Modified Artificial Bee Colony algorithm. Comput. Methods Programs Biomed. 2013, 112, 92–103. [Google Scholar] [CrossRef]
- Li, W.; Li, Y.; Hu, C.; Chen, X.; Dai, H. Point process analysis in brain networks of patients with diabetes. Neurocomputing 2014, 145, 182–189. [Google Scholar] [CrossRef]
- Cheruku, R.; Edla, D.R.; Kuppili, V. SM-RuleMiner: Spider monkey based rule miner using novel fitness function for diabetes classification. Comput. Biol. Med. 2017, 81, 79–92. [Google Scholar] [CrossRef]
- Zangooei, M.H.; Habibi, J.; Alizadehsani, R. Disease Diagnosis with a hybrid method SVR using NSGA-II. Neurocomputing 2014, 136, 14–29. [Google Scholar] [CrossRef]
- Ani, R.; Krishna, S.; Anju, N.; Aslam, M.S.; Deepa, O. Iot based patient monitoring and diagnostic prediction tool using ensemble classifier. In Proceedings of the IEEE International Conference on Advances in Computing, Communications and Informatics, Udupi, India, 13–16 September 2017; pp. 1588–1593. [Google Scholar]
- Yang, Z.; Zhou, Q.; Lei, L.; Zheng, K.; Xiang, W. An IoT-cloud based wearable ECG monitoring system for smart healthcare. J. Med. Syst. 2016, 40, 286. [Google Scholar] [CrossRef]
- Khan, J.; Li, J.P.; Ahamad, B.; Parveen, S.; Haq, A.U.; Khan, G.A.; Sangaiah, A.K. SMSH: Secure Surveillance Mechanism on Smart Healthcare IoT System With Probabilistic Image Encryption. IEEE Access 2020, 8, 15747–15767. [Google Scholar] [CrossRef]
- Migliorelli, L.; Moccia, S.; Avellino, I.; Fiorentino, M.C.; Frontoni, E. MyDi application: Towards automatic activity annotation of young patients with Type 1 diabetes. In Proceedings of the 2019 IEEE 23rd International Symposium on Consumer Technologies (ISCT), Ancona, Italy, 19–21 June 2019; pp. 220–224. [Google Scholar]
- Spänig, S.; Emberger-Klein, A.; Sowa, J.P.; Canbay, A.; Menrad, K.; Heider, D. The virtual doctor: An interactive clinical-decision-support system based on deep learning for non-invasive prediction of diabetes. Artif. Intell. Med. 2019, 100, 101706. [Google Scholar] [CrossRef] [PubMed] [Green Version]
- Kotsiantis, S.; Kanellopoulos, D.; Pintelas, P. Data preprocessing for supervised leaning. Int. J. Comput. Sci. 2006, 1, 111–117. [Google Scholar]
- Alasadi, S.A.; Bhaya, W.S. Review of data preprocessing techniques in data mining. J. Eng. Appl. Sci. 2017, 12, 4102–4107. [Google Scholar]
- Chen, J.; Luo, D.-L.; Mu, F.-X. An improved ID3 decision tree algorithm. In Proceedings of the IEEE 4th International Conference on Computer Science &Education, Nanning, China, 25–28 July 2009; pp. 127–130. [Google Scholar]
- Valencia, R.; Andrade-Cetto, J. Mapping, Planning and Exploration with Pose SLAM; Springer: Berlin/Heidelberg, Germany, 2018; Volume 74. [Google Scholar]
- Ferri, F.; Pudil, P.; Hatef, M.; Kittler, J. Comparative study of techniques for large-scale feature selection. In Machine Intelligence and Pattern Recognition; Elsevier: Amsterdam, The Netherlands, 1994; Volume 16, pp. 403–413. [Google Scholar]
- Pudil, P.; Novovičová, J.; Kittler, J. Floating search methods in feature selection. Pattern Recognit. Lett. 1994, 15, 1119–1125. [Google Scholar] [CrossRef]
- Rother, C.; Kolmogorov, V.; Blake, A. “GrabCut” interactive foreground extraction using iterated graph cuts. ACM Trans. Graph. (TOG) 2004, 23, 309–314. [Google Scholar] [CrossRef]
- Wu, X.; Kumar, V.; Quinlan, J.R.; Ghosh, J.; Yang, Q.; Motoda, H.; McLachlan, G.J.; Ng, A.; Liu, B.; Philip, S.Y.; et al. Top 10 algorithms in data mining. Knowl. Inf. Syst. 2008, 14, 1–37. [Google Scholar] [CrossRef] [Green Version]
- Quinlan, J. Induction of Decision Trees. Mach. Learn. 1986, 1, 81–106. [Google Scholar] [CrossRef] [Green Version]
- Haq, A.U.; Li, J.P.; Memon, M.H.; Nazir, S.; Sun, R. A hybrid intelligent system framework for the prediction of heart disease using machine learning algorithms. Mob. Inf. Syst. 2018, 2018. [Google Scholar] [CrossRef]
- Safavian, S.R.; Landgrebe, D. A survey of decision tree classifier methodology. IEEE Trans. Syst. Man Cybern. 1991, 21, 660–674. [Google Scholar] [CrossRef] [Green Version]
- Mingers, J. An empirical comparison of pruning methods for decision tree induction. Mach. Learn. 1989, 4, 227–243. [Google Scholar] [CrossRef] [Green Version]
- Pal, M.; Mather, P.M. An assessment of the effectiveness of decision tree methods for land cover classification. Remote Sens. Environ. 2003, 86, 554–565. [Google Scholar] [CrossRef]
- Shouman, M.; Turner, T.; Stocker, R. Using decision tree for diagnosing heart disease patients. In Proceedings of the Ninth Australasian Data Mining Conference, Ballarat, Australia, 1–2 December 2011; Volume 121, pp. 23–30. [Google Scholar]
- Chasmer, L.; Hopkinson, C.; Veness, T.; Quinton, W.; Baltzer, J. A decision-tree classification for low-lying complex land cover types within the zone of discontinuous permafrost. Remote Sens. Environ. 2014, 143, 73–84. [Google Scholar] [CrossRef]
- Haq, A.U.; Li, J.; Memon, M.H.; Khan, J.; Din, S.U.; Ahad, I.; Sun, R.; Lai, Z. Comparative analysis of the classification performance of machine learning classifiers and deep neural network classifier for prediction of Parkinson disease. In Proceedings of the IEEE 15th International Computer Conference on Wavelet Active Media Technology and Information Processing, Chengdu, China, 14–16 December 2018; pp. 101–106. [Google Scholar]
- Haq, A.U.; Li, J.P.; Memon, M.H.; Malik, A.; Ahmad, T.; Ali, A.; Nazir, S.; Ahad, I.; Shahid, M. Feature selection based on L1-norm support vector machine and effective recognition system for Parkinson’s disease using voice recordings. IEEE Access 2019, 7, 37718–37734. [Google Scholar] [CrossRef]
- Tsanas, A.; Little, M.A.; McSharry, P.E.; Spielman, J.; Ramig, L.O. Novel speech signal processing algorithms for high-accuracy classification of Parkinson’s disease. IEEE Trans. Biomed. Eng. 2012, 59, 1264–1271. [Google Scholar] [CrossRef] [Green Version]
- Naranjo, L.; Pérez, C.J.; Martín, J.; Campos-Roca, Y. A two-stage variable selection and classification approach for Parkinson’s disease detection by using voice recording replications. Comput. Methods Programs Biomed. 2017, 142, 147–156. [Google Scholar] [CrossRef]
- Cai, Z.; Gu, J.; Chen, H.L. A new hybrid intelligent framework for predicting Parkinson’s disease. IEEE Access 2017, 5, 17188–17200. [Google Scholar] [CrossRef]
- Wang, Z.; Li, M.; Wang, H.; Jiang, H.; Yao, Y.; Zhang, H.; Xin, J. Breast cancer detection using extreme learning machine based on feature fusion with CNN deep features. IEEE Access 2019, 7, 105146–105158. [Google Scholar] [CrossRef]
- Everitt, B.S. The Analysis of Contingency Tables; CRC Press: Boca Raton, FL, USA, 1992. [Google Scholar]
- Ul Haq, A.; Li, J.; Memon, M.H.; Khan, J.; Ud Din, S. A novel integrated diagnosis method for breast cancer detection. J. Intell. Fuzzy Syst. 2020, 38, 2383–2398. [Google Scholar] [CrossRef]
- Kohli, P.S.; Arora, S. Application of Machine Learning in Disease Prediction. In Proceedings of the IEEE 4th International Conference on Computing Communication and Automation, Greater Noida, India, 14–15 December 2018; pp. 1–4. [Google Scholar]
- Dey, S.K.; Hossain, A.; Rahman, M.M. Implementation of a web application to predict diabetes disease: An approach using machine learning algorithm. In Proceedings of the IEEE 21st International Conference of Computer and Information Technology, Dhaka, Bangladesh, 21–23 December 2018; pp. 1–5. [Google Scholar]
- Aofa, F.; Sasongko, P.S.; Adzani, W.A. Early Detection System Of Diabetes Mellitus Disease Using Artificial Neural Network Backpropagation With Adaptive Learning Rate And Particle Swarm Optimization. In Proceedings of the 2018 2nd International Conference on Informatics and Computational Sciences (ICICoS), Semarang, Indonesia, 30–31 October 2018; pp. 1–5. [Google Scholar]
- Fitriyani, N.L.; Syafrudin, M.; Alfian, G.; Rhee, J. Development of Disease Prediction Model Based on Ensemble Learning Approach for Diabetes and Hypertension. IEEE Access 2019, 7, 144777–144789. [Google Scholar] [CrossRef]
- Wang, Y.S.; Wang, Y. A gradient-based approach for optimal plant controller co-design. In Proceedings of the IEEE American Control Conference (ACC), Chicago, IL, USA, 1–3 July 2015; pp. 3249–3254. [Google Scholar]
Symbol | Description |
---|---|
H | Data set |
S | Subset |
F | Feature set |
n | Number of instances in dataset |
X | Input features in dataset |
Y | Predicted output classes label |
b | Bais is offset value from the origin |
w | d-dimensional coefficient vector |
i | i is ith sample in data set |
ith instance of dataset sample X | |
Target labels to x | |
R | Training set |
T | Test set |
t | Finite set |
IG(F) | Information gain |
p-value | Test probability value |
Degree of freedom | |
f | Feature in dataset |
MI | Mutual information |
ith feature in dataset | |
Empty sect | |
p | probability |
Null hypothesis | |
Alternate hypothesis |
Feature Name | Feature Code | Description | Min-Max | Mean, (±) STD |
---|---|---|---|---|
Pregnancies | PG | Number of period pregnant | 0.000000–17.000000 | 3.703500, (±) 3.306063 |
Glucose | GL | Plasma glucose concentrations | 0.000000–199.000000 | 121.182500, (±) 32.068636 |
Blood Pressure | BP | Blood pressures (mm Hg) | 0.000000–122.000000 | 69.145500, (±)19.188315 |
Skin Thickness | ST | Triceps skin fold thickness(mm) | 0.000000–110.000000 | 20.935000, (±) 16.103243 |
Insulin | IS | Serum insulin concentration | 0.000000–744.000000 | 80.254000, (±)111.180534 |
BMI | BMI | Blood mass index | 0.000000–80.600000 | 32.193000, (±) 8.149901 |
Diabetes Pedigree Function | DPF | Diabetes pedigree function | 0.078000–2.420000 | 0.470930, (±) 0.323553 |
Age | AGE | Age in years | 21.000000–81.000000 | 33.090500, (±)11.786423 |
Outcome | 1 = yes | Diabetes = 1 | 0.000000–1.000000 | 0.342000, (±) 0.474498 |
0 = no | Healthy = 0 |
S.No | Feature Label | Ranking | Score |
---|---|---|---|
1 | PG | IS | 0.07605 |
2 | GL | ST | 0.07947 |
3 | BP | BP | 0.10179 |
4 | ST | PG | 0.11071 |
5 | IS | DPF | 0.11491 |
6 | BMI | BMI | 0.13829 |
7 | DPF | AGE | 0.14366 |
8 | AGE | GL | 0.23511 |
S.NO | Feature Set | Feature Selection Algorithm | ||
---|---|---|---|---|
DT(ID3) | Ada Boost | Random Forest | ||
1 | PG | GL | GL | BP |
2 | GL | AGE | BMI | GL |
3 | BP | IS | DPF | AGE |
4 | ST | DPF | BP | ST |
5 | IS | BMI | AGE | IS |
6 | BMI | BP | IS | BMI |
7 | DPF | PG | DPE | |
8 | AGE |
Classifier | Feature | Acc (%) | Sn (%) | Sp (%) | MCC (%) | ROC-AUC (%) | K-Fold (%) | LOSO (%) | Time (s) |
---|---|---|---|---|---|---|---|---|---|
DT | GL | 75 | 45 | 88 | 67 | 67 | 77 | 76 | 0.001 |
BP | 68 | 8 | 74 | 52 | 53 | 67 | 66 | 0.005 | |
BMI | 74 | 45 | 88 | 66 | 66 | 73 | 72 | 0.005 | |
DPF | 84 | 66 | 87 | 78 | 78 | 84 | 83 | 0.002 | |
IS | 73 | 34 | 92 | 64 | 63 | 73 | 73 | 0.001 | |
ST | 68 | 14 | 95 | 54 | 54 | 65 | 66 | 0.001 | |
PG | 69 | 27 | 90 | 59 | 58 | 69 | 70 | 0.0009 | |
AGE | 70 | 40 | 85 | 62 | 63 | 70 | 71 | 0.0018 | |
Full with GL | 98.2 | 100 | 97 | 99 | 99 | 99 | 99.8 | 0.006 | |
Without GL | 97 | 75 | 82 | 97 | 97 | 99.5 | 99.7 | 0.005 |
Feature Set Selection | Acc (%) | Sn (%) | Sp (%) | MCC (%) | Pre (%) | Rec (%) | F1 (%) | ROC (%) | K-Folds (%) | LOSO (%) | Time (S) |
---|---|---|---|---|---|---|---|---|---|---|---|
Full set | 98.2 | 98 | 97 | 97 | 99.8 | 98 | 98.6 | 98 | 99.2 | 99.6 | 0.006 |
ID3 | 99 | 100 | 98 | 99 | 100 | 100 | 100 | 99.8 | 99.8 | 99.9 | 0.005 |
Ada Boost | 98.5 | 98 | 99 | 98 | 98 | 98 | 99 | 98.6 | 99.3 | 99.6 | 0.004 |
Random Forest | 98.3 | 98 | 98 | 98 | 95 | 98 | 99 | 98.7 | 99.4 | 99.7 | 0.006 |
Feature Set Selection | Acc (%) | Sn (%) | Sp (%) | MCC (%) | Pre (%) | F1 (%) | ROC (%) | K-Fold (%) | LOSO (%) | Time (s) |
---|---|---|---|---|---|---|---|---|---|---|
SBS | 98 | 99 | 98 | 98 | 99 | 98 | 97.6 | 98.5 | 98.9 | 0.007 |
Reference | Method | Accuracy (%) | p-Value |
---|---|---|---|
[9] | LANFIS | 88.05 | 0.87 |
[26] | SM-Rule-Miner | 89.87 | 0.92 |
[10] | TSHDE | 91.91 | 0.21 |
[11] | C4.5 algorithm | 92.38 | 0.69 |
[12] | Modified K-Means Clustering +SVM (10-FC) | 96.71 | 0.07 |
[56] | Support Vector Machine | 97.14 | 0.06 |
[57] | Artificial Neural Network (ANN) | 82.35 | 1.23 |
[58] | SBNN + PSO + ALR | 88.75 | 0.31 |
[59] | DPM | 96.74 | 0.08 |
[60] | DNN | 95.6 | 0.09 |
[13] | BN | 99.51 | 0.06 |
DT(ID3) + DT | 99 (Hold out) | 0.04 | |
Our study | DT(ID3) + DT | 99.8 (K-fold) | |
DT(ID3) + DT | 99.9 (LOSO) |
© 2020 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (http://creativecommons.org/licenses/by/4.0/).
Share and Cite
Haq, A.U.; Li, J.P.; Khan, J.; Memon, M.H.; Nazir, S.; Ahmad, S.; Khan, G.A.; Ali, A. Intelligent Machine Learning Approach for Effective Recognition of Diabetes in E-Healthcare Using Clinical Data. Sensors 2020, 20, 2649. https://doi.org/10.3390/s20092649
Haq AU, Li JP, Khan J, Memon MH, Nazir S, Ahmad S, Khan GA, Ali A. Intelligent Machine Learning Approach for Effective Recognition of Diabetes in E-Healthcare Using Clinical Data. Sensors. 2020; 20(9):2649. https://doi.org/10.3390/s20092649
Chicago/Turabian StyleHaq, Amin Ul, Jian Ping Li, Jalaluddin Khan, Muhammad Hammad Memon, Shah Nazir, Sultan Ahmad, Ghufran Ahmad Khan, and Amjad Ali. 2020. "Intelligent Machine Learning Approach for Effective Recognition of Diabetes in E-Healthcare Using Clinical Data" Sensors 20, no. 9: 2649. https://doi.org/10.3390/s20092649
APA StyleHaq, A. U., Li, J. P., Khan, J., Memon, M. H., Nazir, S., Ahmad, S., Khan, G. A., & Ali, A. (2020). Intelligent Machine Learning Approach for Effective Recognition of Diabetes in E-Healthcare Using Clinical Data. Sensors, 20(9), 2649. https://doi.org/10.3390/s20092649